ID A0A2U1MNN9_ARTAN Unreviewed; 3606 AA.
AC A0A2U1MNN9;
DT 18-JUL-2018, integrated into UniProtKB/TrEMBL.
DT 18-JUL-2018, sequence version 1.
DT 24-JAN-2024, entry version 17.
DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:PWA62870.1};
GN ORFNames=CTI12_AA355920 {ECO:0000313|EMBL:PWA62870.1};
OS Artemisia annua (Sweet wormwood).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC asterids; campanulids; Asterales; Asteraceae; Asteroideae; Anthemideae;
OC Artemisiinae; Artemisia.
OX NCBI_TaxID=35608 {ECO:0000313|EMBL:PWA62870.1, ECO:0000313|Proteomes:UP000245207};
RN [1] {ECO:0000313|EMBL:PWA62870.1, ECO:0000313|Proteomes:UP000245207}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Huhao1 {ECO:0000313|Proteomes:UP000245207};
RC TISSUE=Leaf {ECO:0000313|EMBL:PWA62870.1};
RX PubMed=29703587; DOI=10.1016/j.molp.2018.03.015;
RA Shen Q., Zhang L., Liao Z., Wang S., Yan T., Shi P., Liu M., Fu X., Pan Q.,
RA Wang Y., Lv Z., Lu X., Zhang F., Jiang W., Ma Y., Chen M., Hao X., Li L.,
RA Tang Y., Lv G., Zhou Y., Sun X., Brodelius P.E., Rose J.K.C., Tang K.;
RT "The genome of Artemisia annua provides insight into the evolution of
RT Asteraceae family and artemisinin biosynthesis.";
RL Mol. Plant 11:776-788(2018).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PWA62870.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; PKPP01004766; PWA62870.1; -; Genomic_DNA.
DR STRING; 35608.A0A2U1MNN9; -.
DR Proteomes; UP000245207; Unassembled WGS sequence.
DR CDD; cd06071; Beach; 1.
DR CDD; cd01201; PH_BEACH; 1.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 1.10.1540.10; BEACH domain; 1.
DR Gene3D; 1.25.10.10; Leucine-rich Repeat Variant; 1.
DR Gene3D; 2.30.29.30; Pleckstrin-homology domain (PH domain)/Phosphotyrosine-binding domain (PTB); 1.
DR Gene3D; 2.130.10.10; YVTN repeat-like/Quinoprotein amine dehydrogenase; 2.
DR InterPro; IPR011989; ARM-like.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR000409; BEACH_dom.
DR InterPro; IPR036372; BEACH_dom_sf.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR023362; PH-BEACH_dom.
DR InterPro; IPR011993; PH-like_dom_sf.
DR InterPro; IPR015943; WD40/YVTN_repeat-like_dom_sf.
DR InterPro; IPR019775; WD40_repeat_CS.
DR InterPro; IPR036322; WD40_repeat_dom_sf.
DR InterPro; IPR001680; WD40_rpt.
DR PANTHER; PTHR46108; BLUE CHEESE; 1.
DR PANTHER; PTHR46108:SF4; BLUE CHEESE; 1.
DR Pfam; PF02138; Beach; 1.
DR Pfam; PF14844; PH_BEACH; 1.
DR Pfam; PF00400; WD40; 3.
DR SMART; SM01026; Beach; 1.
DR SMART; SM00320; WD40; 4.
DR SUPFAM; SSF48371; ARM repeat; 2.
DR SUPFAM; SSF81837; BEACH domain; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR SUPFAM; SSF50729; PH domain-like; 1.
DR SUPFAM; SSF50978; WD40 repeat-like; 1.
DR PROSITE; PS50197; BEACH; 1.
DR PROSITE; PS51783; PH_BEACH; 1.
DR PROSITE; PS00678; WD_REPEATS_1; 1.
DR PROSITE; PS50082; WD_REPEATS_2; 2.
DR PROSITE; PS50294; WD_REPEATS_REGION; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000245207};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW WD repeat {ECO:0000256|ARBA:ARBA00022574, ECO:0000256|PROSITE-
KW ProRule:PRU00221}.
FT DOMAIN 2728..2894
FT /note="BEACH-type PH"
FT /evidence="ECO:0000259|PROSITE:PS51783"
FT DOMAIN 2915..3207
FT /note="BEACH"
FT /evidence="ECO:0000259|PROSITE:PS50197"
FT REPEAT 3340..3375
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT REPEAT 3561..3602
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT REGION 16..48
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 365..400
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 418..446
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2056..2075
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 24..48
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 365..382
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 418..443
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 3606 AA; 401717 MW; 5C2C6784EA165FC9 CRC64;
MKWVTLLKDF KEKVGLASHS PPTAAESPPR SSYFDANTST SSSNAGYDHP LARENYELEL
DFKRSWEEFR TSTSEKEKEK ALNITIDIFC RLVKQHSNVA QLISIAFVAD IEKLKLSSKT
RSLEIESVIG FFSEVTEDGT RAGSNLLQAV SFLVTGDIDK QSLLDSGILC CLVHILNALL
VSDGGNVKNS LTLGEEPEVT ENVAPERWFE VVGSILHIMK ALASHPASAQ SLTDDNSLKL
LFEMVANGSL VLFSRYKEGL VPLHTIQLHR HAMQVLGLLL GNDNGSTAEY IRKHQLIKVL
LMAVKGFSPE SGDSAYTVGI VDLLLESVEL SYRPEAGGVR LREDIHNAHG YQFLIQFALV
LSRDQGSQTS QNSDPEGSNS TNDMDTQDEK GKGDSSSQSV SPTLSRLLDI FVNLSQTGST
NCTGSPESKG KKNQVNPGRS SDRYSNDVWE KDNYKVKDLE AVRMLQDIFL KSDSMELQVE
VLNRIFKIFS SHLENYTMCQ QLRTVPLLIL NMGCFPNSLQ EIILKILEYA VTVVNCIPEQ
ELLSLCCLLQ QQITSELKHT ILTFFVKLVS FDQQFKKFLR EVGVLEVLLD DLKQHKFLVG
PEEYNDDSGK VERKGSPNGF KNKDNKDAIL LSNNLLDSGS GKFSLFEDEG TIPVAWDCLV
SLLRKADYNQ VTFRSADGVN IALPFLASDV HRPGVLRVLS CLIIEDSAQA HSDELGALLE
VSKSGMVTSI SGCQYKLSDD AKCDIFGAIW RILGSSSSAQ RVFGETTGFS LILTTLHGFQ
GVKQSPLTVC LKVFTHLLRV VTAGACKNAA NRVKLHDIIS SQSFCDHLSE SGLICVEGEK
QVMQLLFELA LEIVLPPSFT PETTEPLDDI EIMSVFRIIT PSASVIPDKQ LIYNACAVRI
LIRLLLRFTP KLQLEFLKLI EELASAGSFN QESLTSAGCV ELLLEIIHPF LSGSSSLLSH
ALRIIEVLGA YRLSSAELRV IIRYILQVRR MKSVHTLVDM MERFTDSDTI PLAPFVEMNM
SKSGHACIQV PLGERSWPPA AGYSFVCWFQ YQNFIKSNVT TCPQVMRIFS VGATDGGNNN
TFCAELYLQE DGTLTLATSN SSSLSFSGLD IDEYRWHHLV VVHSKPNALA GLFQSSVSYV
YLNGKLRHTG RLGYSPSPAG KSLQVTIGTP DNCARVNDLS WKIRSCYLFE EVLTPGSICF
MYILGRGYKG LFQDTNLLQF VPNQACGGET MAILDSLEAD LALASNSQRP ESGNRQGSSK
TDGSGVVWDV ERLGNFSLHL SARKLIFAFD ATSTEASRVS GTLSMLNLVD PMSGAASSIG
GETMRPIGGL AMVLALVEAA ETRDMLHMVL TLLACALHQH PQNVKYMQSC RGYHLLALLL
HPKMSLFDMP SLEIFFQIAA CEASFPGPKK LEKTDNMPPV LTVQEVSFEE EPNLSKFRDD
VSSVGSQEDM DDISVNKDAF RHISELDNDG LPAETSNCIV LSNADMVEHV LLDWTVWVVA
SVPIQITLLG FLENLVSVRW YRNHNLTVLR NINLVQHLLV TLQRGDVEVP VLEKVVVLLG
AILKDGFVIS ELEHVVTFVI MTFDPPEPTT KSQILRESMG KHIIVRNMLL EMLTDLLVTI
KSEELLEQWH KTVSSRLLTY FFDEAVHPTS MRWVMTLLGV CLTSSSTFSL KFRTSGGYQG
LVRVLPSFYD SPDIYYILFC LIFGKPVYPR LPEVRLLDFH ALIPSDGSRT ELKSLELLDS
VIAMAKSTFD RLSIQTMIAC ESGDLNQVGA GLVAELVEGN TEMTGELQGE ALMHKTYAAR
LMGGDASAPA TATSVLRFMV DLVKMSPPFS AVCRRAEFLE NCVDLYFSCV RSAHAVKLAK
KLCVKSDDKN LNDGEDIVSS HNSLPLEQEQ CAKTSISLGS FPPVHASTSS EDIPVATTNL
DEYKADNLTS PPLESHDSVT DLSSHDLKAT PETVNQTSKL ELHNSVTDIS LHDLQAAPET
VQQISTLESD TSVTGLSAQH LNATSETACQ IITLESENLV KDLSSHDLKA SLETVRQTST
LGSSSLSKFD STLLSEKSTG VQQTPSSQAL TTSLTESKRK LAASPSMASS ISMTEFDTAS
DSKHPPLIPN TTDPIFTINP QMLLDADDLG YGGGPCSAGA TAILDFMAGV LSDFVTDQIK
AAPVIEIILK SVPLYVDAET LLVFQGLCLG RLMNFLERRI LRDDEEDAKK LDKTRWSSNL
DPLCWMIVDR IYMGALPEPV SVLKTLEFLL SMLQLANKDG RIEQAPPPGK GLLSIGRAKQ
LDAYVLSMFK NMNRTLMYCF LPSFLISIGE DELLSRMGLQ IEPKKGFVLN GSQEDGVIDI
CNVLQVLVAH KRLIFCPSNL DNDLNCCLCI NLISLLHDQR QNAQNSAVEI LKHLLLYRKT
ALEDLLVSKP NQGAVLDAFH GGFDKLVTES MQSFLEWLYK SEDLVDKTLE QCAAIMWAQY
IAGSLKFPGV RIKGMDSRRK KDISKKSKDS RKFDRRHWEQ IKERRVALES VRDSMCTELR
VIRQDKYGWV LHAESEWQTH LQQLVHERGI FPLPKSLSVE EPEWQLCPIE GPYRMRKKIE
RCKLKVDTIG SILNVEFEGR ELSPEKNELG LITFDCGSDS FSNLLPYDEL YDDSDGVKDV
GSNRVRWNDD KDSSIFEASV QSAAEFNVKP SSAIVQKLES IIEHSEVGSL RRYASLRSEN
VRVTDDKMDK MDKESSDSGE YLIRPYMERN EKIKFKYNCE RVVSLDKHDG IFLIGELCLY
VIENFYIDDS GYICEKKCED DISVIDQALG VKKEVSLSMN SHTKSTSSWG VNVNKYTGGR
AWAYTSGAWG NENAVTSCNV PHFWRMWKLH SVHELLKRDY QLRPVAVEIF SMDGCNDLLV
FHKKEREEVF RNLLAMNLPR NSTLLDPTIS GSLKQESTFK LTTKPFSKRW QNGEISNFQY
LMHLNTLAGR GYSDLTQYPV FPWILSDYES ENLDLTKAET FRKLDKPMGC QTKEGELEFR
KRFDSWDDPE IPKFHYGSHY SSAGIVLFYL IRLPPFSSEN QKLQGGQFDH ADRLFSNLRD
TWSSASGKAN TSDVKELIPE FFYMPEFLEN GFDLQLGEKQ SGEKVGDVRL PPWAKGSTRE
FIKKHREALE SDYVSENLHH WIDLIFGYRQ RGKAAEEAGN VFYHYTYEGS VDIDAVDDPA
MKAAILAQIN HFGQTPRQLF LRPHAKRRTD KKRPSNPLKL SSRLIPHEIR KISSSVAQIV
TFNDRILMVG KNNVMKPRTY TKYVAWGFPD HSLRFVTYDQ DRLLSTHENL HGGHEIQCAS
ASHDGQFIVT GADDGLVCVW RIGSYGGHRG PRKLSLKKTL CAHTDKITQI YVCQPYMMIV
SGSDDCTVII WDLRSLVFVK QLPEFPFPIS ALFMNDMTGE IVTAAGVLLA IWSVNGDCLA
VVNTSQLPSD IILSLVFVKQ LPEFPFPISA LFMNDMTGEI VTAAGVLLAI WSVNGDCLAV
VNTSQLPSDI ILSVTTCTFS DWLDTNWYAS GHQSGAIKVW QMVHNSSEMA QTHKQNSANL
QSGSGLGGKI PEYTLVLHKI LKGHKHPVTA LHISSDLKLL FSGDSGGHLF TWTLPDETLR
SSIKRG
//