ID C1N6U9_MICPC Unreviewed; 1550 AA.
AC C1N6U9;
DT 26-MAY-2009, integrated into UniProtKB/TrEMBL.
DT 26-MAY-2009, sequence version 1.
DT 27-MAR-2024, entry version 68.
DE RecName: Full=HELP domain-containing protein {ECO:0000259|Pfam:PF03451};
GN ORFNames=MICPUCDRAFT_42955 {ECO:0000313|EMBL:EEH52167.1};
OS Micromonas pusilla (strain CCMP1545) (Picoplanktonic green alga).
OC Eukaryota; Viridiplantae; Chlorophyta; Mamiellophyceae; Mamiellales;
OC Mamiellaceae; Micromonas.
OX NCBI_TaxID=564608 {ECO:0000313|Proteomes:UP000001876};
RN [1] {ECO:0000313|EMBL:EEH52167.1, ECO:0000313|Proteomes:UP000001876}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CCMP1545 {ECO:0000313|EMBL:EEH52167.1,
RC ECO:0000313|Proteomes:UP000001876};
RX PubMed=19359590; DOI=10.1126/science.1167222;
RA Worden A.Z., Lee J.H., Mock T., Rouze P., Simmons M.P., Aerts A.L.,
RA Allen A.E., Cuvelier M.L., Derelle E., Everett M.V., Foulon E.,
RA Grimwood J., Gundlach H., Henrissat B., Napoli C., McDonald S.M.,
RA Parker M.S., Rombauts S., Salamov A., Von Dassow P., Badger J.H.,
RA Coutinho P.M., Demir E., Dubchak I., Gentemann C., Eikrem W., Gready J.E.,
RA John U., Lanier W., Lindquist E.A., Lucas S., Mayer K.F., Moreau H.,
RA Not F., Otillar R., Panaud O., Pangilinan J., Paulsen I., Piegu B.,
RA Poliakov A., Robbens S., Schmutz J., Toulza E., Wyss T., Zelensky A.,
RA Zhou K., Armbrust E.V., Bhattacharya D., Goodenough U.W., Van de Peer Y.,
RA Grigoriev I.V.;
RT "Green evolution and dynamic adaptations revealed by genomes of the marine
RT picoeukaryotes Micromonas.";
RL Science 324:268-272(2009).
CC -!- SIMILARITY: Belongs to the WD repeat EMAP family.
CC {ECO:0000256|ARBA:ARBA00006489}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GG663749; EEH52167.1; -; Genomic_DNA.
DR RefSeq; XP_003063794.1; XM_003063748.1.
DR STRING; 564608.C1N6U9; -.
DR GeneID; 9689235; -.
DR KEGG; mpp:MICPUCDRAFT_42955; -.
DR eggNOG; KOG2106; Eukaryota.
DR OMA; HATEYNI; -.
DR OrthoDB; 294256at2759; -.
DR Proteomes; UP000001876; Unassembled WGS sequence.
DR Gene3D; 2.130.10.10; YVTN repeat-like/Quinoprotein amine dehydrogenase; 4.
DR InterPro; IPR011048; Haem_d1_sf.
DR InterPro; IPR005108; HELP.
DR InterPro; IPR011047; Quinoprotein_ADH-like_supfam.
DR InterPro; IPR015943; WD40/YVTN_repeat-like_dom_sf.
DR InterPro; IPR019775; WD40_repeat_CS.
DR InterPro; IPR036322; WD40_repeat_dom_sf.
DR InterPro; IPR001680; WD40_rpt.
DR PANTHER; PTHR13720:SF33; HELP DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR13720; WD-40 REPEAT PROTEIN; 1.
DR Pfam; PF03451; HELP; 2.
DR Pfam; PF00400; WD40; 8.
DR SMART; SM00320; WD40; 17.
DR SUPFAM; SSF51004; C-terminal (heme d1) domain of cytochrome cd1-nitrite reductase; 1.
DR SUPFAM; SSF50998; Quinoprotein alcohol dehydrogenase-like; 1.
DR SUPFAM; SSF50978; WD40 repeat-like; 2.
DR PROSITE; PS00678; WD_REPEATS_1; 1.
DR PROSITE; PS50082; WD_REPEATS_2; 7.
DR PROSITE; PS50294; WD_REPEATS_REGION; 2.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000001876};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW WD repeat {ECO:0000256|ARBA:ARBA00022574, ECO:0000256|PROSITE-
KW ProRule:PRU00221}.
FT DOMAIN 75..139
FT /note="HELP"
FT /evidence="ECO:0000259|Pfam:PF03451"
FT REPEAT 342..373
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT REPEAT 461..502
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT REPEAT 724..756
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT DOMAIN 869..936
FT /note="HELP"
FT /evidence="ECO:0000259|Pfam:PF03451"
FT REPEAT 1142..1183
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT REPEAT 1250..1272
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT REPEAT 1392..1433
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT REPEAT 1509..1543
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT REGION 1..71
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 32..46
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1550 AA; 168158 MW; 3235A667B9AA8C6F CRC64;
MGAGASTEFN PEAVSLGDAE SSEPVDAEGY DSDLADDFRD LKAPRAKPAQ GRPTVETSRG
RLGPEPSGDA KRALPCAKKI VPPFQYPKDH PEGAEKLPDN KLRLEFVHGY RSHDARHNLH
YDKKGLLVYH AAALGIVLNA KTRKQTFFDA HSDDVTCIAK HPGGDFFATG QVGDFPAIHV
WRSSDAKKVC TIRSTELHKN AILCAAFSKC GKYLASVGAD EAHLVAVHEW GPPGWDPLKG
EATPKLIATE KFGRARPYVC AFNPVDGRLV VGGKKSLKFF TIDDGTLRVA PAQYSHGASK
GFAACSVLSI AFLPDGSTFA GTMKGDAYKY EEGGCRAVRK FAAIHHGPIH DMAFTGKVLA
TAGKDGKIKL WSVFMQPVFE VDTAKVAEGL LDAHAQPRSY AAGKAPSIRA LAPSADGRKL
AYGTAASEIF EIDITDEKAA QDKTKAKLLM NGHAGAIDPK TGADRGDVWG LAMHPREPRF
VTVSEDRSIR LWSLKGKTQD RMLRLPSKGR CVDWHPKTEH VAVGTYRGDV IVVDVEKGTI
VTRTKLSNVR VNAVNYSPCG CFLGCASQDG VFRVLGVYAG QSGYQIVDKT DDVKPFFVID
AAHEDDRGPQ AMTHVDWSED SKFVQINTAG GDLLFFTAPQ CDPVEAAFAP VRDAEWNSWT
IPMGWATQGL WSEDAKPGDL NAVARSNRGD WEDGERVLAC GDDYGSVKLA KYPANVGVSD
AHEYRGHSAH VTNVAFSASD KWLVSTGGGD RCVMVWRHKD VDGPPDGLTP EAEAMLEDKK
LTHEEKEERV LAIVHEAVGP DTDSESDEED VVSYDQNAAS SKPVMMLQTG MEHGLGVFTP
VNATDGVPNP YKNHSLAELG CRRGYKSPMT GKPVLSQTHV PSWWKKDSTS YDVPTSRLKL
TWAYGFRGHD ARQNAWYNTK GEVCYHTAAC GVVYNPASET QRLITDDPES DDVAEGNSDD
VLCMTRHPER RIFATGEIGG KPKIIVWDSH DVKALAMLQG FHRTAVLCLC FSPCGDYLAS
VGADAEHSVA VYEWKTETLV ATYKGDRNKI LGINWSPFDG SLVTTGVKHV KFVKCAWKEG
EKIAKGTVFR PRRATLGKKG KWQNFYACSF IDVEGKPPRT VVGCKGGQLY VFEGSTLTQV
IPGAHDGKVN AVYTLHEHNV LLTGGDDGVV CFWKASDLSP LHKVRIESAA TGRPAPISSL
TTDAKAKALV GTKVGEIWEI SDQAHLLVES HERGEIWGLA PHPSAGRAKV ATGGDDGTIR
IWNVKPGKKR HCVARARVIE NAKPLKSGGG RYTTAAVRSL AWHPSGGQLA AGTVSGAVSL
FKVDLDDPET TRIDKELDLW RPKLRTGWIA DIKYSPATPD FPSGGHYIAV GSHDNVVDVY
DTMRDHALVG TCKGHASFIT HLGWSQDARY LWTNSGDYEL MFWKMPTATQ VKKGKDVQDV
SWDGWTGVLG FQTTGVWYKG SDGTDVNACE VAIVPGDHGV EERVVATGDD RGIVSLFKAP
ALGGKPRTYG GHSSHVANVR FGPDGSQMFS AGGGDTSMLQ WDVYMPAPVE
//