ID F0Y4F1_AURAN Unreviewed; 2287 AA.
AC F0Y4F1;
DT 03-MAY-2011, integrated into UniProtKB/TrEMBL.
DT 03-MAY-2011, sequence version 1.
DT 27-MAR-2024, entry version 43.
DE RecName: Full=C-type lectin domain-containing protein {ECO:0000259|PROSITE:PS50041};
GN ORFNames=AURANDRAFT_62669 {ECO:0000313|EMBL:EGB10405.1};
OS Aureococcus anophagefferens (Harmful bloom alga).
OC Eukaryota; Sar; Stramenopiles; Ochrophyta; Pelagophyceae; Pelagomonadales;
OC Aureococcus.
OX NCBI_TaxID=44056 {ECO:0000313|Proteomes:UP000002729};
RN [1] {ECO:0000313|EMBL:EGB10405.1, ECO:0000313|Proteomes:UP000002729}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CCMP 1984 {ECO:0000313|Proteomes:UP000002729};
RX PubMed=21368207; DOI=10.1073/pnas.1016106108;
RA Gobler C.J., Berry D.L., Dyhrman S.T., Wilhelm S.W., Salamov A.,
RA Lobanov A.V., Zhang Y., Collier J.L., Wurch L.L., Kustka A.B., Dill B.D.,
RA Shah M., VerBerkmoes N.C., Kuo A., Terry A., Pangilinan J., Lindquist E.A.,
RA Lucas S., Paulsen I.T., Hattenrath-Lehmann T.K., Talmage S.C., Walker E.A.,
RA Koch F., Burson A.M., Marcoval M.A., Tang Y.Z., Lecleir G.R., Coyne K.J.,
RA Berg G.M., Bertrand E.M., Saito M.A., Gladyshev V.N., Grigoriev I.V.;
RT "Niche of harmful alga Aureococcus anophagefferens revealed through
RT ecogenomics.";
RL Proc. Natl. Acad. Sci. U.S.A. 108:4352-4357(2011).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GL833124; EGB10405.1; -; Genomic_DNA.
DR RefSeq; XP_009035205.1; XM_009036957.1.
DR EnsemblProtists; EGB10405; EGB10405; AURANDRAFT_62669.
DR GeneID; 20224001; -.
DR KEGG; aaf:AURANDRAFT_62669; -.
DR eggNOG; KOG4297; Eukaryota.
DR InParanoid; F0Y4F1; -.
DR OMA; PWHASAR; -.
DR OrthoDB; 2879388at2759; -.
DR Proteomes; UP000002729; Unassembled WGS sequence.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR CDD; cd00037; CLECT; 4.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 6.
DR InterPro; IPR001304; C-type_lectin-like.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR018378; C-type_lectin_CS.
DR InterPro; IPR016187; CTDL_fold.
DR PANTHER; PTHR22803:SF124; C-TYPE LECTIN DOMAIN FAMILY 19 MEMBER A-RELATED; 1.
DR PANTHER; PTHR22803; MANNOSE, PHOSPHOLIPASE, LECTIN RECEPTOR RELATED; 1.
DR Pfam; PF00059; Lectin_C; 4.
DR SMART; SM00034; CLECT; 6.
DR SUPFAM; SSF56436; C-type lectin-like; 6.
DR PROSITE; PS00615; C_TYPE_LECTIN_1; 2.
DR PROSITE; PS50041; C_TYPE_LECTIN_2; 6.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Membrane {ECO:0000256|SAM:Phobius};
KW Reference proteome {ECO:0000313|Proteomes:UP000002729};
KW Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT TRANSMEM 1019..1042
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 1088..1111
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 17..156
FT /note="C-type lectin"
FT /evidence="ECO:0000259|PROSITE:PS50041"
FT DOMAIN 200..321
FT /note="C-type lectin"
FT /evidence="ECO:0000259|PROSITE:PS50041"
FT DOMAIN 365..473
FT /note="C-type lectin"
FT /evidence="ECO:0000259|PROSITE:PS50041"
FT DOMAIN 641..742
FT /note="C-type lectin"
FT /evidence="ECO:0000259|PROSITE:PS50041"
FT DOMAIN 871..999
FT /note="C-type lectin"
FT /evidence="ECO:0000259|PROSITE:PS50041"
FT DOMAIN 1174..1350
FT /note="C-type lectin"
FT /evidence="ECO:0000259|PROSITE:PS50041"
FT REGION 840..869
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1888..1909
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2287 AA; 234153 MW; 5796274813A124A1 CRC64;
MARPRVVAAA AATLGVAGAH NFILHEAPLS WPDAAVACRA AGGRLAKVED AAEQAVLATT
MGSQAVWIGG SDGATEGVWA WDAEDGGVLP PSSGATQAFE NWLTPDRAAA FPGCGGHAGE
SEPNDWGAGE DCMGAYEHCG LWFDANCADP RPYVCEHPEA AARAACPEGW RGVGGRCFGL
AGSGPRDECD ARCAAALPPT WPAAAPACPA NEAEFAAVST DGRFAAAYEC CNWTPGTKDT
SCCAWLGLRQ DPDADDGWSG DGAAGWDAGC ASADRYGYDG WAAGEPNQFL GLEEDCAAFG
VWGSMRWYDA PCGQSFRCLC EVELAGAAAG AAGAAAAPTA APTPRAVEAA PDASCPLAFV
PAAGTWADAE ATCVSLGGHL ATLPDAAASA YASAHFVRAR GGCRPAWIGY NDREEEGVWA
WADGSVGGYE RWWPGEPDDS ANGARGADCA SMGVDCVHGW RGVADEWVDS GCDAAGADYT
PRGALCQLRP LAAGDCAGAG GAGGGVCSAN PCPAGVCAGT FTCDDAARFG YTCGMLEAAG
YGCGGCACAA DAAAAAAEAA WAAPDADDGE AAVVESQCHA AACGEYGSGG VDDDCCAAPD
DAFCAEGYVY AAGVVGCGRG FADGRVSTCC IDPAAPAPGV DADAHCVALG GHLATVPDAN
VARWIDERFA RPGGECVPLW IGLSDATREG AWQWADGSPA SWQHWWANEP DDAGGSENCA
SLGADCFYHV EGVGDRWTDS GCGPGDFEYR ARASVCQLRP LVDGDCVPEA VADAEAAAPE
AAAPEAGGCP FNACPLPDLC GEAFTCDAAA VSGFSCETLE AEGFDCAGCA CPLDGVDPAD
RAAPDPGRGG GGARGGRDRD RGPEARSASA SGGKTLVFFG RPLAWADARA ACRGDGGDLA
GLRDADDQAA LERVAPAAPV WVGGAGAGDA WRWSAGGAVD GFERWATADR AAATPGCGAW
AGADADEPDE GAAACMAVHG GCGLWFDHPC DGKKAYVCEY ADDAGGGEKG PEAAAGGGMM
AVVVAVVIVL ALLGGGAAAY AAKRRRGHAA RGYGDVARGY EPDSDLPSYG DYVLNPISLK
QQTMTMRLLV AALPLADAAT MTWTAGAVGY FDVANNWDGG EVPGSSDTMV LADGTVYLAG
DATINNLKMS GGELVVGSSE CPDGWSTTLT FDGCVRAYAT PKTWLEAEEV CHAAGGTSNG
GYRGHLALIG DDETNQLASL LCAAATNATS LRLENSDAES RGEEFAPGAE PCWIGITDSY
ANASGAAPPR EQVSRPDAGG AAADDGAWAW ADEAAVLYDP LYRSWARREP LFRGGANCGA
LLHRGYDPYS PPRPRWRTHL CSERLPFVCT RRGATTSYDL TVANTFTFTG GAVSGGGVVT
PKKMSFTAGM TPELVNAGLV VAQSISMGSD ARLYGSLGAW IEVTSAGSWG VSGDALLGGY
GARVLSEGAM RITSNGVSEW AIHASGTLRC NGYGNANGGG DLSRATIVLA SSSSRFVANG
RSLAFRTPPV DLVDVSADAP VAREDVLYAK ELTDEPGEAV FGTYAFGQGA LESAFGALDD
AMPLTVSAYK IDGFSDHKNG DGELLAPARG SYRLAAGADE SVCIPWHASA REFQDAVNGL
PDVAAVGGAT VTRHGDGGRR WNFGFRYEVA YDRAGTATVG TLSLSCAGSA NGCGCFDAVS
RQYAKPTGWC KHGIAEYAKA ETGRLCLTDV AVTVDRVVVG GETTFSDFGT ASVELAEGFH
ALPATLVPPL RVTGARARGV TASAATAYQK LYMEAGSLTF AGPGYAGDDA AALLFGAPDK
FFRGALGLAR WSENRDRDFA ATVALKSEIN GGDVRIANPR LADQAAAGGG SSASLSFQGV
VTWTGNGGFR GNGKVALQST TTITTCVSRS GDGDECPRGS RGDGAPDRFQ PAHLRDAVVV
EIDDGSWSQG DLLMGEGATL TVAQGLAVDD DASDRSPRIF ATARDATPGR EDDARNGWYS
NPTCGDLCQA SPTLRVDAAG EVDVAASAAI FEATLFNRGE FKVASSGAAT LGDGGGGDGD
FVVAGTLTQR GGVLDLERGA LSGSGSVGVS GGRVLLPGTV APSVSVSGGE AAIRGRRADL
SSVAISNGTF KVLADAANVT LSSDLIISGG ALRFPERDSY AATHRNNLHS GTRNLDRGAM
VVHGHLNWTD GAISGNGDVN LMASSSVRSG FIERFARVVN HAAMFLEDGA VVAETEGYME
NRGTVEMREP RSTYAGMFGR RPNPREAQPV LQYDWEDNMY FYNGKRVMVV GDDSIRDASL
GVSGNYD
//