GenomeNet

Database: UniProt
Entry: R7TI05_CAPTE
LinkDB: R7TI05_CAPTE
Original site: R7TI05_CAPTE 
ID   R7TI05_CAPTE            Unreviewed;      1096 AA.
AC   R7TI05;
DT   24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT   24-JUL-2013, sequence version 1.
DT   24-JAN-2024, entry version 48.
DE   RecName: Full=C-type lectin domain-containing protein {ECO:0008006|Google:ProtNLM};
GN   ORFNames=CAPTEDRAFT_220791 {ECO:0000313|EMBL:ELT93458.1};
OS   Capitella teleta (Polychaete worm).
OC   Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Annelida; Polychaeta;
OC   Sedentaria; Scolecida; Capitellidae; Capitella.
OX   NCBI_TaxID=283909 {ECO:0000313|EMBL:ELT93458.1};
RN   [1] {ECO:0000313|Proteomes:UP000014760}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=I ESC-2004 {ECO:0000313|Proteomes:UP000014760};
RA   Hellsten U., Grimwood J., Chapman J.A., Shapiro H., Aerts A., Otillar R.P.,
RA   Terry A.Y., Boore J.L., Simakov O., Marletaz F., Cho S.-J.,
RA   Edsinger-Gonzales E., Havlak P., Kuo D.-H., Larsson T., Lv J., Arendt D.,
RA   Savage R., Osoegawa K., de Jong P., Lindberg D.R., Seaver E.C.,
RA   Weisblat D.A., Putnam N.H., Grigoriev I.V., Rokhsar D.S.;
RL   Submitted (DEC-2012) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|EMBL:ELT93458.1, ECO:0000313|Proteomes:UP000014760}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=I ESC-2004 {ECO:0000313|EMBL:ELT93458.1,
RC   ECO:0000313|Proteomes:UP000014760};
RX   PubMed=23254933; DOI=10.1038/nature11696;
RA   Simakov O., Marletaz F., Cho S.J., Edsinger-Gonzales E., Havlak P.,
RA   Hellsten U., Kuo D.H., Larsson T., Lv J., Arendt D., Savage R.,
RA   Osoegawa K., de Jong P., Grimwood J., Chapman J.A., Shapiro H., Aerts A.,
RA   Otillar R.P., Terry A.Y., Boore J.L., Grigoriev I.V., Lindberg D.R.,
RA   Seaver E.C., Weisblat D.A., Putnam N.H., Rokhsar D.S.;
RT   "Insights into bilaterian evolution from three spiralian genomes.";
RL   Nature 493:526-531(2013).
RN   [3] {ECO:0000313|EnsemblMetazoa:CapteP220791}
RP   IDENTIFICATION.
RG   EnsemblMetazoa;
RL   Submitted (JUN-2015) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AMQN01012764; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AMQN01012765; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; KB309733; ELT93458.1; -; Genomic_DNA.
DR   AlphaFoldDB; R7TI05; -.
DR   STRING; 283909.R7TI05; -.
DR   EnsemblMetazoa; CapteT220791; CapteP220791; CapteG220791.
DR   HOGENOM; CLU_283896_0_0_1; -.
DR   Proteomes; UP000014760; Unassembled WGS sequence.
DR   GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR   CDD; cd00037; CLECT; 1.
DR   CDD; cd00057; FA58C; 2.
DR   Gene3D; 2.60.120.260; Galactose-binding domain-like; 3.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 4.
DR   InterPro; IPR001304; C-type_lectin-like.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR018378; C-type_lectin_CS.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR000421; FA58C.
DR   InterPro; IPR008979; Galactose-bd-like_sf.
DR   InterPro; IPR002889; WSC_carb-bd.
DR   PANTHER; PTHR24543; MULTICOPPER OXIDASE-RELATED; 1.
DR   Pfam; PF00754; F5_F8_type_C; 3.
DR   Pfam; PF00059; Lectin_C; 2.
DR   Pfam; PF01822; WSC; 1.
DR   SMART; SM00034; CLECT; 2.
DR   SMART; SM00231; FA58C; 3.
DR   SUPFAM; SSF56436; C-type lectin-like; 4.
DR   SUPFAM; SSF49785; Galactose-binding domain-like; 3.
DR   PROSITE; PS00615; C_TYPE_LECTIN_1; 1.
DR   PROSITE; PS50041; C_TYPE_LECTIN_2; 3.
DR   PROSITE; PS01285; FA58C_1; 1.
DR   PROSITE; PS50022; FA58C_3; 3.
DR   PROSITE; PS51212; WSC; 1.
PE   4: Predicted;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Membrane {ECO:0000256|SAM:Phobius};
KW   Reference proteome {ECO:0000313|Proteomes:UP000014760};
KW   Signal {ECO:0000256|SAM:SignalP}; Transmembrane {ECO:0000256|SAM:Phobius};
KW   Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT   SIGNAL          1..19
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           20..1096
FT                   /note="C-type lectin domain-containing protein"
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5008787038"
FT   TRANSMEM        1064..1087
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   DOMAIN          117..234
FT                   /note="C-type lectin"
FT                   /evidence="ECO:0000259|PROSITE:PS50041"
FT   DOMAIN          239..381
FT                   /note="F5/8 type C"
FT                   /evidence="ECO:0000259|PROSITE:PS50022"
FT   DOMAIN          385..540
FT                   /note="F5/8 type C"
FT                   /evidence="ECO:0000259|PROSITE:PS50022"
FT   DOMAIN          545..699
FT                   /note="F5/8 type C"
FT                   /evidence="ECO:0000259|PROSITE:PS50022"
FT   DOMAIN          693..782
FT                   /note="WSC"
FT                   /evidence="ECO:0000259|PROSITE:PS51212"
FT   DOMAIN          795..894
FT                   /note="C-type lectin"
FT                   /evidence="ECO:0000259|PROSITE:PS50041"
FT   DOMAIN          934..1024
FT                   /note="C-type lectin"
FT                   /evidence="ECO:0000259|PROSITE:PS50041"
SQ   SEQUENCE   1096 AA;  124038 MW;  2E3DDF94935C661C CRC64;
     MKLLLALLMS LGFSRTLQAA RCPAEWTPIA DHCYQLSENG HSLEAAKKKC EERGGNLTDS
     KDEMELASII VQASKTTKKK LNSVNRSTEH REKQKYFCKK KTTVDVKCEA GWHYDRIGRR
     CYMLNEQLLS WDDARTQCMA INGDLASVTS TTDQEFIREL ITNATSSGIW IGGNEIDQTA
     GWTWTDGSPF DFFYWLDGKP DDLTGPEWCI EVVKDSNTKW DDAECSIPLQ SICKKRVTCH
     EELISGEHYV MDWAFYATSV SEDDEAHKSR LDSTYESGGG GWTATGDDFN PYLSFAFTSI
     MKVISVVTQG HASRDEWVTK FIVSYSLTHE DLFEWEFEGN TDRNGKVENT LSESGILAKV
     IRIHTIEHHE AASMRVELLG CLHGVYSNKE ALVSGDIFVH DEKITASSQF SEYHGPERAR
     FDTVREGSFR DSWYAREQAV GEWIQVEFSS LMIVEAVRTR GTSEGSASYW VGSFEIHFSD
     DGQNWEAYQE PYGTVKVHRN NNSTDQVEHY LQDPKRAKFV RLLPLTWNIY IAISFEVFGS
     HYSECSETEL MGGNDRYWSN GEISASSQLD DYHGPHRSRL NEVANGDLSG GWVPLHTNDQ
     QWIQVNLGSV IAIKGVAIQG QHHTVNFVTE FYVSYSEDMD TWSNHMEPED NATTLFHGNM
     DSSHIRRTYF MEGFSARGVR LHPTAWNNNI GIRWDLIGCR GRSISYYGLP CLAKTVIRQH
     NGSKVKFNLI GYPYAGVQYS SHCYCGTSYG TYGPTIGCNF PCTGDVSQRC GGFRANGVMT
     TGLFYKKCPD HWTAMGDNCY RVFTDQRTWE EAGNNCTMHG VGAQLASVNS IEEQQFVEEQ
     LHIFKRDLWI GLTCHNIPYL HSSCEWSNGE RVIHTNWAQG EPNLKNRNDH CVLLNYSEES
     ESKENGVPVH AMINKMLTFV KRRRKRVILY LGLLLMMDVQ QFEQSFVVFS HGERSVSDSF
     WTDLSNMDDI STFKFSNGLL PEELYWAEDQ PSSNVYPTPV ITNASDYGHW ENVDENQNHP
     HICEHERVGF HRRTTVFTAA SDIVDNEMGS FGQKAGNAGM SSGAVAGVVL GMVFVAIIVA
     FVGTFVYKRR LSSNSS
//
DBGET integrated database retrieval system