GenomeNet

Database: UniProt
Entry: G0M6S1_CAEBE
LinkDB: G0M6S1_CAEBE
Original site: G0M6S1_CAEBE 
ID   G0M6S1_CAEBE            Unreviewed;      2102 AA.
AC   G0M6S1;
DT   19-OCT-2011, integrated into UniProtKB/TrEMBL.
DT   19-OCT-2011, sequence version 1.
DT   24-JAN-2024, entry version 54.
DE   SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EGT29907.1};
GN   ORFNames=CAEBREN_13087 {ECO:0000313|EMBL:EGT29907.1};
OS   Caenorhabditis brenneri (Nematode worm).
OC   Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC   Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae;
OC   Caenorhabditis.
OX   NCBI_TaxID=135651 {ECO:0000313|Proteomes:UP000008068};
RN   [1] {ECO:0000313|Proteomes:UP000008068}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=PB2801 {ECO:0000313|Proteomes:UP000008068};
RG   Caenorhabditis brenneri Sequencing and Analysis Consortium;
RA   Wilson R.K.;
RL   Submitted (JUL-2011) to the EMBL/GenBank/DDBJ databases.
CC   -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC       feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; GL379786; EGT29907.1; -; Genomic_DNA.
DR   STRING; 135651.G0M6S1; -.
DR   EnsemblMetazoa; CBN13087.1; CBN13087.1; WBGene00151812.
DR   eggNOG; ENOG502SD0F; Eukaryota.
DR   HOGENOM; CLU_231162_0_0_1; -.
DR   InParanoid; G0M6S1; -.
DR   OMA; RSEYDLF; -.
DR   Proteomes; UP000008068; Unassembled WGS sequence.
DR   CDD; cd00037; CLECT; 1.
DR   CDD; cd00054; EGF_CA; 1.
DR   CDD; cd01450; vWFA_subfamily_ECM; 1.
DR   Gene3D; 2.10.25.10; Laminin; 2.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 1.
DR   InterPro; IPR001304; C-type_lectin-like.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR000742; EGF-like_dom.
DR   InterPro; IPR006582; MD_domain.
DR   InterPro; IPR002035; VWF_A.
DR   InterPro; IPR036465; vWFA_dom_sf.
DR   PANTHER; PTHR47324; PROTEIN IRG-7-RELATED; 1.
DR   PANTHER; PTHR47324:SF2; VWFA DOMAIN-CONTAINING PROTEIN; 1.
DR   Pfam; PF00059; Lectin_C; 1.
DR   Pfam; PF00092; VWA; 1.
DR   SMART; SM00034; CLECT; 1.
DR   SMART; SM00181; EGF; 3.
DR   SMART; SM00604; MD; 2.
DR   SMART; SM00327; VWA; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 1.
DR   SUPFAM; SSF57196; EGF/Laminin; 1.
DR   SUPFAM; SSF53300; vWA-like; 2.
DR   PROSITE; PS50041; C_TYPE_LECTIN_2; 1.
DR   PROSITE; PS00022; EGF_1; 2.
DR   PROSITE; PS01186; EGF_2; 2.
DR   PROSITE; PS50026; EGF_3; 2.
DR   PROSITE; PS50234; VWFA; 1.
PE   4: Predicted;
KW   Disulfide bond {ECO:0000256|PROSITE-ProRule:PRU00076};
KW   EGF-like domain {ECO:0000256|PROSITE-ProRule:PRU00076};
KW   Reference proteome {ECO:0000313|Proteomes:UP000008068};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..16
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           17..2102
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5003403414"
FT   DOMAIN          334..368
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   DOMAIN          781..819
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   DOMAIN          1132..1234
FT                   /note="C-type lectin"
FT                   /evidence="ECO:0000259|PROSITE:PS50041"
FT   DOMAIN          1913..2089
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DISULFID        358..367
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT   DISULFID        809..818
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ   SEQUENCE   2102 AA;  231473 MW;  DFB497539F123096 CRC64;
     MRLLLALFLL VTAVAAAPSY DSAFQHLPNY YPELKHQDAI HIRAKRAIFA ALGIANSCED
     GWTGDGCKNP ICTDPRPVPT TGQTALIELL FLKGGCGGSY YIPVDSDVGK TTQTIQIHIS
     AAGIPYVNLT DSKGVVFTPT YANNGDGYSL SIYGDLPSGG YSLTIDNQGV PTTECIVEVN
     SDTVLKVTEG FVNSPQSDDT PYGESAVDGV PMYVVAHVDS QQAPAQVHSI TIRQGNSLTP
     VYRAPLTRRY QCGYEYYAGQ WQCQLGNSYY YHVDGIDSNG FAFRRTGKFA CMQHLTSPAP
     PTTTRAPLTT CFNNGTLLNV VNQGQTCFCT ELFTGTQCEK VNCMNAGFPD PDGNECACAA
     GYHGTNCQDV TCPLNWEPYL TDYKTLVVVI RSVTSMNQNL AAISNAVYKE LTSNAANNYE
     VYKGFVLVKF GNGVYTNTYY PAYDQTKFLT DINLASAAAG QCSDATFDSI ASIFTEVAIY
     QKSPIYFFTD AIASDVEKWQ TVIEMNTRQK FPIYTHFFVQ DNCLFDDMSQ GFQAIEYASY
     YSGGLILRPT PDTLQHIFQN VIKATAYKMN SVLIDDLGSC ATPTRVFFVD TSTTEIMILA
     IGESLIVSVT DPNGATTTAL QIVNSGTTQM YEIANPVVGE HLITVVSNVK NTPCSYRVQA
     RSEYDLFIGT STGVNDDASD SEPVVGQSAH IVAQLTGLKK NVADPFRLFS EISITSNVNL
     DNTYQKPMYY SSGKYRDGCG FHMYFGASDF CDFMSQPFYA TVYADDGKGF TIQRTTTGFC
     SGTPTTPYPP NTCQNGGVSD PTNNATCICP PGFYGKYCEN IQCVNGGTAR GGSCVCPVGT
     AGTFCEQYMC TTFNNNPDVS FDGQSIAFVI STRSTMKNAV ATIATNVQTM TRDMQQASDK
     WIDKWILIAV NSNTSYLLVN SNRPADFVAG VNNLNGNFTN YAADETSCQI QIEQAMLGAA
     LLSERRSSVW VFTDSDGPND LNYIQLFDTA QEYQISLNLV GVGSSICTTP ENNGQFPYYL
     KSLSETTLGE VYMTDKLDQI MFFIISLYKS AVSHRYYVPD CKAGTSYYMP VDGWTQSLTL
     AVTGTDLRSV QITFPDGTLG QNSDYELVAI NDPELKLNQY VAACEGSFWN HRKQNCFEFT
     AEKYDWLDGW NMCHSQKAYL LHIDSQDIND FVFSQITGYR VWIGLAFLNG KWYWDVPDNN
     FEQPLSGYTN WADGVDPANP KFNHAVMNAN GKWEPADPAE MNFGGCMKHR YGQGYYPGEG
     ANIVPAGLWK VTVQSNSGSC EIQARSQSEI QVFFGFVTDP RVDKPNTYAN IQSSNNYLIA
     YPTGVYPYTP DTKPSMEGKL NYAVLSSNRT ITNSLPLGNR VCTYATISAP FSCPDTDGSI
     SDFSIKFTGL DQYGYAFERY ADALCTKTVI SCGNGGFINN GVCVCRAGWV GTTCNTPVCQ
     NGGIEKNGKC DCSSVPQFSG QFCELAHCEP PYPTAFNDKD RTLAIVLETS YNMGSSIFQL
     RKNLKASLDS INNDPTLQGW FTNFVLYPFD STSNQASWYP VTVSRNSDDI VAAVKNISTM
     SCPGNAPCSS QCPRPIVSVL QNVLSMDALA SPNSVVLVIT RSSPEDYEQV GRIAQVLQDK
     KAYINFVFPA IDSPCGEGWN NPNVDALYRI ISYSQGNTFT MNPVELSKNF LTQYIPTLYS
     SGGIAASSGN CMNDEIIFQV EHEMYEFSID FYHPLMETIK VFDPSGDQIT IPDNIITSDS
     NYIGIFPVNE TGATRAGTYR ILLTGTGGNN CFATVRGRSN LELFLGFVDS NSDSNNGATN
     DAAHHAPVNL QQNTVVVHAT GLGQGIVRYV QIVMPGFGLL HTTEMRKRDT ECSYEWYATT
     PFQFDYDSYW IVIYGSSEFG SNWKRNFYVS TVGSRPPLPP PPANCDLQSV KQDTLFLIDS
     SLKDTNVTFT ILKQFAVTAM QPYNYVNSLA QVAAGSVADK GYWGFSYNAG ENSFDRVSEL
     LYDMQYIGVN GQNVTAGLQL VLDFFDLPAQ GYRSDPDVRH LLVYVTQTNP TDADPSELVR
     AIKRSGRYEI IVVALDMQPS DQLTNMVAPR CFYYAQDFHD LMNYGVNFVQ GQSCLRWNFC
     NY
//
DBGET integrated database retrieval system