ID G0M6S1_CAEBE Unreviewed; 2102 AA.
AC G0M6S1;
DT 19-OCT-2011, integrated into UniProtKB/TrEMBL.
DT 19-OCT-2011, sequence version 1.
DT 24-JAN-2024, entry version 54.
DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EGT29907.1};
GN ORFNames=CAEBREN_13087 {ECO:0000313|EMBL:EGT29907.1};
OS Caenorhabditis brenneri (Nematode worm).
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae;
OC Caenorhabditis.
OX NCBI_TaxID=135651 {ECO:0000313|Proteomes:UP000008068};
RN [1] {ECO:0000313|Proteomes:UP000008068}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=PB2801 {ECO:0000313|Proteomes:UP000008068};
RG Caenorhabditis brenneri Sequencing and Analysis Consortium;
RA Wilson R.K.;
RL Submitted (JUL-2011) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GL379786; EGT29907.1; -; Genomic_DNA.
DR STRING; 135651.G0M6S1; -.
DR EnsemblMetazoa; CBN13087.1; CBN13087.1; WBGene00151812.
DR eggNOG; ENOG502SD0F; Eukaryota.
DR HOGENOM; CLU_231162_0_0_1; -.
DR InParanoid; G0M6S1; -.
DR OMA; RSEYDLF; -.
DR Proteomes; UP000008068; Unassembled WGS sequence.
DR CDD; cd00037; CLECT; 1.
DR CDD; cd00054; EGF_CA; 1.
DR CDD; cd01450; vWFA_subfamily_ECM; 1.
DR Gene3D; 2.10.25.10; Laminin; 2.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 1.
DR InterPro; IPR001304; C-type_lectin-like.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR006582; MD_domain.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR47324; PROTEIN IRG-7-RELATED; 1.
DR PANTHER; PTHR47324:SF2; VWFA DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF00059; Lectin_C; 1.
DR Pfam; PF00092; VWA; 1.
DR SMART; SM00034; CLECT; 1.
DR SMART; SM00181; EGF; 3.
DR SMART; SM00604; MD; 2.
DR SMART; SM00327; VWA; 1.
DR SUPFAM; SSF56436; C-type lectin-like; 1.
DR SUPFAM; SSF57196; EGF/Laminin; 1.
DR SUPFAM; SSF53300; vWA-like; 2.
DR PROSITE; PS50041; C_TYPE_LECTIN_2; 1.
DR PROSITE; PS00022; EGF_1; 2.
DR PROSITE; PS01186; EGF_2; 2.
DR PROSITE; PS50026; EGF_3; 2.
DR PROSITE; PS50234; VWFA; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|PROSITE-ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|PROSITE-ProRule:PRU00076};
KW Reference proteome {ECO:0000313|Proteomes:UP000008068};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..16
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 17..2102
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5003403414"
FT DOMAIN 334..368
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 781..819
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1132..1234
FT /note="C-type lectin"
FT /evidence="ECO:0000259|PROSITE:PS50041"
FT DOMAIN 1913..2089
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DISULFID 358..367
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 809..818
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ SEQUENCE 2102 AA; 231473 MW; DFB497539F123096 CRC64;
MRLLLALFLL VTAVAAAPSY DSAFQHLPNY YPELKHQDAI HIRAKRAIFA ALGIANSCED
GWTGDGCKNP ICTDPRPVPT TGQTALIELL FLKGGCGGSY YIPVDSDVGK TTQTIQIHIS
AAGIPYVNLT DSKGVVFTPT YANNGDGYSL SIYGDLPSGG YSLTIDNQGV PTTECIVEVN
SDTVLKVTEG FVNSPQSDDT PYGESAVDGV PMYVVAHVDS QQAPAQVHSI TIRQGNSLTP
VYRAPLTRRY QCGYEYYAGQ WQCQLGNSYY YHVDGIDSNG FAFRRTGKFA CMQHLTSPAP
PTTTRAPLTT CFNNGTLLNV VNQGQTCFCT ELFTGTQCEK VNCMNAGFPD PDGNECACAA
GYHGTNCQDV TCPLNWEPYL TDYKTLVVVI RSVTSMNQNL AAISNAVYKE LTSNAANNYE
VYKGFVLVKF GNGVYTNTYY PAYDQTKFLT DINLASAAAG QCSDATFDSI ASIFTEVAIY
QKSPIYFFTD AIASDVEKWQ TVIEMNTRQK FPIYTHFFVQ DNCLFDDMSQ GFQAIEYASY
YSGGLILRPT PDTLQHIFQN VIKATAYKMN SVLIDDLGSC ATPTRVFFVD TSTTEIMILA
IGESLIVSVT DPNGATTTAL QIVNSGTTQM YEIANPVVGE HLITVVSNVK NTPCSYRVQA
RSEYDLFIGT STGVNDDASD SEPVVGQSAH IVAQLTGLKK NVADPFRLFS EISITSNVNL
DNTYQKPMYY SSGKYRDGCG FHMYFGASDF CDFMSQPFYA TVYADDGKGF TIQRTTTGFC
SGTPTTPYPP NTCQNGGVSD PTNNATCICP PGFYGKYCEN IQCVNGGTAR GGSCVCPVGT
AGTFCEQYMC TTFNNNPDVS FDGQSIAFVI STRSTMKNAV ATIATNVQTM TRDMQQASDK
WIDKWILIAV NSNTSYLLVN SNRPADFVAG VNNLNGNFTN YAADETSCQI QIEQAMLGAA
LLSERRSSVW VFTDSDGPND LNYIQLFDTA QEYQISLNLV GVGSSICTTP ENNGQFPYYL
KSLSETTLGE VYMTDKLDQI MFFIISLYKS AVSHRYYVPD CKAGTSYYMP VDGWTQSLTL
AVTGTDLRSV QITFPDGTLG QNSDYELVAI NDPELKLNQY VAACEGSFWN HRKQNCFEFT
AEKYDWLDGW NMCHSQKAYL LHIDSQDIND FVFSQITGYR VWIGLAFLNG KWYWDVPDNN
FEQPLSGYTN WADGVDPANP KFNHAVMNAN GKWEPADPAE MNFGGCMKHR YGQGYYPGEG
ANIVPAGLWK VTVQSNSGSC EIQARSQSEI QVFFGFVTDP RVDKPNTYAN IQSSNNYLIA
YPTGVYPYTP DTKPSMEGKL NYAVLSSNRT ITNSLPLGNR VCTYATISAP FSCPDTDGSI
SDFSIKFTGL DQYGYAFERY ADALCTKTVI SCGNGGFINN GVCVCRAGWV GTTCNTPVCQ
NGGIEKNGKC DCSSVPQFSG QFCELAHCEP PYPTAFNDKD RTLAIVLETS YNMGSSIFQL
RKNLKASLDS INNDPTLQGW FTNFVLYPFD STSNQASWYP VTVSRNSDDI VAAVKNISTM
SCPGNAPCSS QCPRPIVSVL QNVLSMDALA SPNSVVLVIT RSSPEDYEQV GRIAQVLQDK
KAYINFVFPA IDSPCGEGWN NPNVDALYRI ISYSQGNTFT MNPVELSKNF LTQYIPTLYS
SGGIAASSGN CMNDEIIFQV EHEMYEFSID FYHPLMETIK VFDPSGDQIT IPDNIITSDS
NYIGIFPVNE TGATRAGTYR ILLTGTGGNN CFATVRGRSN LELFLGFVDS NSDSNNGATN
DAAHHAPVNL QQNTVVVHAT GLGQGIVRYV QIVMPGFGLL HTTEMRKRDT ECSYEWYATT
PFQFDYDSYW IVIYGSSEFG SNWKRNFYVS TVGSRPPLPP PPANCDLQSV KQDTLFLIDS
SLKDTNVTFT ILKQFAVTAM QPYNYVNSLA QVAAGSVADK GYWGFSYNAG ENSFDRVSEL
LYDMQYIGVN GQNVTAGLQL VLDFFDLPAQ GYRSDPDVRH LLVYVTQTNP TDADPSELVR
AIKRSGRYEI IVVALDMQPS DQLTNMVAPR CFYYAQDFHD LMNYGVNFVQ GQSCLRWNFC
NY
//