ID A0A2C9JE73_BIOGL Unreviewed; 2241 AA.
AC A0A2C9JE73;
DT 20-DEC-2017, integrated into UniProtKB/TrEMBL.
DT 20-DEC-2017, sequence version 1.
DT 27-MAR-2024, entry version 25.
DE SubName: Full=Uncharacterized protein {ECO:0000313|EnsemblMetazoa:BGLB001284-PB};
GN Name=106054935 {ECO:0000313|EnsemblMetazoa:BGLB001284-PB};
OS Biomphalaria glabrata (Bloodfluke planorb) (Freshwater snail).
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Mollusca; Gastropoda;
OC Heterobranchia; Euthyneura; Panpulmonata; Hygrophila; Lymnaeoidea;
OC Planorbidae; Biomphalaria.
OX NCBI_TaxID=6526 {ECO:0000313|EnsemblMetazoa:BGLB001284-PB, ECO:0000313|Proteomes:UP000076420};
RN [1] {ECO:0000313|EnsemblMetazoa:BGLB001284-PB}
RP IDENTIFICATION.
RC STRAIN=BB02 {ECO:0000313|EnsemblMetazoa:BGLB001284-PB};
RG EnsemblMetazoa;
RL Submitted (MAY-2020) to UniProtKB.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_013066475.1; XM_013211021.1.
DR EnsemblMetazoa; BGLB001284-RB; BGLB001284-PB; BGLB001284.
DR VEuPathDB; VectorBase:BGLB001284; -.
DR OrthoDB; 2969310at2759; -.
DR Proteomes; UP000076420; Unassembled WGS sequence.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR GO; GO:0007160; P:cell-matrix adhesion; IEA:InterPro.
DR CDD; cd00054; EGF_CA; 6.
DR Gene3D; 2.10.25.140; -; 1.
DR Gene3D; 2.10.25.10; Laminin; 18.
DR Gene3D; 2.170.300.10; Tie2 ligand-binding domain superfamily; 3.
DR InterPro; IPR005533; AMOP_dom.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
DR InterPro; IPR018097; EGF_Ca-bd_CS.
DR InterPro; IPR009030; Growth_fac_rcpt_cys_sf.
DR InterPro; IPR002049; LE_dom.
DR InterPro; IPR003886; NIDO_dom.
DR InterPro; IPR001846; VWF_type-D.
DR PANTHER; PTHR24034; EGF-LIKE DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR24034:SF194; NIDOGEN (BASEMENT MEMBRANE PROTEIN); 1.
DR Pfam; PF07645; EGF_CA; 15.
DR Pfam; PF06119; NIDO; 1.
DR PRINTS; PR00011; EGFLAMININ.
DR SMART; SM00181; EGF; 30.
DR SMART; SM00179; EGF_CA; 16.
DR SMART; SM00180; EGF_Lam; 8.
DR SMART; SM00539; NIDO; 1.
DR SMART; SM00216; VWD; 1.
DR SUPFAM; SSF57196; EGF/Laminin; 4.
DR SUPFAM; SSF57184; Growth factor receptor domain; 6.
DR PROSITE; PS50856; AMOP; 1.
DR PROSITE; PS00010; ASX_HYDROXYL; 9.
DR PROSITE; PS00022; EGF_1; 1.
DR PROSITE; PS01186; EGF_2; 10.
DR PROSITE; PS50026; EGF_3; 8.
DR PROSITE; PS01187; EGF_CA; 6.
DR PROSITE; PS51220; NIDO; 1.
DR PROSITE; PS51233; VWFD; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT DOMAIN 249..396
FT /note="NIDO"
FT /evidence="ECO:0000259|PROSITE:PS51220"
FT DOMAIN 395..563
FT /note="AMOP"
FT /evidence="ECO:0000259|PROSITE:PS50856"
FT DOMAIN 574..782
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 1034..1072
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1243..1283
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1325..1365
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1496..1536
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1662..1702
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1748..1787
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1833..1875
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1876..1917
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
SQ SEQUENCE 2241 AA; 246715 MW; 8F6F0ABFB7A23DFB CRC64;
MDYQYLVVSE ESLFHRARRQ ALEYISGYST YRSKFTLGNR EMTSTESQDG TGEFLRLRSS
ADDYISSTLQ QAFPNRPIQI GITKLSTPHT TVEFEIQIHD DVDLNDTRKQ LAKALDALDK
ADSFQLDSRT YSVEEVMLGK EGGDDSTLEY TICILCSDYQ TCTQTNGVWT CQLSDPRLYS
FGSVQQDSSM VKNVDYATVQ LKVPDAITWG EEEVSNIWVS VNGFISLDSQ FVSYIPRRLP
MNSQKLLAVY WSDLELKSGD VGEVYYQIYS KYGRKYDPNI FKKANEDVKS YTGDESYDAT
SVIVVTWSDM APYPLYRSET ERVTFQCTII TDGTTTYAVY HYGHGAMRFN AQLRRPVEAG
WGGINLDSSR VNYYNFDQNL GNTGVVGKWF FQIGRRENYK AKCLNWYYKN LQDYENIRFW
DIVLPKCPCN EFFVWATGWW QMSNQNGVSC YESYNSYTPY GRVCCYRLDD WQLNWSSLSW
MFSAFGRTFG TFENRMPFAG SLQRNSPYYR GNYNYGYFNN PMFDTNRYST QTIFAYMHEV
DDLEPKQWCC YQSNLCHLYY QVRPASNCIG PNVAFGIGFG DPQINTLDNK WFIFNGLGKY
RLLEITGKHP KNTSLSVDFK LQGRTCKAVT SSGESTNATV WCALALKTTS GNTTKIEISE
TGNNMIIYAN GQDYSLRFRN SLNFSEVNKD IYLRKDSITE SLMISTADGV GLTVSLKNKI
LVYTLDVDEM YKSMTRGLLS NFNDDPSDDF IFPNSTKLSN NASDRQIYDY GQTWAVTQDS
SLFDYIYGQN TSDPSFMPLF LDQLNTTEAV IVCNDATHIA CIFDYAVTRN PGIALLTKIT
VDVFANQVKL VGNSAPTITG NRDYNVTINQ TLNIVLNCTD PDKDNLSVVV LSKPNQGFTY
SLIDGQLSLS YTPSKITDES IELVVHDSAG LDSGVLKLNL TMCSGCSSHG HCDFADRLTV
NTPYKAIARC VCDVGYTGDD CELDRDGCLM EPCPDQTKCQ DLDVATERAT GLSFKCTDCP
KGYKLVNNDT KCEDIDECQN TSNACPAHAN CQNTIGSFEC NCGQGFRKYN GQCIDINECA
EYQDDCAQIC INELSTFTCD CYEGYRKYGI IGHDCIQRED VCKGLNLTCE YGCTNQSGVA
ECFCQHGKKL ADDKRSCIDI DECQLNLCPQ GCQNINGGFI CTCFDGFHLS ETDNLTCEAC
SDDKYGSDCK KLCYCRGRAM DCDAVRGCVE CDSGWTGETC SQDVDECAAS ATTCPQDQIC
TNTNGSYICS CPTGYELTNG VCENINECVS IQTSLCSHVC IDTPGSFRCQ CEVGFKSSTN
GTCIDIDECS YSTSGCQHKC TNVESSHNCE CFPGYRLLED RRTCQKVVSP CQNLNISCSF
GCSLINSKAQ CFCPLGYILG SDNYTCYDID ECTVQGDHED KCTDNCLNTP GSYNCSCPIG
KALLPDQRTC EVCDAYHWGE DCSQDCACYP IGSDRCDPVV GCICKTGWAG VHCKTDVDEC
QSSKSECTAP SICVNLPGSY KCECPNGFQN VNNICVDIDE CKDTKTCDHN CTNAIGSFHC
TCFEGFKVDG AKCLDIDECA VPSLSKCDQL CRNAPGGFAC ECLDGYSLNM TTRTTCVLSE
TAKACNSSSL CNQICTVVDD KDVCSCHLGY KLNSSNNVTC MDINECSGSN PCSSGQCNNI
DGSFYCTCPA GYKLTSDQLT CQECEEGYYG TNCNSSCQCT STNTVSCNTT DGSCVCKPGW
QGQDCSVDIN ECSTNGVCYN NSQCINSQGS YKCVCATGYL STGSSCNACD VNRYGQDCAQ
TCTCNFANTL DCHHTTGQCN CKPGWEGVNC DQDINECSNS SYCSGSFVQC INLNGSAECR
CLSGYERPTN SSTCQDINEC ENSLLNNCLT PSVCNNTWGS YECVCMKGFH NVSGQCSACD
SLHYGTNCAM DCQCHVNNTL DCDDINGTCT CRHGWTGQKC DLDVDECTQN ASFCSSINET
CTNLNGSAEC LCNIGFYKAA FNESCRACGS LYYGLNCASQ CSCNATNTED CNDINGTCSC
KSGWTGPICN QACNSTQYGP NCASQCLCNA TNTEDCNDVN GTCSCKSGWT GDNCNQACNS
TQYGPNCASQ CLCNATNTED CNDVNGTCSC KPGWTGPICN QACNSTQYGP NCASQCLCNA
TNTEDCNDVN GTCSCKPGWI GANCNQACNS TQYGPNCASQ CLCNATNTED CNDVNGTCFC
KSGWTGANCN QGKCISILNQ L
//