ID A0A2C9M8M7_BIOGL Unreviewed; 1856 AA.
AC A0A2C9M8M7;
DT 20-DEC-2017, integrated into UniProtKB/TrEMBL.
DT 20-DEC-2017, sequence version 1.
DT 27-MAR-2024, entry version 28.
DE SubName: Full=Uncharacterized protein {ECO:0000313|EnsemblMetazoa:BGLB039806-PA};
GN Name=106063549 {ECO:0000313|EnsemblMetazoa:BGLB039806-PA};
OS Biomphalaria glabrata (Bloodfluke planorb) (Freshwater snail).
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Mollusca; Gastropoda;
OC Heterobranchia; Euthyneura; Panpulmonata; Hygrophila; Lymnaeoidea;
OC Planorbidae; Biomphalaria.
OX NCBI_TaxID=6526 {ECO:0000313|EnsemblMetazoa:BGLB039806-PA, ECO:0000313|Proteomes:UP000076420};
RN [1] {ECO:0000313|EnsemblMetazoa:BGLB039806-PA}
RP IDENTIFICATION.
RC STRAIN=BB02 {ECO:0000313|EnsemblMetazoa:BGLB039806-PA};
RG EnsemblMetazoa;
RL Submitted (MAY-2020) to UniProtKB.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_013077397.1; XM_013221943.1.
DR STRING; 6526.A0A2C9M8M7; -.
DR EnsemblMetazoa; BGLB039806-RA; BGLB039806-PA; BGLB039806.
DR KEGG; bgt:106063549; -.
DR VEuPathDB; VectorBase:BGLB039806; -.
DR OrthoDB; 2969706at2759; -.
DR Proteomes; UP000076420; Unassembled WGS sequence.
DR CDD; cd00112; LDLa; 4.
DR CDD; cd19941; TIL; 1.
DR Gene3D; 2.10.25.10; Laminin; 2.
DR Gene3D; 4.10.400.10; Low-density Lipoprotein Receptor; 4.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR036055; LDL_receptor-like_sf.
DR InterPro; IPR023415; LDLR_class-A_CS.
DR InterPro; IPR002172; LDrepeatLR_classA_rpt.
DR InterPro; IPR036084; Ser_inhib-like_sf.
DR InterPro; IPR002919; TIL_dom.
DR InterPro; IPR014853; VWF/SSPO/ZAN-like_Cys-rich_dom.
DR InterPro; IPR001846; VWF_type-D.
DR PANTHER; PTHR11339; EXTRACELLULAR MATRIX GLYCOPROTEIN RELATED; 1.
DR PANTHER; PTHR11339:SF401; INTESTINAL MUCIN-LIKE PROTEIN ISOFORM X1; 1.
DR Pfam; PF00057; Ldl_recept_a; 4.
DR Pfam; PF01826; TIL; 1.
DR Pfam; PF00094; VWD; 2.
DR PRINTS; PR00261; LDLRECEPTOR.
DR SMART; SM00832; C8; 1.
DR SMART; SM00181; EGF; 3.
DR SMART; SM00192; LDLa; 4.
DR SMART; SM00216; VWD; 2.
DR SUPFAM; SSF57196; EGF/Laminin; 1.
DR SUPFAM; SSF57424; LDL receptor-like module; 4.
DR SUPFAM; SSF57567; Serine protease inhibitors; 2.
DR PROSITE; PS00022; EGF_1; 1.
DR PROSITE; PS01186; EGF_2; 1.
DR PROSITE; PS50026; EGF_3; 1.
DR PROSITE; PS01209; LDLRA_1; 2.
DR PROSITE; PS50068; LDLRA_2; 4.
DR PROSITE; PS51233; VWFD; 2.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; EGF-like domain {ECO:0000256|PROSITE-ProRule:PRU00076};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..23
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 24..1856
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5013310901"
FT DOMAIN 101..142
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 212..403
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 590..763
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DISULFID 105..115
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 132..141
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1252..1264
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1259..1277
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1271..1286
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1294..1312
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1306..1321
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1343..1358
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1361..1373
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1368..1386
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1380..1395
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
SQ SEQUENCE 1856 AA; 204846 MW; C74D24F497142D76 CRC64;
MPLTALLIGL TSWLLAFAPL ATADSYFCDQ EWEQTIYEQH PCCEFFDWMI GKYNYSLTLD
WPKNSTQKVS LTDCKPVCNY YRKVHSTNKT CCPGFQGIYC DQPVCDPACI NGGQCKALGG
SLVQNPEPEC VCPPGYGGQA CQEDINALTT ELMYCYQDDV CNGQLVSSQV MNISNCCTEG
FAGSWGSQMS TYGCTPCKIE HPVKINNTLG YSTCMATGDS TFRTFDGVLW HYRTPCAIDM
VLTPQLKLSA VTVCDPFNKC ACSKAVTINI TATPPRIYTL RDQEMTIEDG TTVKLNASEL
VDNTITPADS HGLINWKIDK IRNVMHLTIT QYGLEMKLES DGTCMVTLRK DSILNGNLDG
ICGDNDGYIN DELTLQNPIV AERTITKYKN DIIPCGQGLE KCKTPENEVA AQMSCKAINT
LFYPCHSKVP PQDFLDYCYD AYCTSLQAAG EVEAQRAACN VISTYHTTCF LETGQAINWR
SKTLCPKTCK SPFEFNGLIT NNCPLTCGLP LYAYTRDICR TTPYSGCICP TGMARLNDTC
VAPNQCQCTG DNGQYYNHGE TVISGDKCEQ CKCGDFGLWS CQPSLQSCTS VCTVLAGQHV
TTFDSRHFVV DGDCPKLKIV QSNDPTIDVS VTLELLAPSF LAEDGESLIS PSKVTVTYQG
NTASVTMKES GVTVDNGFTP NVYARQIGQD VYALDFQNGL IHVKLFRNGL FFLRMKSVFK
SKVIGICGNM DDNKNNDLIS PSNSLMDPKE FLRFYSECSD PTFKELTQTP MNSICQQISD
VPLSPNSRVD SDSFITLCNN VDEKLRCMVL ELFGLLAHID YTTSFQNTSC GAMLCQRREY
DICDINCRDG NSVKCEKERM FGCTCGEGQY YKHDETCAPR SKCGCYDMEF SSDVIEPGAV
YERRCHECVC TETSEIKCSE FCEEIICANS QVARSTLLEG QTNTTCLRKM CPKPYFTNEE
CINVLTSKQL CYCSEGYKQT QTGQCVLKCP CYEGGQWYAD GYEYIMNCQR RVCHDGIFEY
ETSDSLECTG TCVLTGSSMK VKSFDATTLS DYSISGSCNY WAVKSDLTCS VIVKAIQCGS
KDTPCLYECT IKSPFIRDNI ILKSADPGAV MVGDREFTNF VGPNITITNI GIYMAISCGD
SMTILWNGAI QCDGGSVYEA CGNTCTETCG KSVNSSSCTS LTCVEGCFCP SGYLRSYDIK
SQDITCVLKS ECPCIDDRGR ELFPGQSVTI NCQECECMDG DIVCTGKVCE KCDITEFTCN
NGQCIDGEFK CNGYLDCLDG EDEFNCTECN GFYCDNKECA PLNSSCDGKI DCSDGSDEVL
CECNSYEVKC TQSNMCIFKD FVCDNKVDCL YGEDEHNCTI CPTNFTHCNQ TYCIDKNFMC
DNHDDCGDGS DEIGCTTTVP PTTTLEECKK TKATLDNPAK GIYPNSETGI ANDVFTSDGW
SPGTGKKEIL NLVLDSQSDA VLERIEFVVR NASGTTLELN VNTKESTNNF LTVIVSKDPF
TYAQPFNDNF SVINIASQGI ISNLKLDVCY TPIEVTTPVS KTTTTVAPTT STTPKAKECE
EGVRLSDAHL EYVKNITRDN ITTEDGKDII AVQLRPSSIV KTLNVTELLF RLNLIDSMNI
TIFKRNGSSV SEVVDTKNKT LAEMIVYNPV DGNDAHSILI EFYTQKLNLA GATFSGASGC
VTEETTTLTT PSSWFTTPKE CEEGTQLVVD NHLRPVTNIT EETPKDGRDF LEFQLHTSLQ
VKTLNVSELM FHLSFIDSMN VTIIKRNGSV VSEVVDIKNK TYKDTIIYKP FDGNDINIIN
IVFYFDEPNR NLSRVSYVGA KGCGTVTNCS LGYCNGTCLD QYDNLCLSPC PSLDCC
//