ID A0A2C9JLT8_BIOGL Unreviewed; 3104 AA.
AC A0A2C9JLT8;
DT 20-DEC-2017, integrated into UniProtKB/TrEMBL.
DT 20-DEC-2017, sequence version 1.
DT 27-MAR-2024, entry version 31.
DE RecName: Full=VWFD domain-containing protein {ECO:0008006|Google:ProtNLM};
GN Name=106059342 {ECO:0000313|EnsemblMetazoa:BGLB004509-PB};
OS Biomphalaria glabrata (Bloodfluke planorb) (Freshwater snail).
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Mollusca; Gastropoda;
OC Heterobranchia; Euthyneura; Panpulmonata; Hygrophila; Lymnaeoidea;
OC Planorbidae; Biomphalaria.
OX NCBI_TaxID=6526 {ECO:0000313|EnsemblMetazoa:BGLB004509-PB, ECO:0000313|Proteomes:UP000076420};
RN [1] {ECO:0000313|EnsemblMetazoa:BGLB004509-PB}
RP IDENTIFICATION.
RC STRAIN=BB02 {ECO:0000313|EnsemblMetazoa:BGLB004509-PB};
RG EnsemblMetazoa;
RL Submitted (MAY-2020) to UniProtKB.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_013072378.1; XM_013216924.1.
DR STRING; 6526.A0A2C9JLT8; -.
DR EnsemblMetazoa; BGLB004509-RB; BGLB004509-PB; BGLB004509.
DR KEGG; bgt:106059342; -.
DR VEuPathDB; VectorBase:BGLB004509; -.
DR OrthoDB; 5398470at2759; -.
DR Proteomes; UP000076420; Unassembled WGS sequence.
DR CDD; cd00054; EGF_CA; 1.
DR CDD; cd00112; LDLa; 4.
DR CDD; cd19941; TIL; 1.
DR Gene3D; 2.10.25.10; Laminin; 2.
DR Gene3D; 4.10.400.10; Low-density Lipoprotein Receptor; 4.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR036055; LDL_receptor-like_sf.
DR InterPro; IPR023415; LDLR_class-A_CS.
DR InterPro; IPR002172; LDrepeatLR_classA_rpt.
DR InterPro; IPR036084; Ser_inhib-like_sf.
DR InterPro; IPR002919; TIL_dom.
DR InterPro; IPR014853; VWF/SSPO/ZAN-like_Cys-rich_dom.
DR InterPro; IPR001846; VWF_type-D.
DR PANTHER; PTHR11339; EXTRACELLULAR MATRIX GLYCOPROTEIN RELATED; 1.
DR PANTHER; PTHR11339:SF398; MUCIN-2-LIKE-RELATED; 1.
DR Pfam; PF08742; C8; 2.
DR Pfam; PF00008; EGF; 1.
DR Pfam; PF00057; Ldl_recept_a; 4.
DR Pfam; PF01826; TIL; 1.
DR Pfam; PF00094; VWD; 3.
DR PRINTS; PR00261; LDLRECEPTOR.
DR SMART; SM00832; C8; 2.
DR SMART; SM00181; EGF; 3.
DR SMART; SM00192; LDLa; 4.
DR SMART; SM00216; VWD; 3.
DR SUPFAM; SSF57196; EGF/Laminin; 1.
DR SUPFAM; SSF57424; LDL receptor-like module; 4.
DR SUPFAM; SSF57567; Serine protease inhibitors; 1.
DR PROSITE; PS00022; EGF_1; 1.
DR PROSITE; PS01186; EGF_2; 1.
DR PROSITE; PS50026; EGF_3; 1.
DR PROSITE; PS01209; LDLRA_1; 2.
DR PROSITE; PS50068; LDLRA_2; 4.
DR PROSITE; PS51233; VWFD; 3.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; EGF-like domain {ECO:0000256|PROSITE-ProRule:PRU00076};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..25
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 26..3104
FT /note="VWFD domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5012090096"
FT DOMAIN 103..142
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 212..397
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 591..759
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 1028..1203
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT REGION 2074..2810
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3037..3104
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2074..2283
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2291..2339
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2347..2395
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2403..2451
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2459..2479
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2487..2558
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2565..2647
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2655..2675
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2683..2810
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3037..3078
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3083..3097
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 107..117
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 132..141
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1402..1414
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1409..1427
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1421..1436
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1445..1463
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1457..1472
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1496..1511
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1514..1526
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1521..1539
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1533..1548
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
SQ SEQUENCE 3104 AA; 328709 MW; B122DBACD09ED841 CRC64;
MALFRRSTLV LLGLTLWLRI WSTSATSHFC ERQWEKTVVE QHPCCEFFGW ITGMNGYQLT
LVWPKTGEKT IDPNDCVPMC NYTRVEHVTN TSCCNGWAGN DCDVPVCDPP CSNGGQCVET
ETWAAALPKC SCPDPYTGRA CEENINSLKS NLKYCYVNSN CQGKLVRAEA MEISNCCTEG
FTGSWGSQLA NYGCASCNPT ASVKVNNTLD VATCMTSGDD VYRTFDGVLF NYLTLCAVGL
VVTPQLEIYT VTECDPLDKC TCSKVVTIIV RGDPTITYTL EGVFLTKSNG TSHETFDASK
LTDNVPSAVD KLGSVIWRYH VDQKTVYITL PAFSLETRLE VDGTFMITIK KNSVLRNSLA
GVCGDMDGYT DDEIGLQTQV GAERVFEKYK NTNLPCGKGV SKCSTSEDIT KATEACQAIN
TVFYRCHNEV DPDDYIDRCR AYFCTSLSAG GLEAAKKAAC NVMSTYQKVC TLVTGEAITW
RSRTLCPKTC HQPFQFNGLI TNKCPLTCGA PLYSYTRSSC LTQPYGGCEC PTGLARINNT
CVAPEACQCQ GEDGHFYNNG DQIISGDYCL ECTCGKFGLW ECNESPERCS ATCSILGGSY
ISTFDGRMFG IDNICPELTL VQAANVTIKL ESSTSYLDEN GDTISPSTIT IDYQGVTSSI
KVQKTGATIE NGFSPLIYIR QVGRDFYVVD IKGGQIRLEI FKDGTILLKM KTTIYKGKVL
GICGNMDRNK DNDFLSPSNS LMDSAQFLGF YSKCSSDTKF KELTQSVMNP VCGKINTAPL
PPNSRVDIDS FVKLCNNVPT EALRCNVLKS FSLTANVDFV AIFNDISACS GSTCLRKQIN
ICDAHCKDHL FTNCEKEFVY SCGCAEGEYF KDDGTCVSKN QCGCFDFTRP NEVIGPHAEF
MHGCRECVCK GYELQCSNSC EEVICANSQV SRNALLSKST NSTCLRKMCP KPFYATEECI
NVEASSQLCF CADGLKQTQS GSCVPKCPCY EGGKWYNDGE RYEQNCQTKV CRDGVFEFDS
EHTNDCTGIC VLTGSSMKLK PFDTTTLTDY SITGMCEYYV MKIGSDHAVK IKPVLCGSKK
TACMNMITIE TSYLKSPIIL KSVNPGVVLV GNREYSNNVG PNIKVINTNY YLAVIFGDLF
TIFWNEGLTL RIEISSKLLN QTSGLCGNFN LDASDDRKGS DNLQKETLSK LAKSWIVNQD
ECTSTEEQTV GASCEGSNRE TWAETTCKII LEGDAFSDCR KLNADVRSFY DNCVVQACNC
DTGGDCECLC DAIAAFAAHC NELGAPGRWR HQRLCPMQCD YGSVYDPCGD ACPETCGKPL
NTSSSICSSL TCVEGCFCPP GYVRKDLDSL KETICIPKPE CPCVDENGRE IFQGQTVTID
CQECECKDGE LKCTGQVCNV TCKEDEYKCG DGHCISNSFK CNGVPECIDG SDEFECIDVC
EGYLCKNGQC ISNTSVCDNK IDCLDNSDEA LCDQKCDENE YKCPESQFCI NQDYLCDGEK
DCWYGEDEDN CTVCPHNETH CNQTYCIPKD YHCDGHDDCG DGTDEIGCTY PTTTKIPGCP
YNTVTFDSPG VDFSVQSSNG IGNSAFTTHG WSPEVGKLGY FTITIHSSTP ATLMEIAFNL
YNGQGTALKI EIETEQGTKI LEMTQITKDK VTFDKVYNIE FLAITIVTFG TISDLLFEVC
YTPVTTENTS PETTPEFTTL TTIAECQESL VSLTPQYFGP TSSNVFRPTQ GETHIVITLK
PQNMVEVLDL ISVSLQLIDI EQIQIIVYKS DGSKYNDQVY AKDSDKLTYF IKDGKDVQYV
EMYLEFNEYK FNTAQAVYIG AEGCIKVKNC SLGYCNSQCL DEFDNICTSD CPPSNCFTTP
STTTVTIITP TPSTPCPFDM VPVPVLNKEP PFSTNSAYNG NPQVLVVIDN SDYTLQSIDA
GPDGKVLELL MTNGIQGVSG TQFAFIFTNS PIPEGNYYWK EQATVYQTMQ HGNTIIIFVF
TIPTGIHLSV PNGAQAFIVK DAHTVSITLT SDIVFVLTKP DYQVYISICK VHDTTTSLPE
TPSSAEFPSI PETGITNIES ASVSQSVGSS VPSFSSVSPF SESASTSTPI SGPSITGSES
ESSITPVSVS GSSGSTPSVS PQSESQSSTT PMSGPSISGS SGSPPSESES STTPISGPSI
SGSSGSPPSE SESSTTPISG PSISGSSGSP PSESESSTTP ISGPSISGSS GSPPSESQSS
TTPISGPSIS GSSGSPPSES ESSTTPISGP SMTETYGSPP SESESSTTPI SGPSISGSSG
SPPSVPPTSK PESSTTPVSS PSITGSSGSS STVPPTSKPE SSTTPVSGPS ISESSSSPPS
VPPTSKPESS TTPVSGPSIS GSSGSPPSVP SQSESQSSTT PISGPSVSGS SSSPPSVPPT
SKPESSTTPV SGPSISESSG SPPSVPSQSE SESSTTPISG PSVSESSGSP PSVPPTSKPE
SSTTPVSGPS ISESSGSPPS VPPTSKPESS TTPVSGPSIS ESSGSPQSVS ESSTTPVSGP
SISESSGSPP SVPSQSESES STSPVSSPSI TGSSGSPPSV PPTSVPSQSE SESSTTPVSG
PSISESSGSP PSVPSQSESE SSTTPISGPS VSESSGSPPS VPSQSESQSS TTPISGPSVS
GSSSSPPSVP PTSKPESSTT PVSGPSITGS SGSPPSVPPT SKPESSTTPI SSPSITGSSG
SPPSVPPQSV SESSTTPISG PSITGSSSSP PSVSPQSESQ SSTTPISGPI VSGSSGSSPS
EPETSTTPIS VPSISGSSGS PPSVPPSSET ESSATPLSTT PGPLVSVPTG SITTSKPCPL
HLNSVSPENI FSPISLNGST SLDNYLYSIT SSKEYIGNIS ITPGKGYLLV PIAFYSVNFI
GDDSFKAVYF TSSITENFET EVQNFAEKLK AEVFYLIAPN TAIILFWAVS GTYYPELSTS
SRPSILELMD ITEDKLYREY VSPVYVWAPP GSIVNIEKCE EYTTSKATTI PVSTESISSS
LVQSYSTVQS TTQFTTITTP LTSSTITESL SASLSTTESS ITGSTPPVSS FTTVPSITES
TATTTTPESV TTPSTTGGGD RDGGCGEDAE TGAEEPKDSG ICNR
//