ID A0A267GKY7_9PLAT Unreviewed; 2145 AA.
AC A0A267GKY7;
DT 22-NOV-2017, integrated into UniProtKB/TrEMBL.
DT 22-NOV-2017, sequence version 1.
DT 27-MAR-2024, entry version 19.
DE RecName: Full=VWFD domain-containing protein {ECO:0000259|PROSITE:PS51233};
GN ORFNames=BOX15_Mlig013356g1 {ECO:0000313|EMBL:PAA86064.1};
OS Macrostomum lignano.
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Platyhelminthes;
OC Rhabditophora; Macrostomorpha; Macrostomida; Macrostomidae; Macrostomum.
OX NCBI_TaxID=282301 {ECO:0000313|EMBL:PAA86064.1, ECO:0000313|Proteomes:UP000215902};
RN [1] {ECO:0000313|EMBL:PAA86064.1, ECO:0000313|Proteomes:UP000215902}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=DV1 {ECO:0000313|EMBL:PAA86064.1};
RC TISSUE=Whole organism {ECO:0000313|EMBL:PAA86064.1};
RA Berezikov E.;
RT "A platform for efficient transgenesis in Macrostomum lignano, a flatworm
RT model organism for stem cell research.";
RL Submitted (JUN-2017) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00124}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PAA86064.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; NIVC01000296; PAA86064.1; -; Genomic_DNA.
DR STRING; 282301.A0A267GKY7; -.
DR Proteomes; UP000215902; Unassembled WGS sequence.
DR CDD; cd00112; LDLa; 1.
DR CDD; cd19941; TIL; 1.
DR Gene3D; 2.10.25.10; Laminin; 2.
DR Gene3D; 4.10.400.10; Low-density Lipoprotein Receptor; 2.
DR InterPro; IPR036055; LDL_receptor-like_sf.
DR InterPro; IPR002172; LDrepeatLR_classA_rpt.
DR InterPro; IPR036084; Ser_inhib-like_sf.
DR InterPro; IPR002919; TIL_dom.
DR InterPro; IPR014853; VWF/SSPO/ZAN-like_Cys-rich_dom.
DR InterPro; IPR001846; VWF_type-D.
DR PANTHER; PTHR11339; EXTRACELLULAR MATRIX GLYCOPROTEIN RELATED; 1.
DR PANTHER; PTHR11339:SF401; INTESTINAL MUCIN-LIKE PROTEIN ISOFORM X1; 1.
DR Pfam; PF00057; Ldl_recept_a; 1.
DR Pfam; PF01826; TIL; 1.
DR Pfam; PF00094; VWD; 2.
DR PRINTS; PR00261; LDLRECEPTOR.
DR SMART; SM00832; C8; 1.
DR SMART; SM00192; LDLa; 2.
DR SMART; SM00216; VWD; 2.
DR SUPFAM; SSF57603; FnI-like domain; 1.
DR SUPFAM; SSF57424; LDL receptor-like module; 1.
DR SUPFAM; SSF57567; Serine protease inhibitors; 2.
DR PROSITE; PS50068; LDLRA_2; 2.
DR PROSITE; PS51257; PROKAR_LIPOPROTEIN; 1.
DR PROSITE; PS51233; VWFD; 2.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00124}; Reference proteome {ECO:0000313|Proteomes:UP000215902};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..23
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 24..2145
FT /note="VWFD domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5012831381"
FT DOMAIN 391..571
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 862..1029
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT REGION 206..245
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1307..1327
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1978..1997
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2036..2073
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2103..2145
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1310..1327
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1978..1994
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2106..2132
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 1257..1272
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
SQ SEQUENCE 2145 AA; 236360 MW; C11BB03DBE3BEEC4 CRC64;
MKLLTAVVAA QLLLACLTCG AAGAPEDTEP QWRYCLQTGL AHLLDFRGRW SSEFRPGGEA
ALASFTSEVL NWRVEVGTVG SGDCGEDGAA EDACRKEIVI HFRGVPRCPT DTRVSIRGRS
VRVVQLDTDG RNSSRTVAEL RLGGDSEGDD DDDRLPASVP LLLRSFDSGR QHSLHLSGTG
LHLRWSDDGF LALLLESSSA AAAASSTGRT GGWCAGERPP PAPREGRTAA KRRRVRPANR
GPCGRLDRPD GRLARSCLRD PFVEVAGSAA AYATARRFLL SSRGRDCNSG GRSGAACRDG
RAVLSLCHRR CPDRCAALRR LPPLMCALAP LCACAPPRLL DSRGRCVEEA ECPCRHGNRI
YHRGERFTKD CQTCSCLGNN NWRCASSDCG RVCLATPGQV VTFDGQKISF VDNRNDYYLI
EPDDSKIPLA VKFSRSSPAR LEVSWQGQQV LVYHRPATDS ESGDFVIQLG DRSQKLAIGE
HLRVGLGFFL HRVTSHYIRL RVADLLRVDF NGGTVELAAS RRLRGARRLR GLCGRYDGSS
ENDLWPRRGR AAADSLSVAR SYLVKAISPA GGPKRTTISR EAVRTCYDLF QHSGQFQRCA
RTENLQPFLR NCEAAGDRDS VCYIAFNAMR LCARSGYKIH WRLLPQLRNC ALIACPRGGS
SFRLCANTCQ ATCASLARPD RCPSGCYFGC QCPRGRYLMA DGRCVTKSQC TCFDSATGRQ
RQPGEKFSRR NENCKCQDGR AVCQAAQSAS ANGRVACPKN QVWRRAQPGE CWPRRCEDPV
KDGSRCQSPD SAAEERCRCP AGLFATDSGL CVNRSDCPCR YGGRELLTGS VLTAQCPGRV
CSSGRWLNRA ASESSSDSGC RGRCRAFGDG YYVSFDGRAF RYYAGSRETR LLQLPQLRVA
TRSVACGSSR PSVCARLVKI RLAGGDEIRV LQGRIIAGLR HVDGEAVVLT ATAALGSLHF
LRLGVSIVWD RGTRLTINAA RPRSGQPIGG LCGNFNGDAT DDYRTPGGLL ALSPDEFGRS
WTLESDYAQP IPESEYDQGC LKSRENFDWS FARCGLLRDQ RGPFGRCLSV LNEDRLEFLH
DACLREACRC RRTVKDAANC DGLCNLMAAA ADECAGLNIT VDWRSAAFCP YNCPEGTAYT
SCGQLQSPFV FANGPDVQCL EDCRCRMGGQ WTQAAGSCTE ASHVCRYRAA VYQTGDSAKL
GCRQCTCREG KWIRCDNKDC YRNTTIELNN LSTVAPCTGH SFACRNSGAC LHRSKVCDGV
PDCSDGSDEM SCEGCPAFKL KCRRSNLCIA KSEVCDGKHQ CRLGNSTDDS DEENCATGAT
TSAPQKKPIE GTDSCYRPLL QSGLIKESQV RITENFAPTH GYRRAELLDS RKRPYLLSKG
GTVFELRLSP AGSAVRVSSV LLQIQSNHRR GLKAIRLYAN SNKTWQLLEA KESNIACTRC
LIKLNKLPFA AEKLKFKIQS RSSMKIQLEI NGCPGQNLES GRCRRLPATA CMVKVYKSIT
YNREYSGEEK SRLVLPRTEY SLLNQGSSEI VIGAPETVRI CAVIIKSKLP SDGKPTANVV
SFVYDKKLSE LTLVSKLPIS TNTTTITISK PTNEYKAVLY SKDPIRISIE IQVKKQPVTK
NEPKQKKIAC KGSLPDSEVN GKVRVLRMTA LDSRQEYTQS DRRQLLRKGA ALGSGRTEVL
VQFSLKQTDV IEALQVLTTT PGADAKVTAR ELQLCTRDRC SKVKMSTPSV QQTNMVIRAR
ARFLKIILDS PTSRYAKNAK LKFHFCSRNK PHVCKERYLE RCNVPLMKSI TFGKRYDAYA
VKALLQGRPV RLLRGYTKIT LINLNLAESV ILKKTGGRRP LKLIISLTTT NERKRKTTMT
KLETVYSAKL PTGYTKMIIT IKSKEPSKLA LFKEECQKPQ KIPNRRPKTA AHTTTIAPAT
TTLEPTKRPT TASTAAAKTT KAPVLTTTTK RRRVCKKKMA FDSNVLRPED VVEVKRVPQN
EPYDEPEKKS LLPGRRGLPL RKGDTLVRID LSRLPEGSRV AEIKVSADKP KRITKVSVTE
VTNNERTPVG KAEQPGDEPL SRDDLESIPL TPTRRTPDLI EVRIRRGSKR PVRVTLEVKV
CLKKKPATTT AQSTTTSATT TRRPSTAGLT TEQRALHGEH RPAKR
//