GenomeNet

Database: UniProt
Entry: A0A267GKY7_9PLAT
LinkDB: A0A267GKY7_9PLAT
Original site: A0A267GKY7_9PLAT 
ID   A0A267GKY7_9PLAT        Unreviewed;      2145 AA.
AC   A0A267GKY7;
DT   22-NOV-2017, integrated into UniProtKB/TrEMBL.
DT   22-NOV-2017, sequence version 1.
DT   27-MAR-2024, entry version 19.
DE   RecName: Full=VWFD domain-containing protein {ECO:0000259|PROSITE:PS51233};
GN   ORFNames=BOX15_Mlig013356g1 {ECO:0000313|EMBL:PAA86064.1};
OS   Macrostomum lignano.
OC   Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Platyhelminthes;
OC   Rhabditophora; Macrostomorpha; Macrostomida; Macrostomidae; Macrostomum.
OX   NCBI_TaxID=282301 {ECO:0000313|EMBL:PAA86064.1, ECO:0000313|Proteomes:UP000215902};
RN   [1] {ECO:0000313|EMBL:PAA86064.1, ECO:0000313|Proteomes:UP000215902}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=DV1 {ECO:0000313|EMBL:PAA86064.1};
RC   TISSUE=Whole organism {ECO:0000313|EMBL:PAA86064.1};
RA   Berezikov E.;
RT   "A platform for efficient transgenesis in Macrostomum lignano, a flatworm
RT   model organism for stem cell research.";
RL   Submitted (JUN-2017) to the EMBL/GenBank/DDBJ databases.
CC   -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC       feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00124}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:PAA86064.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; NIVC01000296; PAA86064.1; -; Genomic_DNA.
DR   STRING; 282301.A0A267GKY7; -.
DR   Proteomes; UP000215902; Unassembled WGS sequence.
DR   CDD; cd00112; LDLa; 1.
DR   CDD; cd19941; TIL; 1.
DR   Gene3D; 2.10.25.10; Laminin; 2.
DR   Gene3D; 4.10.400.10; Low-density Lipoprotein Receptor; 2.
DR   InterPro; IPR036055; LDL_receptor-like_sf.
DR   InterPro; IPR002172; LDrepeatLR_classA_rpt.
DR   InterPro; IPR036084; Ser_inhib-like_sf.
DR   InterPro; IPR002919; TIL_dom.
DR   InterPro; IPR014853; VWF/SSPO/ZAN-like_Cys-rich_dom.
DR   InterPro; IPR001846; VWF_type-D.
DR   PANTHER; PTHR11339; EXTRACELLULAR MATRIX GLYCOPROTEIN RELATED; 1.
DR   PANTHER; PTHR11339:SF401; INTESTINAL MUCIN-LIKE PROTEIN ISOFORM X1; 1.
DR   Pfam; PF00057; Ldl_recept_a; 1.
DR   Pfam; PF01826; TIL; 1.
DR   Pfam; PF00094; VWD; 2.
DR   PRINTS; PR00261; LDLRECEPTOR.
DR   SMART; SM00832; C8; 1.
DR   SMART; SM00192; LDLa; 2.
DR   SMART; SM00216; VWD; 2.
DR   SUPFAM; SSF57603; FnI-like domain; 1.
DR   SUPFAM; SSF57424; LDL receptor-like module; 1.
DR   SUPFAM; SSF57567; Serine protease inhibitors; 2.
DR   PROSITE; PS50068; LDLRA_2; 2.
DR   PROSITE; PS51257; PROKAR_LIPOPROTEIN; 1.
DR   PROSITE; PS51233; VWFD; 2.
PE   4: Predicted;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW   ProRule:PRU00124}; Reference proteome {ECO:0000313|Proteomes:UP000215902};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..23
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           24..2145
FT                   /note="VWFD domain-containing protein"
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5012831381"
FT   DOMAIN          391..571
FT                   /note="VWFD"
FT                   /evidence="ECO:0000259|PROSITE:PS51233"
FT   DOMAIN          862..1029
FT                   /note="VWFD"
FT                   /evidence="ECO:0000259|PROSITE:PS51233"
FT   REGION          206..245
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1307..1327
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1978..1997
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2036..2073
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2103..2145
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1310..1327
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1978..1994
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2106..2132
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   DISULFID        1257..1272
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
SQ   SEQUENCE   2145 AA;  236360 MW;  C11BB03DBE3BEEC4 CRC64;
     MKLLTAVVAA QLLLACLTCG AAGAPEDTEP QWRYCLQTGL AHLLDFRGRW SSEFRPGGEA
     ALASFTSEVL NWRVEVGTVG SGDCGEDGAA EDACRKEIVI HFRGVPRCPT DTRVSIRGRS
     VRVVQLDTDG RNSSRTVAEL RLGGDSEGDD DDDRLPASVP LLLRSFDSGR QHSLHLSGTG
     LHLRWSDDGF LALLLESSSA AAAASSTGRT GGWCAGERPP PAPREGRTAA KRRRVRPANR
     GPCGRLDRPD GRLARSCLRD PFVEVAGSAA AYATARRFLL SSRGRDCNSG GRSGAACRDG
     RAVLSLCHRR CPDRCAALRR LPPLMCALAP LCACAPPRLL DSRGRCVEEA ECPCRHGNRI
     YHRGERFTKD CQTCSCLGNN NWRCASSDCG RVCLATPGQV VTFDGQKISF VDNRNDYYLI
     EPDDSKIPLA VKFSRSSPAR LEVSWQGQQV LVYHRPATDS ESGDFVIQLG DRSQKLAIGE
     HLRVGLGFFL HRVTSHYIRL RVADLLRVDF NGGTVELAAS RRLRGARRLR GLCGRYDGSS
     ENDLWPRRGR AAADSLSVAR SYLVKAISPA GGPKRTTISR EAVRTCYDLF QHSGQFQRCA
     RTENLQPFLR NCEAAGDRDS VCYIAFNAMR LCARSGYKIH WRLLPQLRNC ALIACPRGGS
     SFRLCANTCQ ATCASLARPD RCPSGCYFGC QCPRGRYLMA DGRCVTKSQC TCFDSATGRQ
     RQPGEKFSRR NENCKCQDGR AVCQAAQSAS ANGRVACPKN QVWRRAQPGE CWPRRCEDPV
     KDGSRCQSPD SAAEERCRCP AGLFATDSGL CVNRSDCPCR YGGRELLTGS VLTAQCPGRV
     CSSGRWLNRA ASESSSDSGC RGRCRAFGDG YYVSFDGRAF RYYAGSRETR LLQLPQLRVA
     TRSVACGSSR PSVCARLVKI RLAGGDEIRV LQGRIIAGLR HVDGEAVVLT ATAALGSLHF
     LRLGVSIVWD RGTRLTINAA RPRSGQPIGG LCGNFNGDAT DDYRTPGGLL ALSPDEFGRS
     WTLESDYAQP IPESEYDQGC LKSRENFDWS FARCGLLRDQ RGPFGRCLSV LNEDRLEFLH
     DACLREACRC RRTVKDAANC DGLCNLMAAA ADECAGLNIT VDWRSAAFCP YNCPEGTAYT
     SCGQLQSPFV FANGPDVQCL EDCRCRMGGQ WTQAAGSCTE ASHVCRYRAA VYQTGDSAKL
     GCRQCTCREG KWIRCDNKDC YRNTTIELNN LSTVAPCTGH SFACRNSGAC LHRSKVCDGV
     PDCSDGSDEM SCEGCPAFKL KCRRSNLCIA KSEVCDGKHQ CRLGNSTDDS DEENCATGAT
     TSAPQKKPIE GTDSCYRPLL QSGLIKESQV RITENFAPTH GYRRAELLDS RKRPYLLSKG
     GTVFELRLSP AGSAVRVSSV LLQIQSNHRR GLKAIRLYAN SNKTWQLLEA KESNIACTRC
     LIKLNKLPFA AEKLKFKIQS RSSMKIQLEI NGCPGQNLES GRCRRLPATA CMVKVYKSIT
     YNREYSGEEK SRLVLPRTEY SLLNQGSSEI VIGAPETVRI CAVIIKSKLP SDGKPTANVV
     SFVYDKKLSE LTLVSKLPIS TNTTTITISK PTNEYKAVLY SKDPIRISIE IQVKKQPVTK
     NEPKQKKIAC KGSLPDSEVN GKVRVLRMTA LDSRQEYTQS DRRQLLRKGA ALGSGRTEVL
     VQFSLKQTDV IEALQVLTTT PGADAKVTAR ELQLCTRDRC SKVKMSTPSV QQTNMVIRAR
     ARFLKIILDS PTSRYAKNAK LKFHFCSRNK PHVCKERYLE RCNVPLMKSI TFGKRYDAYA
     VKALLQGRPV RLLRGYTKIT LINLNLAESV ILKKTGGRRP LKLIISLTTT NERKRKTTMT
     KLETVYSAKL PTGYTKMIIT IKSKEPSKLA LFKEECQKPQ KIPNRRPKTA AHTTTIAPAT
     TTLEPTKRPT TASTAAAKTT KAPVLTTTTK RRRVCKKKMA FDSNVLRPED VVEVKRVPQN
     EPYDEPEKKS LLPGRRGLPL RKGDTLVRID LSRLPEGSRV AEIKVSADKP KRITKVSVTE
     VTNNERTPVG KAEQPGDEPL SRDDLESIPL TPTRRTPDLI EVRIRRGSKR PVRVTLEVKV
     CLKKKPATTT AQSTTTSATT TRRPSTAGLT TEQRALHGEH RPAKR
//
DBGET integrated database retrieval system