ID R7TEQ1_CAPTE Unreviewed; 940 AA.
AC R7TEQ1;
DT 24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT 24-JUL-2013, sequence version 1.
DT 27-MAR-2024, entry version 44.
DE RecName: Full=VWFA domain-containing protein {ECO:0000259|PROSITE:PS50234};
GN ORFNames=CAPTEDRAFT_198366 {ECO:0000313|EMBL:ELT89952.1};
OS Capitella teleta (Polychaete worm).
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Annelida; Polychaeta;
OC Sedentaria; Scolecida; Capitellidae; Capitella.
OX NCBI_TaxID=283909 {ECO:0000313|EMBL:ELT89952.1};
RN [1] {ECO:0000313|Proteomes:UP000014760}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=I ESC-2004 {ECO:0000313|Proteomes:UP000014760};
RA Hellsten U., Grimwood J., Chapman J.A., Shapiro H., Aerts A., Otillar R.P.,
RA Terry A.Y., Boore J.L., Simakov O., Marletaz F., Cho S.-J.,
RA Edsinger-Gonzales E., Havlak P., Kuo D.-H., Larsson T., Lv J., Arendt D.,
RA Savage R., Osoegawa K., de Jong P., Lindberg D.R., Seaver E.C.,
RA Weisblat D.A., Putnam N.H., Grigoriev I.V., Rokhsar D.S.;
RL Submitted (DEC-2012) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:ELT89952.1, ECO:0000313|Proteomes:UP000014760}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=I ESC-2004 {ECO:0000313|EMBL:ELT89952.1,
RC ECO:0000313|Proteomes:UP000014760};
RX PubMed=23254933; DOI=10.1038/nature11696;
RA Simakov O., Marletaz F., Cho S.J., Edsinger-Gonzales E., Havlak P.,
RA Hellsten U., Kuo D.H., Larsson T., Lv J., Arendt D., Savage R.,
RA Osoegawa K., de Jong P., Grimwood J., Chapman J.A., Shapiro H., Aerts A.,
RA Otillar R.P., Terry A.Y., Boore J.L., Grigoriev I.V., Lindberg D.R.,
RA Seaver E.C., Weisblat D.A., Putnam N.H., Rokhsar D.S.;
RT "Insights into bilaterian evolution from three spiralian genomes.";
RL Nature 493:526-531(2013).
RN [3] {ECO:0000313|EnsemblMetazoa:CapteP198366}
RP IDENTIFICATION.
RG EnsemblMetazoa;
RL Submitted (JUN-2015) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AMQN01014622; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; KB311106; ELT89952.1; -; Genomic_DNA.
DR AlphaFoldDB; R7TEQ1; -.
DR STRING; 283909.R7TEQ1; -.
DR EnsemblMetazoa; CapteT198366; CapteP198366; CapteG198366.
DR HOGENOM; CLU_332400_0_0_1; -.
DR OMA; YLECCES; -.
DR Proteomes; UP000014760; Unassembled WGS sequence.
DR CDD; cd01450; vWFA_subfamily_ECM; 1.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 2.20.100.10; Thrombospondin type-1 (TSP1) repeat; 3.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 1.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR006558; LamG-like.
DR InterPro; IPR000884; TSP1_rpt.
DR InterPro; IPR036383; TSP1_rpt_sf.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR24020; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24020:SF70; PH DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF13385; Laminin_G_3; 1.
DR Pfam; PF19030; TSP1_ADAMTS; 1.
DR Pfam; PF00090; TSP_1; 2.
DR Pfam; PF00092; VWA; 1.
DR PRINTS; PR01217; PRICHEXTENSN.
DR PRINTS; PR00453; VWFADOMAIN.
DR SMART; SM00560; LamGL; 1.
DR SMART; SM00209; TSP1; 5.
DR SMART; SM00327; VWA; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR SUPFAM; SSF82895; TSP-1 type 1 repeat; 3.
DR SUPFAM; SSF53300; vWA-like; 1.
DR PROSITE; PS50092; TSP1; 2.
DR PROSITE; PS50234; VWFA; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000014760};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..16
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 17..940
FT /note="VWFA domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5008786931"
FT DOMAIN 144..322
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT REGION 496..594
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 502..594
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 940 AA; 102424 MW; 10C1FE145D50E6F1 CRC64;
MRFWLALAVL IGAAFCEEEI LTGEDKTLAD DILQNDGDWS DWSDCSASCG GGITVRSIKC
YDTLYGRIVD PCREVHVSCN SIPCPEDPDY IAWEDWTDCS AVCGGGTRTR NGLGNDGVEH
TETINCGMQS CDTLDPYLEC GKANVMFVLD SSQSVGAENW YKVKQWAIDV ISSLNIGVTE
THVGVVVYST KVNVGLALDE SYDAETIKQK IWELPYLAGV TNTVDALASV VPAMEAARRS
DAQDIVILMS DGRTNVEQDS LLDTAQQIKD QGVTVFTIGN TYLTGVNKKN FDELKAVCTQ
PSEDYYRGVE KYDQLQDIVD SIVEGACDVT QIIECGDWSE WSACPVSCDG PGQETRSRLC
QLVDSTGAII EDNIERVETR PCGMEPCPEV VCEDWVEFTP CPIECGGPAE EILTRVCELI
DQDGNHVGTD IQEEKRVCAE EPCPEPTEEC PDWEEYPECP VSCDGPVEQI RSRVCDIINY
KGETIEQVIR EEMQNCGEEP CPEPVTEPPT EPPTEPPTEP PTEPPTEPPT EPPTEPPTEP
PTEPPTEPPT EPPTEPPTEP PTEPPTEPPT EPPTEPPTEP PTDPPTPAPT APPAVPAVDC
STCDYGPYGI VVYLPDPKNC QCFYQCQRVG GVEGAYTYLT HHQCCAPGLT WRQTWMTCVT
EAFKETDACI EIAEEIPAVE PVVVEESTCP LQVVPGKSEY FLNGGIEVYC GDGMVFDLAE
CTCMPASTDI VCDSDVLLYF PFDEDLHDHS CQRAVSTQTS EASVVLVEDA QRGTVAFFDG
ASSLHVGFIY NYFADRSVTA WTVTVWFKRT GGTELVSGLL NNGDCVGSPS FGMHLGDGQV
GSVSVDTDVS SAMVAIDGVQ VMHDDWQHMA LVYDGSALNM YLDGANVNAV PASGAIENRQ
CAMNIGAEHA GTEYFEGFMD DIYIYERALS AEEVQTLSGL
//