ID G3Q5M4_GASAC Unreviewed; 903 AA.
AC G3Q5M4;
DT 16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT 16-NOV-2011, sequence version 1.
DT 27-MAR-2024, entry version 46.
DE RecName: Full=VWFD domain-containing protein {ECO:0000259|PROSITE:PS51233};
GN Name=spg4 {ECO:0000313|Ensembl:ENSGACP00000025180.1};
OS Gasterosteus aculeatus (Three-spined stickleback).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Eupercaria; Perciformes; Cottioidei; Gasterosteales; Gasterosteidae;
OC Gasterosteus.
OX NCBI_TaxID=69293 {ECO:0000313|Ensembl:ENSGACP00000025180.1, ECO:0000313|Proteomes:UP000007635};
RN [1] {ECO:0000313|Ensembl:ENSGACP00000025180.1, ECO:0000313|Proteomes:UP000007635}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Lindblad-Toh K., Mauceli E., Grabherr M., Chang J.L., Lander E.S.;
RL Submitted (JAN-2006) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSGACP00000025180.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; G3Q5M4; -.
DR Ensembl; ENSGACT00000025229.1; ENSGACP00000025180.1; ENSGACG00000019037.1.
DR GeneTree; ENSGT00940000165245; -.
DR Proteomes; UP000007635; Unassembled WGS sequence.
DR Bgee; ENSGACG00000019037; Expressed in mesonephros and 2 other cell types or tissues.
DR CDD; cd19941; TIL; 3.
DR Gene3D; 2.10.25.10; Laminin; 2.
DR InterPro; IPR036084; Ser_inhib-like_sf.
DR InterPro; IPR014853; VWF/SSPO/ZAN-like_Cys-rich_dom.
DR InterPro; IPR001007; VWF_dom.
DR InterPro; IPR001846; VWF_type-D.
DR PANTHER; PTHR11339; EXTRACELLULAR MATRIX GLYCOPROTEIN RELATED; 1.
DR PANTHER; PTHR11339:SF289; MUCIN-19-LIKE; 1.
DR Pfam; PF08742; C8; 2.
DR Pfam; PF00094; VWD; 3.
DR SMART; SM00832; C8; 1.
DR SMART; SM00215; VWC_out; 2.
DR SMART; SM00216; VWD; 2.
DR SUPFAM; SSF57603; FnI-like domain; 1.
DR SUPFAM; SSF57567; Serine protease inhibitors; 2.
DR PROSITE; PS51233; VWFD; 3.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Reference proteome {ECO:0000313|Proteomes:UP000007635};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..19
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 20..903
FT /note="VWFD domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5003450513"
FT DOMAIN 32..202
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 362..532
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 805..903
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
SQ SEQUENCE 903 AA; 101942 MW; B97FB8AD9893D3AA CRC64;
MTTQRWILAF CLSLASVFGT VELFKTKEIQ TYTCRTFGSG IVQPFKGESY YVRSDCPFKL
TSFNVNRGEY SVTIRRGHNG LLVQVEIIVN KVTTLLQNGH ILVQNNSVSL PYDHTYQHIF
KYGIYTRLRS SLLPFTVTWH NVHGGINSLW VTLESELCTD MCGLCGKQNV AGHRDELIRE
SKLHDHRCKI RDPVLQKNHI CRRFFLKTKN CLQDNNSHYH RLCKENICGF ENSQSIFCPF
FQEVASQCNQ SRINRFWRRL TRCAKPRCPG DLIYEKKGPA FIPSCSNPNP APFYQELTET
CACPNGKVLN NCEKGYRCIP KSSCSCEFAG KTYGNGEIRS SRCQSCTCDG GKWRCSENFC
HRRCVIEGQF VTTFDGKQYV LPNKCLYVAS KGPNWIIIIE FSQKKLHIRK VTVQLMEELF
VFKKNKVLFD GQEIPEFHFS GHAQVYWVSS MFVQVHTTIG INFQIQLSPE IHLFIDAPDT
SNDKIKGLCG NSNSDTTDDF TTNSGIIENS AKPFAMSWSL LNCFGNIPTT CTNLENENYA
HEKCAVLNQP TGIFAKCHPH IPTDYYYTAC IQRICNSAGS RRQGLCIGLA SYAKACAGVG
VVIGDWRRIT GCDLKCQKNQ EFSYSMHICN RTCNSLTGHD IRCDMNDDAV EGCGCPEGTH
LNQGQTCCPK EECGCIYYGG IAAPGPVVIA GQKCDCKNGV LNCLPNCDCR NGKVCVSCSE
GQHKRVQKTC DYISKPKGTR ENCKSGCYCP DHQYEDHHGN CVSLDDCTCV FSGKAFKAGQ
QVTSNCKTCT CYRGQWHCIE KPCPGQCQVY GNGHYQTFDS KWFRFSGQCL YTLVQDSCDM
RRGTFSIRVE SVPCCEEVLT CSRNIILDLK GQVTLTLRDM QVIRRLHEGW NGQDDSLYSI
HTL
//