ID A0A3Q2IAY2_HORSE Unreviewed; 2510 AA.
AC A0A3Q2IAY2;
DT 10-APR-2019, integrated into UniProtKB/TrEMBL.
DT 13-SEP-2023, sequence version 2.
DT 27-MAR-2024, entry version 21.
DE RecName: Full=VWFD domain-containing protein {ECO:0000259|PROSITE:PS51233};
GN Name=FCGBP {ECO:0000313|Ensembl:ENSECAP00000044714.2};
OS Equus caballus (Horse).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Perissodactyla; Equidae; Equus.
OX NCBI_TaxID=9796 {ECO:0000313|Ensembl:ENSECAP00000044714.2, ECO:0000313|Proteomes:UP000002281};
RN [1] {ECO:0000313|Ensembl:ENSECAP00000044714.2, ECO:0000313|Proteomes:UP000002281}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000044714.2,
RC ECO:0000313|Proteomes:UP000002281};
RX PubMed=19892987; DOI=10.1126/science.1178158;
RG Broad Institute Genome Sequencing Platform;
RG Broad Institute Whole Genome Assembly Team;
RA Wade C.M., Giulotto E., Sigurdsson S., Zoli M., Gnerre S., Imsland F.,
RA Lear T.L., Adelson D.L., Bailey E., Bellone R.R., Bloecker H., Distl O.,
RA Edgar R.C., Garber M., Leeb T., Mauceli E., MacLeod J.N., Penedo M.C.T.,
RA Raison J.M., Sharpe T., Vogel J., Andersson L., Antczak D.F., Biagi T.,
RA Binns M.M., Chowdhary B.P., Coleman S.J., Della Valle G., Fryc S.,
RA Guerin G., Hasegawa T., Hill E.W., Jurka J., Kiialainen A., Lindgren G.,
RA Liu J., Magnani E., Mickelson J.R., Murray J., Nergadze S.G., Onofrio R.,
RA Pedroni S., Piras M.F., Raudsepp T., Rocchi M., Roeed K.H., Ryder O.A.,
RA Searle S., Skow L., Swinburne J.E., Syvaenen A.C., Tozaki T., Valberg S.J.,
RA Vaudin M., White J.R., Zody M.C., Lander E.S., Lindblad-Toh K.;
RT "Genome sequence, comparative analysis, and population genetics of the
RT domestic horse.";
RL Science 326:865-867(2009).
RN [2] {ECO:0000313|Ensembl:ENSECAP00000044714.2}
RP IDENTIFICATION.
RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000044714.2};
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 9796.ENSECAP00000044714; -.
DR PaxDb; 9796-ENSECAP00000044714; -.
DR Ensembl; ENSECAT00000046019.2; ENSECAP00000044714.2; ENSECAG00000011718.4.
DR GeneTree; ENSGT00940000163156; -.
DR InParanoid; A0A3Q2IAY2; -.
DR OMA; YQVAQVQ; -.
DR Proteomes; UP000002281; Chromosome 10.
DR Bgee; ENSECAG00000011718; Expressed in epithelium of bronchus and 16 other cell types or tissues.
DR ExpressionAtlas; A0A3Q2IAY2; baseline.
DR GO; GO:0031012; C:extracellular matrix; IBA:GO_Central.
DR GO; GO:0005615; C:extracellular space; IBA:GO_Central.
DR CDD; cd19941; TIL; 6.
DR Gene3D; 2.10.25.10; Laminin; 6.
DR InterPro; IPR003645; Fol_N.
DR InterPro; IPR036084; Ser_inhib-like_sf.
DR InterPro; IPR002919; TIL_dom.
DR InterPro; IPR025615; TILa_dom.
DR InterPro; IPR014853; VWF/SSPO/ZAN-like_Cys-rich_dom.
DR InterPro; IPR001007; VWF_dom.
DR InterPro; IPR001846; VWF_type-D.
DR PANTHER; PTHR46160; ALPHA-TECTORIN-RELATED; 1.
DR PANTHER; PTHR46160:SF5; PROTEIN CBG11501; 1.
DR Pfam; PF08742; C8; 6.
DR Pfam; PF01826; TIL; 6.
DR Pfam; PF12714; TILa; 5.
DR Pfam; PF00094; VWD; 7.
DR SMART; SM00832; C8; 6.
DR SMART; SM00274; FOLN; 3.
DR SMART; SM00214; VWC; 3.
DR SMART; SM00215; VWC_out; 6.
DR SMART; SM00216; VWD; 7.
DR SUPFAM; SSF57567; Serine protease inhibitors; 6.
DR PROSITE; PS51233; VWFD; 7.
PE 1: Evidence at protein level;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Proteomics identification {ECO:0007829|PeptideAtlas:A0A3Q2IAY2};
KW Reference proteome {ECO:0000313|Proteomes:UP000002281};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT DOMAIN 1..161
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 373..552
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 761..940
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 1178..1361
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 1577..1760
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 1959..2130
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 2338..2509
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT REGION 909..933
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2510 AA; 265464 MW; A206045A44AE2A7D CRC64;
MMGTCTYTMA ELCSSDQTLP AFSVEAKNEH RGSRRVSYVG FVTVRAYSHS VSIARGEVGF
VRIDNQRSRL PASLANGRFR VYQSGRQAVA ELDFGLVVTY NWDSQLVLSL PERFQDQVCG
LCGNYNGDPA DDFLTPDWEQ APDAVEFAGS WRLDDGDYLC DDGCQDDCPS CTAGQAQHYE
GDGLCGMLTL SNGPFAACHD ALAPRAFLEE CVYDLCVFNG DRASLCRSLS AYAQACLELG
VSVGNWRSPA NCPLSCPANS RYESCGPACP ATCNPEAAPA DCSGRPCVEG CVCLPGFVAS
GDACVASSSC GCVHEGRPLA PGQAVWADEK CQRRCTCDGA TQQVRCSDTQ GCPAGERCRV
QNGLLGCYAD RFGSCQGSGD PHYVSFDGRR FDFMGTCTYL LAGSCGQAQA LAAFRVLVEN
EHRGSQTVSY ARAVIVEARG VKVAVRREHP GQVLVDDILR HLPFQAADGQ VQVFRQGQKA
VVSTDFGLTV TYNWDAHVTV KVPQSYAGAL CGLCGNFNGD SSDDLALRGG GQAANALAFG
KSWQEETRPG CGATEPGDCP KLDSLLAEQL QSKKDCGIVA DPKGPFRECH KVLDPQGVVR
DCVYDRCLLP GQTEPLCDAL ATYAAACQAA GVTVHAWRSE EFCPLDCPRH SHYEACASGC
PLSCGDLPVP GGCGSDCHEG CVCDEGFALS GESCVALPSC GCVYAGAYHP PGQSFHPGPD
CNSLCHCQEG GLVSCEPSTC GPHEACQPSG GILGCVAVGS ATCQASGDPH YTTFDGKRFD
FMGTCVYALA RTCGTRPGLH QFAVLQENVA WGNGKVSVTK AITVQVANFT LRLEQNQWKV
KVNGVDMKLP MVLDQGRVRA SKHGSDVIIE TDFGLRVAYD LVYNVRVTVP GNYHRQLCGL
CGDYNGDPKD DFQKPDGSQA SDPKDFGNSW EEKVPDSPCV PPPPCEEDCD PGTCKPELQD
KYKQEKFCGL LASPTGPLAA CHKLLDPQGP LEDCVFDLCL GGGDESILCN NIHAYVSACQ
AAGGHVKPWR TESFCPLPCP PNSHYEVCAD TCSVGCAALS APSQCPDSCA EGCQCDPGFL
NNGQTCVPIQ ECGCYHNGAY YEPGQTVLID SCRQQCTCHA GGSMVCQAHS CKPGQVCEPL
KGVLSCVTKD PCQDVNCRPQ ETCQEKDGQA ICVPKYEVTC WLWGDPHYHS FDGRNFDFQG
TCNYVLVTTD CPGVSAQGLP PFTVTTKNEN RGNPAVSYVR LVTVAALGTN ISIHKGEIGK
VRVNGVLTAL PVSVAGGRLS VTQGASKAVL TTDFGLTVTY DWDWRVEVTL PSSYDSAVCG
LCGNMDRNPS NDQAFPNGTL APSIPVWGGS WRVPDWDPLC WHECQGSCPT CPEDQLDLYE
GPGFCGPLAP GSGGPFAACH AHVTPETFFK GCVLDVCLGG GVYDILCQAL AAYAAACQAA
GIVIKDWREQ TDCMMPCPEN SHYELCGPPC PASCPSPTPP TTTALCEGPC TEGCQCDSGF
VFSADRCVPL DGGCGCWANG AYHEAGSEFW ADATCSQWCH CGPGGGSLVC KPASCGLGEQ
CALLPSGQYG CQPVSTAECQ AWGDPHYTTL DGHRFDFQGS CEYLLSAPCH APPAGAENFT
VTVTNEHRGS QAVSYTRSVT LHIYGHRLTL SAQWPRKLQV DGEFVALPFQ PDSRLSAYVS
GADVVVTAAA GLSLAFDGDS NVRLRVSSAY AGALCGLCGN YNRDPADDLT AVGGDPSGWQ
VGGAAGCGEC VPGPCPEPCK PEEQKPFGGP DACGVISATD GPLAPCHSLV PPAEYFQACL
LDACQAQGHP GALCPAVAAY VALCQAAGAQ LGEWRRPDFC HVQCPAHSHY ELCGNSCPVS
CPSLSVPEDC EPTCREGCVC NAGFVLSGDS CVPVGQCGCL YNGRYYPLGE AFYPGPECER
RCECGQGGLV SCQEGAACRP YEECRIEDGV QACHPTGCGR CLANGGIHYV TLDGRVYDLH
GSCSYVLAQV CHPQPGDEDF AIMLEKNEAG DPQRLLVIVA DQVVGLTQGP QVTVDGEAVT
LPVAVGAVRV TAEGRNMVLQ TTKGLRLLFD GDAHILISVP SPFRGRLCGL CGNFNGNWSD
DFVLPSGTVA PSVEAFGAAW RAPSSLQGCG EGCGPQGCPV CLAQETAAYE STEACGRLRD
PEGPFAGCHA ELSPSEYFRQ CVYDLCAHKG DRSFLCSSMA AYTAACQAAG GAVTSWRTDS
FCPLECPAHS HYSVCTRSCR GSCAALSGLT GCTSRCFEGC ECDDRFLLSD GVCIPIQDCG
CTHAGRYLPV NSSLLSSDCS ERCSCSASAG LTCQAAGCLL GRVCDVQSGV RDCWVRQGLC
SLSVGANLTT FDGARSAIGS SGVYEVSSRC PGLRATVPWY RVLAGVWPCH GKDKAVGQAH
IFFQDGLVTV TPNNGVWVNG LRVDLPAQVL TSVSVSQNPD GSVLVQQKAG VRVWLTTDGQ
LAVMVSDEHA GKLCGACGNF DGDQTNDGRG SQGKTGVESW RAPDFYPCHD
//