GenomeNet

Database: UniProt
Entry: A0A3Q2IAY2_HORSE
LinkDB: A0A3Q2IAY2_HORSE
Original site: A0A3Q2IAY2_HORSE 
ID   A0A3Q2IAY2_HORSE        Unreviewed;      2510 AA.
AC   A0A3Q2IAY2;
DT   10-APR-2019, integrated into UniProtKB/TrEMBL.
DT   13-SEP-2023, sequence version 2.
DT   27-MAR-2024, entry version 21.
DE   RecName: Full=VWFD domain-containing protein {ECO:0000259|PROSITE:PS51233};
GN   Name=FCGBP {ECO:0000313|Ensembl:ENSECAP00000044714.2};
OS   Equus caballus (Horse).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Laurasiatheria; Perissodactyla; Equidae; Equus.
OX   NCBI_TaxID=9796 {ECO:0000313|Ensembl:ENSECAP00000044714.2, ECO:0000313|Proteomes:UP000002281};
RN   [1] {ECO:0000313|Ensembl:ENSECAP00000044714.2, ECO:0000313|Proteomes:UP000002281}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000044714.2,
RC   ECO:0000313|Proteomes:UP000002281};
RX   PubMed=19892987; DOI=10.1126/science.1178158;
RG   Broad Institute Genome Sequencing Platform;
RG   Broad Institute Whole Genome Assembly Team;
RA   Wade C.M., Giulotto E., Sigurdsson S., Zoli M., Gnerre S., Imsland F.,
RA   Lear T.L., Adelson D.L., Bailey E., Bellone R.R., Bloecker H., Distl O.,
RA   Edgar R.C., Garber M., Leeb T., Mauceli E., MacLeod J.N., Penedo M.C.T.,
RA   Raison J.M., Sharpe T., Vogel J., Andersson L., Antczak D.F., Biagi T.,
RA   Binns M.M., Chowdhary B.P., Coleman S.J., Della Valle G., Fryc S.,
RA   Guerin G., Hasegawa T., Hill E.W., Jurka J., Kiialainen A., Lindgren G.,
RA   Liu J., Magnani E., Mickelson J.R., Murray J., Nergadze S.G., Onofrio R.,
RA   Pedroni S., Piras M.F., Raudsepp T., Rocchi M., Roeed K.H., Ryder O.A.,
RA   Searle S., Skow L., Swinburne J.E., Syvaenen A.C., Tozaki T., Valberg S.J.,
RA   Vaudin M., White J.R., Zody M.C., Lander E.S., Lindblad-Toh K.;
RT   "Genome sequence, comparative analysis, and population genetics of the
RT   domestic horse.";
RL   Science 326:865-867(2009).
RN   [2] {ECO:0000313|Ensembl:ENSECAP00000044714.2}
RP   IDENTIFICATION.
RC   STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000044714.2};
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   STRING; 9796.ENSECAP00000044714; -.
DR   PaxDb; 9796-ENSECAP00000044714; -.
DR   Ensembl; ENSECAT00000046019.2; ENSECAP00000044714.2; ENSECAG00000011718.4.
DR   GeneTree; ENSGT00940000163156; -.
DR   InParanoid; A0A3Q2IAY2; -.
DR   OMA; YQVAQVQ; -.
DR   Proteomes; UP000002281; Chromosome 10.
DR   Bgee; ENSECAG00000011718; Expressed in epithelium of bronchus and 16 other cell types or tissues.
DR   ExpressionAtlas; A0A3Q2IAY2; baseline.
DR   GO; GO:0031012; C:extracellular matrix; IBA:GO_Central.
DR   GO; GO:0005615; C:extracellular space; IBA:GO_Central.
DR   CDD; cd19941; TIL; 6.
DR   Gene3D; 2.10.25.10; Laminin; 6.
DR   InterPro; IPR003645; Fol_N.
DR   InterPro; IPR036084; Ser_inhib-like_sf.
DR   InterPro; IPR002919; TIL_dom.
DR   InterPro; IPR025615; TILa_dom.
DR   InterPro; IPR014853; VWF/SSPO/ZAN-like_Cys-rich_dom.
DR   InterPro; IPR001007; VWF_dom.
DR   InterPro; IPR001846; VWF_type-D.
DR   PANTHER; PTHR46160; ALPHA-TECTORIN-RELATED; 1.
DR   PANTHER; PTHR46160:SF5; PROTEIN CBG11501; 1.
DR   Pfam; PF08742; C8; 6.
DR   Pfam; PF01826; TIL; 6.
DR   Pfam; PF12714; TILa; 5.
DR   Pfam; PF00094; VWD; 7.
DR   SMART; SM00832; C8; 6.
DR   SMART; SM00274; FOLN; 3.
DR   SMART; SM00214; VWC; 3.
DR   SMART; SM00215; VWC_out; 6.
DR   SMART; SM00216; VWD; 7.
DR   SUPFAM; SSF57567; Serine protease inhibitors; 6.
DR   PROSITE; PS51233; VWFD; 7.
PE   1: Evidence at protein level;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Proteomics identification {ECO:0007829|PeptideAtlas:A0A3Q2IAY2};
KW   Reference proteome {ECO:0000313|Proteomes:UP000002281};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT   DOMAIN          1..161
FT                   /note="VWFD"
FT                   /evidence="ECO:0000259|PROSITE:PS51233"
FT   DOMAIN          373..552
FT                   /note="VWFD"
FT                   /evidence="ECO:0000259|PROSITE:PS51233"
FT   DOMAIN          761..940
FT                   /note="VWFD"
FT                   /evidence="ECO:0000259|PROSITE:PS51233"
FT   DOMAIN          1178..1361
FT                   /note="VWFD"
FT                   /evidence="ECO:0000259|PROSITE:PS51233"
FT   DOMAIN          1577..1760
FT                   /note="VWFD"
FT                   /evidence="ECO:0000259|PROSITE:PS51233"
FT   DOMAIN          1959..2130
FT                   /note="VWFD"
FT                   /evidence="ECO:0000259|PROSITE:PS51233"
FT   DOMAIN          2338..2509
FT                   /note="VWFD"
FT                   /evidence="ECO:0000259|PROSITE:PS51233"
FT   REGION          909..933
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   2510 AA;  265464 MW;  A206045A44AE2A7D CRC64;
     MMGTCTYTMA ELCSSDQTLP AFSVEAKNEH RGSRRVSYVG FVTVRAYSHS VSIARGEVGF
     VRIDNQRSRL PASLANGRFR VYQSGRQAVA ELDFGLVVTY NWDSQLVLSL PERFQDQVCG
     LCGNYNGDPA DDFLTPDWEQ APDAVEFAGS WRLDDGDYLC DDGCQDDCPS CTAGQAQHYE
     GDGLCGMLTL SNGPFAACHD ALAPRAFLEE CVYDLCVFNG DRASLCRSLS AYAQACLELG
     VSVGNWRSPA NCPLSCPANS RYESCGPACP ATCNPEAAPA DCSGRPCVEG CVCLPGFVAS
     GDACVASSSC GCVHEGRPLA PGQAVWADEK CQRRCTCDGA TQQVRCSDTQ GCPAGERCRV
     QNGLLGCYAD RFGSCQGSGD PHYVSFDGRR FDFMGTCTYL LAGSCGQAQA LAAFRVLVEN
     EHRGSQTVSY ARAVIVEARG VKVAVRREHP GQVLVDDILR HLPFQAADGQ VQVFRQGQKA
     VVSTDFGLTV TYNWDAHVTV KVPQSYAGAL CGLCGNFNGD SSDDLALRGG GQAANALAFG
     KSWQEETRPG CGATEPGDCP KLDSLLAEQL QSKKDCGIVA DPKGPFRECH KVLDPQGVVR
     DCVYDRCLLP GQTEPLCDAL ATYAAACQAA GVTVHAWRSE EFCPLDCPRH SHYEACASGC
     PLSCGDLPVP GGCGSDCHEG CVCDEGFALS GESCVALPSC GCVYAGAYHP PGQSFHPGPD
     CNSLCHCQEG GLVSCEPSTC GPHEACQPSG GILGCVAVGS ATCQASGDPH YTTFDGKRFD
     FMGTCVYALA RTCGTRPGLH QFAVLQENVA WGNGKVSVTK AITVQVANFT LRLEQNQWKV
     KVNGVDMKLP MVLDQGRVRA SKHGSDVIIE TDFGLRVAYD LVYNVRVTVP GNYHRQLCGL
     CGDYNGDPKD DFQKPDGSQA SDPKDFGNSW EEKVPDSPCV PPPPCEEDCD PGTCKPELQD
     KYKQEKFCGL LASPTGPLAA CHKLLDPQGP LEDCVFDLCL GGGDESILCN NIHAYVSACQ
     AAGGHVKPWR TESFCPLPCP PNSHYEVCAD TCSVGCAALS APSQCPDSCA EGCQCDPGFL
     NNGQTCVPIQ ECGCYHNGAY YEPGQTVLID SCRQQCTCHA GGSMVCQAHS CKPGQVCEPL
     KGVLSCVTKD PCQDVNCRPQ ETCQEKDGQA ICVPKYEVTC WLWGDPHYHS FDGRNFDFQG
     TCNYVLVTTD CPGVSAQGLP PFTVTTKNEN RGNPAVSYVR LVTVAALGTN ISIHKGEIGK
     VRVNGVLTAL PVSVAGGRLS VTQGASKAVL TTDFGLTVTY DWDWRVEVTL PSSYDSAVCG
     LCGNMDRNPS NDQAFPNGTL APSIPVWGGS WRVPDWDPLC WHECQGSCPT CPEDQLDLYE
     GPGFCGPLAP GSGGPFAACH AHVTPETFFK GCVLDVCLGG GVYDILCQAL AAYAAACQAA
     GIVIKDWREQ TDCMMPCPEN SHYELCGPPC PASCPSPTPP TTTALCEGPC TEGCQCDSGF
     VFSADRCVPL DGGCGCWANG AYHEAGSEFW ADATCSQWCH CGPGGGSLVC KPASCGLGEQ
     CALLPSGQYG CQPVSTAECQ AWGDPHYTTL DGHRFDFQGS CEYLLSAPCH APPAGAENFT
     VTVTNEHRGS QAVSYTRSVT LHIYGHRLTL SAQWPRKLQV DGEFVALPFQ PDSRLSAYVS
     GADVVVTAAA GLSLAFDGDS NVRLRVSSAY AGALCGLCGN YNRDPADDLT AVGGDPSGWQ
     VGGAAGCGEC VPGPCPEPCK PEEQKPFGGP DACGVISATD GPLAPCHSLV PPAEYFQACL
     LDACQAQGHP GALCPAVAAY VALCQAAGAQ LGEWRRPDFC HVQCPAHSHY ELCGNSCPVS
     CPSLSVPEDC EPTCREGCVC NAGFVLSGDS CVPVGQCGCL YNGRYYPLGE AFYPGPECER
     RCECGQGGLV SCQEGAACRP YEECRIEDGV QACHPTGCGR CLANGGIHYV TLDGRVYDLH
     GSCSYVLAQV CHPQPGDEDF AIMLEKNEAG DPQRLLVIVA DQVVGLTQGP QVTVDGEAVT
     LPVAVGAVRV TAEGRNMVLQ TTKGLRLLFD GDAHILISVP SPFRGRLCGL CGNFNGNWSD
     DFVLPSGTVA PSVEAFGAAW RAPSSLQGCG EGCGPQGCPV CLAQETAAYE STEACGRLRD
     PEGPFAGCHA ELSPSEYFRQ CVYDLCAHKG DRSFLCSSMA AYTAACQAAG GAVTSWRTDS
     FCPLECPAHS HYSVCTRSCR GSCAALSGLT GCTSRCFEGC ECDDRFLLSD GVCIPIQDCG
     CTHAGRYLPV NSSLLSSDCS ERCSCSASAG LTCQAAGCLL GRVCDVQSGV RDCWVRQGLC
     SLSVGANLTT FDGARSAIGS SGVYEVSSRC PGLRATVPWY RVLAGVWPCH GKDKAVGQAH
     IFFQDGLVTV TPNNGVWVNG LRVDLPAQVL TSVSVSQNPD GSVLVQQKAG VRVWLTTDGQ
     LAVMVSDEHA GKLCGACGNF DGDQTNDGRG SQGKTGVESW RAPDFYPCHD
//
DBGET integrated database retrieval system