ID Q3U4I5_MOUSE Unreviewed; 1153 AA.
AC Q3U4I5;
DT 11-OCT-2005, integrated into UniProtKB/TrEMBL.
DT 11-OCT-2005, sequence version 1.
DT 27-MAR-2024, entry version 117.
DE RecName: Full=VWFA domain-containing protein {ECO:0000259|PROSITE:PS50234};
OS Mus musculus (Mouse).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC Murinae; Mus; Mus.
OX NCBI_TaxID=10090 {ECO:0000313|EMBL:BAE32446.1};
RN [1] {ECO:0000313|EMBL:BAE32446.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=NOD {ECO:0000313|EMBL:BAE32446.1};
RX PubMed=10349636; DOI=10.1016/S0076-6879(99)03004-9;
RA Carninci P., Hayashizaki Y.;
RT "High-efficiency full-length cDNA cloning.";
RL Methods Enzymol. 303:19-44(1999).
RN [2] {ECO:0000313|EMBL:BAE32446.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=NOD {ECO:0000313|EMBL:BAE32446.1};
RX PubMed=11042159; DOI=10.1101/gr.145100;
RA Carninci P., Shibata Y., Hayatsu N., Sugahara Y., Shibata K., Itoh M.,
RA Konno H., Okazaki Y., Muramatsu M., Hayashizaki Y.;
RT "Normalization and subtraction of cap-trapper-selected cDNAs to prepare
RT full-length cDNA libraries for rapid discovery of new genes.";
RL Genome Res. 10:1617-1630(2000).
RN [3] {ECO:0000313|EMBL:BAE32446.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=NOD {ECO:0000313|EMBL:BAE32446.1};
RX PubMed=11076861; DOI=10.1101/gr.152600;
RA Shibata K., Itoh M., Aizawa K., Nagaoka S., Sasaki N., Carninci P.,
RA Konno H., Akiyama J., Nishi K., Kitsunai T., Tashiro H., Itoh M., Sumi N.,
RA Ishii Y., Nakamura S., Hazama M., Nishine T., Harada A., Yamamoto R.,
RA Matsumoto H., Sakaguchi S., Ikegami T., Kashiwagi K., Fujiwake S.,
RA Inoue K., Togawa Y., Izawa M., Ohara E., Watahiki M., Yoneda Y.,
RA Ishikawa T., Ozawa K., Tanaka T., Matsuura S., Kawai J., Okazaki Y.,
RA Muramatsu M., Inoue Y., Kira A., Hayashizaki Y.;
RT "RIKEN integrated sequence analysis (RISA) system--384-format sequencing
RT pipeline with 384 multicapillary sequencer.";
RL Genome Res. 10:1757-1771(2000).
RN [4] {ECO:0000313|EMBL:BAE32446.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=NOD {ECO:0000313|EMBL:BAE32446.1};
RX PubMed=11217851; DOI=10.1038/35055500;
RG The RIKEN Genome Exploration Research Group Phase II Team and the FANTOM Consortium;
RT "Functional annotation of a full-length mouse cDNA collection.";
RL Nature 409:685-690(2001).
RN [5] {ECO:0000313|EMBL:BAE32446.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=NOD {ECO:0000313|EMBL:BAE32446.1};
RX PubMed=12466851; DOI=10.1038/nature01266;
RG The FANTOM Consortium and the RIKEN Genome Exploration Research Group Phase I and II Team;
RT "Analysis of the mouse transcriptome based on functional annotation of
RT 60,770 full-length cDNAs.";
RL Nature 420:563-573(2002).
RN [6] {ECO:0000313|EMBL:BAE32446.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=NOD {ECO:0000313|EMBL:BAE32446.1};
RA Arakawa T., Carninci P., Fukuda S., Hashizume W., Hayashida K., Hori F.,
RA Iida J., Imamura K., Imotani K., Itoh M., Kanagawa S., Kawai J., Kojima M.,
RA Konno H., Murata M., Nakamura M., Ninomiya N., Nishiyori H., Nomura K.,
RA Ohno M., Sakazume N., Sano H., Sasaki D., Shibata K., Shiraki T.,
RA Tagami M., Tagami Y., Waki K., Watahiki A., Muramatsu M., Hayashizaki Y.;
RL Submitted (MAR-2004) to the EMBL/GenBank/DDBJ databases.
RN [7] {ECO:0000313|EMBL:BAE32446.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=NOD {ECO:0000313|EMBL:BAE32446.1};
RG The FANTOM Consortium;
RG Riken Genome Exploration Research Group and Genome Science Group (Genome Network Project Core Group);
RT "The Transcriptional Landscape of the Mammalian Genome.";
RL Science 309:1559-1563(2005).
RN [8] {ECO:0000313|EMBL:BAE32446.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=NOD {ECO:0000313|EMBL:BAE32446.1};
RX PubMed=16141073; DOI=10.1126/science.1112009;
RG RIKEN Genome Exploration Research Group and Genome Science Group (Genome Network Project Core Group) and the FANTOM Consortium;
RT "Antisense Transcription in the Mammalian Transcriptome.";
RL Science 309:1564-1566(2005).
CC -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004479,
CC ECO:0000256|RuleBase:RU003762}; Single-pass type I membrane protein
CC {ECO:0000256|ARBA:ARBA00004479, ECO:0000256|RuleBase:RU003762}.
CC -!- SIMILARITY: Belongs to the integrin alpha chain family.
CC {ECO:0000256|ARBA:ARBA00008054, ECO:0000256|RuleBase:RU003762}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AK154224; BAE32446.1; -; mRNA.
DR AlphaFoldDB; Q3U4I5; -.
DR EPD; Q3U4I5; -.
DR PeptideAtlas; Q3U4I5; -.
DR GO; GO:0008305; C:integrin complex; IEA:InterPro.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0007155; P:cell adhesion; IEA:UniProtKB-KW.
DR GO; GO:0007229; P:integrin-mediated signaling pathway; IEA:UniProtKB-KW.
DR CDD; cd01469; vWA_integrins_alpha_subunit; 1.
DR Gene3D; 1.20.5.930; Bicelle-embedded integrin alpha(iib) transmembrane segment; 1.
DR Gene3D; 2.130.10.130; Integrin alpha, N-terminal; 1.
DR Gene3D; 2.60.40.1460; Integrin domains. Chain A, domain 2; 1.
DR Gene3D; 2.60.40.1510; ntegrin, alpha v. Chain A, domain 3; 1.
DR Gene3D; 2.60.40.1530; ntegrin, alpha v. Chain A, domain 4; 1.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 1.
DR InterPro; IPR013517; FG-GAP.
DR InterPro; IPR013519; Int_alpha_beta-p.
DR InterPro; IPR000413; Integrin_alpha.
DR InterPro; IPR018184; Integrin_alpha_C_CS.
DR InterPro; IPR013649; Integrin_alpha_Ig-like_1.
DR InterPro; IPR048285; Integrin_alpha_Ig-like_2.
DR InterPro; IPR028994; Integrin_alpha_N.
DR InterPro; IPR032695; Integrin_dom_sf.
DR InterPro; IPR048633; ITGAX-like_Ig_3.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR23220; INTEGRIN ALPHA; 1.
DR PANTHER; PTHR23220:SF130; INTEGRIN ALPHA-M; 1.
DR Pfam; PF01839; FG-GAP; 1.
DR Pfam; PF08441; Integrin_A_Ig_1; 1.
DR Pfam; PF20805; Integrin_A_Ig_2; 1.
DR Pfam; PF00357; Integrin_alpha; 1.
DR Pfam; PF21520; ITGAX-like_Ig_3; 1.
DR Pfam; PF00092; VWA; 1.
DR PRINTS; PR01185; INTEGRINA.
DR PRINTS; PR00453; VWFADOMAIN.
DR SMART; SM00191; Int_alpha; 5.
DR SMART; SM00327; VWA; 1.
DR SUPFAM; SSF69318; Integrin alpha N-terminal domain; 1.
DR SUPFAM; SSF69179; Integrin domains; 3.
DR SUPFAM; SSF53300; vWA-like; 1.
DR PROSITE; PS51470; FG_GAP; 5.
DR PROSITE; PS00242; INTEGRIN_ALPHA; 1.
DR PROSITE; PS50234; VWFA; 1.
PE 2: Evidence at transcript level;
KW Calcium {ECO:0000256|ARBA:ARBA00022837};
KW Cell adhesion {ECO:0000256|ARBA:ARBA00022889,
KW ECO:0000256|RuleBase:RU003762};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Integrin {ECO:0000256|ARBA:ARBA00023037, ECO:0000256|RuleBase:RU003762};
KW Membrane {ECO:0000256|ARBA:ARBA00023136, ECO:0000256|RuleBase:RU003762};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Receptor {ECO:0000256|ARBA:ARBA00023170, ECO:0000256|RuleBase:RU003762};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|RuleBase:RU003762};
KW Transmembrane {ECO:0000256|ARBA:ARBA00022692,
KW ECO:0000256|RuleBase:RU003762};
KW Transmembrane helix {ECO:0000256|ARBA:ARBA00022989,
KW ECO:0000256|RuleBase:RU003762}.
FT SIGNAL 1..16
FT /evidence="ECO:0000256|RuleBase:RU003762"
FT CHAIN 17..1153
FT /note="VWFA domain-containing protein"
FT /evidence="ECO:0000256|RuleBase:RU003762"
FT /id="PRO_5001425843"
FT TRANSMEM 1107..1129
FT /note="Helical"
FT /evidence="ECO:0000256|RuleBase:RU003762"
FT REPEAT 18..75
FT /note="FG-GAP"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00803"
FT DOMAIN 150..328
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT REPEAT 339..390
FT /note="FG-GAP"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00803"
FT REPEAT 443..503
FT /note="FG-GAP"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00803"
FT REPEAT 506..564
FT /note="FG-GAP"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00803"
FT REPEAT 569..629
FT /note="FG-GAP"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00803"
SQ SEQUENCE 1153 AA; 127419 MW; A9ABDEEC6291527B CRC64;
MTLKALLVTA LALCHGFNLD TEHPMTFQEN AKGFGQSVVQ LGGTSVVVAA PQEAKAVNQT
GALYQCDYST SRCHPIPLQV PPEAVNMSLG LSLAVSTVPQ QLLACGPTVH QNCKENTYVN
GLCYLFGSNL LRPPQQFPEA LRECPQQESD IVFLIDGSGS INNIDFQKMK EFVSTVMEQF
KKSKTLFSLM QYSDEFRIHF TFNDFKRNPS PRSHVSPIKQ LNGRTKTASG IRKVVRELFH
KTNGARENAA KILVVITDGE KFGDPLDYKD VIPEADRAGV IRYVIGVGNA FNKPQSRREL
DTIASKPAGE HVFQVDNFEA LNTIQNQLQE KIFAIEGTQT GSTSSFEHEM SQEGFSASIT
SNGPLLGSVG SFDWAGGAFL YTSKDKVTFI NTTRVDSDMN DAYLGYASAV ILRNRVQSLV
LGAPRYQHIG LVVMFRENFG TWEPHTSIKG SQIGSYFGAS LCSVDMDADG NTNLILIGAP
HYYEKTRGGQ VSVCPLPRGR ARWQCEALLH GDQGHPWGRF GAALTVLGDV NGDKLTDVAI
GAPGEQENQG AVYIFYGASI ASLSASHSQR IIGAHFSPGL QYFGQSLSGG KDLTMDGLMD
LAVGAQGHLL LLRAQPVLRL EATMEFSLKK VARSVFACQE QVLKNKDAGE VRVCLRVRKN
TKDRLREGDI QSTVTYDLAL DPGRSRIRAF FDETKNNTRR RTQVFGLMQK CETLKLILPD
CVDDSVSPII LRLNYTLVGE PLRSFGNLRP VLAMDAQRFF TAMFPFEKNC GNDSICQDDL
SITMSAMGLD TLVVGGPQDF NMSVTLRNDG EDSYGTQVTV YYPSGLSYRK DSASQNPLTK
KPWFVKPAES SSSSEGHGAL KSTTWNINHP IFPANSEVTF NVTFDVDSHA SFGNKLLLKA
IVASENNMSR THKTKFQLEL PVKYAIYMIV TSDESSIRYL NFTASEMTSK VIQHQYQFNN
LGQRSLPVSV VFWIPVQINN VTVWDHPQVI FSQNLSSACH TEQKSPPHSN FRDQLERTPV
LNCSVAVCKR IQCDLPSFNT QEIFNVTLKG NLSFDWYIKT SHGHLLLVSS TEILFNDSAF
ALLPGQESYV RSKTETKVEP YEVHNPVPLI VGSSIGGLVL LALITAGLYK LGFFKRQYKD
MMNEAAPQDA PPQ
//