ID W5NQS2_SHEEP Unreviewed; 1035 AA.
AC W5NQS2;
DT 16-APR-2014, integrated into UniProtKB/TrEMBL.
DT 16-APR-2014, sequence version 1.
DT 27-MAR-2024, entry version 59.
DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSOARP00000000512.1};
GN Name=ITGA9 {ECO:0000313|Ensembl:ENSOARP00000000512.1};
OS Ovis aries (Sheep).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Artiodactyla; Ruminantia; Pecora; Bovidae;
OC Caprinae; Ovis.
OX NCBI_TaxID=9940 {ECO:0000313|Ensembl:ENSOARP00000000512.1, ECO:0000313|Proteomes:UP000002356};
RN [1] {ECO:0000313|Ensembl:ENSOARP00000000512.1, ECO:0000313|Proteomes:UP000002356}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Texel {ECO:0000313|Ensembl:ENSOARP00000000512.1,
RC ECO:0000313|Proteomes:UP000002356};
RX PubMed=20809919; DOI=10.1111/j.1365-2052.2010.02100.x;
RA Archibald A.L., Cockett N.E., Dalrymple B.P., Faraut T., Kijas J.W.,
RA Maddox J.F., McEwan J.C., Hutton Oddy V., Raadsma H.W., Wade C., Wang J.,
RA Wang W., Xun X.;
RT "The sheep genome reference sequence: a work in progress.";
RL Anim. Genet. 41:449-453(2010).
RN [2] {ECO:0000313|Ensembl:ENSOARP00000000512.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004479,
CC ECO:0000256|RuleBase:RU003762}; Single-pass type I membrane protein
CC {ECO:0000256|ARBA:ARBA00004479, ECO:0000256|RuleBase:RU003762}.
CC -!- SIMILARITY: Belongs to the integrin alpha chain family.
CC {ECO:0000256|ARBA:ARBA00008054, ECO:0000256|RuleBase:RU003762}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AMGL01044257; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AMGL01044258; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AMGL01044259; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AMGL01044260; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AMGL01044261; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AMGL01044262; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AMGL01044263; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AMGL01044264; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AMGL01044265; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR AlphaFoldDB; W5NQS2; -.
DR SMR; W5NQS2; -.
DR STRING; 9940.ENSOARP00000000512; -.
DR PaxDb; 9940-ENSOARP00000000512; -.
DR Ensembl; ENSOART00000000540.1; ENSOARP00000000512.1; ENSOARG00000000498.1.
DR eggNOG; KOG3637; Eukaryota.
DR HOGENOM; CLU_004111_5_0_1; -.
DR OMA; MMRKGMA; -.
DR Proteomes; UP000002356; Chromosome 19.
DR Bgee; ENSOARG00000000498; Expressed in gastric lymph node and 53 other cell types or tissues.
DR GO; GO:0008305; C:integrin complex; IEA:InterPro.
DR GO; GO:0007155; P:cell adhesion; IEA:UniProtKB-KW.
DR GO; GO:0007229; P:integrin-mediated signaling pathway; IEA:UniProtKB-KW.
DR Gene3D; 1.20.5.930; Bicelle-embedded integrin alpha(iib) transmembrane segment; 1.
DR Gene3D; 2.130.10.130; Integrin alpha, N-terminal; 1.
DR Gene3D; 2.60.40.1460; Integrin domains. Chain A, domain 2; 1.
DR Gene3D; 2.60.40.1510; ntegrin, alpha v. Chain A, domain 3; 1.
DR Gene3D; 2.60.40.1530; ntegrin, alpha v. Chain A, domain 4; 1.
DR InterPro; IPR013517; FG-GAP.
DR InterPro; IPR013519; Int_alpha_beta-p.
DR InterPro; IPR000413; Integrin_alpha.
DR InterPro; IPR018184; Integrin_alpha_C_CS.
DR InterPro; IPR013649; Integrin_alpha_Ig-like_1.
DR InterPro; IPR048285; Integrin_alpha_Ig-like_2.
DR InterPro; IPR048286; Integrin_alpha_Ig-like_3.
DR InterPro; IPR028994; Integrin_alpha_N.
DR InterPro; IPR032695; Integrin_dom_sf.
DR PANTHER; PTHR23220; INTEGRIN ALPHA; 1.
DR PANTHER; PTHR23220:SF69; INTEGRIN ALPHA-9; 1.
DR Pfam; PF01839; FG-GAP; 2.
DR Pfam; PF08441; Integrin_A_Ig_1; 1.
DR Pfam; PF20805; Integrin_A_Ig_2; 1.
DR Pfam; PF20806; Integrin_A_Ig_3; 1.
DR PRINTS; PR01185; INTEGRINA.
DR SMART; SM00191; Int_alpha; 5.
DR SUPFAM; SSF69318; Integrin alpha N-terminal domain; 1.
DR SUPFAM; SSF69179; Integrin domains; 3.
DR PROSITE; PS51470; FG_GAP; 4.
DR PROSITE; PS00242; INTEGRIN_ALPHA; 1.
PE 3: Inferred from homology;
KW Cell adhesion {ECO:0000256|ARBA:ARBA00022889,
KW ECO:0000256|RuleBase:RU003762};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Integrin {ECO:0000256|ARBA:ARBA00023037, ECO:0000256|RuleBase:RU003762};
KW Membrane {ECO:0000256|ARBA:ARBA00023136, ECO:0000256|RuleBase:RU003762};
KW Receptor {ECO:0000256|ARBA:ARBA00023170, ECO:0000256|RuleBase:RU003762};
KW Reference proteome {ECO:0000313|Proteomes:UP000002356};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|RuleBase:RU003762};
KW Transmembrane {ECO:0000256|ARBA:ARBA00022692,
KW ECO:0000256|RuleBase:RU003762};
KW Transmembrane helix {ECO:0000256|ARBA:ARBA00022989,
KW ECO:0000256|RuleBase:RU003762}.
FT SIGNAL 1..29
FT /evidence="ECO:0000256|RuleBase:RU003762"
FT CHAIN 30..1035
FT /evidence="ECO:0000256|RuleBase:RU003762"
FT /id="PRO_5001426612"
FT TRANSMEM 978..1002
FT /note="Helical"
FT /evidence="ECO:0000256|RuleBase:RU003762"
FT REPEAT 35..96
FT /note="FG-GAP"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00803"
FT REPEAT 290..349
FT /note="FG-GAP"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00803"
FT REPEAT 351..408
FT /note="FG-GAP"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00803"
FT REPEAT 413..474
FT /note="FG-GAP"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00803"
FT DOMAIN 459..618
FT /note="Integrin alpha first immunoglubulin-like"
FT /evidence="ECO:0000259|Pfam:PF08441"
FT DOMAIN 620..763
FT /note="Integrin alpha second immunoglobulin-like"
FT /evidence="ECO:0000259|Pfam:PF20805"
FT DOMAIN 772..965
FT /note="Integrin alpha third immunoglobulin-like"
FT /evidence="ECO:0000259|Pfam:PF20806"
SQ SEQUENCE 1035 AA; 114388 MW; DD1677D474904BF2 CRC64;
MGGSAAQRGA GGLRALLLAL VAAGTPAGAY NLDPQRPVRF QGPAGSFFGY AVLEHFHDNT
RWVLVGAPKA DSKYSTSVKS PGAVFKCRVH TNPDRRCTEL DMARGKNRGM PCGKTCREDR
DDEWMGVSLA RQPKADGRVL ACAHRWKNIY YESDHILPHG FCYIIPSNLQ AKGRTLIPCY
EEYKKKYGEE HGSCQAGIAG FFTEELVVMG APGSFYWSGT VKVLNLTDNT YFKLNDEVIM
NRRYTYLGYA VTAGHFSHVS STDVVGGAPQ DEGIGKVYIF RADRRSGTLI KIFQASGKKM
GSYFGSSLCA VDLNTDGLSD LLVGAPMFSE IRDEGQVTVY INKGNGVLEE QLTLSGDGAY
NAHFGESIAS LGDLDDDGFP DVAIGAPKED DFSGAVYIYH GDARGMVPQY SMKLSGRKIS
PVLRMFGQSI SGGIDMDGNS YPDVTIGAFM SDSVVLLRAR PVITVDVSIF LPASINITAP
QCHDGQQPVN CLNVTACFSF HGKHVPGEIG LNYVLTADVA KKEKSQLPRV YFVLLGESVG
QVSEKLRLVH LEETCHHYVA HVKRRVQDVI SPIVFEAAYS LGEHVTGEEE KELPALTPVL
RWKKGQKIAQ KNQTVFERNC RSEDCAADLQ LQGHLLLSSV DEKTPYLALG AVKNISLNIS
ISNLGDDAYD ANVSFNVSRE LFFINMWQKE EMGISCELLE LDFLKCSVGF PFMRSKSKYE
FSVIFDTSHL SGEEEVLSFI VTAQSGNVER SESLHNNILT LMVPLMHEVD TSITGIMSPT
SFVYGESVDA SNFIQLDDLE CHFQPLNVTL QVYNTGPSTL PGSSVSISFP SRLSPGGAEM
FHVQEMVVGQ EKGNCSFQKN PTPCIIPQEQ ENIFHTIFAF FTKSGRKVLD CEKTGISCLT
MYCNLSALAK EESRTIDIYM LLNTEILKKD SSSVIQFMTH AKVKVDPSLR VVEVASGNPE
EMTVVFEALH NLEPRGYVVG WIIAISLLVG ILIFLLLAVL LWKMGFFRRR YKEIIEAEKN
RKENEDSWDW VQKNQ
//