ID A0A2K5W6M5_MACFA Unreviewed; 471 AA.
AC A0A2K5W6M5;
DT 28-MAR-2018, integrated into UniProtKB/TrEMBL.
DT 02-JUN-2021, sequence version 2.
DT 24-JAN-2024, entry version 32.
DE SubName: Full=Iroquois homeobox 5 {ECO:0000313|Ensembl:ENSMFAP00000032746.2};
GN Name=IRX5 {ECO:0000313|Ensembl:ENSMFAP00000032746.2};
OS Macaca fascicularis (Crab-eating macaque) (Cynomolgus monkey).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini;
OC Cercopithecidae; Cercopithecinae; Macaca.
OX NCBI_TaxID=9541 {ECO:0000313|Ensembl:ENSMFAP00000032746.2, ECO:0000313|Proteomes:UP000233100};
RN [1] {ECO:0000313|Ensembl:ENSMFAP00000032746.2, ECO:0000313|Proteomes:UP000233100}
RP NUCLEOTIDE SEQUENCE.
RA Warren W., Wilson R.K.;
RL Submitted (MAR-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSMFAP00000032746.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108}.
CC -!- SIMILARITY: Belongs to the TALE/IRO homeobox family.
CC {ECO:0000256|ARBA:ARBA00008446}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; A0A2K5W6M5; -.
DR STRING; 9541.ENSMFAP00000032746; -.
DR Ensembl; ENSMFAT00000006970.2; ENSMFAP00000032746.2; ENSMFAG00000002928.2.
DR VEuPathDB; HostDB:ENSMFAG00000002928; -.
DR GeneTree; ENSGT00940000159483; -.
DR OrthoDB; 2915644at2759; -.
DR Proteomes; UP000233100; Chromosome 20.
DR Bgee; ENSMFAG00000002928; Expressed in lung and 2 other cell types or tissues.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR GO; GO:1990837; F:sequence-specific double-stranded DNA binding; IEA:Ensembl.
DR GO; GO:0048701; P:embryonic cranial skeleton morphogenesis; IEA:Ensembl.
DR GO; GO:0008406; P:gonad development; IEA:Ensembl.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR008422; Homeobox_KN_domain.
DR InterPro; IPR003893; Iroquois_homeo.
DR PANTHER; PTHR11211; IROQUOIS-CLASS HOMEODOMAIN PROTEIN IRX; 1.
DR PANTHER; PTHR11211:SF17; IROQUOIS-CLASS HOMEODOMAIN PROTEIN IRX-5; 1.
DR Pfam; PF05920; Homeobox_KN; 1.
DR SMART; SM00389; HOX; 1.
DR SMART; SM00548; IRO; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 3: Inferred from homology;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000233100}.
FT DOMAIN 81..138
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 83..139
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 1..35
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 141..355
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 163..181
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 277..293
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 471 AA; 49728 MW; 5D1664073AD8C6E8 CRC64;
MSSAALLRAP RSRPTLARLP SRRPRRATTR TSSTAPTAAA AAFSSYVGSP YDHTPGMAGS
LGYHPYAAPL GSYPYGDPAY RKNATRDATA TLKAWLNEHR KNPYPTKGEK IMLAIITKMT
LTQVSTWFAN ARRRLKKENK MTWTPRNRSE DEEEEENIDL EKNDEDEPQK PEDKGDPEGP
EAGGAEQKAA SGCERLQGPP TPASKETEGS LSDSDFKEPP SEGRLDALQG APAPAGPPRL
GQRRRGWRRT RPSLPRGRAG AGPASSRGRA APGPGGPSVI HSPPPPPPPA VLAKPKLWSL
AEIATSSDKV KDGGGGSEGS PCPPCPGPIA GQALGGSRAS PTPAPSRSPS AQCPFPGGTV
LSRPLYYTAP FYPGYTNYGS FGHLHGHPGP GPGPTTGPGS HFNGLNQTVL NRADALAKDP
KMLRSQSQLD LCKDSPYELK KGTGEVARVC TCEKTQRGGA LGFSGNVTVK G
//