ID H2Q3G4_PANTR Unreviewed; 387 AA.
AC H2Q3G4;
DT 21-MAR-2012, integrated into UniProtKB/TrEMBL.
DT 28-FEB-2018, sequence version 2.
DT 24-JAN-2024, entry version 79.
DE SubName: Full=ALX homeobox 4 {ECO:0000313|Ensembl:ENSPTRP00000006127.6};
GN Name=ALX4 {ECO:0000313|Ensembl:ENSPTRP00000006127.6,
GN ECO:0000313|VGNC:VGNC:11255};
OS Pan troglodytes (Chimpanzee).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC Pan.
OX NCBI_TaxID=9598 {ECO:0000313|Ensembl:ENSPTRP00000006127.6, ECO:0000313|Proteomes:UP000002277};
RN [1] {ECO:0000313|Ensembl:ENSPTRP00000006127.6, ECO:0000313|Proteomes:UP000002277}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=16136131; DOI=10.1038/nature04072;
RG Chimpanzee sequencing and analysis consortium;
RT "Initial sequence of the chimpanzee genome and comparison with the human
RT genome.";
RL Nature 437:69-87(2005).
RN [2] {ECO:0000313|Ensembl:ENSPTRP00000006127.6}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (JUL-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AACZ04016403; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AACZ04016404; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR AlphaFoldDB; H2Q3G4; -.
DR Ensembl; ENSPTRT00000006641.6; ENSPTRP00000006127.6; ENSPTRG00000003529.7.
DR VGNC; VGNC:11255; ALX4.
DR eggNOG; KOG0490; Eukaryota.
DR GeneTree; ENSGT00940000159662; -.
DR HOGENOM; CLU_047013_0_0_1; -.
DR OMA; PCYGKDN; -.
DR TreeFam; TF350743; -.
DR Proteomes; UP000002277; Chromosome 11.
DR Bgee; ENSPTRG00000003529; Expressed in fibroblast and 4 other cell types or tissues.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR003654; OAR_dom.
DR PANTHER; PTHR24329; HOMEOBOX PROTEIN ARISTALESS; 1.
DR PANTHER; PTHR24329:SF322; HOMEOBOX PROTEIN ARISTALESS-LIKE 4; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR Pfam; PF03826; OAR; 1.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
DR PROSITE; PS50803; OAR; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000002277};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT DOMAIN 188..248
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DOMAIN 367..380
FT /note="OAR"
FT /evidence="ECO:0000259|PROSITE:PS50803"
FT DNA_BIND 190..249
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 77..121
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 160..195
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 387 AA; 41613 MW; 13500ED12F86E2C1 CRC64;
MNAETCVSYC ESPAAAMDAY YSPVSQSREG SSPFRAFPGG DKFGTTFLSA AAKAQGFGDA
KSRARYGAGQ QDLATPLESG AAPQQQQPQP QPPAQPHLYL QRGACKTPPD GSLKLQEGSS
GHSAALQVPC YAKESSLGEP ELPPDSDTVG MDSSYLSVKE AGVKGPQDRA SSDLPSPLEK
ADSESNKGKK RRNRTTFTSY QLEELEKVFQ KTHYPDVYAR EQLAMRTDLT EARVQVWFQN
RRAKWRKRER FGQMQQVRTH FSTAYELPLL TRAENYAQIQ NPSWLGNNGA ASPVPACVVP
CDPVPACMSP HAHPPGSGAS SVTDFLSVSG AGSHVGQTHM GSLFGAASLS PGLNGYELNG
EPDRKTSSIA ALRMKAKEHS AAISWAT
//