ID G3U9I4_LOXAF Unreviewed; 332 AA.
AC G3U9I4;
DT 16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT 16-NOV-2011, sequence version 1.
DT 27-MAR-2024, entry version 84.
DE SubName: Full=Short stature homeobox {ECO:0000313|Ensembl:ENSLAFP00000024492.1};
GN Name=SHOX {ECO:0000313|Ensembl:ENSLAFP00000024492.1};
OS Loxodonta africana (African elephant).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Afrotheria; Proboscidea; Elephantidae; Loxodonta.
OX NCBI_TaxID=9785 {ECO:0000313|Ensembl:ENSLAFP00000024492.1, ECO:0000313|Proteomes:UP000007646};
RN [1] {ECO:0000313|Ensembl:ENSLAFP00000024492.1, ECO:0000313|Proteomes:UP000007646}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Isolate ISIS603380 {ECO:0000313|Ensembl:ENSLAFP00000024492.1,
RC ECO:0000313|Proteomes:UP000007646};
RA Di Palma F., Heiman D., Young S., Johnson J., Lander E.S., Lindblad-Toh K.;
RT "The Genome Sequence of Loxodonta africana (African elephant).";
RL Submitted (JUN-2009) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSLAFP00000024492.1}
RP IDENTIFICATION.
RC STRAIN=Isolate ISIS603380 {ECO:0000313|Ensembl:ENSLAFP00000024492.1};
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|PROSITE-ProRule:PRU00108,
CC ECO:0000256|RuleBase:RU000682}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; G3U9I4; -.
DR STRING; 9785.ENSLAFP00000024492; -.
DR Ensembl; ENSLAFT00000028406.1; ENSLAFP00000024492.1; ENSLAFG00000028340.1.
DR eggNOG; KOG0490; Eukaryota.
DR GeneTree; ENSGT00940000154287; -.
DR HOGENOM; CLU_047013_5_0_1; -.
DR InParanoid; G3U9I4; -.
DR OMA; RELGNSX; -.
DR TreeFam; TF350757; -.
DR Proteomes; UP000007646; Unassembled WGS sequence.
DR GO; GO:0005654; C:nucleoplasm; IEA:Ensembl.
DR GO; GO:0001228; F:DNA-binding transcription activator activity, RNA polymerase II-specific; IEA:Ensembl.
DR GO; GO:1990837; F:sequence-specific double-stranded DNA binding; IEA:Ensembl.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR000047; HTH_motif.
DR InterPro; IPR003654; OAR_dom.
DR PANTHER; PTHR46255; SHORT STATURE HOMEOBOX; 1.
DR PANTHER; PTHR46255:SF2; SHORT STATURE HOMEOBOX PROTEIN; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR Pfam; PF03826; OAR; 1.
DR PRINTS; PR00031; HTHREPRESSR.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
DR PROSITE; PS50803; OAR; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000007646}.
FT DOMAIN 148..208
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DOMAIN 314..327
FT /note="OAR"
FT /evidence="ECO:0000259|PROSITE:PS50803"
FT DNA_BIND 150..209
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 1..152
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 32..81
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 125..152
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 332 AA; 36432 MW; EB825F480A1D9952 CRC64;
MEELTAFVSK SFDQKSKESS VGGGGGGKKD SITYREVLES GLARSREPRL GRKDPGNRSR
VFLKKETKPK SGRDGGRKAG PPGKAEARGP GAGAARSPRA RSPGRWLWAP SPELSRAAGS
EALRGACEEC KEKREDVKSE DEDGQTKLKQ RRSRTNFTLE QLNELERLFD ETHYPDAFMR
EELSQRLGLS EARVQVWFQN RRAKCRKQEN QMHKGVILGT ASHLDACRVA PYVNMGALRM
PFQQFPHLLT QVQAQLQLEG VAHAHPHLHP HLAAHAPYLM FPPPPFGLPI ASLAESASAA
AVVAAAAKSN SKNSSIADLR LKARKHAEAL GL
//