GenomeNet

Database: UniProt
Entry: G3X0Z0_SARHA
LinkDB: G3X0Z0_SARHA
Original site: G3X0Z0_SARHA 
ID   G3X0Z0_SARHA            Unreviewed;       325 AA.
AC   G3X0Z0;
DT   16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT   07-APR-2021, sequence version 2.
DT   27-MAR-2024, entry version 85.
DE   SubName: Full=Orthopedia homeobox {ECO:0000313|Ensembl:ENSSHAP00000021345.2};
GN   Name=OTP {ECO:0000313|Ensembl:ENSSHAP00000021345.2};
OS   Sarcophilus harrisii (Tasmanian devil) (Sarcophilus laniarius).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Metatheria; Dasyuromorphia; Dasyuridae; Sarcophilus.
OX   NCBI_TaxID=9305 {ECO:0000313|Ensembl:ENSSHAP00000021345.2, ECO:0000313|Proteomes:UP000007648};
RN   [1] {ECO:0000313|Ensembl:ENSSHAP00000021345.2, ECO:0000313|Proteomes:UP000007648}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=21709235; DOI=10.1073/pnas.1102838108;
RA   Miller W., Hayes V.M., Ratan A., Petersen D.C., Wittekindt N.E., Miller J.,
RA   Walenz B., Knight J., Qi J., Zhao F., Wang Q., Bedoya-Reina O.C.,
RA   Katiyar N., Tomsho L.P., Kasson L.M., Hardie R.A., Woodbridge P.,
RA   Tindall E.A., Bertelsen M.F., Dixon D., Pyecroft S., Helgen K.M.,
RA   Lesk A.M., Pringle T.H., Patterson N., Zhang Y., Kreiss A., Woods G.M.,
RA   Jones M.E., Schuster S.C.;
RT   "Genetic diversity and population structure of the endangered marsupial
RT   Sarcophilus harrisii (Tasmanian devil).";
RL   Proc. Natl. Acad. Sci. U.S.A. 108:12348-12353(2011).
RN   [2] {ECO:0000313|Ensembl:ENSSHAP00000021345.2}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|PROSITE-ProRule:PRU00108,
CC       ECO:0000256|RuleBase:RU000682}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   AlphaFoldDB; G3X0Z0; -.
DR   STRING; 9305.ENSSHAP00000021345; -.
DR   Ensembl; ENSSHAT00000021518.2; ENSSHAP00000021345.2; ENSSHAG00000018090.2.
DR   eggNOG; KOG0490; Eukaryota.
DR   GeneTree; ENSGT00940000159952; -.
DR   HOGENOM; CLU_056068_0_0_1; -.
DR   InParanoid; G3X0Z0; -.
DR   OrthoDB; 5398847at2759; -.
DR   TreeFam; TF351614; -.
DR   Proteomes; UP000007648; Unassembled WGS sequence.
DR   GO; GO:0001650; C:fibrillar center; IEA:Ensembl.
DR   GO; GO:0016604; C:nuclear body; IEA:Ensembl.
DR   GO; GO:0001227; F:DNA-binding transcription repressor activity, RNA polymerase II-specific; IEA:Ensembl.
DR   GO; GO:0000977; F:RNA polymerase II transcription regulatory region sequence-specific DNA binding; IEA:Ensembl.
DR   GO; GO:0071542; P:dopaminergic neuron differentiation; IEA:Ensembl.
DR   GO; GO:0021879; P:forebrain neuron differentiation; IEA:Ensembl.
DR   GO; GO:0021979; P:hypothalamus cell differentiation; IEA:Ensembl.
DR   GO; GO:0007405; P:neuroblast proliferation; IEA:Ensembl.
DR   GO; GO:0061101; P:neuroendocrine cell differentiation; IEA:Ensembl.
DR   GO; GO:0021985; P:neurohypophysis development; IEA:Ensembl.
DR   GO; GO:0002052; P:positive regulation of neuroblast proliferation; IEA:Ensembl.
DR   CDD; cd00086; homeodomain; 1.
DR   Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR   InterPro; IPR009057; Homeobox-like_sf.
DR   InterPro; IPR017970; Homeobox_CS.
DR   InterPro; IPR001356; Homeobox_dom.
DR   InterPro; IPR000047; HTH_motif.
DR   InterPro; IPR003654; OAR_dom.
DR   PANTHER; PTHR46770; HOMEOBOX PROTEIN ORTHOPEDIA; 1.
DR   PANTHER; PTHR46770:SF1; HOMEOBOX PROTEIN ORTHOPEDIA; 1.
DR   Pfam; PF00046; Homeodomain; 1.
DR   Pfam; PF03826; OAR; 1.
DR   PRINTS; PR00031; HTHREPRESSR.
DR   SMART; SM00389; HOX; 1.
DR   SUPFAM; SSF46689; Homeodomain-like; 1.
DR   PROSITE; PS00027; HOMEOBOX_1; 1.
DR   PROSITE; PS50071; HOMEOBOX_2; 1.
DR   PROSITE; PS50803; OAR; 1.
PE   4: Predicted;
KW   DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW   ProRule:PRU00108};
KW   Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW   ProRule:PRU00108};
KW   Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW   ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000007648}.
FT   DOMAIN          102..162
FT                   /note="Homeobox"
FT                   /evidence="ECO:0000259|PROSITE:PS50071"
FT   DOMAIN          306..319
FT                   /note="OAR"
FT                   /evidence="ECO:0000259|PROSITE:PS50803"
FT   DNA_BIND        104..163
FT                   /note="Homeobox"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT   REGION          32..112
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          280..299
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        62..76
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        86..102
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        283..299
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   325 AA;  34225 MW;  B4C9ED88AADC320B CRC64;
     MLSHADLLDA RLGMKDAAEL LGHREAVKCR LGVGGSDPGG HPGDLAPNSD TVEGATLLPG
     EDITTVGSNT ASLAVSAKDP DKQPGPQGGQ NPSQAGQQQG QQKQKRHRTR FTPAQLNELE
     RSFAKTHYPD IFMREELALR IGLTESRVQV WFQNRRAKWK KRKKTTNVFR APGTLLPTPG
     LPQFPSAAAA AAAAMGDSLC SFHANDTRWA AAAMPGVSQL PLPPALGRQQ AMAQSLSQCS
     LAAGPPPNSM GLSNSLAGSN GAGLQSHLYQ PAFPGMVPAS LPGPSNVTGS PQLCSSPDSS
     DVWRGTSIAS LRRKALEHTV SMSFT
//
DBGET integrated database retrieval system