ID G3WRF4_SARHA Unreviewed; 265 AA.
AC G3WRF4;
DT 16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT 07-APR-2021, sequence version 2.
DT 27-MAR-2024, entry version 73.
DE SubName: Full=Orthodenticle homeobox 2 {ECO:0000313|Ensembl:ENSSHAP00000018009.2};
GN Name=OTX2 {ECO:0000313|Ensembl:ENSSHAP00000018009.2};
OS Sarcophilus harrisii (Tasmanian devil) (Sarcophilus laniarius).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Metatheria; Dasyuromorphia; Dasyuridae; Sarcophilus.
OX NCBI_TaxID=9305 {ECO:0000313|Ensembl:ENSSHAP00000018009.2, ECO:0000313|Proteomes:UP000007648};
RN [1] {ECO:0000313|Ensembl:ENSSHAP00000018009.2, ECO:0000313|Proteomes:UP000007648}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21709235; DOI=10.1073/pnas.1102838108;
RA Miller W., Hayes V.M., Ratan A., Petersen D.C., Wittekindt N.E., Miller J.,
RA Walenz B., Knight J., Qi J., Zhao F., Wang Q., Bedoya-Reina O.C.,
RA Katiyar N., Tomsho L.P., Kasson L.M., Hardie R.A., Woodbridge P.,
RA Tindall E.A., Bertelsen M.F., Dixon D., Pyecroft S., Helgen K.M.,
RA Lesk A.M., Pringle T.H., Patterson N., Zhang Y., Kreiss A., Woods G.M.,
RA Jones M.E., Schuster S.C.;
RT "Genetic diversity and population structure of the endangered marsupial
RT Sarcophilus harrisii (Tasmanian devil).";
RL Proc. Natl. Acad. Sci. U.S.A. 108:12348-12353(2011).
RN [2] {ECO:0000313|Ensembl:ENSSHAP00000018009.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC -!- SIMILARITY: Belongs to the paired homeobox family. Bicoid subfamily.
CC {ECO:0000256|ARBA:ARBA00006503}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; G3WRF4; -.
DR STRING; 9305.ENSSHAP00000018009; -.
DR Ensembl; ENSSHAT00000018158.2; ENSSHAP00000018009.2; ENSSHAG00000015291.2.
DR eggNOG; KOG2251; Eukaryota.
DR GeneTree; ENSGT00940000155014; -.
DR TreeFam; TF351179; -.
DR Proteomes; UP000007648; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR003022; Otx2_TF.
DR InterPro; IPR003025; Otx_TF.
DR InterPro; IPR013851; Otx_TF_C.
DR PANTHER; PTHR45793; HOMEOBOX PROTEIN; 1.
DR PANTHER; PTHR45793:SF2; HOMEOBOX PROTEIN OTX2; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR Pfam; PF03529; TF_Otx; 1.
DR PRINTS; PR01257; OTX2HOMEOBOX.
DR PRINTS; PR01255; OTXHOMEOBOX.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 3: Inferred from homology;
KW Developmental protein {ECO:0000256|ARBA:ARBA00022473};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000007648}.
FT DOMAIN 12..72
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 14..73
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 69..119
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 69..85
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 96..119
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 265 AA; 29030 MW; 3BF46ECCB33ACAB2 CRC64;
GVLIFPSLAT PRKQRRERTT FTRAQLDVLE ALFAKTRYPD IFMREEVALK INLPESRVQV
WFKNRRAKCR QQQQQQQNGG QNKVRPAKKK SSPVREVSSE SGTSGQFTPP SSTSVPAISS
SSAPVSIWSP ASISPLSDPL STSSSCMQRS YPMTYTQASG YSQGYAGSTS YFGGMDCGSY
LTPMHHQLPG PGATLSPMGT NAVTSHLNQS TASLTTQGYG ASSLGFNSTT DCLDYKDQTA
SWKLNFNADC LDYKDQTSSW KFQVL
//