ID A0A452GMF2_9SAUR Unreviewed; 303 AA.
AC A0A452GMF2;
DT 08-MAY-2019, integrated into UniProtKB/TrEMBL.
DT 08-MAY-2019, sequence version 1.
DT 24-JAN-2024, entry version 24.
DE RecName: Full=Homeobox protein {ECO:0000256|PIRNR:PIRNR000563};
OS Gopherus agassizii (Agassiz's desert tortoise).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Testudinata; Testudines; Cryptodira; Durocryptodira;
OC Testudinoidea; Testudinidae; Gopherus.
OX NCBI_TaxID=38772 {ECO:0000313|Ensembl:ENSGAGP00000002992.1, ECO:0000313|Proteomes:UP000291020};
RN [1] {ECO:0000313|Proteomes:UP000291020}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=28562605;
RA Tollis M., DeNardo D.F., Cornelius J.A., Dolby G.A., Edwards T.,
RA Henen B.T., Karl A.E., Murphy R.W., Kusumi K.;
RT "The Agassiz's desert tortoise genome provides a resource for the
RT conservation of a threatened species.";
RL PLoS ONE 12:e0177708-e0177708(2017).
RN [2] {ECO:0000313|Ensembl:ENSGAGP00000002992.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PIRNR:PIRNR000563, ECO:0000256|PROSITE-ProRule:PRU00108,
CC ECO:0000256|RuleBase:RU000682}.
CC -!- SIMILARITY: Belongs to the paired homeobox family. Bicoid subfamily.
CC {ECO:0000256|ARBA:ARBA00006503, ECO:0000256|PIRNR:PIRNR000563}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; A0A452GMF2; -.
DR STRING; 38772.ENSGAGP00000002992; -.
DR Ensembl; ENSGAGT00000003426.1; ENSGAGP00000002992.1; ENSGAGG00000002386.1.
DR Proteomes; UP000291020; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR016233; Homeobox_Pitx/unc30.
DR InterPro; IPR003654; OAR_dom.
DR PANTHER; PTHR45882:SF1; PITUITARY HOMEOBOX 1; 1.
DR PANTHER; PTHR45882; PITUITARY HOMEOBOX HOMOLOG PTX1; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR Pfam; PF03826; OAR; 1.
DR PIRSF; PIRSF000563; Homeobox_protein_Pitx/Unc30; 1.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
DR PROSITE; PS50803; OAR; 1.
PE 3: Inferred from homology;
KW Developmental protein {ECO:0000256|ARBA:ARBA00022473,
KW ECO:0000256|PIRNR:PIRNR000563};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PIRNR:PIRNR000563};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PIRNR:PIRNR000563};
KW Reference proteome {ECO:0000313|Proteomes:UP000291020}.
FT DOMAIN 74..134
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DOMAIN 269..282
FT /note="OAR"
FT /evidence="ECO:0000259|PROSITE:PS50803"
FT DNA_BIND 76..135
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 1..86
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 21..51
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 52..74
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 303 AA; 33968 MW; 09FFE38CD4831B7F CRC64;
MDSFKGGMNL ERLPESLRPQ PSHDMASSFH LQRSSESRDP IENSASESSD TEVPDRSGEQ
KSEDGNPEDP AKKKKQRRQR THFTSQQLQE LEATFQRNRY PDMSMREEIA VWTNLTEPRV
RVWFKNRRAK WRKRERNQQM DLCKNGYVPQ FSGLMQPYED MYPGYPYNNW ATKSLTPAPL
STKSFTFFNS MSPLSSQSMF SAPSTISSMS MPSSMGHSAV PGMANANLNN INNLSNISGS
SLNSAMSSPA CPYGPPGSPY SVYRDTCNSS LASLRLKSKQ HSTFGYSSLQ SPGSGLNACQ
YNS
//