ID A0A1A6GFH9_NEOLE Unreviewed; 312 AA.
AC A0A1A6GFH9;
DT 05-OCT-2016, integrated into UniProtKB/TrEMBL.
DT 05-OCT-2016, sequence version 1.
DT 27-MAR-2024, entry version 41.
DE RecName: Full=Homeobox domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=A6R68_07404 {ECO:0000313|EMBL:OBS64057.1};
OS Neotoma lepida (Desert woodrat).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea;
OC Cricetidae; Neotominae; Neotoma.
OX NCBI_TaxID=56216 {ECO:0000313|EMBL:OBS64057.1, ECO:0000313|Proteomes:UP000092124};
RN [1] {ECO:0000313|EMBL:OBS64057.1, ECO:0000313|Proteomes:UP000092124}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=417 {ECO:0000313|EMBL:OBS64057.1};
RC TISSUE=Liver {ECO:0000313|EMBL:OBS64057.1};
RA Campbell M., Oakeson K.F., Yandell M., Halpert J.R., Dearing D.;
RT "The Draft Genome Sequence and Annotation of the Desert Woodrat Neotoma
RT lepida.";
RL Submitted (JUN-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|PROSITE-ProRule:PRU00108,
CC ECO:0000256|RuleBase:RU000682}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OBS64057.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LZPO01097440; OBS64057.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A1A6GFH9; -.
DR STRING; 56216.A0A1A6GFH9; -.
DR Proteomes; UP000092124; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR000047; HTH_motif.
DR InterPro; IPR003654; OAR_dom.
DR PANTHER; PTHR46770; HOMEOBOX PROTEIN ORTHOPEDIA; 1.
DR PANTHER; PTHR46770:SF1; HOMEOBOX PROTEIN ORTHOPEDIA; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR Pfam; PF03826; OAR; 1.
DR PRINTS; PR00031; HTHREPRESSR.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
DR PROSITE; PS50803; OAR; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000092124}.
FT DOMAIN 89..149
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DOMAIN 293..306
FT /note="OAR"
FT /evidence="ECO:0000259|PROSITE:PS50803"
FT DNA_BIND 91..150
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 1..98
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 75..89
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 312 AA; 32765 MW; 5C709138835EDA5C CRC64;
MKDAAELLGH REAVKCRLGV GGSDPGGHPG DLAPNSDPVE GATLLPGEDI TTVGSTPASL
AVSAKDPDKQ PGPQGGPNPS QAGQQQGQQK QKRHRTRFTP AQLNELERSF AKTHYPDIFM
REELALRIGL TESRVQVWFQ NRRAKWKKRK KTTNVFRAPG TLLPTPGLPQ FPSAAAAAAA
AMGDSLCSFH ANDTRWAAAA MPGVSQLPLP PALGRQQAMA QSLSQCSLAA GPPPNSMGLS
NSLAGSNGAG LQSHLYQPAF PGMVPASLPG PSNVSGSPQL CSSPDSSDVW RGTSIASLRR
KALEHTVSMS FT
//