GenomeNet

Database: UniProt
Entry: A0A1A6GFH9_NEOLE
LinkDB: A0A1A6GFH9_NEOLE
Original site: A0A1A6GFH9_NEOLE 
ID   A0A1A6GFH9_NEOLE        Unreviewed;       312 AA.
AC   A0A1A6GFH9;
DT   05-OCT-2016, integrated into UniProtKB/TrEMBL.
DT   05-OCT-2016, sequence version 1.
DT   27-MAR-2024, entry version 41.
DE   RecName: Full=Homeobox domain-containing protein {ECO:0008006|Google:ProtNLM};
GN   ORFNames=A6R68_07404 {ECO:0000313|EMBL:OBS64057.1};
OS   Neotoma lepida (Desert woodrat).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea;
OC   Cricetidae; Neotominae; Neotoma.
OX   NCBI_TaxID=56216 {ECO:0000313|EMBL:OBS64057.1, ECO:0000313|Proteomes:UP000092124};
RN   [1] {ECO:0000313|EMBL:OBS64057.1, ECO:0000313|Proteomes:UP000092124}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=417 {ECO:0000313|EMBL:OBS64057.1};
RC   TISSUE=Liver {ECO:0000313|EMBL:OBS64057.1};
RA   Campbell M., Oakeson K.F., Yandell M., Halpert J.R., Dearing D.;
RT   "The Draft Genome Sequence and Annotation of the Desert Woodrat Neotoma
RT   lepida.";
RL   Submitted (JUN-2016) to the EMBL/GenBank/DDBJ databases.
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|PROSITE-ProRule:PRU00108,
CC       ECO:0000256|RuleBase:RU000682}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:OBS64057.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; LZPO01097440; OBS64057.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A1A6GFH9; -.
DR   STRING; 56216.A0A1A6GFH9; -.
DR   Proteomes; UP000092124; Unassembled WGS sequence.
DR   GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR   GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR   GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR   CDD; cd00086; homeodomain; 1.
DR   Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR   InterPro; IPR009057; Homeobox-like_sf.
DR   InterPro; IPR017970; Homeobox_CS.
DR   InterPro; IPR001356; Homeobox_dom.
DR   InterPro; IPR000047; HTH_motif.
DR   InterPro; IPR003654; OAR_dom.
DR   PANTHER; PTHR46770; HOMEOBOX PROTEIN ORTHOPEDIA; 1.
DR   PANTHER; PTHR46770:SF1; HOMEOBOX PROTEIN ORTHOPEDIA; 1.
DR   Pfam; PF00046; Homeodomain; 1.
DR   Pfam; PF03826; OAR; 1.
DR   PRINTS; PR00031; HTHREPRESSR.
DR   SMART; SM00389; HOX; 1.
DR   SUPFAM; SSF46689; Homeodomain-like; 1.
DR   PROSITE; PS00027; HOMEOBOX_1; 1.
DR   PROSITE; PS50071; HOMEOBOX_2; 1.
DR   PROSITE; PS50803; OAR; 1.
PE   4: Predicted;
KW   DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW   ProRule:PRU00108};
KW   Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW   ProRule:PRU00108};
KW   Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW   ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000092124}.
FT   DOMAIN          89..149
FT                   /note="Homeobox"
FT                   /evidence="ECO:0000259|PROSITE:PS50071"
FT   DOMAIN          293..306
FT                   /note="OAR"
FT                   /evidence="ECO:0000259|PROSITE:PS50803"
FT   DNA_BIND        91..150
FT                   /note="Homeobox"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT   REGION          1..98
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        75..89
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   312 AA;  32765 MW;  5C709138835EDA5C CRC64;
     MKDAAELLGH REAVKCRLGV GGSDPGGHPG DLAPNSDPVE GATLLPGEDI TTVGSTPASL
     AVSAKDPDKQ PGPQGGPNPS QAGQQQGQQK QKRHRTRFTP AQLNELERSF AKTHYPDIFM
     REELALRIGL TESRVQVWFQ NRRAKWKKRK KTTNVFRAPG TLLPTPGLPQ FPSAAAAAAA
     AMGDSLCSFH ANDTRWAAAA MPGVSQLPLP PALGRQQAMA QSLSQCSLAA GPPPNSMGLS
     NSLAGSNGAG LQSHLYQPAF PGMVPASLPG PSNVSGSPQL CSSPDSSDVW RGTSIASLRR
     KALEHTVSMS FT
//
DBGET integrated database retrieval system