GenomeNet

Database: UniProt
Entry: A0A1A6H8E5_NEOLE
LinkDB: A0A1A6H8E5_NEOLE
Original site: A0A1A6H8E5_NEOLE 
ID   A0A1A6H8E5_NEOLE        Unreviewed;       396 AA.
AC   A0A1A6H8E5;
DT   05-OCT-2016, integrated into UniProtKB/TrEMBL.
DT   05-OCT-2016, sequence version 1.
DT   24-JAN-2024, entry version 28.
DE   RecName: Full=Homeobox domain-containing protein {ECO:0000259|PROSITE:PS50071};
DE   Flags: Fragment;
GN   ORFNames=A6R68_15326 {ECO:0000313|EMBL:OBS74135.1};
OS   Neotoma lepida (Desert woodrat).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea;
OC   Cricetidae; Neotominae; Neotoma.
OX   NCBI_TaxID=56216 {ECO:0000313|EMBL:OBS74135.1, ECO:0000313|Proteomes:UP000092124};
RN   [1] {ECO:0000313|EMBL:OBS74135.1, ECO:0000313|Proteomes:UP000092124}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=417 {ECO:0000313|EMBL:OBS74135.1};
RC   TISSUE=Liver {ECO:0000313|EMBL:OBS74135.1};
RA   Campbell M., Oakeson K.F., Yandell M., Halpert J.R., Dearing D.;
RT   "The Draft Genome Sequence and Annotation of the Desert Woodrat Neotoma
RT   lepida.";
RL   Submitted (JUN-2016) to the EMBL/GenBank/DDBJ databases.
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC       ECO:0000256|PROSITE-ProRule:PRU00108}.
CC   -!- SIMILARITY: Belongs to the TALE/IRO homeobox family.
CC       {ECO:0000256|ARBA:ARBA00008446}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:OBS74135.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; LZPO01044486; OBS74135.1; -; Genomic_DNA.
DR   STRING; 56216.A0A1A6H8E5; -.
DR   Proteomes; UP000092124; Unassembled WGS sequence.
DR   GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR   GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR   GO; GO:0006355; P:regulation of DNA-templated transcription; IEA:InterPro.
DR   CDD; cd00086; homeodomain; 1.
DR   Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR   InterPro; IPR009057; Homeobox-like_sf.
DR   InterPro; IPR001356; Homeobox_dom.
DR   InterPro; IPR008422; Homeobox_KN_domain.
DR   InterPro; IPR003893; Iroquois_homeo.
DR   PANTHER; PTHR11211; IROQUOIS-CLASS HOMEODOMAIN PROTEIN IRX; 1.
DR   PANTHER; PTHR11211:SF11; IROQUOIS-CLASS HOMEODOMAIN PROTEIN IRX-6; 1.
DR   Pfam; PF05920; Homeobox_KN; 1.
DR   SMART; SM00389; HOX; 1.
DR   SMART; SM00548; IRO; 1.
DR   SUPFAM; SSF46689; Homeodomain-like; 1.
DR   PROSITE; PS50071; HOMEOBOX_2; 1.
PE   3: Inferred from homology;
KW   DNA-binding {ECO:0000256|PROSITE-ProRule:PRU00108};
KW   Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW   ProRule:PRU00108};
KW   Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW   ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000092124}.
FT   DOMAIN          98..161
FT                   /note="Homeobox"
FT                   /evidence="ECO:0000259|PROSITE:PS50071"
FT   DNA_BIND        100..162
FT                   /note="Homeobox"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT   REGION          161..229
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        161..182
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        210..226
FT                   /note="Acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   NON_TER         396
FT                   /evidence="ECO:0000313|EMBL:OBS74135.1"
SQ   SEQUENCE   396 AA;  42673 MW;  2F9A4DE93D015DC9 CRC64;
     MLCSGSSATC CETAPRSVSD VVSASTAAST LCCTPYDSRL LGSARPELGA ALSIYGAPYA
     AAQSYPGYLP YGPEPSPLCG ALPTLGQYQY DRYGGVELSS AGRRKNATRE STSALKAWLH
     EHRKNPYPTK GEKIMLAIIT KMTLTQVSTW XAXARRRLKK ENKMTWAPKN KGGEERKAEG
     AGEDSLGCLA GDTKDATASQ EAQGLRLSDL EDLEEEEEEE EADEEEAAVS ASRRLADFPK
     SAQXLPAPCA AAQEGRLESR ECGLAVSRFS FTEAPRSGEA DFIXTEPSGP TMIVHYPSGQ
     KPRIWSLAHT AAASAXESAP STPPRAQSPE CHMIPRQPIS IRRLLVPRDS AVEEDSLATK
     AFGKSTFTLQ GLPLNCAPCP RRREPEVRFQ YPSGAE
//
DBGET integrated database retrieval system