GenomeNet

Database: UniProt
Entry: A0A5E4AZY4_MARMO
LinkDB: A0A5E4AZY4_MARMO
Original site: A0A5E4AZY4_MARMO 
ID   A0A5E4AZY4_MARMO        Unreviewed;       199 AA.
AC   A0A5E4AZY4;
DT   13-NOV-2019, integrated into UniProtKB/TrEMBL.
DT   13-NOV-2019, sequence version 1.
DT   27-MAR-2024, entry version 17.
DE   RecName: Full=Homeobox domain-containing protein {ECO:0000259|PROSITE:PS50071};
GN   ORFNames=GHT09_016923 {ECO:0000313|EMBL:KAF7461088.1}, MONAX_5E011490
GN   {ECO:0000313|EMBL:VTJ62977.1};
OS   Marmota monax (Woodchuck).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Sciuromorpha; Sciuridae;
OC   Xerinae; Marmotini; Marmota.
OX   NCBI_TaxID=9995 {ECO:0000313|EMBL:VTJ62977.1, ECO:0000313|Proteomes:UP000335636};
RN   [1] {ECO:0000313|EMBL:VTJ62977.1, ECO:0000313|Proteomes:UP000335636}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA   Alioto T., Alioto T.;
RL   Submitted (APR-2019) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|EMBL:KAF7461088.1}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=WC2-LM {ECO:0000313|EMBL:KAF7461088.1};
RC   TISSUE=Liver {ECO:0000313|EMBL:KAF7461088.1};
RA   Shumante A., Zimin A.V., Puiu D., Salzberg S.L.;
RL   Submitted (AUG-2020) to the EMBL/GenBank/DDBJ databases.
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC       ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; WJEC01008724; KAF7461088.1; -; Genomic_DNA.
DR   EMBL; CABDUW010000215; VTJ62977.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A5E4AZY4; -.
DR   Proteomes; UP000335636; Unassembled WGS sequence.
DR   Proteomes; UP000662637; Unassembled WGS sequence.
DR   GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR   GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR   GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro.
DR   CDD; cd00086; homeodomain; 1.
DR   Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR   InterPro; IPR009057; Homeobox-like_sf.
DR   InterPro; IPR001356; Homeobox_dom.
DR   InterPro; IPR003022; Otx2_TF.
DR   InterPro; IPR013851; Otx_TF_C.
DR   PANTHER; PTHR45793; HOMEOBOX PROTEIN; 1.
DR   PANTHER; PTHR45793:SF2; HOMEOBOX PROTEIN OTX2; 1.
DR   Pfam; PF00046; Homeodomain; 1.
DR   Pfam; PF03529; TF_Otx; 1.
DR   PRINTS; PR01257; OTX2HOMEOBOX.
DR   SMART; SM00389; HOX; 1.
DR   SUPFAM; SSF46689; Homeodomain-like; 1.
DR   PROSITE; PS50071; HOMEOBOX_2; 1.
PE   4: Predicted;
KW   Developmental protein {ECO:0000256|ARBA:ARBA00022473};
KW   DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW   ProRule:PRU00108};
KW   Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW   ProRule:PRU00108};
KW   Nucleus {ECO:0000256|PROSITE-ProRule:PRU00108,
KW   ECO:0000256|RuleBase:RU000682};
KW   Reference proteome {ECO:0000313|Proteomes:UP000335636}.
FT   DOMAIN          1..53
FT                   /note="Homeobox"
FT                   /evidence="ECO:0000259|PROSITE:PS50071"
FT   DNA_BIND        3..54
FT                   /note="Homeobox"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT   REGION          48..99
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          170..199
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        174..199
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   199 AA;  21471 MW;  36D7CBFE7B54F6F4 CRC64;
     MGPEYKTNIL EALFAKTWYP DIFMQEEVAL KINLPESRMQ VWFKNGRAKC RQQQQQQQNG
     GQNKVTPAKK KTSPAGEVSS ESGTSGQFTP PSSTSVPTIA SSSAPVSIWR PDSISPLSDP
     LCTSSSCMQR SYPMTYTQAS GYSQGYAGST SYFGGMDCGS YLTPMHHQLR GPGATLSPTG
     TNAVTSHLNQ SPASLSIQE
//
DBGET integrated database retrieval system