ID G7PGC9_MACFA Unreviewed; 969 AA.
AC G7PGC9;
DT 25-JAN-2012, integrated into UniProtKB/TrEMBL.
DT 25-JAN-2012, sequence version 1.
DT 24-JAN-2024, entry version 43.
DE RecName: Full=Homeobox domain-containing protein {ECO:0000259|PROSITE:PS50071};
GN ORFNames=EGM_02081 {ECO:0000313|EMBL:EHH65335.1};
OS Macaca fascicularis (Crab-eating macaque) (Cynomolgus monkey).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini;
OC Cercopithecidae; Cercopithecinae; Macaca.
OX NCBI_TaxID=9541;
RN [1] {ECO:0000313|EMBL:EHH65335.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CE-4 {ECO:0000313|EMBL:EHH65335.1};
RX PubMed=22002653; DOI=10.1038/nbt.1992;
RA Yan G., Zhang G., Fang X., Zhang Y., Li C., Ling F., Cooper D.N., Li Q.,
RA Li Y., van Gool A.J., Du H., Chen J., Chen R., Zhang P., Huang Z.,
RA Thompson J.R., Meng Y., Bai Y., Wang J., Zhuo M., Wang T., Huang Y.,
RA Wei L., Li J., Wang Z., Hu H., Yang P., Le L., Stenson P.D., Li B., Liu X.,
RA Ball E.V., An N., Huang Q., Zhang Y., Fan W., Zhang X., Li Y., Wang W.,
RA Katze M.G., Su B., Nielsen R., Yang H., Wang J., Wang X., Wang J.;
RT "Genome sequencing and comparison of two nonhuman primate animal models,
RT the cynomolgus and Chinese rhesus macaques.";
RL Nat. Biotechnol. 29:1019-1023(2011).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC -!- SIMILARITY: Belongs to the ZHX family. {ECO:0000256|ARBA:ARBA00007440}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM001285; EHH65335.1; -; Genomic_DNA.
DR AlphaFoldDB; G7PGC9; -.
DR eggNOG; ENOG502RC6G; Eukaryota.
DR Proteomes; UP000009130; Chromosome 10.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR CDD; cd00086; homeodomain; 4.
DR Gene3D; 3.30.160.60; Classic Zinc Finger; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 5.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR024578; Homez_homeobox_dom.
DR InterPro; IPR041057; ZHX_Znf_C2H2.
DR InterPro; IPR036236; Znf_C2H2_sf.
DR InterPro; IPR013087; Znf_C2H2_type.
DR PANTHER; PTHR15467:SF6; ZINC FINGERS AND HOMEOBOXES PROTEIN 3; 1.
DR PANTHER; PTHR15467; ZINC-FINGERS AND HOMEOBOXES RELATED; 1.
DR Pfam; PF00046; Homeodomain; 3.
DR Pfam; PF11569; Homez; 1.
DR Pfam; PF18387; zf_C2H2_ZHX; 1.
DR SMART; SM00389; HOX; 4.
DR SMART; SM00355; ZnF_C2H2; 2.
DR SUPFAM; SSF57667; beta-beta-alpha zinc fingers; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 4.
DR PROSITE; PS50071; HOMEOBOX_2; 4.
PE 3: Inferred from homology;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}.
FT DOMAIN 319..362
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DOMAIN 502..552
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DOMAIN 622..670
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DOMAIN 772..822
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 321..363
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT DNA_BIND 504..553
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT DNA_BIND 624..671
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT DNA_BIND 774..823
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 22..66
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 153..173
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 598..618
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 666..695
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 928..948
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 50..66
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 666..680
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 681..695
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 969 AA; 106500 MW; 067133E2B12790EE CRC64;
MASKRKSTTP CMIPVKTVVV QDASMEAQPT ETLPEGPQQD LPPEAPVASS EAAQNPSSTD
GSTLANGHRS TLDGYLYSCK YCDFRSHDMT QFVGHMNSEH TDFNKDPTFV CSGCSFLAKS
PEGLSLHNAT CHSGEASFVW NVAKPDNHVV VEQSVPESTS TPDLAGEPSA EGADGQAEII
ITKTPIMKIM KGKAEAKKIH TLKENVPSQP VGEALPKLST GEMEVREGDH SFINGAVPVS
QASASSAKNP HAANGPLIGT VPVLPAGIAQ FLSLQQQPPV HTQHHAHQPL PTAKALPKVM
IPLSSIPTYN AAMDSNSFLK NSFHKFPYPT KAELCYLTVV TKYPEEQLKI WFTAQRLKQG
ISWSPEEIED ARKKMFNTVI QSVPQPTITV LNTPLVASAG NVQHLIQAAL PGHVVGQPEG
TGGGLLVTQP LMANGLQATS SSLPLTVTSV PKQPGVAPIN TVCSNTTSAV KVVNAAQSLL
TACPSITSQA FLDASIYKNK KSHEQLSALK GSFCRNQFPG QSEVEHLTKV TGLSTREVRK
WFSDRRYHCR NLKGSRVTMP GDHSSMIIDS VPEVSFSPSS KVPEVTCVPT TATLATHPSA
KRQSWHQTPD FTPTKYKERA PEQLRALESS FAQNPLPLDE ELDRLRSETK MTRREIDSWF
SERRKKVNAE ETKKAEENAS QEEEEAVEDE GGEEDLASEL RVSGENGSLE MPSSHILAER
KVSPIKINLK NLRVTEANGR NEIPGLGACD PEDDGSNKLA EQLPGKVSCK KTAQQRHLLR
QLFVQTQWPS NQDYDSIMAQ TGLPRPEVVR WFGDSRYALK NGQLKWYEDY KRGNFPPGLL
VIAPGNRELL QDYYITHKMM YEEDLQNLCD KTQMSSQQKQ TEFDLINVKD WPVWETACHV
EEPSPTLCWH MLFPCLVAEH LGELPESSQT AQSLPLPSAC PPPSKQQARW GSHQFFLPQC
RTFPLPSNG
//