GenomeNet

Database: UniProt
Entry: G7PGC9_MACFA
LinkDB: G7PGC9_MACFA
Original site: G7PGC9_MACFA 
ID   G7PGC9_MACFA            Unreviewed;       969 AA.
AC   G7PGC9;
DT   25-JAN-2012, integrated into UniProtKB/TrEMBL.
DT   25-JAN-2012, sequence version 1.
DT   24-JAN-2024, entry version 43.
DE   RecName: Full=Homeobox domain-containing protein {ECO:0000259|PROSITE:PS50071};
GN   ORFNames=EGM_02081 {ECO:0000313|EMBL:EHH65335.1};
OS   Macaca fascicularis (Crab-eating macaque) (Cynomolgus monkey).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini;
OC   Cercopithecidae; Cercopithecinae; Macaca.
OX   NCBI_TaxID=9541;
RN   [1] {ECO:0000313|EMBL:EHH65335.1}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=CE-4 {ECO:0000313|EMBL:EHH65335.1};
RX   PubMed=22002653; DOI=10.1038/nbt.1992;
RA   Yan G., Zhang G., Fang X., Zhang Y., Li C., Ling F., Cooper D.N., Li Q.,
RA   Li Y., van Gool A.J., Du H., Chen J., Chen R., Zhang P., Huang Z.,
RA   Thompson J.R., Meng Y., Bai Y., Wang J., Zhuo M., Wang T., Huang Y.,
RA   Wei L., Li J., Wang Z., Hu H., Yang P., Le L., Stenson P.D., Li B., Liu X.,
RA   Ball E.V., An N., Huang Q., Zhang Y., Fan W., Zhang X., Li Y., Wang W.,
RA   Katze M.G., Su B., Nielsen R., Yang H., Wang J., Wang X., Wang J.;
RT   "Genome sequencing and comparison of two nonhuman primate animal models,
RT   the cynomolgus and Chinese rhesus macaques.";
RL   Nat. Biotechnol. 29:1019-1023(2011).
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC       ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC   -!- SIMILARITY: Belongs to the ZHX family. {ECO:0000256|ARBA:ARBA00007440}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; CM001285; EHH65335.1; -; Genomic_DNA.
DR   AlphaFoldDB; G7PGC9; -.
DR   eggNOG; ENOG502RC6G; Eukaryota.
DR   Proteomes; UP000009130; Chromosome 10.
DR   GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR   GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR   CDD; cd00086; homeodomain; 4.
DR   Gene3D; 3.30.160.60; Classic Zinc Finger; 1.
DR   Gene3D; 1.10.10.60; Homeodomain-like; 5.
DR   InterPro; IPR009057; Homeobox-like_sf.
DR   InterPro; IPR001356; Homeobox_dom.
DR   InterPro; IPR024578; Homez_homeobox_dom.
DR   InterPro; IPR041057; ZHX_Znf_C2H2.
DR   InterPro; IPR036236; Znf_C2H2_sf.
DR   InterPro; IPR013087; Znf_C2H2_type.
DR   PANTHER; PTHR15467:SF6; ZINC FINGERS AND HOMEOBOXES PROTEIN 3; 1.
DR   PANTHER; PTHR15467; ZINC-FINGERS AND HOMEOBOXES RELATED; 1.
DR   Pfam; PF00046; Homeodomain; 3.
DR   Pfam; PF11569; Homez; 1.
DR   Pfam; PF18387; zf_C2H2_ZHX; 1.
DR   SMART; SM00389; HOX; 4.
DR   SMART; SM00355; ZnF_C2H2; 2.
DR   SUPFAM; SSF57667; beta-beta-alpha zinc fingers; 1.
DR   SUPFAM; SSF46689; Homeodomain-like; 4.
DR   PROSITE; PS50071; HOMEOBOX_2; 4.
PE   3: Inferred from homology;
KW   DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW   ProRule:PRU00108};
KW   Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW   ProRule:PRU00108};
KW   Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW   ProRule:PRU00108}.
FT   DOMAIN          319..362
FT                   /note="Homeobox"
FT                   /evidence="ECO:0000259|PROSITE:PS50071"
FT   DOMAIN          502..552
FT                   /note="Homeobox"
FT                   /evidence="ECO:0000259|PROSITE:PS50071"
FT   DOMAIN          622..670
FT                   /note="Homeobox"
FT                   /evidence="ECO:0000259|PROSITE:PS50071"
FT   DOMAIN          772..822
FT                   /note="Homeobox"
FT                   /evidence="ECO:0000259|PROSITE:PS50071"
FT   DNA_BIND        321..363
FT                   /note="Homeobox"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT   DNA_BIND        504..553
FT                   /note="Homeobox"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT   DNA_BIND        624..671
FT                   /note="Homeobox"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT   DNA_BIND        774..823
FT                   /note="Homeobox"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT   REGION          22..66
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          153..173
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          598..618
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          666..695
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          928..948
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        50..66
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        666..680
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        681..695
FT                   /note="Acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   969 AA;  106500 MW;  067133E2B12790EE CRC64;
     MASKRKSTTP CMIPVKTVVV QDASMEAQPT ETLPEGPQQD LPPEAPVASS EAAQNPSSTD
     GSTLANGHRS TLDGYLYSCK YCDFRSHDMT QFVGHMNSEH TDFNKDPTFV CSGCSFLAKS
     PEGLSLHNAT CHSGEASFVW NVAKPDNHVV VEQSVPESTS TPDLAGEPSA EGADGQAEII
     ITKTPIMKIM KGKAEAKKIH TLKENVPSQP VGEALPKLST GEMEVREGDH SFINGAVPVS
     QASASSAKNP HAANGPLIGT VPVLPAGIAQ FLSLQQQPPV HTQHHAHQPL PTAKALPKVM
     IPLSSIPTYN AAMDSNSFLK NSFHKFPYPT KAELCYLTVV TKYPEEQLKI WFTAQRLKQG
     ISWSPEEIED ARKKMFNTVI QSVPQPTITV LNTPLVASAG NVQHLIQAAL PGHVVGQPEG
     TGGGLLVTQP LMANGLQATS SSLPLTVTSV PKQPGVAPIN TVCSNTTSAV KVVNAAQSLL
     TACPSITSQA FLDASIYKNK KSHEQLSALK GSFCRNQFPG QSEVEHLTKV TGLSTREVRK
     WFSDRRYHCR NLKGSRVTMP GDHSSMIIDS VPEVSFSPSS KVPEVTCVPT TATLATHPSA
     KRQSWHQTPD FTPTKYKERA PEQLRALESS FAQNPLPLDE ELDRLRSETK MTRREIDSWF
     SERRKKVNAE ETKKAEENAS QEEEEAVEDE GGEEDLASEL RVSGENGSLE MPSSHILAER
     KVSPIKINLK NLRVTEANGR NEIPGLGACD PEDDGSNKLA EQLPGKVSCK KTAQQRHLLR
     QLFVQTQWPS NQDYDSIMAQ TGLPRPEVVR WFGDSRYALK NGQLKWYEDY KRGNFPPGLL
     VIAPGNRELL QDYYITHKMM YEEDLQNLCD KTQMSSQQKQ TEFDLINVKD WPVWETACHV
     EEPSPTLCWH MLFPCLVAEH LGELPESSQT AQSLPLPSAC PPPSKQQARW GSHQFFLPQC
     RTFPLPSNG
//
DBGET integrated database retrieval system