ID G8F3R1_MACFA Unreviewed; 389 AA.
AC G8F3R1;
DT 25-JAN-2012, integrated into UniProtKB/TrEMBL.
DT 25-JAN-2012, sequence version 1.
DT 27-MAR-2024, entry version 44.
DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EHH61925.1};
DE Flags: Fragment;
GN ORFNames=EGM_20067 {ECO:0000313|EMBL:EHH61925.1};
OS Macaca fascicularis (Crab-eating macaque) (Cynomolgus monkey).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini;
OC Cercopithecidae; Cercopithecinae; Macaca.
OX NCBI_TaxID=9541 {ECO:0000313|Proteomes:UP000009130};
RN [1] {ECO:0000313|EMBL:EHH61925.1, ECO:0000313|Proteomes:UP000009130}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CE-4 {ECO:0000313|EMBL:EHH61925.1,
RC ECO:0000313|Proteomes:UP000009130};
RX PubMed=22002653; DOI=10.1038/nbt.1992;
RA Yan G., Zhang G., Fang X., Zhang Y., Li C., Ling F., Cooper D.N., Li Q.,
RA Li Y., van Gool A.J., Du H., Chen J., Chen R., Zhang P., Huang Z.,
RA Thompson J.R., Meng Y., Bai Y., Wang J., Zhuo M., Wang T., Huang Y.,
RA Wei L., Li J., Wang Z., Hu H., Yang P., Le L., Stenson P.D., Li B., Liu X.,
RA Ball E.V., An N., Huang Q., Zhang Y., Fan W., Zhang X., Li Y., Wang W.,
RA Katze M.G., Su B., Nielsen R., Yang H., Wang J., Wang X., Wang J.;
RT "Genome sequencing and comparison of two nonhuman primate animal models,
RT the cynomolgus and Chinese rhesus macaques.";
RL Nat. Biotechnol. 29:1019-1023(2011).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|PROSITE-ProRule:PRU00108}.
CC -!- SIMILARITY: Belongs to the TALE/PBX homeobox family.
CC {ECO:0000256|ARBA:ARBA00007601}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JH330515; EHH61925.1; -; Genomic_DNA.
DR AlphaFoldDB; G8F3R1; -.
DR eggNOG; KOG0774; Eukaryota.
DR Proteomes; UP000009130; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR008422; Homeobox_KN_domain.
DR InterPro; IPR005542; PBX_PBC_dom.
DR PANTHER; PTHR11850; HOMEOBOX PROTEIN TRANSCRIPTION FACTORS; 1.
DR PANTHER; PTHR11850:SF97; PRE-B-CELL LEUKEMIA TRANSCRIPTION FACTOR 3; 1.
DR Pfam; PF05920; Homeobox_KN; 1.
DR Pfam; PF03792; PBC; 1.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
DR PROSITE; PS51978; PBC; 1.
PE 3: Inferred from homology;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}.
FT DOMAIN 1..168
FT /note="PBC"
FT /evidence="ECO:0000259|PROSITE:PS51978"
FT DOMAIN 167..251
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 169..252
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 281..302
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 358..389
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:EHH61925.1"
FT NON_TER 389
FT /evidence="ECO:0000313|EMBL:EHH61925.1"
SQ SEQUENCE 389 AA; 42601 MW; 07B2A2BF2E66BCAA CRC64;
RKHALNCHRM KPALFSVLCE IKEKTGLSIR GAQEEDPPDP QLMRLDNMLL AEGVSGPEKG
GGSAAAAAAA AASGGSSDNS IEHSDYRAKL TQIRQIYHTE LEKYEQACNE FTTHVMNLLR
EQSRTRPISP KEIERMVGII HRKFSSIQMQ LKQSTCEAVM ILRSRFLDAR RKRRNFSKQA
TEILNEYFYS HLSNPYPSEE AKEELAKKCS ITVSQSLIKD PKERGNKGSD IRQTSVVSNW
FGNKRIRYKK NIGKFQEEAN LYAAKTAVTA AHAVAAAVQN NQTNSPTTPN SGSSGSFNLP
NSGDMFMNMQ SLNGDSYQGS QVGANVQSQV DTLRHVINQT GGYSDGLGGN SLYSPHNLNA
NGGWQDATTP SSVTSPTEGP GSVHSDTSN
//