GenomeNet

Database: UniProt
Entry: A0A061E2P5_THECC
LinkDB: A0A061E2P5_THECC
Original site: A0A061E2P5_THECC 
ID   A0A061E2P5_THECC        Unreviewed;      1278 AA.
AC   A0A061E2P5;
DT   03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT   03-SEP-2014, sequence version 1.
DT   27-MAR-2024, entry version 52.
DE   SubName: Full=DNA polymerase phi subunit {ECO:0000313|EMBL:EOX98606.1};
GN   ORFNames=TCM_007314 {ECO:0000313|EMBL:EOX98606.1};
OS   Theobroma cacao (Cacao) (Cocoa).
OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC   Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC   rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX   NCBI_TaxID=3641 {ECO:0000313|EMBL:EOX98606.1, ECO:0000313|Proteomes:UP000026915};
RN   [1] {ECO:0000313|EMBL:EOX98606.1, ECO:0000313|Proteomes:UP000026915}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX   PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA   Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA   Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA   Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA   Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA   Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA   Kuhn D.N.;
RT   "The genome sequence of the most widely cultivated cacao type and its use
RT   to identify candidate genes regulating pod color.";
RL   Genome Biol. 14:R53.1-R53.24(2013).
CC   -!- SIMILARITY: Belongs to the MYBBP1A family.
CC       {ECO:0000256|ARBA:ARBA00006809}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; CM001880; EOX98606.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A061E2P5; -.
DR   STRING; 3641.A0A061E2P5; -.
DR   EnsemblPlants; EOX98606; EOX98606; TCM_007314.
DR   Gramene; EOX98606; EOX98606; TCM_007314.
DR   eggNOG; KOG1926; Eukaryota.
DR   HOGENOM; CLU_003261_0_0_1; -.
DR   InParanoid; A0A061E2P5; -.
DR   OMA; VWKHDDP; -.
DR   Proteomes; UP000026915; Chromosome 2.
DR   GO; GO:0005730; C:nucleolus; IBA:GO_Central.
DR   GO; GO:0003677; F:DNA binding; IEA:InterPro.
DR   GO; GO:0006355; P:regulation of DNA-templated transcription; IEA:InterPro.
DR   InterPro; IPR016024; ARM-type_fold.
DR   InterPro; IPR007015; DNA_pol_V/MYBBP1A.
DR   PANTHER; PTHR13213:SF2; MYB-BINDING PROTEIN 1A; 1.
DR   PANTHER; PTHR13213; MYB-BINDING PROTEIN 1A FAMILY MEMBER; 1.
DR   Pfam; PF04931; DNA_pol_phi; 1.
DR   SUPFAM; SSF48371; ARM repeat; 1.
PE   3: Inferred from homology;
KW   Reference proteome {ECO:0000313|Proteomes:UP000026915}.
FT   REGION          1..99
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          901..945
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        74..95
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        901..919
FT                   /note="Acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1278 AA;  143233 MW;  8BC303F1944D46D3 CRC64;
     MGSKKRSINS VEEVVEGQTD LAADNTVSMP SDKKSKMFIK TDAQMGDGVA APSSVPSSIK
     PMERKKKRKQ LDKERRRSVL ENEESQPKQM NLESKRNDAW EPVASSSTIG LPEFHISVFK
     DLASANSSVR ESAVETLVTE LQEVQKAYDR LENKDLVEGV LKLEAQKNDG LDNCASSLRY
     AVRRLIRGVS SSRECARQGF ALGLTALVAT IPSIKVDSLL KLIVDLLEVT SSMKGQEVRD
     CLLGRLFAYG ALARSDRLIK EWFSDKDTLH IKEFMSAIIS LAAKKRYLQE PAVSIILEFV
     GKLPDEALID HILEAPGIPE WFQEAISVGN PDALLLALKI REKSSIDSTS FGELLPNPFS
     SSKLFSADYL SSIDNCLKES TFCQPRVHCL WPVLVNVLLP DTVLQAEDVA SISNSFKKYK
     KGRKSSSSEE EIVKNVQCFC EVVIEGSLLL SSHDRKHLAL DVLLLLLPRL PSSFVPIVLS
     YKLVQCLMDI LSTKDSWLYK VVQHFLKELL DWVSNDDVRR IAVIVAFQKH SNGKFDCVTK
     TKTVKGLVAD FKTETGCMLF VQNLINLFLD EGHASEEPSD QSQTTDENSE IGSIEDKDSI
     GIMGNADFLK SWVIESLPSV LKHLKLDPEA KFRVQKEILK FLAVQGLFSA SLGNEVTSFE
     LQEKFRWPKA ATSIALCRMC IEQLQSLLAN AQKVEEPRSL ANGLEPNDLG CYFMHFFSTL
     RNIPSVSLFR TVSDEDEQAV KKLQEMDSKL YKDERNCGLS SNANKLHALR YLLILLVLQV
     LLRPGEFCDA ASELIICCKK AFSAPDDLDS SGEDELDNDA APELMDVLVD TLLSLLPQSS
     APMRSAIEQV FKYFCGDVTD DGLLRMLRII KKDLKPARHQ EASSENDDDD LLGIEEDEDI
     DEAETAETAE SDEQSEDSEA VVGSEGADKE LPEDSDDSDG GMDDDAMFRM DTYLAQIFKE
     KKNQAGGETA QSQLVVFKLR VLSLLEIYLH ENRGKPQVLT VYSKLAQAFV NPHTMDGSEQ
     LGQRIWSILQ KKVFKEKKLP KDESMQLSTL ESLLEKNLKL ASKPFKRKKS ASTLSKKKLS
     GSLNRHKMIV SLAQNSTYWI LKIIEARNFS DAELQGVFDL LQAVLVGYFD SKKSQIKSGF
     LKEIFRRNPR IGHQLFSLLL DKCGNAKSDF RRVEALDLVI EVLKSQVPMN PSESNWDASK
     KILKSHLQSL SHLIERLVTR MPEKKLRKTE VHKFCDKIFQ MISTLDLTEA FLRCLGPDAR
     PSCESQLGPL FLKLKKLE
//
DBGET integrated database retrieval system