GenomeNet

Database: UniProt
Entry: A0A061FB87_THECC
LinkDB: A0A061FB87_THECC
Original site: A0A061FB87_THECC 
ID   A0A061FB87_THECC        Unreviewed;       409 AA.
AC   A0A061FB87;
DT   03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT   03-SEP-2014, sequence version 1.
DT   27-MAR-2024, entry version 27.
DE   SubName: Full=DNA glycosylase superfamily protein isoform 1 {ECO:0000313|EMBL:EOY14286.1};
GN   ORFNames=TCM_033602 {ECO:0000313|EMBL:EOY14286.1};
OS   Theobroma cacao (Cacao) (Cocoa).
OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC   Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC   rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX   NCBI_TaxID=3641 {ECO:0000313|EMBL:EOY14286.1, ECO:0000313|Proteomes:UP000026915};
RN   [1] {ECO:0000313|EMBL:EOY14286.1, ECO:0000313|Proteomes:UP000026915}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX   PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA   Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA   Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA   Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA   Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA   Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA   Kuhn D.N.;
RT   "The genome sequence of the most widely cultivated cacao type and its use
RT   to identify candidate genes regulating pod color.";
RL   Genome Biol. 14:R53.1-R53.24(2013).
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; CM001885; EOY14286.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A061FB87; -.
DR   EnsemblPlants; EOY14286; EOY14286; TCM_033602.
DR   Gramene; EOY14286; EOY14286; TCM_033602.
DR   HOGENOM; CLU_046054_0_0_1; -.
DR   Proteomes; UP000026915; Chromosome 7.
DR   GO; GO:0008725; F:DNA-3-methyladenine glycosylase activity; IEA:InterPro.
DR   GO; GO:0006284; P:base-excision repair; IEA:InterPro.
DR   InterPro; IPR005019; Adenine_glyco.
DR   InterPro; IPR011257; DNA_glycosylase.
DR   PANTHER; PTHR31116:SF50; DNA GLYCOSYLASE SUPERFAMILY PROTEIN; 1.
DR   PANTHER; PTHR31116; OS04G0501200 PROTEIN; 1.
DR   Pfam; PF03352; Adenine_glyco; 1.
DR   SUPFAM; SSF48150; DNA-glycosylase; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000026915}.
FT   REGION          30..134
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        99..114
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        115..129
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   409 AA;  44766 MW;  8EBF2B52F67B2B1F CRC64;
     MCSSNAKVTA GVEITPAVAR INGRPVLQPT CNRVPSLDRR NSLKKIPPLS PPTPPSLAST
     LPATSATVGN GGRAKASLTP PISPKSKSPR PAAIKRGSDP NALNTSSEKV MTPRNITKTL
     ERKKSKSFKE GMGNGLSSWI EPSLSYSSSL IVEAPGSIAA VRREQMALQQ AQRKMKIAHY
     GRSKSAKFES KVVPLNTSSA MTKPDEEEKR CSFITPNSDP VYVAYHDEEW GVPVHDDSML
     FELLVLSGAQ VGSDWISILK KRQDFRDAFS GFDAETVAKF TDKEMTTISS EYGIDISRVL
     GVVDNSNRIL EVKGQFGSFD KYIWGFVNHK AISTQYKFGH KIPVKTSKSE SISKDMLRRG
     FRCVGPTVVH SFMQAAGLTN DHLITCHRHL PCTLLAASSI DGLTFKRRQ
//
DBGET integrated database retrieval system