GenomeNet

Database: UniProt
Entry: A0A061EFX8_THECC
LinkDB: A0A061EFX8_THECC
Original site: A0A061EFX8_THECC 
ID   A0A061EFX8_THECC        Unreviewed;       386 AA.
AC   A0A061EFX8;
DT   03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT   03-SEP-2014, sequence version 1.
DT   27-MAR-2024, entry version 34.
DE   RecName: Full=Gypsy retrotransposon integrase-like protein 1 {ECO:0000256|ARBA:ARBA00039658};
GN   ORFNames=TCM_018990 {ECO:0000313|EMBL:EOY03806.1};
OS   Theobroma cacao (Cacao) (Cocoa).
OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC   Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC   rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX   NCBI_TaxID=3641 {ECO:0000313|EMBL:EOY03806.1, ECO:0000313|Proteomes:UP000026915};
RN   [1] {ECO:0000313|EMBL:EOY03806.1, ECO:0000313|Proteomes:UP000026915}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX   PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA   Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA   Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA   Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA   Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA   Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA   Kuhn D.N.;
RT   "The genome sequence of the most widely cultivated cacao type and its use
RT   to identify candidate genes regulating pod color.";
RL   Genome Biol. 14:R53.1-R53.24(2013).
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; CM001882; EOY03806.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A061EFX8; -.
DR   EnsemblPlants; EOY03806; EOY03806; TCM_018990.
DR   Gramene; EOY03806; EOY03806; TCM_018990.
DR   eggNOG; KOG0017; Eukaryota.
DR   HOGENOM; CLU_000384_6_0_1; -.
DR   InParanoid; A0A061EFX8; -.
DR   OMA; NEHENMK; -.
DR   Proteomes; UP000026915; Chromosome 4.
DR   GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR   GO; GO:0003964; F:RNA-directed DNA polymerase activity; IEA:UniProtKB-KW.
DR   GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR   Gene3D; 1.10.340.70; -; 1.
DR   Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR   InterPro; IPR001584; Integrase_cat-core.
DR   InterPro; IPR041588; Integrase_H2C2.
DR   InterPro; IPR012337; RNaseH-like_sf.
DR   InterPro; IPR036397; RNaseH_sf.
DR   PANTHER; PTHR47266; ENDONUCLEASE-RELATED; 1.
DR   PANTHER; PTHR47266:SF28; GYPSY RETROTRANSPOSON INTEGRASE-LIKE PROTEIN 1; 1.
DR   Pfam; PF17921; Integrase_H2C2; 1.
DR   Pfam; PF00665; rve; 1.
DR   SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR   PROSITE; PS50994; INTEGRASE; 1.
PE   4: Predicted;
KW   Nucleotidyltransferase {ECO:0000313|EMBL:EOY03806.1};
KW   Reference proteome {ECO:0000313|Proteomes:UP000026915};
KW   RNA-directed DNA polymerase {ECO:0000313|EMBL:EOY03806.1};
KW   Transferase {ECO:0000313|EMBL:EOY03806.1}.
FT   DOMAIN          96..255
FT                   /note="Integrase catalytic"
FT                   /evidence="ECO:0000259|PROSITE:PS50994"
SQ   SEQUENCE   386 AA;  44597 MW;  09B82F9FC67A9328 CRC64;
     MNFFLDGDIL YKRSRDQVLL RCMESAEARR IVEEVHEGIC GAHVSGHMLT RQQVMRAGYY
     WLTLEKDCID FARKCHKCQI YADRIHTPAN SLHVLAPPWP FSMWGMDVIG LITPKASNGH
     RFILVAIDYF TKWVEAASYA NVTQKVVCKF IQKEIICRYG LPERIITDNT SNLNGSMMKE
     VCAKFKIKHH NSTPYRPKMN GAVEAANKNI KRIIEKMTDI YKDWHEKLPF ALHAYRTTVR
     TSTGATPFSL VYGMEAVLPI EVEIPSLRVL KEVQLEETEW VNARYEQLNL IEEKRLTALC
     HGQLYQKRMM RAYDKKAHSR QFREGELVLK RILPNQHDPR GKWTPNWEGP FVIKKAFSGG
     ALILAEMDGR EFSNPVNADA VKKYFA
//
DBGET integrated database retrieval system