GenomeNet

Database: UniProt
Entry: A0A061FKS1_THECC
LinkDB: A0A061FKS1_THECC
Original site: A0A061FKS1_THECC 
ID   A0A061FKS1_THECC        Unreviewed;      2215 AA.
AC   A0A061FKS1;
DT   03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT   03-SEP-2014, sequence version 1.
DT   27-MAR-2024, entry version 41.
DE   RecName: Full=Reverse transcriptase domain-containing protein {ECO:0008006|Google:ProtNLM};
GN   ORFNames=TCM_036737 {ECO:0000313|EMBL:EOY17513.1};
OS   Theobroma cacao (Cacao) (Cocoa).
OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC   Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC   rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX   NCBI_TaxID=3641 {ECO:0000313|EMBL:EOY17513.1, ECO:0000313|Proteomes:UP000026915};
RN   [1] {ECO:0000313|EMBL:EOY17513.1, ECO:0000313|Proteomes:UP000026915}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX   PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA   Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA   Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA   Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA   Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA   Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA   Kuhn D.N.;
RT   "The genome sequence of the most widely cultivated cacao type and its use
RT   to identify candidate genes regulating pod color.";
RL   Genome Biol. 14:R53.1-R53.24(2013).
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; CM001886; EOY17513.1; -; Genomic_DNA.
DR   EnsemblPlants; EOY17513; EOY17513; TCM_036737.
DR   Gramene; EOY17513; EOY17513; TCM_036737.
DR   eggNOG; KOG1075; Eukaryota.
DR   HOGENOM; CLU_001916_0_0_1; -.
DR   InParanoid; A0A061FKS1; -.
DR   Proteomes; UP000026915; Chromosome 8.
DR   GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR   GO; GO:0004523; F:RNA-DNA hybrid ribonuclease activity; IEA:InterPro.
DR   CDD; cd06222; RNase_H_like; 1.
DR   CDD; cd01650; RT_nLTR_like; 1.
DR   Gene3D; 3.60.10.10; Endonuclease/exonuclease/phosphatase; 1.
DR   Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR   InterPro; IPR025558; DUF4283.
DR   InterPro; IPR036691; Endo/exonu/phosph_ase_sf.
DR   InterPro; IPR005135; Endo/exonuclease/phosphatase.
DR   InterPro; IPR044730; RNase_H-like_dom_plant.
DR   InterPro; IPR012337; RNaseH-like_sf.
DR   InterPro; IPR002156; RNaseH_domain.
DR   InterPro; IPR036397; RNaseH_sf.
DR   InterPro; IPR000477; RT_dom.
DR   InterPro; IPR026960; RVT-Znf.
DR   PANTHER; PTHR46890; NON-LTR RETROLELEMENT REVERSE TRANSCRIPTASE-LIKE PROTEIN-RELATED; 1.
DR   PANTHER; PTHR46890:SF46; RNA-DIRECTED DNA POLYMERASE, EUKARYOTA, REVERSE TRANSCRIPTASE ZINC-BINDING DOMAIN PROTEIN; 1.
DR   Pfam; PF14111; DUF4283; 1.
DR   Pfam; PF03372; Exo_endo_phos; 1.
DR   Pfam; PF00078; RVT_1; 1.
DR   Pfam; PF13456; RVT_3; 1.
DR   Pfam; PF13966; zf-RVT; 1.
DR   SUPFAM; SSF56219; DNase I-like; 1.
DR   SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR   PROSITE; PS50879; RNASE_H_1; 1.
DR   PROSITE; PS50878; RT_POL; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000026915}.
FT   DOMAIN          1322..1601
FT                   /note="Reverse transcriptase"
FT                   /evidence="ECO:0000259|PROSITE:PS50878"
FT   DOMAIN          2055..2184
FT                   /note="RNase H type-1"
FT                   /evidence="ECO:0000259|PROSITE:PS50879"
FT   REGION          252..372
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          405..527
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          579..599
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          621..640
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        255..274
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        281..327
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        340..356
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        405..419
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        429..446
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        447..480
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        505..527
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        582..598
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   2215 AA;  251867 MW;  B3272BB8B8EB2178 CRC64;
     MPKLQDVRAA FKGIALTGAY EVRWLDYKHV LIHLSNEQDF NRIWTKQNWF IATQKMRVFK
     WTPEFEPEKE SAVVPVWISF PNLKAHLFEK SALLLIAKTV GKPLFVDEAT ANGSRPSVAR
     VCVEYDCRKS PVDQVWIVVQ NRKTGEVMNG YSQRVEFAQM PAYCDHCCHV GHKETDCILL
     GNKPRPPGTS KPPTSRIEDG ERRIGLKEDA EFITDKRKTV ANSKKPENGK ILYHEEPSKY
     LQMWQLVYKG STSGVKDRQG KEVKADKASK EENILVSNRF HTISEEKEDD HNRTTQNGKE
     KHEKNNEKDE GGRTEGIRRE TTEERRTGAE LQTGNGKPEG GRKEGTRRET TEERRIGAEI
     QTGNGTPEGT EMTAIPLANS QILEDTAQGT LHENGVHGQL QNHVEERDKH AERENGNPRN
     SQNKKNTSKS QQKDNEVQHT RGRLQTEENL QKSNARTVGP RLQAADTQRT AKTTCGEPLD
     VTDQTGKEGT FAFSKSKSDI QQSKRDPTKT NTAGKGENSK KSTVGDGNLT LDIMQVSNCE
     QNLNNYSSQH PKQAAPTLQG ATQFEKETED QIYSAEALNK TETGQPKHKA KQNDGEKSKG
     GELVTIATIE LHSERNDEIV RSPGIDSHDQ AKTRPSENQE RAKEFVGAAV EGEGPAATGS
     VPHHTPYVHV ERNKDVEGQN QLMQATPDEP LLQKDGQIKP SQSLKNNFIK SSTQASTLRQ
     AKGCYMIEYG SGVHYSPVDT LEGSGEHVPI EEEGTSQTPL QTEQISTAFK IIRAGEMRVD
     NDLLSPNLES ASSKCLFNKE PSDIPSFSGN NHADLEVHPR ERRRRYSDNA IPLRNTLSTA
     TEEAIVLGGN EEDSDGDSIS KSRVIQRRIK KLQLMHRLKI LAILEPMVDT SKAEYFRRKM
     GFEKVIVNNS QKIWLFHSVE FICEVLLDHP QCLHVRVTIP WLDLPIFTTF VYAKCTRSER
     TPLWNCLRNL AADMEGPWIV GGDFNIILKR EERLYGADPH EGSIEDFASV LLDCGLLDGG
     FEGNPFTWTN NRMFQRLDRM VYNQQWINKF PITRIQHLNR DGSDHCPLLL SCSNSSEKAP
     SSFRFLHAWA LHHNFNASVE GNWNLPINGS GLMAFWSKQK RLKQHLKWWN KTVFGDIFSN
     IKEAEKRVEE CEILHQQEQT IGSRIQLNKS YAQLNKQLSM EEIFWKQKSG VKWVVEGERN
     TKFFHMRMQK KRIRSHIFKI QEQDGNWIED PEQLQQSAID FFSSLLKAES CDDTRFQSSL
     CPSIISDTDN GFLCAEPTLQ EVKEAVFGID PESAAGPDGF SSHFYQQCWD IIAHDLFEAV
     KEFFHGADIP QGMTSTTLVL IPKTTSASKW SEFRPISLCT VMNKIITKIL ANRLAKILPS
     IITENQSGFV GGRLISDNIL LAQELIGKLD QKNRGGNVAL KLDMMKAYDR LDWSFLFKVL
     QHLGFNAQWI GMIQKCISNC WFSLLLNGRT VGYFKSERGL RQGDSISPQL FILAAEYLAR
     GLNALYDQYP SLHYSSGCSL SVSHLAFADD VIIFANGSKS ALQKIMAFLQ EYEKLSGQRI
     NPQKSCVVTH TNMASSRRQI ILQATGFSHR PLPITYLGAP LYKGHKKVML FNDLVAKIEE
     RITGWENKTL SPGGRITLLR STLSSLPIYL LQVLKPPVIV LERINRLLNN FLWGGSTASK
     RIHWASWGKI ALPIAEGGLD IRNVEDVCEA FSMKLWWRFR TTNSLWTQFM RAKYCGGQLP
     TDVQPKLHDS QTWKRMVTIS SITEQNIRWR IGHGELFFWH DCWMGEEPLV NRNQAFASSM
     AQVSDFFLNN SWNVEKLKTV LQQEVVEEIV KIPIDTSSND KAYWTTTPNG DFSTKSAWQL
     IRNRKVENPV FNFIWHKSVP LTTSFFLWRL LHDWIPVELK MKTKGFQLAS RCRCCKSEES
     LMHVMWKNPV ANQVWSYFAK VFQIQIINPC TINQIICAWF YSGDYSKPGH IRTLVPLFTL
     WFLWVERNDA KHRNLGMYPN RVVWKILKLL HQLFQGKQLQ KWQWQGDKQI AQEWGIILKA
     DAPSPPKLLF WLKPSIGELK LNVDGSCKHN PQSAAGGGLL RDHTGSMIFG FSENFGPQDS
     LQAELMALHR GLLLCIEHNI SRLWIEMDAK VAVQMIKEGH QGSSRTRYLL ASIHRCLSGI
     SFRISHIFRE GNQAADHLSN QGHTHQNLQV ISQAEGQLRG ILRLEKINLA YVRFK
//
DBGET integrated database retrieval system