ID A0A061FKS1_THECC Unreviewed; 2215 AA.
AC A0A061FKS1;
DT 03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT 03-SEP-2014, sequence version 1.
DT 27-MAR-2024, entry version 41.
DE RecName: Full=Reverse transcriptase domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=TCM_036737 {ECO:0000313|EMBL:EOY17513.1};
OS Theobroma cacao (Cacao) (Cocoa).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX NCBI_TaxID=3641 {ECO:0000313|EMBL:EOY17513.1, ECO:0000313|Proteomes:UP000026915};
RN [1] {ECO:0000313|EMBL:EOY17513.1, ECO:0000313|Proteomes:UP000026915}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA Kuhn D.N.;
RT "The genome sequence of the most widely cultivated cacao type and its use
RT to identify candidate genes regulating pod color.";
RL Genome Biol. 14:R53.1-R53.24(2013).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM001886; EOY17513.1; -; Genomic_DNA.
DR EnsemblPlants; EOY17513; EOY17513; TCM_036737.
DR Gramene; EOY17513; EOY17513; TCM_036737.
DR eggNOG; KOG1075; Eukaryota.
DR HOGENOM; CLU_001916_0_0_1; -.
DR InParanoid; A0A061FKS1; -.
DR Proteomes; UP000026915; Chromosome 8.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0004523; F:RNA-DNA hybrid ribonuclease activity; IEA:InterPro.
DR CDD; cd06222; RNase_H_like; 1.
DR CDD; cd01650; RT_nLTR_like; 1.
DR Gene3D; 3.60.10.10; Endonuclease/exonuclease/phosphatase; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR InterPro; IPR025558; DUF4283.
DR InterPro; IPR036691; Endo/exonu/phosph_ase_sf.
DR InterPro; IPR005135; Endo/exonuclease/phosphatase.
DR InterPro; IPR044730; RNase_H-like_dom_plant.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR002156; RNaseH_domain.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR026960; RVT-Znf.
DR PANTHER; PTHR46890; NON-LTR RETROLELEMENT REVERSE TRANSCRIPTASE-LIKE PROTEIN-RELATED; 1.
DR PANTHER; PTHR46890:SF46; RNA-DIRECTED DNA POLYMERASE, EUKARYOTA, REVERSE TRANSCRIPTASE ZINC-BINDING DOMAIN PROTEIN; 1.
DR Pfam; PF14111; DUF4283; 1.
DR Pfam; PF03372; Exo_endo_phos; 1.
DR Pfam; PF00078; RVT_1; 1.
DR Pfam; PF13456; RVT_3; 1.
DR Pfam; PF13966; zf-RVT; 1.
DR SUPFAM; SSF56219; DNase I-like; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50879; RNASE_H_1; 1.
DR PROSITE; PS50878; RT_POL; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000026915}.
FT DOMAIN 1322..1601
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|PROSITE:PS50878"
FT DOMAIN 2055..2184
FT /note="RNase H type-1"
FT /evidence="ECO:0000259|PROSITE:PS50879"
FT REGION 252..372
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 405..527
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 579..599
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 621..640
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 255..274
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 281..327
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 340..356
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 405..419
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 429..446
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 447..480
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 505..527
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 582..598
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2215 AA; 251867 MW; B3272BB8B8EB2178 CRC64;
MPKLQDVRAA FKGIALTGAY EVRWLDYKHV LIHLSNEQDF NRIWTKQNWF IATQKMRVFK
WTPEFEPEKE SAVVPVWISF PNLKAHLFEK SALLLIAKTV GKPLFVDEAT ANGSRPSVAR
VCVEYDCRKS PVDQVWIVVQ NRKTGEVMNG YSQRVEFAQM PAYCDHCCHV GHKETDCILL
GNKPRPPGTS KPPTSRIEDG ERRIGLKEDA EFITDKRKTV ANSKKPENGK ILYHEEPSKY
LQMWQLVYKG STSGVKDRQG KEVKADKASK EENILVSNRF HTISEEKEDD HNRTTQNGKE
KHEKNNEKDE GGRTEGIRRE TTEERRTGAE LQTGNGKPEG GRKEGTRRET TEERRIGAEI
QTGNGTPEGT EMTAIPLANS QILEDTAQGT LHENGVHGQL QNHVEERDKH AERENGNPRN
SQNKKNTSKS QQKDNEVQHT RGRLQTEENL QKSNARTVGP RLQAADTQRT AKTTCGEPLD
VTDQTGKEGT FAFSKSKSDI QQSKRDPTKT NTAGKGENSK KSTVGDGNLT LDIMQVSNCE
QNLNNYSSQH PKQAAPTLQG ATQFEKETED QIYSAEALNK TETGQPKHKA KQNDGEKSKG
GELVTIATIE LHSERNDEIV RSPGIDSHDQ AKTRPSENQE RAKEFVGAAV EGEGPAATGS
VPHHTPYVHV ERNKDVEGQN QLMQATPDEP LLQKDGQIKP SQSLKNNFIK SSTQASTLRQ
AKGCYMIEYG SGVHYSPVDT LEGSGEHVPI EEEGTSQTPL QTEQISTAFK IIRAGEMRVD
NDLLSPNLES ASSKCLFNKE PSDIPSFSGN NHADLEVHPR ERRRRYSDNA IPLRNTLSTA
TEEAIVLGGN EEDSDGDSIS KSRVIQRRIK KLQLMHRLKI LAILEPMVDT SKAEYFRRKM
GFEKVIVNNS QKIWLFHSVE FICEVLLDHP QCLHVRVTIP WLDLPIFTTF VYAKCTRSER
TPLWNCLRNL AADMEGPWIV GGDFNIILKR EERLYGADPH EGSIEDFASV LLDCGLLDGG
FEGNPFTWTN NRMFQRLDRM VYNQQWINKF PITRIQHLNR DGSDHCPLLL SCSNSSEKAP
SSFRFLHAWA LHHNFNASVE GNWNLPINGS GLMAFWSKQK RLKQHLKWWN KTVFGDIFSN
IKEAEKRVEE CEILHQQEQT IGSRIQLNKS YAQLNKQLSM EEIFWKQKSG VKWVVEGERN
TKFFHMRMQK KRIRSHIFKI QEQDGNWIED PEQLQQSAID FFSSLLKAES CDDTRFQSSL
CPSIISDTDN GFLCAEPTLQ EVKEAVFGID PESAAGPDGF SSHFYQQCWD IIAHDLFEAV
KEFFHGADIP QGMTSTTLVL IPKTTSASKW SEFRPISLCT VMNKIITKIL ANRLAKILPS
IITENQSGFV GGRLISDNIL LAQELIGKLD QKNRGGNVAL KLDMMKAYDR LDWSFLFKVL
QHLGFNAQWI GMIQKCISNC WFSLLLNGRT VGYFKSERGL RQGDSISPQL FILAAEYLAR
GLNALYDQYP SLHYSSGCSL SVSHLAFADD VIIFANGSKS ALQKIMAFLQ EYEKLSGQRI
NPQKSCVVTH TNMASSRRQI ILQATGFSHR PLPITYLGAP LYKGHKKVML FNDLVAKIEE
RITGWENKTL SPGGRITLLR STLSSLPIYL LQVLKPPVIV LERINRLLNN FLWGGSTASK
RIHWASWGKI ALPIAEGGLD IRNVEDVCEA FSMKLWWRFR TTNSLWTQFM RAKYCGGQLP
TDVQPKLHDS QTWKRMVTIS SITEQNIRWR IGHGELFFWH DCWMGEEPLV NRNQAFASSM
AQVSDFFLNN SWNVEKLKTV LQQEVVEEIV KIPIDTSSND KAYWTTTPNG DFSTKSAWQL
IRNRKVENPV FNFIWHKSVP LTTSFFLWRL LHDWIPVELK MKTKGFQLAS RCRCCKSEES
LMHVMWKNPV ANQVWSYFAK VFQIQIINPC TINQIICAWF YSGDYSKPGH IRTLVPLFTL
WFLWVERNDA KHRNLGMYPN RVVWKILKLL HQLFQGKQLQ KWQWQGDKQI AQEWGIILKA
DAPSPPKLLF WLKPSIGELK LNVDGSCKHN PQSAAGGGLL RDHTGSMIFG FSENFGPQDS
LQAELMALHR GLLLCIEHNI SRLWIEMDAK VAVQMIKEGH QGSSRTRYLL ASIHRCLSGI
SFRISHIFRE GNQAADHLSN QGHTHQNLQV ISQAEGQLRG ILRLEKINLA YVRFK
//