ID A0A1R3KHA2_COCAP Unreviewed; 1399 AA.
AC A0A1R3KHA2;
DT 12-APR-2017, integrated into UniProtKB/TrEMBL.
DT 12-APR-2017, sequence version 1.
DT 24-JAN-2024, entry version 25.
DE RecName: Full=Reverse transcriptase {ECO:0008006|Google:ProtNLM};
GN ORFNames=CCACVL1_01552 {ECO:0000313|EMBL:OMP06477.1};
OS Corchorus capsularis (Jute).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Grewioideae; Apeibeae; Corchorus.
OX NCBI_TaxID=210143 {ECO:0000313|EMBL:OMP06477.1, ECO:0000313|Proteomes:UP000188268};
RN [1] {ECO:0000313|EMBL:OMP06477.1, ECO:0000313|Proteomes:UP000188268}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. CVL-1 {ECO:0000313|Proteomes:UP000188268};
RC TISSUE=Whole seedling {ECO:0000313|EMBL:OMP06477.1};
RA Alam M., Haque M.S., Islam M.S., Emdad E.M., Islam M.M., Ahmed B.,
RA Halim A., Hossen Q.M.M., Hossain M.Z., Ahmed R., Khan M.M., Islam R.,
RA Rashid M.M., Khan S.A., Rahman M.S., Alam M.;
RT "Corchorus capsularis genome sequencing.";
RL Submitted (SEP-2013) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OMP06477.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AWWV01004903; OMP06477.1; -; Genomic_DNA.
DR STRING; 210143.A0A1R3KHA2; -.
DR EnsemblPlants; OMP06477; OMP06477; CCACVL1_01552.
DR Gramene; OMP06477; OMP06477; CCACVL1_01552.
DR OMA; QDIMEME; -.
DR OrthoDB; 5401763at2759; -.
DR Proteomes; UP000188268; Unassembled WGS sequence.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0004523; F:RNA-DNA hybrid ribonuclease activity; IEA:InterPro.
DR CDD; cd06222; RNase_H_like; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR InterPro; IPR025558; DUF4283.
DR InterPro; IPR044730; RNase_H-like_dom_plant.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR002156; RNaseH_domain.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR026960; RVT-Znf.
DR PANTHER; PTHR33116:SF67; REVERSE TRANSCRIPTASE ZINC-BINDING DOMAIN-CONTAINING PROTEIN-RELATED; 1.
DR PANTHER; PTHR33116; REVERSE TRANSCRIPTASE ZINC-BINDING DOMAIN-CONTAINING PROTEIN-RELATED-RELATED; 1.
DR Pfam; PF14111; DUF4283; 1.
DR Pfam; PF13456; RVT_3; 1.
DR Pfam; PF13966; zf-RVT; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000188268};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..20
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 21..1399
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5013000711"
FT DOMAIN 99..209
FT /note="DUF4283"
FT /evidence="ECO:0000259|Pfam:PF14111"
FT DOMAIN 1066..1158
FT /note="Reverse transcriptase zinc-binding"
FT /evidence="ECO:0000259|Pfam:PF13966"
FT DOMAIN 1263..1382
FT /note="RNase H type-1"
FT /evidence="ECO:0000259|Pfam:PF13456"
FT REGION 303..323
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 399..438
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 422..438
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1399 AA; 157441 MW; AEDB152DF778BEA3 CRC64;
MTFSLVSFSV SSLPLGLSSG AADLYVNSDS YLDFQFEFLK VVQDKIYPGY HKSDFPIRNL
EEDEKDKIVI DGDWVEAPVG QQGGGYLIGK LLLHKPATVE GLRAVFQQIW KLRRDLLVRE
VGERLFVFQY ANVLERDRVS VSQLWLFHKA LLVLREFDGV QQSESIEFDT CPFWVKAFGI
PFQMVNERVG TVVGESMRRV LDVDANSGRY LRIRIDMDLQ RPFKTVSTLT YQDGEAEIKF
DYEKRPDYCW VCGLVDHQES DYAVAVAMKI ENGFVIQKYK PDKSKSSFVM GRASVSLVQR
QRRGGAINRQ GSQSVPAKSN TSITSPFRRH VDSMLLHGRR VARALAYNDV FCEIISKMDA
RAVETGQDFQ GGYNNEDPRR VVDKSALTNV MVVTYRGAEK GGRQSQNGKE KECRAGNAGN
SFIPHGENSS GESSSSSSIS PNANWVVPII GGPSQNCVIP GVGPREVGSN LGLNYMVNSS
LDGVARGLEI EKGDGVGAIQ DAVENSTTEG YDPTSPFVFG TGSSGTRKVR KWKKAARVSE
QYSFDTLCHE QPFKVGSKRS AGICAKGGSA YGVDCNGRSV GLALLWMKDE CISLLSYSFW
HIDVSIGSSD KWQFTGFYGQ LDTNRRYESW SLLRSLFHED FDMAIKKAWP DGDVDIVKKI
KACGTTLEDW NQTMFGNLQF NIAKKQKEFG SLFARGASGN HAELDKCNRD LDKLLHQEEL
LWRQRPKTHW LKVRDRNTRF FHAVASSRKQ KKQILSIKED ARNTHTEQTG IMSTFTNYFK
GVFTTSNPTQ AAIHEVLQHM ECRVTEQMQI QLEQPFTARE IQHAAFQMGG SKAPGPDGMS
PLFFQKCWSV VGKDVVNYAL KFLNNNESLP DVNHTNVVLI PKIDDPKLAK DFRPISLCNV
IFRIVSKALA NRSCEEVISL LDMFEAASGQ KININKSAVL FSANTTSGVK DELMNFLGVQ
RVLDNDKYLG LPIMIGRSKC REFRFLKDRL QKRINAWNSK LFSKAGKAVM IQAVAQATPV
YLMSVFLFPK SFLQELNAMI ARFCLVVPRQ NEEDRLIWNG TMLGEFTVCS AYHVARRVIG
RQELPLQLRS PIWRYIWSAG IMPKIQYFMW RLVWNILPTK SNLNKRGMEI AGTCEVCGGE
ESADAHVFFN CHLSKLVWED ACPWVLSCIE QWDLNGNFWE FFLEKAKAIG QFDRVCTILW
LLWGNSNRAL YEAFCSMPNA IVRAATRILD QVCAATSRIG EILMGQSRQI AWMPPPLGVM
KINTDASFST ENGEAGLGVV IRDDVGNVIA SGSRRLYFIA DSLYAEVHAI LFGFEMALEL
GLDRCIVESD SLLAIREINK LDTVFWEGGC LIHEIRELAS LFEFCSFQFV NREANMLAHS
LVGLRLDNVW CGTLPLDVL
//