ID A0A1R3JZR2_COCAP Unreviewed; 2053 AA.
AC A0A1R3JZR2;
DT 12-APR-2017, integrated into UniProtKB/TrEMBL.
DT 12-APR-2017, sequence version 1.
DT 24-JAN-2024, entry version 24.
DE SubName: Full=Reverse transcriptase {ECO:0000313|EMBL:OMP00330.1};
GN ORFNames=CCACVL1_03376 {ECO:0000313|EMBL:OMP00330.1};
OS Corchorus capsularis (Jute).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Grewioideae; Apeibeae; Corchorus.
OX NCBI_TaxID=210143 {ECO:0000313|EMBL:OMP00330.1, ECO:0000313|Proteomes:UP000188268};
RN [1] {ECO:0000313|EMBL:OMP00330.1, ECO:0000313|Proteomes:UP000188268}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. CVL-1 {ECO:0000313|Proteomes:UP000188268};
RC TISSUE=Whole seedling {ECO:0000313|EMBL:OMP00330.1};
RA Alam M., Haque M.S., Islam M.S., Emdad E.M., Islam M.M., Ahmed B.,
RA Halim A., Hossen Q.M.M., Hossain M.Z., Ahmed R., Khan M.M., Islam R.,
RA Rashid M.M., Khan S.A., Rahman M.S., Alam M.;
RT "Corchorus capsularis genome sequencing.";
RL Submitted (SEP-2013) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OMP00330.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AWWV01006665; OMP00330.1; -; Genomic_DNA.
DR EnsemblPlants; OMP00330; OMP00330; CCACVL1_03376.
DR Gramene; OMP00330; OMP00330; CCACVL1_03376.
DR OrthoDB; 857534at2759; -.
DR Proteomes; UP000188268; Unassembled WGS sequence.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0003964; F:RNA-directed DNA polymerase activity; IEA:UniProtKB-KW.
DR GO; GO:0004523; F:RNA-DNA hybrid ribonuclease activity; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0006310; P:DNA recombination; IEA:UniProtKB-KW.
DR CDD; cd00303; retropepsin_like; 1.
DR CDD; cd09279; RNase_HI_like; 1.
DR CDD; cd01647; RT_LTR; 1.
DR Gene3D; 1.10.340.70; -; 1.
DR Gene3D; 3.30.70.270; -; 1.
DR Gene3D; 2.40.70.10; Acid Proteases; 1.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR018289; MULE_transposase_dom.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR005162; Retrotrans_gag_dom.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR002156; RNaseH_domain.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR006564; Znf_PMZ.
DR InterPro; IPR007527; Znf_SWIM.
DR PANTHER; PTHR24559:SF425; RT_RNASEH DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR24559; TRANSPOSON TY3-I GAG-POL POLYPROTEIN; 1.
DR Pfam; PF10551; MULE; 1.
DR Pfam; PF03732; Retrotrans_gag; 1.
DR Pfam; PF00078; RVT_1; 1.
DR Pfam; PF13456; RVT_3; 1.
DR Pfam; PF04434; SWIM; 1.
DR SMART; SM00575; ZnF_PMZ; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR PROSITE; PS50879; RNASE_H_1; 1.
DR PROSITE; PS50966; ZF_SWIM; 1.
PE 4: Predicted;
KW DNA recombination {ECO:0000256|ARBA:ARBA00023172};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Nucleotidyltransferase {ECO:0000313|EMBL:OMP00330.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000188268};
KW RNA-directed DNA polymerase {ECO:0000313|EMBL:OMP00330.1};
KW Transferase {ECO:0000313|EMBL:OMP00330.1};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00325};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00325}.
FT DOMAIN 879..1012
FT /note="RNase H type-1"
FT /evidence="ECO:0000259|PROSITE:PS50879"
FT DOMAIN 1787..1819
FT /note="SWIM-type"
FT /evidence="ECO:0000259|PROSITE:PS50966"
FT REGION 235..286
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1403..1429
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1847..1893
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1924..1967
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2019..2053
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 260..283
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1403..1427
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1855..1893
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1924..1943
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1953..1967
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2027..2053
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2053 AA; 231502 MW; 0EED6D21798E5D7F CRC64;
MSSSGYGADV PTYLQDMLSK IMAAFDDKTK AIDERANERM IAMEERFVQL AKGVHGDNGK
TLEVPAASET NNGAIDTTIP TLNNTTGAKD GIASASADTT YVTKDQLQSL DDDLKLKEFS
KSLTGKAYTW YVNLIPGSIE SWNQMCTQFG EKFFPTQEKL TLIDLGREHQ KSGEDLMEYI
QRFRERVLDV HDAYNERELV KVCMQGMFDE YRVHLENLPL YTFAALVEAA RRTNSSVQRQ
KESRYARRNT PPVHAVQFQK RNNDRPRNDR FQKRPKTDFR RGDDAPPFPV PVEKVRALLQ
EWIRDGQINL PFVSRMPTGQ EKIDPKYCDY HRVVGHPFAE CWSMRRLIQN RVRKGELVIN
SNDTVQVNLL PTHGACAVIH SLRDEQYENE DAYDTHVAAV MTSTIASSLL KTPNVRHFFD
MLGFCDDARK EAAEALVQVA NKYHDMCTPD PNHNKPLYVE STINGVYIRT TFIDDGSGLN
LMPLKTLRAL GIDQRSLRHP MIINAFDNKG TRTLGYVTVN MKVGNIQEQT CFHVLDADVA
YHVLVGRKWL HAHYLIASTL HQCIKGYWND KEVSIPATKA PFEQNEVRYA EASFFDELAD
DGEGALGRPI GVSLPPWSNY DGISHCNVKR TKKGNKMRCG NATGGSRGAD VTSYTRADGR
IDKPKPQEEE LEELNIADEG GTPKPLFLSK NLSAEQKSVL IELLKEFEDV FAWSYEQMPG
LDTNLVTHEL HIAPGSRLVK QSARLFQPEI ETKIKEEIEK LLRVDSSSLF TILHGHEMFS
FMDGFSGYNQ IKMAQEDAEK TAFRTPIGNF YYTVVPFGLK NAGATYQRAM TAIFHDMLHE
CVEDYVDDIV VKSKKAADHL TDLRKVFERC RKYNLRMNPL KCAFGVTSGK FLGGSGAGIV
LVPPEARCEH EEALSLAFKL DFPCTNNQAE YEALVLGLHT ARIIGVEELC IIGDSNLVVK
QTNGEFSLKE PTLAPYRDLV RSFLDKFQSV RCEHSPRSSN RYADALATLA SKINMPDGEQ
TIPLTVKRWS IPSPHALWME TPKGEEEQDW RDPIIQQLGD PTSNLLPSLK KYVLIHGTLY
YRGANEVLAR CVSSKEADCR LKAAHRQWCG QEGPPLYRRL QRAGYYWPTM MKDATHMESL
CAKCSEPPNV HECHFVGSVG DCRRPYIDFL QNGVLLTNYQ DARRIKRRAE RFFLKGNELF
RTSFAGKPLK CVSPADMTAL LEDVHGGVKM AFDTAIVRFH YEGRFSGVDE ELQYVGGYVD
DLVFDPDKIS KNEFEQLCQR AGYKNIVRMF YQRPGFLLCD GLKPIINDAS IIDMTGELFL
NDGVIDVYVE HGVEVVPEIA GLLENGENLA DPPVVNLGDP PVVNLGDPVE DVGVNLQDVH
VEDNGDINDD VGEVEVDEVN LEGLHDEDGE DEGPAAGENG ENDLESDGEG ISEFHIDSEY
DDSDDPLEIE SDEELIVNEA TTMRGRFPRF DSTADVPYIY KSMLFKNSDE FKLASKEMIV
ESYVVNYEEE FAKLWSYKDM ILLSNPGSTV KMDTFRADPD GPPVFERMYI CLGALKEGRD
GNNQMFPVAW ALVEDETTLT WSWFIECLQE DLGIGDRFGF TFMSDQHKAI QRSIEDNVPQ
AEHRFCARHV WVNWQGRGHR GDEMNDFFWR LVKAPTLREY YEILDKLKQK SSQAATDFEA
YTPPAKFCRT FFRLESMVEV ADNNLCEAFN KTLLKARKMP VISLFEMMRR EMMKRIVKKN
NELSRWRDGL GPRIWQSIEK SAKIAQYCRV IFNGADGYEV EHGESRYVVR LEEKTCTCRI
YKLSGVPCAH AICAIRERRG NVAEFVSSWY SKEVYMQSYS NPIQSMPGLK DWPTSDLPPN
STTPSHEEVN CSSPPQNPAK TKKKAHSSSQ ASVSVQIPDL NANANSQAAS SGCGDAEASA
SVILPSQQSG TDPTTVNSSS KAKSGQSKKK KQTLMLDLGR KRDSTVTDNN GRVLECVWVK
GQVRLSQAKY ANQRRITATF LQRAAQKKFR ARLEALKKGL QPAASGDNVP LQGTQESVTT
ASASSKGKNK QQN
//