GenomeNet

Database: UniProt
Entry: A0A1R3JZR2_COCAP
LinkDB: A0A1R3JZR2_COCAP
Original site: A0A1R3JZR2_COCAP 
ID   A0A1R3JZR2_COCAP        Unreviewed;      2053 AA.
AC   A0A1R3JZR2;
DT   12-APR-2017, integrated into UniProtKB/TrEMBL.
DT   12-APR-2017, sequence version 1.
DT   24-JAN-2024, entry version 24.
DE   SubName: Full=Reverse transcriptase {ECO:0000313|EMBL:OMP00330.1};
GN   ORFNames=CCACVL1_03376 {ECO:0000313|EMBL:OMP00330.1};
OS   Corchorus capsularis (Jute).
OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC   Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC   rosids; malvids; Malvales; Malvaceae; Grewioideae; Apeibeae; Corchorus.
OX   NCBI_TaxID=210143 {ECO:0000313|EMBL:OMP00330.1, ECO:0000313|Proteomes:UP000188268};
RN   [1] {ECO:0000313|EMBL:OMP00330.1, ECO:0000313|Proteomes:UP000188268}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=cv. CVL-1 {ECO:0000313|Proteomes:UP000188268};
RC   TISSUE=Whole seedling {ECO:0000313|EMBL:OMP00330.1};
RA   Alam M., Haque M.S., Islam M.S., Emdad E.M., Islam M.M., Ahmed B.,
RA   Halim A., Hossen Q.M.M., Hossain M.Z., Ahmed R., Khan M.M., Islam R.,
RA   Rashid M.M., Khan S.A., Rahman M.S., Alam M.;
RT   "Corchorus capsularis genome sequencing.";
RL   Submitted (SEP-2013) to the EMBL/GenBank/DDBJ databases.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:OMP00330.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AWWV01006665; OMP00330.1; -; Genomic_DNA.
DR   EnsemblPlants; OMP00330; OMP00330; CCACVL1_03376.
DR   Gramene; OMP00330; OMP00330; CCACVL1_03376.
DR   OrthoDB; 857534at2759; -.
DR   Proteomes; UP000188268; Unassembled WGS sequence.
DR   GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR   GO; GO:0003964; F:RNA-directed DNA polymerase activity; IEA:UniProtKB-KW.
DR   GO; GO:0004523; F:RNA-DNA hybrid ribonuclease activity; IEA:InterPro.
DR   GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR   GO; GO:0006310; P:DNA recombination; IEA:UniProtKB-KW.
DR   CDD; cd00303; retropepsin_like; 1.
DR   CDD; cd09279; RNase_HI_like; 1.
DR   CDD; cd01647; RT_LTR; 1.
DR   Gene3D; 1.10.340.70; -; 1.
DR   Gene3D; 3.30.70.270; -; 1.
DR   Gene3D; 2.40.70.10; Acid Proteases; 1.
DR   Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR   Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR   InterPro; IPR043502; DNA/RNA_pol_sf.
DR   InterPro; IPR018289; MULE_transposase_dom.
DR   InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR   InterPro; IPR005162; Retrotrans_gag_dom.
DR   InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR   InterPro; IPR002156; RNaseH_domain.
DR   InterPro; IPR036397; RNaseH_sf.
DR   InterPro; IPR000477; RT_dom.
DR   InterPro; IPR006564; Znf_PMZ.
DR   InterPro; IPR007527; Znf_SWIM.
DR   PANTHER; PTHR24559:SF425; RT_RNASEH DOMAIN-CONTAINING PROTEIN; 1.
DR   PANTHER; PTHR24559; TRANSPOSON TY3-I GAG-POL POLYPROTEIN; 1.
DR   Pfam; PF10551; MULE; 1.
DR   Pfam; PF03732; Retrotrans_gag; 1.
DR   Pfam; PF00078; RVT_1; 1.
DR   Pfam; PF13456; RVT_3; 1.
DR   Pfam; PF04434; SWIM; 1.
DR   SMART; SM00575; ZnF_PMZ; 1.
DR   SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR   PROSITE; PS50879; RNASE_H_1; 1.
DR   PROSITE; PS50966; ZF_SWIM; 1.
PE   4: Predicted;
KW   DNA recombination {ECO:0000256|ARBA:ARBA00023172};
KW   Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW   Nucleotidyltransferase {ECO:0000313|EMBL:OMP00330.1};
KW   Reference proteome {ECO:0000313|Proteomes:UP000188268};
KW   RNA-directed DNA polymerase {ECO:0000313|EMBL:OMP00330.1};
KW   Transferase {ECO:0000313|EMBL:OMP00330.1};
KW   Zinc {ECO:0000256|PROSITE-ProRule:PRU00325};
KW   Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00325}.
FT   DOMAIN          879..1012
FT                   /note="RNase H type-1"
FT                   /evidence="ECO:0000259|PROSITE:PS50879"
FT   DOMAIN          1787..1819
FT                   /note="SWIM-type"
FT                   /evidence="ECO:0000259|PROSITE:PS50966"
FT   REGION          235..286
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1403..1429
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1847..1893
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1924..1967
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2019..2053
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        260..283
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1403..1427
FT                   /note="Acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1855..1893
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1924..1943
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1953..1967
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2027..2053
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   2053 AA;  231502 MW;  0EED6D21798E5D7F CRC64;
     MSSSGYGADV PTYLQDMLSK IMAAFDDKTK AIDERANERM IAMEERFVQL AKGVHGDNGK
     TLEVPAASET NNGAIDTTIP TLNNTTGAKD GIASASADTT YVTKDQLQSL DDDLKLKEFS
     KSLTGKAYTW YVNLIPGSIE SWNQMCTQFG EKFFPTQEKL TLIDLGREHQ KSGEDLMEYI
     QRFRERVLDV HDAYNERELV KVCMQGMFDE YRVHLENLPL YTFAALVEAA RRTNSSVQRQ
     KESRYARRNT PPVHAVQFQK RNNDRPRNDR FQKRPKTDFR RGDDAPPFPV PVEKVRALLQ
     EWIRDGQINL PFVSRMPTGQ EKIDPKYCDY HRVVGHPFAE CWSMRRLIQN RVRKGELVIN
     SNDTVQVNLL PTHGACAVIH SLRDEQYENE DAYDTHVAAV MTSTIASSLL KTPNVRHFFD
     MLGFCDDARK EAAEALVQVA NKYHDMCTPD PNHNKPLYVE STINGVYIRT TFIDDGSGLN
     LMPLKTLRAL GIDQRSLRHP MIINAFDNKG TRTLGYVTVN MKVGNIQEQT CFHVLDADVA
     YHVLVGRKWL HAHYLIASTL HQCIKGYWND KEVSIPATKA PFEQNEVRYA EASFFDELAD
     DGEGALGRPI GVSLPPWSNY DGISHCNVKR TKKGNKMRCG NATGGSRGAD VTSYTRADGR
     IDKPKPQEEE LEELNIADEG GTPKPLFLSK NLSAEQKSVL IELLKEFEDV FAWSYEQMPG
     LDTNLVTHEL HIAPGSRLVK QSARLFQPEI ETKIKEEIEK LLRVDSSSLF TILHGHEMFS
     FMDGFSGYNQ IKMAQEDAEK TAFRTPIGNF YYTVVPFGLK NAGATYQRAM TAIFHDMLHE
     CVEDYVDDIV VKSKKAADHL TDLRKVFERC RKYNLRMNPL KCAFGVTSGK FLGGSGAGIV
     LVPPEARCEH EEALSLAFKL DFPCTNNQAE YEALVLGLHT ARIIGVEELC IIGDSNLVVK
     QTNGEFSLKE PTLAPYRDLV RSFLDKFQSV RCEHSPRSSN RYADALATLA SKINMPDGEQ
     TIPLTVKRWS IPSPHALWME TPKGEEEQDW RDPIIQQLGD PTSNLLPSLK KYVLIHGTLY
     YRGANEVLAR CVSSKEADCR LKAAHRQWCG QEGPPLYRRL QRAGYYWPTM MKDATHMESL
     CAKCSEPPNV HECHFVGSVG DCRRPYIDFL QNGVLLTNYQ DARRIKRRAE RFFLKGNELF
     RTSFAGKPLK CVSPADMTAL LEDVHGGVKM AFDTAIVRFH YEGRFSGVDE ELQYVGGYVD
     DLVFDPDKIS KNEFEQLCQR AGYKNIVRMF YQRPGFLLCD GLKPIINDAS IIDMTGELFL
     NDGVIDVYVE HGVEVVPEIA GLLENGENLA DPPVVNLGDP PVVNLGDPVE DVGVNLQDVH
     VEDNGDINDD VGEVEVDEVN LEGLHDEDGE DEGPAAGENG ENDLESDGEG ISEFHIDSEY
     DDSDDPLEIE SDEELIVNEA TTMRGRFPRF DSTADVPYIY KSMLFKNSDE FKLASKEMIV
     ESYVVNYEEE FAKLWSYKDM ILLSNPGSTV KMDTFRADPD GPPVFERMYI CLGALKEGRD
     GNNQMFPVAW ALVEDETTLT WSWFIECLQE DLGIGDRFGF TFMSDQHKAI QRSIEDNVPQ
     AEHRFCARHV WVNWQGRGHR GDEMNDFFWR LVKAPTLREY YEILDKLKQK SSQAATDFEA
     YTPPAKFCRT FFRLESMVEV ADNNLCEAFN KTLLKARKMP VISLFEMMRR EMMKRIVKKN
     NELSRWRDGL GPRIWQSIEK SAKIAQYCRV IFNGADGYEV EHGESRYVVR LEEKTCTCRI
     YKLSGVPCAH AICAIRERRG NVAEFVSSWY SKEVYMQSYS NPIQSMPGLK DWPTSDLPPN
     STTPSHEEVN CSSPPQNPAK TKKKAHSSSQ ASVSVQIPDL NANANSQAAS SGCGDAEASA
     SVILPSQQSG TDPTTVNSSS KAKSGQSKKK KQTLMLDLGR KRDSTVTDNN GRVLECVWVK
     GQVRLSQAKY ANQRRITATF LQRAAQKKFR ARLEALKKGL QPAASGDNVP LQGTQESVTT
     ASASSKGKNK QQN
//
DBGET integrated database retrieval system