GenomeNet

Database: UniProt
Entry: A0A5B6U2I9_9ROSI
LinkDB: A0A5B6U2I9_9ROSI
Original site: A0A5B6U2I9_9ROSI 
ID   A0A5B6U2I9_9ROSI        Unreviewed;      1597 AA.
AC   A0A5B6U2I9;
DT   13-NOV-2019, integrated into UniProtKB/TrEMBL.
DT   13-NOV-2019, sequence version 1.
DT   27-MAR-2024, entry version 12.
DE   SubName: Full=Reverse transcriptase (RNA-dependent DNA polymerase) domain containing protein {ECO:0000313|EMBL:KAA3450814.1};
GN   ORFNames=EPI10_034343 {ECO:0000313|EMBL:KAA3450814.1};
OS   Gossypium australe.
OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC   Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC   rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium.
OX   NCBI_TaxID=47621 {ECO:0000313|EMBL:KAA3450814.1, ECO:0000313|Proteomes:UP000325315};
RN   [1] {ECO:0000313|Proteomes:UP000325315}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=cv. PA1801 {ECO:0000313|Proteomes:UP000325315};
RX   PubMed=31479566; DOI=10.1111/pbi.13249;
RA   Cai Y., Cai X., Wang Q., Wang P., Zhang Y., Cai C., Xu Y., Wang K.,
RA   Zhou Z., Wang C., Geng S., Li B., Dong Q., Hou Y., Wang H., Ai P., Liu Z.,
RA   Yi F., Sun M., An G., Cheng J., Zhang Y., Shi Q., Xie Y., Shi X., Chang Y.,
RA   Huang F., Chen Y., Hong S., Mi L., Sun Q., Zhang L., Zhou B., Peng R.,
RA   Zhang X., Liu F.;
RT   "Genome sequencing of the Australian wild diploid species Gossypium
RT   australe highlights disease resistance and delayed gland morphogenesis.";
RL   Plant Biotechnol. J. 0:0-0(2019).
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:KAA3450814.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; SMMG02000121; KAA3450814.1; -; Genomic_DNA.
DR   Proteomes; UP000325315; Unassembled WGS sequence.
DR   GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR   GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR   GO; GO:0003964; F:RNA-directed DNA polymerase activity; IEA:UniProtKB-KW.
DR   GO; GO:0004523; F:RNA-DNA hybrid ribonuclease activity; IEA:InterPro.
DR   GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR   GO; GO:0006310; P:DNA recombination; IEA:UniProtKB-KW.
DR   CDD; cd00303; retropepsin_like; 1.
DR   CDD; cd09279; RNase_HI_like; 1.
DR   CDD; cd01647; RT_LTR; 1.
DR   Gene3D; 3.10.20.370; -; 1.
DR   Gene3D; 3.30.70.270; -; 1.
DR   Gene3D; 2.40.70.10; Acid Proteases; 1.
DR   Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR   Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 2.
DR   InterPro; IPR043502; DNA/RNA_pol_sf.
DR   InterPro; IPR001584; Integrase_cat-core.
DR   InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR   InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR   InterPro; IPR012337; RNaseH-like_sf.
DR   InterPro; IPR002156; RNaseH_domain.
DR   InterPro; IPR036397; RNaseH_sf.
DR   InterPro; IPR000477; RT_dom.
DR   InterPro; IPR041577; RT_RNaseH_2.
DR   PANTHER; PTHR37984:SF7; INTEGRASE CATALYTIC DOMAIN-CONTAINING PROTEIN; 1.
DR   PANTHER; PTHR37984; PROTEIN CBG26694; 1.
DR   Pfam; PF17919; RT_RNaseH_2; 1.
DR   Pfam; PF00665; rve; 1.
DR   Pfam; PF00078; RVT_1; 1.
DR   Pfam; PF13456; RVT_3; 1.
DR   SUPFAM; SSF50630; Acid proteases; 1.
DR   SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR   SUPFAM; SSF53098; Ribonuclease H-like; 2.
DR   PROSITE; PS50994; INTEGRASE; 1.
PE   4: Predicted;
KW   DNA recombination {ECO:0000256|ARBA:ARBA00023172};
KW   Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW   Nucleotidyltransferase {ECO:0000313|EMBL:KAA3450814.1};
KW   Reference proteome {ECO:0000313|Proteomes:UP000325315};
KW   RNA-directed DNA polymerase {ECO:0000313|EMBL:KAA3450814.1};
KW   Transferase {ECO:0000313|EMBL:KAA3450814.1}.
FT   DOMAIN          1049..1209
FT                   /note="Integrase catalytic"
FT                   /evidence="ECO:0000259|PROSITE:PS50994"
FT   REGION          48..67
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        48..66
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1597 AA;  182824 MW;  3828A1E3DB91F25D CRC64;
     MAQLTPLDKL QLNKCTKENL FSQTVGMELA SLLSPELLHV FSSISGKDSK KKNAIRGENT
     KKTNTEKKRR ACSRARTHCW NEGLKLPTRN NCPGCNDRYV EYRQDIAKRR SVHERIGRIH
     PSDNRRLELH NDQPKKRHAD HRWIDREEEE DHGYIWQEGQ WCPPGLRKSQ KRRVQRLRNQ
     ELKQARPRMK SVWRRVEKPD EHNPPAPTCM VCLLPSEFMA PTDQVVQEEA LSEVDETEQL
     MAQLMLSKQA TFEKPSKNRH MKPLYLRGFV NGKPLTKMFV DGGAAVNVMP YTTFRKLGMG
     LGDLTPTSIV LNDFAGNPSD TKGCVHVDLM IGSKTLLTTF FVIEGRGSYS LLLGRDWIHA
     NCCIPSTMHQ QLIQWIDDEV EIVQADDSIS VARRGFVSAD KLEEVDIGDG DKPRPTFISA
     NLDPIFKEKL IKLLKEYKDC FAWDYSEMPG LDRSIVEHRL PIKPGYKPYK QPPRKIYKEE
     VLADVKKEIE RLLDANFIRP CKYADWISNI VPVYKKNGKM RVCIDFRDLN KANQWMDTQC
     LFMDGNAGYN QIFMALEDIV KTAFRCPGHI GLFEWVVMTF GLKNAGATYQ RAMNYIFHEL
     IGKIVEYILM IIGWRIPGIL IHEGGIGVGK KSMKAIDEII PPTNLKELQS LLGKINFVRR
     FISNLSQKVL PFSSLLKVKK DQKFIWGDEQ QKAFDEIKEY MKEPPVLVPP QLNKPFKLYM
     AADAQTIGSA LIQEFEGKER VVAYLSRKLL DPETRYSAME KLSLCHMLSM PILNGRIGKW
     ILALSEFELK FESAKAVKGQ IIADFITEHR DSSINLLNII PWVLFFDGSS CDKGGGAGIL
     LTSPKGEVFK FAIPNQSTVT NNQAEYEALL KGLQYLKEAR AIALSGEYEC KNDVLRNYYE
     ECKQILKGFR SIILQHIPRG DNEEANKLAQ SASGYRENQE VFTTDDCAIR SDLAENDWRK
     EIADYLENPS QKVSRRLRYK AIKFVLLDKD LYYKSLDGVL LRCLNQEEAK KLMSEVHDGL
     CGAHQSAYRM KWKFGNIQRV PASALNPIIK PWPFRGWGID LIGQIYPPSS KGHKFVLLAT
     DYFTKWVEAI PLRNVASENM IEFVKEHIIY RFGIPQTITT DQGTQFTSSE FRDFAESMGI
     KLLNSSPYYA QANGQAEASN KIMIKIIQKK IDQKPRKWHS VLNEALWAYR MAPHGSTKTS
     PYELVCGHHA VLPWEVQSDS RRIKLQKDLS SKDYNDLMMD ELEDLHMIRL KALENIEKNK
     MRIAKYYNKK VKVKQFTEGD LVWKALLPIG TKYSAFGKWS PNWEGPFRIF KCIIYKKEGT
     RLQWGLIDPP KPMPNREIMV KRISQIRPRF LHVLHGMFYS TLQASFTYIK STVVGAGMNL
     HGIKGECCTI AHTGPGDIRK DLVDVVHPIQ GRPACCIGRM GKLDNCGFQS VKPRKKIRNK
     KILTSSSPPS NVAFPGDLIN LLILILQDVL EVDNLMNQCR MRRIQNCACF PQKTSLDDDE
     RKKKCSLESG IGTRQSWNVS RNEVRRQFKK TIKRAFGKIL KMTSGPSKKL KGHSPRSVTC
     FMTVRAPEKR TELLSSALVE VINNIAQEYA SPRCVCV
//
DBGET integrated database retrieval system