ID A0A2G2YPK1_CAPAN Unreviewed; 1656 AA.
AC A0A2G2YPK1;
DT 31-JAN-2018, integrated into UniProtKB/TrEMBL.
DT 31-JAN-2018, sequence version 1.
DT 27-MAR-2024, entry version 23.
DE RecName: Full=Reverse transcriptase {ECO:0008006|Google:ProtNLM};
GN ORFNames=T459_22467 {ECO:0000313|EMBL:PHT71682.1};
OS Capsicum annuum (Capsicum pepper).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC asterids; lamiids; Solanales; Solanaceae; Solanoideae; Capsiceae; Capsicum.
OX NCBI_TaxID=4072 {ECO:0000313|EMBL:PHT71682.1, ECO:0000313|Proteomes:UP000222542};
RN [1] {ECO:0000313|EMBL:PHT71682.1, ECO:0000313|Proteomes:UP000222542}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. CM334 {ECO:0000313|Proteomes:UP000222542};
RX PubMed=29089032; DOI=10.1186/s13059-017-1341-9;
RA Kim S., Park J., Yeom S.I., Kim Y.M., Seo E., Kim K.T., Kim M.S., Lee J.M.,
RA Cheong K., Shin H.S., Kim S.B., Han K., Lee J., Park M., Lee H.A.,
RA Lee H.Y., Lee Y., Oh S., Lee J.H., Choi E., Choi E., Lee S.E., Jeon J.,
RA Kim H., Choi G., Song H., Lee J., Lee S.C., Kwon J.K., Lee H.Y., Koo N.,
RA Hong Y., Kim R.W., Kang W.H., Huh J.H., Kang B.C., Yang T.J., Lee Y.H.,
RA Bennetzen J.L., Choi D.;
RT "New reference genome sequences of hot pepper reveal the massive evolution
RT of plant disease-resistance genes by retroduplication.";
RL Genome Biol. 18:R210.1-R210.11(2017).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PHT71682.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AYRZ02000009; PHT71682.1; -; Genomic_DNA.
DR EnsemblPlants; PHT71682; PHT71682; T459_22467.
DR Gramene; PHT71682; PHT71682; T459_22467.
DR OMA; GHTEREF; -.
DR Proteomes; UP000222542; Chromosome 9.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR CDD; cd00303; retropepsin_like; 1.
DR CDD; cd09274; RNase_HI_RT_Ty3; 1.
DR CDD; cd01647; RT_LTR; 1.
DR Gene3D; 1.10.340.70; -; 1.
DR Gene3D; 3.30.70.270; -; 2.
DR Gene3D; 2.40.70.10; Acid Proteases; 1.
DR Gene3D; 3.40.50.1000; HAD superfamily/HAD-like; 1.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR InterPro; IPR016197; Chromo-like_dom_sf.
DR InterPro; IPR000953; Chromo/chromo_shadow_dom.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR023214; HAD_sf.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR041588; Integrase_H2C2.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR005162; Retrotrans_gag_dom.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR041577; RT_RNaseH_2.
DR PANTHER; PTHR24559:SF438; RT_RNASEH_2 DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR24559; TRANSPOSON TY3-I GAG-POL POLYPROTEIN; 1.
DR Pfam; PF17921; Integrase_H2C2; 1.
DR Pfam; PF03732; Retrotrans_gag; 1.
DR Pfam; PF17919; RT_RNaseH_2; 1.
DR Pfam; PF08284; RVP_2; 1.
DR Pfam; PF00078; RVT_1; 1.
DR SUPFAM; SSF50630; Acid proteases; 1.
DR SUPFAM; SSF54160; Chromo domain-like; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50013; CHROMO_2; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
DR PROSITE; PS50878; RT_POL; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000222542}.
FT DOMAIN 559..738
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|PROSITE:PS50878"
FT DOMAIN 1073..1237
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT DOMAIN 1386..1418
FT /note="Chromo"
FT /evidence="ECO:0000259|PROSITE:PS50013"
FT REGION 221..265
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 300..319
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1656 AA; 187672 MW; 0E58D92656049983 CRC64;
METRKQSLEQ FQQHTNETLE KLTAMFHKLV TDVREIKEKD SDKPSGSGKG STIGNVGKPY
LKLHFPRFSG DDPTGWIFQA EQYFEFQNVV DTDRVNLASF HLDGIALQWH RWFAKSRGPM
TWREFTAALL SRFGPTDYDD PSESLHRLKQ ITTVAVYIEA FERLSHRIDN LPESFLLGCF
VGGLKEDIRL EVKLKKPRTM TDAMGLSRLV EEKLNLQRRV TPSPRVTSFN SLPKGPHSAG
ILGPAPSQRL ALPAPSPVRR LSGAEAKERR EKGLCYYCDE RYIPGHKCTK PQLFMISEVD
DVEESSSHED EANENPPDEV SAEISFHAIT GTILPQTLRL PGKIHNKDLV VLIDGGSTHN
FIEQSLVERF GLTVDNGVKL EVVVANRDKL ACVGRVRGLT IIIQGYTITT DFFVLPIAAC
PIVLGVQWLK TLGPIEIDFQ NLTLGFHQAG STHKLQGLRG SDLTALKANE LMGIQGIALL
LQVNRVESEL SSTSSPCPAV QHVLTEYEQV FQEPKELPPR RFHDHDIPLI PGAKPVSSRP
YRQPYLQKTE IEKQVRGLLR DGLIRPSHSP FSSPVLLVKK SDSTWRFCVD YRALNDITVK
DKYPIPVIDE LLDELYGATI FSKLDLRAGY HQIRVRETDI PKTAFRTHDG HYEFVVMPFG
LTNAPATFQC LMNDIFRPYL RKFILVFFDD ILIYSKTLND HLGHLRTALG LLNTNQLFAK
LSKCCFGVSQ VNYLGHVISS GGVAVEENKV KAVLSWPTPT NAKGVRGFLG LAGYYRKFIK
GFGSIAAPLH KLVGKGPFIW NEKAEVAFQE LKIALTTPPT LALPDWSHPF TVECDASGVG
IGAILTQRGR PLAYFSAPLK GIMLSWSTYE KEMLAIVKAV RKWRHYLLGR PFVVKTDHVS
LKYLMEQRIT TPVQSRWLPK LLGFDYKIEY KKGSLNQGAD ALSRTPEFHY LNVSHPCSTI
WTTIQEEVHT DPFYLNLPSS LPIKFKGNLV KHDGVWFRND AILLSSNSPL LSTVLVMCHS
SPEGGHFGFH KTLAKVKHNF WWLGMKDFVK RYIRECHECQ RAKTDTMQPA GLLQPLPVPD
RIWEDISMDF VEGLPASNGF TVIMVIVDRL SKYAHFVPMR HPFTAASVAR DFVANVVRLH
GIPSTIVSDR DKIFISSFWQ ALFKLQGSVL CMSSSYHPQT DGQTEVVNRI LEQYLRCFVC
DKPKKWVDWL PWAEYSYNTS VHTSTKLIPF QVVYGRLPPK ILPYVPGTTK VQAVEDYLQD
RDRMLKTLRA NLFKAQDRMK HFADQRRREL EFEVGDHVYV KLQPYRQSSV VSRTSAKLSP
RFFGPYKILA KVGKVAYRIE LPPGSLIHDV FHVSLLRKRE GPVPEPIPPP VNEPVQLQVA
PQPEGILEER VVQKGKYRPK TEILVKWVGQ SREDATWEDK WRFSRTYPQF HLEDKPNTAV
FPPVFKIGNG KDTFLGSGGD FRKFLDGLVD ADDVPTYVKE HPLIGQPAIT TSHPDWNYYA
KIIVRFVGIK KEISQDQDNC IDSGFSTVEK KNKPIFLKQL KKIWENNSYG GRFSKSNTLL
IDDEPHVALL NPPNTGVFPH AYKVNDGRDT FLGPKGEMQE FLEGLIDAID VPSYVKGHPF
GQPAITDSHR DWDYYDGVVR AVEDPGFGYT DYESDY
//