GenomeNet

Database: UniProt
Entry: A0A2G2YPK1_CAPAN
LinkDB: A0A2G2YPK1_CAPAN
Original site: A0A2G2YPK1_CAPAN 
ID   A0A2G2YPK1_CAPAN        Unreviewed;      1656 AA.
AC   A0A2G2YPK1;
DT   31-JAN-2018, integrated into UniProtKB/TrEMBL.
DT   31-JAN-2018, sequence version 1.
DT   27-MAR-2024, entry version 23.
DE   RecName: Full=Reverse transcriptase {ECO:0008006|Google:ProtNLM};
GN   ORFNames=T459_22467 {ECO:0000313|EMBL:PHT71682.1};
OS   Capsicum annuum (Capsicum pepper).
OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC   Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC   asterids; lamiids; Solanales; Solanaceae; Solanoideae; Capsiceae; Capsicum.
OX   NCBI_TaxID=4072 {ECO:0000313|EMBL:PHT71682.1, ECO:0000313|Proteomes:UP000222542};
RN   [1] {ECO:0000313|EMBL:PHT71682.1, ECO:0000313|Proteomes:UP000222542}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=cv. CM334 {ECO:0000313|Proteomes:UP000222542};
RX   PubMed=29089032; DOI=10.1186/s13059-017-1341-9;
RA   Kim S., Park J., Yeom S.I., Kim Y.M., Seo E., Kim K.T., Kim M.S., Lee J.M.,
RA   Cheong K., Shin H.S., Kim S.B., Han K., Lee J., Park M., Lee H.A.,
RA   Lee H.Y., Lee Y., Oh S., Lee J.H., Choi E., Choi E., Lee S.E., Jeon J.,
RA   Kim H., Choi G., Song H., Lee J., Lee S.C., Kwon J.K., Lee H.Y., Koo N.,
RA   Hong Y., Kim R.W., Kang W.H., Huh J.H., Kang B.C., Yang T.J., Lee Y.H.,
RA   Bennetzen J.L., Choi D.;
RT   "New reference genome sequences of hot pepper reveal the massive evolution
RT   of plant disease-resistance genes by retroduplication.";
RL   Genome Biol. 18:R210.1-R210.11(2017).
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:PHT71682.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AYRZ02000009; PHT71682.1; -; Genomic_DNA.
DR   EnsemblPlants; PHT71682; PHT71682; T459_22467.
DR   Gramene; PHT71682; PHT71682; T459_22467.
DR   OMA; GHTEREF; -.
DR   Proteomes; UP000222542; Chromosome 9.
DR   GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR   GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR   CDD; cd00303; retropepsin_like; 1.
DR   CDD; cd09274; RNase_HI_RT_Ty3; 1.
DR   CDD; cd01647; RT_LTR; 1.
DR   Gene3D; 1.10.340.70; -; 1.
DR   Gene3D; 3.30.70.270; -; 2.
DR   Gene3D; 2.40.70.10; Acid Proteases; 1.
DR   Gene3D; 3.40.50.1000; HAD superfamily/HAD-like; 1.
DR   Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR   Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR   InterPro; IPR016197; Chromo-like_dom_sf.
DR   InterPro; IPR000953; Chromo/chromo_shadow_dom.
DR   InterPro; IPR043502; DNA/RNA_pol_sf.
DR   InterPro; IPR023214; HAD_sf.
DR   InterPro; IPR001584; Integrase_cat-core.
DR   InterPro; IPR041588; Integrase_H2C2.
DR   InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR   InterPro; IPR005162; Retrotrans_gag_dom.
DR   InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR   InterPro; IPR012337; RNaseH-like_sf.
DR   InterPro; IPR036397; RNaseH_sf.
DR   InterPro; IPR000477; RT_dom.
DR   InterPro; IPR041577; RT_RNaseH_2.
DR   PANTHER; PTHR24559:SF438; RT_RNASEH_2 DOMAIN-CONTAINING PROTEIN; 1.
DR   PANTHER; PTHR24559; TRANSPOSON TY3-I GAG-POL POLYPROTEIN; 1.
DR   Pfam; PF17921; Integrase_H2C2; 1.
DR   Pfam; PF03732; Retrotrans_gag; 1.
DR   Pfam; PF17919; RT_RNaseH_2; 1.
DR   Pfam; PF08284; RVP_2; 1.
DR   Pfam; PF00078; RVT_1; 1.
DR   SUPFAM; SSF50630; Acid proteases; 1.
DR   SUPFAM; SSF54160; Chromo domain-like; 1.
DR   SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR   SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR   PROSITE; PS50013; CHROMO_2; 1.
DR   PROSITE; PS50994; INTEGRASE; 1.
DR   PROSITE; PS50878; RT_POL; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000222542}.
FT   DOMAIN          559..738
FT                   /note="Reverse transcriptase"
FT                   /evidence="ECO:0000259|PROSITE:PS50878"
FT   DOMAIN          1073..1237
FT                   /note="Integrase catalytic"
FT                   /evidence="ECO:0000259|PROSITE:PS50994"
FT   DOMAIN          1386..1418
FT                   /note="Chromo"
FT                   /evidence="ECO:0000259|PROSITE:PS50013"
FT   REGION          221..265
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          300..319
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1656 AA;  187672 MW;  0E58D92656049983 CRC64;
     METRKQSLEQ FQQHTNETLE KLTAMFHKLV TDVREIKEKD SDKPSGSGKG STIGNVGKPY
     LKLHFPRFSG DDPTGWIFQA EQYFEFQNVV DTDRVNLASF HLDGIALQWH RWFAKSRGPM
     TWREFTAALL SRFGPTDYDD PSESLHRLKQ ITTVAVYIEA FERLSHRIDN LPESFLLGCF
     VGGLKEDIRL EVKLKKPRTM TDAMGLSRLV EEKLNLQRRV TPSPRVTSFN SLPKGPHSAG
     ILGPAPSQRL ALPAPSPVRR LSGAEAKERR EKGLCYYCDE RYIPGHKCTK PQLFMISEVD
     DVEESSSHED EANENPPDEV SAEISFHAIT GTILPQTLRL PGKIHNKDLV VLIDGGSTHN
     FIEQSLVERF GLTVDNGVKL EVVVANRDKL ACVGRVRGLT IIIQGYTITT DFFVLPIAAC
     PIVLGVQWLK TLGPIEIDFQ NLTLGFHQAG STHKLQGLRG SDLTALKANE LMGIQGIALL
     LQVNRVESEL SSTSSPCPAV QHVLTEYEQV FQEPKELPPR RFHDHDIPLI PGAKPVSSRP
     YRQPYLQKTE IEKQVRGLLR DGLIRPSHSP FSSPVLLVKK SDSTWRFCVD YRALNDITVK
     DKYPIPVIDE LLDELYGATI FSKLDLRAGY HQIRVRETDI PKTAFRTHDG HYEFVVMPFG
     LTNAPATFQC LMNDIFRPYL RKFILVFFDD ILIYSKTLND HLGHLRTALG LLNTNQLFAK
     LSKCCFGVSQ VNYLGHVISS GGVAVEENKV KAVLSWPTPT NAKGVRGFLG LAGYYRKFIK
     GFGSIAAPLH KLVGKGPFIW NEKAEVAFQE LKIALTTPPT LALPDWSHPF TVECDASGVG
     IGAILTQRGR PLAYFSAPLK GIMLSWSTYE KEMLAIVKAV RKWRHYLLGR PFVVKTDHVS
     LKYLMEQRIT TPVQSRWLPK LLGFDYKIEY KKGSLNQGAD ALSRTPEFHY LNVSHPCSTI
     WTTIQEEVHT DPFYLNLPSS LPIKFKGNLV KHDGVWFRND AILLSSNSPL LSTVLVMCHS
     SPEGGHFGFH KTLAKVKHNF WWLGMKDFVK RYIRECHECQ RAKTDTMQPA GLLQPLPVPD
     RIWEDISMDF VEGLPASNGF TVIMVIVDRL SKYAHFVPMR HPFTAASVAR DFVANVVRLH
     GIPSTIVSDR DKIFISSFWQ ALFKLQGSVL CMSSSYHPQT DGQTEVVNRI LEQYLRCFVC
     DKPKKWVDWL PWAEYSYNTS VHTSTKLIPF QVVYGRLPPK ILPYVPGTTK VQAVEDYLQD
     RDRMLKTLRA NLFKAQDRMK HFADQRRREL EFEVGDHVYV KLQPYRQSSV VSRTSAKLSP
     RFFGPYKILA KVGKVAYRIE LPPGSLIHDV FHVSLLRKRE GPVPEPIPPP VNEPVQLQVA
     PQPEGILEER VVQKGKYRPK TEILVKWVGQ SREDATWEDK WRFSRTYPQF HLEDKPNTAV
     FPPVFKIGNG KDTFLGSGGD FRKFLDGLVD ADDVPTYVKE HPLIGQPAIT TSHPDWNYYA
     KIIVRFVGIK KEISQDQDNC IDSGFSTVEK KNKPIFLKQL KKIWENNSYG GRFSKSNTLL
     IDDEPHVALL NPPNTGVFPH AYKVNDGRDT FLGPKGEMQE FLEGLIDAID VPSYVKGHPF
     GQPAITDSHR DWDYYDGVVR AVEDPGFGYT DYESDY
//
DBGET integrated database retrieval system