ID A0A151RK96_CAJCA Unreviewed; 1100 AA.
AC A0A151RK96;
DT 08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT 08-JUN-2016, sequence version 1.
DT 27-MAR-2024, entry version 22.
DE SubName: Full=Pro-Pol polyprotein {ECO:0000313|EMBL:KYP42923.1};
GN ORFNames=KK1_035630 {ECO:0000313|EMBL:KYP42923.1};
OS Cajanus cajan (Pigeon pea) (Cajanus indicus).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; fabids; Fabales; Fabaceae; Papilionoideae; 50 kb inversion clade;
OC NPAAA clade; indigoferoid/millettioid clade; Phaseoleae; Cajanus.
OX NCBI_TaxID=3821 {ECO:0000313|EMBL:KYP42923.1, ECO:0000313|Proteomes:UP000075243};
RN [1] {ECO:0000313|EMBL:KYP42923.1, ECO:0000313|Proteomes:UP000075243}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Asha {ECO:0000313|Proteomes:UP000075243};
RX PubMed=22057054; DOI=10.1038/nbt.2022;
RA Varshney R.K., Chen W., Li Y., Bharti A.K., Saxena R.K., Schlueter J.A.,
RA Donoghue M.T., Azam S., Fan G., Whaley A.M., Farmer A.D., Sheridan J.,
RA Iwata A., Tuteja R., Penmetsa R.V., Wu W., Upadhyaya H.D., Yang S.P.,
RA Shah T., Saxena K.B., Michael T., McCombie W.R., Yang B., Zhang G.,
RA Yang H., Wang J., Spillane C., Cook D.R., May G.D., Xu X., Jackson S.A.;
RT "Draft genome sequence of pigeonpea (Cajanus cajan), an orphan legume crop
RT of resource-poor farmers.";
RL Nat. Biotechnol. 30:83-89(2012).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KQ483692; KYP42923.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A151RK96; -.
DR Proteomes; UP000075243; Unassembled WGS sequence.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0004523; F:RNA-DNA hybrid ribonuclease activity; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR CDD; cd09279; RNase_HI_like; 1.
DR CDD; cd01647; RT_LTR; 1.
DR Gene3D; 1.10.340.70; -; 1.
DR Gene3D; 3.30.70.270; -; 2.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 2.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR041588; Integrase_H2C2.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR002156; RNaseH_domain.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR041577; RT_RNaseH_2.
DR PANTHER; PTHR37984:SF7; INTEGRASE CATALYTIC DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR37984; PROTEIN CBG26694; 1.
DR Pfam; PF17921; Integrase_H2C2; 1.
DR Pfam; PF17919; RT_RNaseH_2; 1.
DR Pfam; PF00665; rve; 1.
DR Pfam; PF00078; RVT_1; 1.
DR Pfam; PF13456; RVT_3; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 2.
DR PROSITE; PS50994; INTEGRASE; 1.
DR PROSITE; PS50879; RNASE_H_1; 1.
DR PROSITE; PS50878; RT_POL; 1.
PE 4: Predicted;
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Reference proteome {ECO:0000313|Proteomes:UP000075243}.
FT DOMAIN 130..309
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|PROSITE:PS50878"
FT DOMAIN 527..656
FT /note="RNase H type-1"
FT /evidence="ECO:0000259|PROSITE:PS50879"
FT DOMAIN 810..971
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
SQ SEQUENCE 1100 AA; 124695 MW; 2B39314592F77DE2 CRC64;
MSKSGRDVNV IGDVGKQKLS GLELDPRLEE EDRVEPIETT VPFQLGKIEG QVTFLSSQLS
ETEAEDIKQV LKKYSDLFAW TAADMPGIDP NFHCHRLSVC RDAKPVAQKK RKMGGERAQA
IKDETAKLLQ ARFIREVKYS TWLANVVMVK KANGKWRMCT DYTDLNKACP KDAYPLPHID
ALVDGAAGHH RLSFLDAYSG YNQIPMYSPD EEKTAFITDS ANFCYKVMPF GLRNAGATYQ
RLMDKIFRHQ IGTCLEVYVD DMVIKSTSAV DHLKDLSTIF EEVRRHRMRL NPAKCTFGVA
GGKFLGFMLS KRGIEANPDK CQAIINMQSP RNIKEVQRLA GRIASLARFL PCMAEKSRPI
MSLLKKATKF SWNQECETAF QNFKTTLMAP PLLSKSDPSL DMIIYISVSD KAISTVLVQE
KTEQMPVYFI SRVLQDAETR YQHLEKTVLA LVHTARRLRH YFQSHRVLIR TDSPVTKVLR
RPELAGRMVA WSIELSQFDI RFEPRGPIKA QSLADFVNEF TPQEILDSHL WTLHVDGSSN
HQGSGAGIIL EGPGQVVIEQ SLRFGFKASN NQAEYEALLA GLRLAKDLGI PKVQCWSDSK
VVTEQVNGTF QIKEPTLLLY FHAFNKLKAD FENVQVKHTP RELNMRADQL ARLASSKKVS
HLRSMIQQEL PKPSITQAEC LQIQKETPNW MTGIIEYLTA GSLPIDPLEA KKMKVVAARY
TLIAGELYKR GFSSPLLKCL APDQAHYVIR EIHEGVCGTH SGSRTLAAKV VRAGYYWPTL
MADCSKYVQQ CKPCQQHGPL THQPPEELHS ITTPWPFSIW GMDILGPFPP AKGQVKFLLV
AVDHFTKWIE AEPVATITAN NVQKFFWKNV ITRFGIPYAL ITDNGLQFTD RRFNEFLAEL
GIKHKMTSVE HPQSNGQAEA ANKVILKELK RRLGQAKGAW PEHLPEILWA YRCTPQSSTR
ETPFRLTYGT DAMIPVEVGE PSLRRTMFNN QINEEALNVE LDLVEEARDQ ALIMMEACRA
RLSRKHRTKV KPREFQTGDL VWRVTGEARK DKAQGKLAPN WDGPYRIMHN LQNGAYKLEE
LSGKSIPRTW NATHLKHYFS
//