ID A0A151R8A5_CAJCA Unreviewed; 1390 AA.
AC A0A151R8A5;
DT 08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT 08-JUN-2016, sequence version 1.
DT 27-MAR-2024, entry version 28.
DE SubName: Full=Transposon Ty3-G Gag-Pol polyprotein {ECO:0000313|EMBL:KYP38605.1};
GN ORFNames=KK1_040129 {ECO:0000313|EMBL:KYP38605.1};
OS Cajanus cajan (Pigeon pea) (Cajanus indicus).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; fabids; Fabales; Fabaceae; Papilionoideae; 50 kb inversion clade;
OC NPAAA clade; indigoferoid/millettioid clade; Phaseoleae; Cajanus.
OX NCBI_TaxID=3821 {ECO:0000313|EMBL:KYP38605.1, ECO:0000313|Proteomes:UP000075243};
RN [1] {ECO:0000313|EMBL:KYP38605.1, ECO:0000313|Proteomes:UP000075243}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Asha {ECO:0000313|Proteomes:UP000075243};
RX PubMed=22057054; DOI=10.1038/nbt.2022;
RA Varshney R.K., Chen W., Li Y., Bharti A.K., Saxena R.K., Schlueter J.A.,
RA Donoghue M.T., Azam S., Fan G., Whaley A.M., Farmer A.D., Sheridan J.,
RA Iwata A., Tuteja R., Penmetsa R.V., Wu W., Upadhyaya H.D., Yang S.P.,
RA Shah T., Saxena K.B., Michael T., McCombie W.R., Yang B., Zhang G.,
RA Yang H., Wang J., Spillane C., Cook D.R., May G.D., Xu X., Jackson S.A.;
RT "Draft genome sequence of pigeonpea (Cajanus cajan), an orphan legume crop
RT of resource-poor farmers.";
RL Nat. Biotechnol. 30:83-89(2012).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KQ483985; KYP38605.1; -; Genomic_DNA.
DR Proteomes; UP000075243; Unassembled WGS sequence.
DR GO; GO:0004190; F:aspartic-type endopeptidase activity; IEA:InterPro.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0003964; F:RNA-directed DNA polymerase activity; IEA:UniProtKB-KW.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:InterPro.
DR CDD; cd00303; retropepsin_like; 1.
DR CDD; cd09274; RNase_HI_RT_Ty3; 1.
DR CDD; cd01647; RT_LTR; 1.
DR Gene3D; 1.10.340.70; -; 1.
DR Gene3D; 3.10.20.370; -; 1.
DR Gene3D; 3.30.70.270; -; 2.
DR Gene3D; 2.40.70.10; Acid Proteases; 1.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR InterPro; IPR001969; Aspartic_peptidase_AS.
DR InterPro; IPR016197; Chromo-like_dom_sf.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR041588; Integrase_H2C2.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR005162; Retrotrans_gag_dom.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR041577; RT_RNaseH_2.
DR PANTHER; PTHR37984:SF7; INTEGRASE CATALYTIC DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR37984; PROTEIN CBG26694; 1.
DR Pfam; PF17921; Integrase_H2C2; 1.
DR Pfam; PF03732; Retrotrans_gag; 1.
DR Pfam; PF17919; RT_RNaseH_2; 1.
DR Pfam; PF00665; rve; 1.
DR Pfam; PF08284; RVP_2; 1.
DR Pfam; PF00078; RVT_1; 1.
DR SUPFAM; SSF50630; Acid proteases; 1.
DR SUPFAM; SSF54160; Chromo domain-like; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS00141; ASP_PROTEASE; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
DR PROSITE; PS50878; RT_POL; 1.
PE 4: Predicted;
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Nucleotidyltransferase {ECO:0000256|ARBA:ARBA00022695};
KW Reference proteome {ECO:0000313|Proteomes:UP000075243};
KW RNA-directed DNA polymerase {ECO:0000256|ARBA:ARBA00022918};
KW Transferase {ECO:0000256|ARBA:ARBA00022679}.
FT DOMAIN 503..682
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|PROSITE:PS50878"
FT DOMAIN 1021..1197
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT REGION 161..188
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1390 AA; 159693 MW; EB0ABDFD83DD521D CRC64;
MYQCENYFLI DATPEDVKVR LAIVHLEGKA LQWHTAISKN LVNQQPSWEE YMKMLQDRFG
DICDDPMAEL MKLRQEKGVS EYHEAFDAII SRLDLTEEYR LSCFLGGLKH EIQMMVRMFR
PDSVRRAFSL AKMYKASQPQ GPLAMASKPL SLNSRNNKNV MNSRPLLPTP TDQAKYNQPE
FTSNKTKPYR NLTPTYMADR RSKGLCYFCD EPYSQAHSLT HKKLQLHVIE VEETSNDPTF
EEELPDSDSA DMGEPQISVH ALTDIPNFKT MRITGYYNKK PLHILIDSGS THNFLDVHIA
KKLGCRIDNL EPMHVTVADD SKLNIEAMVK DFKWTIQQTM FTSDMMLLSL GCCDLVLGIE
WLITLGDITW KFDKLSMQFY AQGRKHVLRS AQLQGMKTVR RKQFGRILKE GVHISMIQLC
NQEGALLHTL TTHGQLPVPT PKIQYILGEF EDVFQEPTQL PPVRSDHDHK IPLVQGSNPV
NKRPYRYAKQ QKDVIDKLVK EYLNTGIIQA SNSPYASPVV LVGKKDGSWR LCVDYRELNK
ATVKDKFPIP LVEDLLDELH GSTIYSKIDL RSGYNQVRMH PMDVHKTAFK THGGHYEYLV
MPFGLTNAPA TFQGLMNSVF QEYLRQFLLV FFDDILIYSK SIEDHMQHLQ LVLQTMRQNN
LFARKSKCYF AVTKVEYLGH FINAEAVSTD PSKIEAVKNW PLPETLKQLR GFLGLAGYYK
RFVRGYGGIA KPLTELLKKD NFTWTVEAKQ AFQKSKSLLI QAPVLALPDF NMQFVLEVDA
CGYGIGAVLM QAHHPIAFIS RALSSQQHAL STYEKELLAV VFAVQKWRHY LLNKQFIIKT
DHRSLKYILD QRLTTSFQQK WLIKLMEFDF IIEYKEGKTN IAGDALSRKE DPTCCSVNIH
TVSTDLLDKI QASWRTDLSL KKIINDVKTN PDSHRHYTWR NEELRRKGRL VIGNNGDLRT
QILNWLHSSS IAGHSGINAT IQRAKSVIYW KGLTRDITEF IRKCATCQRC KYETIASLGI
LQPLPIPDHI WQHINMDFIE GLPSSAGKQV IFVVVDRLSK AAHFIGLSHP YQASDVAQAF
LDNIFKLHGF PETITSDRDP IFISNFWQEF MQLQGVETRL SSAYHPQTDG QSEVVNRCLE
TYLRCMCSDT PTEWQAGRKK ILDTKRVPQT TRISDPDNFR GLHSIKATPY EVVYGKPPPA
FLPYLPGESK NAVINRSLSK REEMLKVLKF HLRRAQDRMK QVADRHRTDR QLQMGDMVFV
KLHPYRQVSV AARSNAKLAP KYFGPYKIID KIGQVAYKVE LPTSARIHNV FHVSQLKKYV
GDAPTSTDLP VEPEAISLTR EPEDILDRIT VKRHGRAVTK VLVKWKNQVP EDATREYYYD
LKQKYPAFNP
//