GenomeNet

Database: UniProt
Entry: A0A151R8A5_CAJCA
LinkDB: A0A151R8A5_CAJCA
Original site: A0A151R8A5_CAJCA 
ID   A0A151R8A5_CAJCA        Unreviewed;      1390 AA.
AC   A0A151R8A5;
DT   08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT   08-JUN-2016, sequence version 1.
DT   27-MAR-2024, entry version 28.
DE   SubName: Full=Transposon Ty3-G Gag-Pol polyprotein {ECO:0000313|EMBL:KYP38605.1};
GN   ORFNames=KK1_040129 {ECO:0000313|EMBL:KYP38605.1};
OS   Cajanus cajan (Pigeon pea) (Cajanus indicus).
OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC   Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC   rosids; fabids; Fabales; Fabaceae; Papilionoideae; 50 kb inversion clade;
OC   NPAAA clade; indigoferoid/millettioid clade; Phaseoleae; Cajanus.
OX   NCBI_TaxID=3821 {ECO:0000313|EMBL:KYP38605.1, ECO:0000313|Proteomes:UP000075243};
RN   [1] {ECO:0000313|EMBL:KYP38605.1, ECO:0000313|Proteomes:UP000075243}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=cv. Asha {ECO:0000313|Proteomes:UP000075243};
RX   PubMed=22057054; DOI=10.1038/nbt.2022;
RA   Varshney R.K., Chen W., Li Y., Bharti A.K., Saxena R.K., Schlueter J.A.,
RA   Donoghue M.T., Azam S., Fan G., Whaley A.M., Farmer A.D., Sheridan J.,
RA   Iwata A., Tuteja R., Penmetsa R.V., Wu W., Upadhyaya H.D., Yang S.P.,
RA   Shah T., Saxena K.B., Michael T., McCombie W.R., Yang B., Zhang G.,
RA   Yang H., Wang J., Spillane C., Cook D.R., May G.D., Xu X., Jackson S.A.;
RT   "Draft genome sequence of pigeonpea (Cajanus cajan), an orphan legume crop
RT   of resource-poor farmers.";
RL   Nat. Biotechnol. 30:83-89(2012).
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; KQ483985; KYP38605.1; -; Genomic_DNA.
DR   Proteomes; UP000075243; Unassembled WGS sequence.
DR   GO; GO:0004190; F:aspartic-type endopeptidase activity; IEA:InterPro.
DR   GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR   GO; GO:0003964; F:RNA-directed DNA polymerase activity; IEA:UniProtKB-KW.
DR   GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR   GO; GO:0006508; P:proteolysis; IEA:InterPro.
DR   CDD; cd00303; retropepsin_like; 1.
DR   CDD; cd09274; RNase_HI_RT_Ty3; 1.
DR   CDD; cd01647; RT_LTR; 1.
DR   Gene3D; 1.10.340.70; -; 1.
DR   Gene3D; 3.10.20.370; -; 1.
DR   Gene3D; 3.30.70.270; -; 2.
DR   Gene3D; 2.40.70.10; Acid Proteases; 1.
DR   Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR   Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR   InterPro; IPR001969; Aspartic_peptidase_AS.
DR   InterPro; IPR016197; Chromo-like_dom_sf.
DR   InterPro; IPR043502; DNA/RNA_pol_sf.
DR   InterPro; IPR001584; Integrase_cat-core.
DR   InterPro; IPR041588; Integrase_H2C2.
DR   InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR   InterPro; IPR005162; Retrotrans_gag_dom.
DR   InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR   InterPro; IPR012337; RNaseH-like_sf.
DR   InterPro; IPR036397; RNaseH_sf.
DR   InterPro; IPR000477; RT_dom.
DR   InterPro; IPR041577; RT_RNaseH_2.
DR   PANTHER; PTHR37984:SF7; INTEGRASE CATALYTIC DOMAIN-CONTAINING PROTEIN; 1.
DR   PANTHER; PTHR37984; PROTEIN CBG26694; 1.
DR   Pfam; PF17921; Integrase_H2C2; 1.
DR   Pfam; PF03732; Retrotrans_gag; 1.
DR   Pfam; PF17919; RT_RNaseH_2; 1.
DR   Pfam; PF00665; rve; 1.
DR   Pfam; PF08284; RVP_2; 1.
DR   Pfam; PF00078; RVT_1; 1.
DR   SUPFAM; SSF50630; Acid proteases; 1.
DR   SUPFAM; SSF54160; Chromo domain-like; 1.
DR   SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR   SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR   PROSITE; PS00141; ASP_PROTEASE; 1.
DR   PROSITE; PS50994; INTEGRASE; 1.
DR   PROSITE; PS50878; RT_POL; 1.
PE   4: Predicted;
KW   Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW   Nucleotidyltransferase {ECO:0000256|ARBA:ARBA00022695};
KW   Reference proteome {ECO:0000313|Proteomes:UP000075243};
KW   RNA-directed DNA polymerase {ECO:0000256|ARBA:ARBA00022918};
KW   Transferase {ECO:0000256|ARBA:ARBA00022679}.
FT   DOMAIN          503..682
FT                   /note="Reverse transcriptase"
FT                   /evidence="ECO:0000259|PROSITE:PS50878"
FT   DOMAIN          1021..1197
FT                   /note="Integrase catalytic"
FT                   /evidence="ECO:0000259|PROSITE:PS50994"
FT   REGION          161..188
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1390 AA;  159693 MW;  EB0ABDFD83DD521D CRC64;
     MYQCENYFLI DATPEDVKVR LAIVHLEGKA LQWHTAISKN LVNQQPSWEE YMKMLQDRFG
     DICDDPMAEL MKLRQEKGVS EYHEAFDAII SRLDLTEEYR LSCFLGGLKH EIQMMVRMFR
     PDSVRRAFSL AKMYKASQPQ GPLAMASKPL SLNSRNNKNV MNSRPLLPTP TDQAKYNQPE
     FTSNKTKPYR NLTPTYMADR RSKGLCYFCD EPYSQAHSLT HKKLQLHVIE VEETSNDPTF
     EEELPDSDSA DMGEPQISVH ALTDIPNFKT MRITGYYNKK PLHILIDSGS THNFLDVHIA
     KKLGCRIDNL EPMHVTVADD SKLNIEAMVK DFKWTIQQTM FTSDMMLLSL GCCDLVLGIE
     WLITLGDITW KFDKLSMQFY AQGRKHVLRS AQLQGMKTVR RKQFGRILKE GVHISMIQLC
     NQEGALLHTL TTHGQLPVPT PKIQYILGEF EDVFQEPTQL PPVRSDHDHK IPLVQGSNPV
     NKRPYRYAKQ QKDVIDKLVK EYLNTGIIQA SNSPYASPVV LVGKKDGSWR LCVDYRELNK
     ATVKDKFPIP LVEDLLDELH GSTIYSKIDL RSGYNQVRMH PMDVHKTAFK THGGHYEYLV
     MPFGLTNAPA TFQGLMNSVF QEYLRQFLLV FFDDILIYSK SIEDHMQHLQ LVLQTMRQNN
     LFARKSKCYF AVTKVEYLGH FINAEAVSTD PSKIEAVKNW PLPETLKQLR GFLGLAGYYK
     RFVRGYGGIA KPLTELLKKD NFTWTVEAKQ AFQKSKSLLI QAPVLALPDF NMQFVLEVDA
     CGYGIGAVLM QAHHPIAFIS RALSSQQHAL STYEKELLAV VFAVQKWRHY LLNKQFIIKT
     DHRSLKYILD QRLTTSFQQK WLIKLMEFDF IIEYKEGKTN IAGDALSRKE DPTCCSVNIH
     TVSTDLLDKI QASWRTDLSL KKIINDVKTN PDSHRHYTWR NEELRRKGRL VIGNNGDLRT
     QILNWLHSSS IAGHSGINAT IQRAKSVIYW KGLTRDITEF IRKCATCQRC KYETIASLGI
     LQPLPIPDHI WQHINMDFIE GLPSSAGKQV IFVVVDRLSK AAHFIGLSHP YQASDVAQAF
     LDNIFKLHGF PETITSDRDP IFISNFWQEF MQLQGVETRL SSAYHPQTDG QSEVVNRCLE
     TYLRCMCSDT PTEWQAGRKK ILDTKRVPQT TRISDPDNFR GLHSIKATPY EVVYGKPPPA
     FLPYLPGESK NAVINRSLSK REEMLKVLKF HLRRAQDRMK QVADRHRTDR QLQMGDMVFV
     KLHPYRQVSV AARSNAKLAP KYFGPYKIID KIGQVAYKVE LPTSARIHNV FHVSQLKKYV
     GDAPTSTDLP VEPEAISLTR EPEDILDRIT VKRHGRAVTK VLVKWKNQVP EDATREYYYD
     LKQKYPAFNP
//
DBGET integrated database retrieval system