ID A0A151RV81_CAJCA Unreviewed; 1139 AA.
AC A0A151RV81;
DT 08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT 08-JUN-2016, sequence version 1.
DT 27-MAR-2024, entry version 28.
DE SubName: Full=Transposon Ty3-I Gag-Pol polyprotein {ECO:0000313|EMBL:KYP46454.1};
GN ORFNames=KK1_031967 {ECO:0000313|EMBL:KYP46454.1};
OS Cajanus cajan (Pigeon pea) (Cajanus indicus).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; fabids; Fabales; Fabaceae; Papilionoideae; 50 kb inversion clade;
OC NPAAA clade; indigoferoid/millettioid clade; Phaseoleae; Cajanus.
OX NCBI_TaxID=3821 {ECO:0000313|EMBL:KYP46454.1, ECO:0000313|Proteomes:UP000075243};
RN [1] {ECO:0000313|EMBL:KYP46454.1, ECO:0000313|Proteomes:UP000075243}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Asha {ECO:0000313|Proteomes:UP000075243};
RX PubMed=22057054; DOI=10.1038/nbt.2022;
RA Varshney R.K., Chen W., Li Y., Bharti A.K., Saxena R.K., Schlueter J.A.,
RA Donoghue M.T., Azam S., Fan G., Whaley A.M., Farmer A.D., Sheridan J.,
RA Iwata A., Tuteja R., Penmetsa R.V., Wu W., Upadhyaya H.D., Yang S.P.,
RA Shah T., Saxena K.B., Michael T., McCombie W.R., Yang B., Zhang G.,
RA Yang H., Wang J., Spillane C., Cook D.R., May G.D., Xu X., Jackson S.A.;
RT "Draft genome sequence of pigeonpea (Cajanus cajan), an orphan legume crop
RT of resource-poor farmers.";
RL Nat. Biotechnol. 30:83-89(2012).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KQ483558; KYP46454.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A151RV81; -.
DR Proteomes; UP000075243; Unassembled WGS sequence.
DR GO; GO:0004190; F:aspartic-type endopeptidase activity; IEA:InterPro.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0003964; F:RNA-directed DNA polymerase activity; IEA:UniProtKB-KW.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:InterPro.
DR CDD; cd00303; retropepsin_like; 1.
DR Gene3D; 1.10.340.70; -; 1.
DR Gene3D; 3.30.70.270; -; 1.
DR Gene3D; 2.40.70.10; Acid Proteases; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR InterPro; IPR001969; Aspartic_peptidase_AS.
DR InterPro; IPR016197; Chromo-like_dom_sf.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR041588; Integrase_H2C2.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR005162; Retrotrans_gag_dom.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR041577; RT_RNaseH_2.
DR PANTHER; PTHR45835:SF103; INTEGRASE CATALYTIC DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR45835; YALI0A06105P; 1.
DR Pfam; PF13650; Asp_protease_2; 1.
DR Pfam; PF17921; Integrase_H2C2; 1.
DR Pfam; PF03732; Retrotrans_gag; 1.
DR Pfam; PF17919; RT_RNaseH_2; 1.
DR SUPFAM; SSF50630; Acid proteases; 1.
DR SUPFAM; SSF54160; Chromo domain-like; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS00141; ASP_PROTEASE; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
PE 4: Predicted;
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Nucleotidyltransferase {ECO:0000256|ARBA:ARBA00022695};
KW Reference proteome {ECO:0000313|Proteomes:UP000075243};
KW RNA-directed DNA polymerase {ECO:0000256|ARBA:ARBA00022918};
KW Transferase {ECO:0000256|ARBA:ARBA00022679}.
FT DOMAIN 722..886
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT REGION 191..225
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 191..216
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1139 AA; 129429 MW; 111E441BDCAF4812 CRC64;
MVLKQHSSTE SASSNNGHNQ PFQVRSVKLD FPRFDGSEVL QWIFKAEQFF SYYRTPDDQR
LLIASIHLDK DVVPWYQMMI REHPFHSWIA FTRALEMEFG PSPYEGPRSQ LFKLTQTNSV
QAYYVQFTAL ANRVQGVTQE ALLDCFVGGL KPDIRRDVIA QSPPSLLRTV SLAKLYEEKY
TIKPKPFSSS FFQKNQTTNT NQTTPQSLKS TSLPPLLPSP DSKPTYVKKL TSAEMQLRRD
KGICFTCDDK FSPNHRCPNK QYFVLQWEED DEPELQPEPP DVIEAVMGTG SQDHHLSYNA
LNGSSGLGTM KFQGSINGVR VQILLDSGSS DNFLQPRIAQ CLKIPVEPIP NLQVLVGNGN
SLVAEGLIRD LGVRIQGHTL KLPVYLLPVS GADLVLGAAW LATIGPHISN YSTLTLKFYL
GNQFITLYGQ KPSLPQPAQF NHMRRMQHTH AIAELFTLQF SHIDAPNDQL LYFPADMEPE
LAMLLYTYRT VFDVPSGLPP RFLGLTGYYR KFIKGYASIA APLSNLLKKE SFHWTEQTTA
AFENLKAAVT KAPVLALPDF SKLFTLETDA SGIGIGAVLN KPGKENLAAD CLSRSFFMAW
SEPKLQIVHV LKEALQADLQ LQSIKELCLQ NKAPDPHYSV HDQLLYWKGR LVIPNSHNLV
KQILYEFHTS LLGGHAGMAR TLARISAQFY WSGMQKDVKE FVQQCLVCQQ AKSATTLPAG
LLQPLPIPMQ IWEDLAMDFI TGLPLSHGFT VILVVIDRLS KYAHFFTMKT DYTSKQVAEI
FVKNVVKLHG FPKTIVSDRD KVFTSQFWQH LFKLSGTTIN LTTAYHPQSD GQSEALNKCL
EMYLRCFTHD SPKDWAQLLP WAEFWYNTAC HNSSGMTPFK VVYGRDPPKL IKYTVDQADP
VSLQEQLLTR DLTINKLKQN LHKAQGHMKK YADQKRRQLE FQIGDLVLVK LQPYRQHSVA
LRKNQKLSLR YFGPFPVIER IGLVAYKLLL PSTTKIHPVF HVSQLKPCKG EHTTPYIPFP
FTNVKVQPII QPAKILKERV IVQGQQQVPQ KLVQWQGFEN EQATWEDTTA LQQAFPDFNL
EDKVNLKGGG IVTRQNMIKE SLAEKESEEH VEEWKLRARK GSRPKITNTK LKDYHWLKE
//