ID A0A151RRQ4_CAJCA Unreviewed; 1217 AA.
AC A0A151RRQ4;
DT 08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT 08-JUN-2016, sequence version 1.
DT 27-MAR-2024, entry version 22.
DE SubName: Full=Retrotransposable element Tf2 {ECO:0000313|EMBL:KYP45237.1};
GN ORFNames=KK1_033252 {ECO:0000313|EMBL:KYP45237.1};
OS Cajanus cajan (Pigeon pea) (Cajanus indicus).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; fabids; Fabales; Fabaceae; Papilionoideae; 50 kb inversion clade;
OC NPAAA clade; indigoferoid/millettioid clade; Phaseoleae; Cajanus.
OX NCBI_TaxID=3821 {ECO:0000313|EMBL:KYP45237.1, ECO:0000313|Proteomes:UP000075243};
RN [1] {ECO:0000313|EMBL:KYP45237.1, ECO:0000313|Proteomes:UP000075243}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Asha {ECO:0000313|Proteomes:UP000075243};
RX PubMed=22057054; DOI=10.1038/nbt.2022;
RA Varshney R.K., Chen W., Li Y., Bharti A.K., Saxena R.K., Schlueter J.A.,
RA Donoghue M.T., Azam S., Fan G., Whaley A.M., Farmer A.D., Sheridan J.,
RA Iwata A., Tuteja R., Penmetsa R.V., Wu W., Upadhyaya H.D., Yang S.P.,
RA Shah T., Saxena K.B., Michael T., McCombie W.R., Yang B., Zhang G.,
RA Yang H., Wang J., Spillane C., Cook D.R., May G.D., Xu X., Jackson S.A.;
RT "Draft genome sequence of pigeonpea (Cajanus cajan), an orphan legume crop
RT of resource-poor farmers.";
RL Nat. Biotechnol. 30:83-89(2012).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KQ483597; KYP45237.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A151RRQ4; -.
DR OMA; WTTASHE; -.
DR Proteomes; UP000075243; Unassembled WGS sequence.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR CDD; cd00303; retropepsin_like; 1.
DR CDD; cd01647; RT_LTR; 1.
DR Gene3D; 1.10.340.70; -; 1.
DR Gene3D; 3.30.70.270; -; 1.
DR Gene3D; 2.40.70.10; Acid Proteases; 1.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR InterPro; IPR016197; Chromo-like_dom_sf.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR041588; Integrase_H2C2.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR005162; Retrotrans_gag_dom.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR041577; RT_RNaseH_2.
DR PANTHER; PTHR45835:SF103; INTEGRASE CATALYTIC DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR45835; YALI0A06105P; 1.
DR Pfam; PF17921; Integrase_H2C2; 1.
DR Pfam; PF03732; Retrotrans_gag; 1.
DR Pfam; PF17919; RT_RNaseH_2; 1.
DR Pfam; PF08284; RVP_2; 1.
DR Pfam; PF00078; RVT_1; 1.
DR SUPFAM; SSF50630; Acid proteases; 1.
DR SUPFAM; SSF54160; Chromo domain-like; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000075243}.
FT DOMAIN 882..1003
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT REGION 24..46
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1217 AA; 138874 MW; F667856807A011F3 CRC64;
MRETIVKSAE QLNEVQASVR EIHARQEEEN SEASHTILPS RPQGGHNGGR YKVDRWRKLE
IPIFDGEDAY GWTTRVERYF DLKGMSGEEK LQAVMVAMEG KALTWYQWWE FSTHQPTWED
FRVAVIRHFQ PTMVDNPFEL LLSLKQTGSV EEYREKFELY AGPLKSAEPA YLKGIFLNGL
KELIKAELKL HPTESLVDLM DCAQRVDEKN QLITKGGNST SMMSRPMRTY NSNCKVTWEP
GAKQQPHVFG TGSSTGEGSV VKPSNTFRGR SFRKLTDAEL QERSRKGLCF RCDQKFGPGH
VCAHKQLQIL VLAEGEEEGD VEGEMTAIDE NGELTHIQLS MCSIAGLTSK KTIKLWGKIG
EEQVMVLINC GASHNFISAK FVKQQKLDMY ATSIYTVEVG DGRKLDCEGV CPKLKLEIQG
LNIEQDFYVF DLGGVDVVLG MEWLASLGEI RANFRELTLR IPITSKYHTL KGDSDLTRAL
ASLKSILKAL RDQGHGFLVT YCQMQAESKE PATCPAWTKL VLEDYDSIFQ EPQGLPPPRR
QDHAIRLKEG SNIPNLRPYR YPHYQKNEIE RLIDDMLKSG IIRPSVSPYS SPIILVKKKD
GGWRFCVDYR ALNKITIPDK FLIPIIDELL DELGGAVIFS KLDLRSSYHQ IRMREEDIHK
TAFRTHEGHY ERFVRDYGKI ARPLTQLLKK DKFQWNAEAQ AALDKLKHLV SELPILTDPD
FSKTFIIETD ASNKGLGAVL MQEGRENGVV DALSRKMTFS ALSSIHFEQM ADWESEIHQD
PKLQGIIHDL IKDCNSHPGY KFQNQKLFYK GRLVLPKGSS HIPLLLQEFH NSAIGVHSGF
FRTYKRISEV VYWEGMRKDI QHHVATCETC QRNKYQALSP ARLLQPLPIP NQVWADISMD
FIEGLPKAQG KNVILVVVDR LTKYAHFLAL SHPFTAKEVA EVFITEVVKL HGIVSDRDKI
FLSHFWSELF KLVGTRLKFS TAYHPQTDGQ TEVVNRCLET YLRCLTGSKP KQWPKWLTLY
GRDPPHLLKG TTIPSTVEEV NLMTYDRDQM LHDLKENLVT AQNQMKKYAD QSRRAVSLAI
GDWVYLKLKP YRLRSLARKR NEKMSPRFYG PYQITKQIGV VAFQLALPPE SKIHPVFHVS
LLKKALSPSA TPQPLPPMLS EELELQVTPA AVKVVRNTAH GIAEVLIQWT DLPEFEATWE
PVEPIMKQFP SFTLRTR
//