ID A0A151REJ2_CAJCA Unreviewed; 1374 AA.
AC A0A151REJ2;
DT 08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT 08-JUN-2016, sequence version 1.
DT 27-MAR-2024, entry version 19.
DE SubName: Full=Retrovirus-related Pol polyprotein from transposon TNT 1-94 {ECO:0000313|EMBL:KYP41064.1};
GN ORFNames=KK1_037587 {ECO:0000313|EMBL:KYP41064.1};
OS Cajanus cajan (Pigeon pea) (Cajanus indicus).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; fabids; Fabales; Fabaceae; Papilionoideae; 50 kb inversion clade;
OC NPAAA clade; indigoferoid/millettioid clade; Phaseoleae; Cajanus.
OX NCBI_TaxID=3821 {ECO:0000313|EMBL:KYP41064.1, ECO:0000313|Proteomes:UP000075243};
RN [1] {ECO:0000313|EMBL:KYP41064.1, ECO:0000313|Proteomes:UP000075243}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Asha {ECO:0000313|Proteomes:UP000075243};
RX PubMed=22057054; DOI=10.1038/nbt.2022;
RA Varshney R.K., Chen W., Li Y., Bharti A.K., Saxena R.K., Schlueter J.A.,
RA Donoghue M.T., Azam S., Fan G., Whaley A.M., Farmer A.D., Sheridan J.,
RA Iwata A., Tuteja R., Penmetsa R.V., Wu W., Upadhyaya H.D., Yang S.P.,
RA Shah T., Saxena K.B., Michael T., McCombie W.R., Yang B., Zhang G.,
RA Yang H., Wang J., Spillane C., Cook D.R., May G.D., Xu X., Jackson S.A.;
RT "Draft genome sequence of pigeonpea (Cajanus cajan), an orphan legume crop
RT of resource-poor farmers.";
RL Nat. Biotechnol. 30:83-89(2012).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KQ483797; KYP41064.1; -; Genomic_DNA.
DR OMA; TIMVANQ; -.
DR Proteomes; UP000075243; Unassembled WGS sequence.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR CDD; cd09272; RNase_HI_RT_Ty1; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR025724; GAG-pre-integrase_dom.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR013103; RVT_2.
DR PANTHER; PTHR42648:SF18; INTEGRASE CATALYTIC DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR42648; TRANSPOSASE, PUTATIVE-RELATED; 1.
DR Pfam; PF13976; gag_pre-integrs; 1.
DR Pfam; PF00665; rve; 1.
DR Pfam; PF07727; RVT_2; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000075243}.
FT DOMAIN 461..627
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT REGION 209..238
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1374 AA; 155414 MW; E45F26DCB3AC05C5 CRC64;
MIMAWLWNSM VPEISDTCMF LKSAKEIWEA VEQTYSKAKD AAQIYDVKVK TVAAKQGNRS
VTEYANQLKS LWMELDHYRV IKAKCSEDSA ILKEYIEQDR VYDFLVGLNP EYDQVRIQIL
GKEKVPGLNE VVAIIRSEES RRGLMLETST TESSAMIAEG GTIMVANQRK NWVPSMEKKH
EEVWCTHCNK PRHTREKCWK LHGKPPSREW GLKDGKTSSR EWGPKGGPPK KGGQGQAYIA
NGQGEESVQL NHEEIERVRS ILSKLEKPTD TPFTHYWILD SGATDHMTPL PKYFSTYSPC
PSNKKISTAD GTLITAAGQG EVQISPSMTL KNVLHVPKLS TNLISIQKLT KDLSCNVVFY
SNSCILQDKN SGRTIGHARE WNGLYYMEDP NLPTKSLISL STMTNKEKAQ LYHCRLGHPS
FRVIKVLFPS LFKNLNVESL HCEVCEFAKH KRVPFPISNK MSSFPFFLVH TDVWGPAHVP
NISGAKWFLT FIDDCTRVTW VFLLKQKSEV SSVFVQFISM IKNQFGVGIK RIRSDNAKDY
FNLVLNSFCQ KEGIIHESSC VNTPQQNGIA ERKNGHLLDQ TRALLFQNHV PKRFWGEALL
TATYLINRLP TKILNLKSPM EVLSSFYPHL HPTNKLQPRI FGCVSFVHVH SNERGKLDPR
AVKCVFLGYS TTQKGYKCFH PISKRFYVSR DVTFNEQESY FKQPHLQGEN VREEDETLMF
PNMTFGPEIG TNGIAVPEIE GRTEPAPEPA PPAPNGGKFG KNLVYSRREK AILESGNVQE
SNPPSLHEVT PSNPINSNDS NEFVSENLEA QVDQTLDLPI ALRKGTRTCT QQPLYPLSNF
LSFEKFSPTH KTFLTNLNST QTPSSVSEAL SDSKWKHAMD VEMEALNKNR TWELVTLPPG
KKPVGCKWVY AVKYRANGTI ERYKARLVAK GFTQTYGVDY LETFAPVAKM NTVRVILSLA
ASYDWDLQQF DVKNAFLHGD LEEEIYMELP PGYNGQVAAG TVCKLRKALY GLKQSPRAWF
GRFTKVMTSL GYKQSQGDHT LFIKHSVSGG VTILLVYVDD IIVSGDDKRE QQLLSECLAT
EFEIKTLGRL KYFLGIEVAH SKKGIFISQQ KYITDLLKET GKTGCRPAST PVDPNIKLGS
MEEDIAVDKE MYQRLVGRLI YLSHTRPDIA FVVSLVSQFM HQPKEAHLQA ALRIVQYLKG
TPGRGILFKR NKSVSLEAYT DADYAGSVVD RRSTTGYCTF LGGNLVTWKS KKQSVVARSS
AEAEFRAMAH GICELLWLKI ILEDLKIKWD EPMRLYCDNK SAISIAHNPV QHDRTKHIEV
DRHFIKEKLD SGMICTPYVS TQNQLADILT KGLNCTHFER IISKLGMENT YSPA
//