ID A0A151U9T1_CAJCA Unreviewed; 1355 AA.
AC A0A151U9T1;
DT 08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT 08-JUN-2016, sequence version 1.
DT 27-MAR-2024, entry version 19.
DE SubName: Full=Retrovirus-related Pol polyprotein from transposon TNT 1-94 {ECO:0000313|EMBL:KYP76032.1};
GN ORFNames=KK1_020250 {ECO:0000313|EMBL:KYP76032.1};
OS Cajanus cajan (Pigeon pea) (Cajanus indicus).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; fabids; Fabales; Fabaceae; Papilionoideae; 50 kb inversion clade;
OC NPAAA clade; indigoferoid/millettioid clade; Phaseoleae; Cajanus.
OX NCBI_TaxID=3821 {ECO:0000313|EMBL:KYP76032.1, ECO:0000313|Proteomes:UP000075243};
RN [1] {ECO:0000313|EMBL:KYP76032.1, ECO:0000313|Proteomes:UP000075243}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Asha {ECO:0000313|Proteomes:UP000075243};
RX PubMed=22057054; DOI=10.1038/nbt.2022;
RA Varshney R.K., Chen W., Li Y., Bharti A.K., Saxena R.K., Schlueter J.A.,
RA Donoghue M.T., Azam S., Fan G., Whaley A.M., Farmer A.D., Sheridan J.,
RA Iwata A., Tuteja R., Penmetsa R.V., Wu W., Upadhyaya H.D., Yang S.P.,
RA Shah T., Saxena K.B., Michael T., McCombie W.R., Yang B., Zhang G.,
RA Yang H., Wang J., Spillane C., Cook D.R., May G.D., Xu X., Jackson S.A.;
RT "Draft genome sequence of pigeonpea (Cajanus cajan), an orphan legume crop
RT of resource-poor farmers.";
RL Nat. Biotechnol. 30:83-89(2012).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM003603; KYP76032.1; -; Genomic_DNA.
DR Proteomes; UP000075243; Chromosome 1.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR CDD; cd09272; RNase_HI_RT_Ty1; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR025724; GAG-pre-integrase_dom.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR013103; RVT_2.
DR PANTHER; PTHR42648:SF18; INTEGRASE CATALYTIC DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR42648; TRANSPOSASE, PUTATIVE-RELATED; 1.
DR Pfam; PF13976; gag_pre-integrs; 1.
DR Pfam; PF14223; Retrotran_gag_2; 1.
DR Pfam; PF00665; rve; 1.
DR Pfam; PF07727; RVT_2; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000075243}.
FT DOMAIN 471..637
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT REGION 165..202
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 770..799
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 170..184
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 770..788
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1355 AA; 151945 MW; 23BB0FA886F17CC8 CRC64;
MWFLGQGLYD HLTKRASEID KEVRDEWQRA DYQLVSLLWQ SIEPKLMVHF RPYKTCYDIW
KKARNVYAND IQRIYESVHG LATLRMVDND LPTYLNRAQS TIDELKLMLV SDDPQQILNK
LDNMFMVFIL QGLHKDYGSV RDQILTNPVI PTVEELIDRL IRVPSPEADP HTESESSAFI
SSTADRGSRG RGRGRGRGKG GRGNLHCTYC QRDGHTRDRC YSLHGFPSKT ANVVQSPTSA
PELKIEPEGN HPITLSTDDY QEYLQLKATK QASSSITVAH TGNSTVCLSH STSIGPWVLD
SGASDHLTGN VSLFPNLSSP KTPHHITLAD GSKVQATGIG QISPLPSLPL KSVLLVPGCP
FNLISISKLT RSLNCVITFT SDSFLIQDRS TGQTIGAGSE SHGLYYLQPS TSTICASIES
PGLIHRRLGH PSLNKLKKMV PHLSRLVSLE CESCQLGKHV RASFPNSINS RAMSPFDVIH
SDVWGPSRIP SLLGHRYYVT FIDDFSRCTW IFLMKNRSEL FNIFLSFYSE IKTQFGKVIR
ILRSDNAKEY FSDCFKSFMA SHGILHQSSC PHTPQQNGVA ERKHRHIVDT ARTLLLNANA
PPKLWGDAVL TAGYLINRMP SSVLNDQVPH SLLYPLDPLY SVHPRVFGCT CFVHDLFSGR
AKLSARAIKC VFLGYSRVQK GYRCYSPATH RFYTSADVTF FEDTPYFIAT DVSPVDSDLL
SQVLPIPHFD HSVPPTTPAT LEAPDPPALP GSDHPLHRFG ITYERRSHTV VPPNDSSIEP
CDSTPASVSS PTPAPPTSVD LPIALRKGSR STCNPHPIYN FLSYHRLSPT YYAFVSAISS
ITIPKTVQEA LTHPGWRQAM IDEMTALDSN HTWVLVPPPL EKSVVGCQWV FNVKVGPDGQ
VNRLKARLVA NGYTQVYGLD YSDTFSPVAK MASVRLFLAM AAMRHWPLFQ LDIKNAFLHG
DLEEEIYMDQ PPGFVAQGGS GLVCKLQKSL YGLKQSPRAW FGRFSKVIQE FGMIRCETDH
SVFFRRSSTH RFIYLVVYVD DIVITGDDQE GIKALKQHLF KHFQTKDLGP LRYFLGIEVA
QSKSGIAISQ RKYALDILEE TGLTDCKPVD TPMDPNVKLM PNQGEPYPDP GRYRRLVGKL
NYLTMTRPDI SFPVSVVSQF LNSPCESHWL AVVRILRYIK RSPGKGLVYN DRGHTNIVGY
SDADWAGDAS DRRSTSGYCV FMGGNLVSWK SKKQSVVARS STEAEYRAMA HTTCELLWLK
FLIQELQFCK VGHMELVCDN QSALYLSSNP VFHERTKHIE VDCHFIREKI LSDIIKTSSV
CSKDQLADIF TKSLRGPRIT YICNKLDVYD IYAPA
//