ID A0A151TN47_CAJCA Unreviewed; 1423 AA.
AC A0A151TN47;
DT 08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT 08-JUN-2016, sequence version 1.
DT 27-MAR-2024, entry version 17.
DE SubName: Full=Retrovirus-related Pol polyprotein from transposon TNT 1-94 {ECO:0000313|EMBL:KYP68426.1};
GN ORFNames=KK1_022050 {ECO:0000313|EMBL:KYP68426.1};
OS Cajanus cajan (Pigeon pea) (Cajanus indicus).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; fabids; Fabales; Fabaceae; Papilionoideae; 50 kb inversion clade;
OC NPAAA clade; indigoferoid/millettioid clade; Phaseoleae; Cajanus.
OX NCBI_TaxID=3821 {ECO:0000313|EMBL:KYP68426.1, ECO:0000313|Proteomes:UP000075243};
RN [1] {ECO:0000313|EMBL:KYP68426.1, ECO:0000313|Proteomes:UP000075243}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Asha {ECO:0000313|Proteomes:UP000075243};
RX PubMed=22057054; DOI=10.1038/nbt.2022;
RA Varshney R.K., Chen W., Li Y., Bharti A.K., Saxena R.K., Schlueter J.A.,
RA Donoghue M.T., Azam S., Fan G., Whaley A.M., Farmer A.D., Sheridan J.,
RA Iwata A., Tuteja R., Penmetsa R.V., Wu W., Upadhyaya H.D., Yang S.P.,
RA Shah T., Saxena K.B., Michael T., McCombie W.R., Yang B., Zhang G.,
RA Yang H., Wang J., Spillane C., Cook D.R., May G.D., Xu X., Jackson S.A.;
RT "Draft genome sequence of pigeonpea (Cajanus cajan), an orphan legume crop
RT of resource-poor farmers.";
RL Nat. Biotechnol. 30:83-89(2012).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM003606; KYP68426.1; -; Genomic_DNA.
DR OMA; QANERWT; -.
DR Proteomes; UP000075243; Chromosome 4.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR CDD; cd09272; RNase_HI_RT_Ty1; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR InterPro; IPR029472; Copia-like_N.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR025724; GAG-pre-integrase_dom.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR013103; RVT_2.
DR PANTHER; PTHR42648:SF18; INTEGRASE CATALYTIC DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR42648; TRANSPOSASE, PUTATIVE-RELATED; 1.
DR Pfam; PF13976; gag_pre-integrs; 1.
DR Pfam; PF14223; Retrotran_gag_2; 1.
DR Pfam; PF14244; Retrotran_gag_3; 1.
DR Pfam; PF00665; rve; 1.
DR Pfam; PF07727; RVT_2; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000075243}.
FT DOMAIN 535..707
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT REGION 801..835
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 820..835
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1423 AA; 161344 MW; 256010FB99449624 CRC64;
MTVQTNPLLD PLSPYFLPSN ENPSISLVSA PLNDHNYLSW SQSMLLVLGT KNKLAFIDGS
LARPVTAGVD QTAWDRCNKL VISWIVQSLD TSLIPSVIWM PTASQIWNDL KKRYYQGDAF
RISELLEEIY SLKQGNMSIT HYFTTLQGLW QELDHFCPIP SCTCFTTECV IRFLKGLNEQ
YSNVRSQVML MDPLPSVQKV FSMILQQERE FHGTNDNQVL AVTSNNERNN YKGSKTFKRN
KDYNTKVCSH CGRIGHLVDS CYKKHGPPLQ HKHGRIVNQY QSVSDEDTDD DQSVHSQRVV
SHNSGNMFTP EQHQALLALL QQSGSTSSHS VNQLAGPSIL SSPKPPGMIL SFPYSHNSDC
WIIDTGATDH VCKNINFFQS YRRIKPILIQ LPNGSQVSTC VSGTVLLSKT CYLTDVLFIP
NFHFNLISVS KLAKTLSCTL TFSDSDCQIQ ANHSMRMIGA AELRAGLYAM VSSPESNVVH
HCTSHFFTYQ SDLWHLRLGH LSHDKLSALK GSYPEIQCNK ISLPCEICHL AKHKRLPFPD
SLTKSENVFD LIHVDIWGPL SVASIFGHKY FLTIVDDKSR FTWIFFMKNK FETKLLLQNF
VSFVQTQFQQ NIKTIRTDNG SEFLLKDWYA KLGIVHQTSC VNTPQQNGVV ERKHQHILSM
ARALMFQSNV SKMFWNYAIG HAVHLINRLP TRFLQQNSPY YVLYSEKPDF SHLKVFGCLA
FASTLSHNRT KLEPRSRKCM FLGYSSGTKG FIMYDLKTRE TFISRDVQFY ENIFPLQKDF
SIQSTDGPVV PIAQMPLTSC DPIPSHTHDN LDETEHEHNS STLPMTNSSN SDQPNIEINI
PEIRRTSQRV KNRPGYLQDY HCTLAASKVD QSSSTARYPI SDYLPYTSYS AVQQSFVSTI
SSIIEPRSYQ DAINHDCWKE AIRAELDALD KQKTWILTDL PPNKRAVGCR WVFKVKYHAD
GSVERYKARL VAKGFTQIPG LDYIDTFSPV VRMTTIRVFL AIAAASNWSV HQLDINTAFL
HGDLVEEVYM KPPPGLILSS PNKVCKLQKS LYGLKQASRQ WNIKLTETLK LFGFVQSKSD
YSLFTKRTNI GFIAILVYVD DLIISGSDET EIMKVKRLLD KQFSIKDLGQ LSYFLGLEFS
RSDQGISVCQ RKYALELLQD TGLLASKPCS TPMDHTTRLH HDPLDLYSDP SSYRRLVGRL
IYLTHTRPDL AFAVGKLSQF MHQPNNAHFQ AARKVLRYVK ATPTKGLFFP SSSDLKLTGY
TDSDWATCPD SRRSISGFCF FLGNALVSWK SKKQNVVSRS SSEAEYRALA LGVCEAQWLH
KLLTDFQLQD LIPISLFCDN QSALYIAANP VFHERTKHVE IDCHTVRDQV QAGFIHLAPI
TSSGQLADIL TKPLLPKMFQ DFVCKLGLSN FTTPSLREGV MMT
//