ID A0A151T1T1_CAJCA Unreviewed; 1316 AA.
AC A0A151T1T1;
DT 08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT 08-JUN-2016, sequence version 1.
DT 27-MAR-2024, entry version 19.
DE SubName: Full=Retrovirus-related Pol polyprotein from transposon TNT 1-94 {ECO:0000313|EMBL:KYP61022.1};
GN ORFNames=KK1_023446 {ECO:0000313|EMBL:KYP61022.1};
OS Cajanus cajan (Pigeon pea) (Cajanus indicus).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; fabids; Fabales; Fabaceae; Papilionoideae; 50 kb inversion clade;
OC NPAAA clade; indigoferoid/millettioid clade; Phaseoleae; Cajanus.
OX NCBI_TaxID=3821 {ECO:0000313|EMBL:KYP61022.1, ECO:0000313|Proteomes:UP000075243};
RN [1] {ECO:0000313|EMBL:KYP61022.1, ECO:0000313|Proteomes:UP000075243}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Asha {ECO:0000313|Proteomes:UP000075243};
RX PubMed=22057054; DOI=10.1038/nbt.2022;
RA Varshney R.K., Chen W., Li Y., Bharti A.K., Saxena R.K., Schlueter J.A.,
RA Donoghue M.T., Azam S., Fan G., Whaley A.M., Farmer A.D., Sheridan J.,
RA Iwata A., Tuteja R., Penmetsa R.V., Wu W., Upadhyaya H.D., Yang S.P.,
RA Shah T., Saxena K.B., Michael T., McCombie W.R., Yang B., Zhang G.,
RA Yang H., Wang J., Spillane C., Cook D.R., May G.D., Xu X., Jackson S.A.;
RT "Draft genome sequence of pigeonpea (Cajanus cajan), an orphan legume crop
RT of resource-poor farmers.";
RL Nat. Biotechnol. 30:83-89(2012).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM003611; KYP61022.1; -; Genomic_DNA.
DR OMA; PRRICTH; -.
DR Proteomes; UP000075243; Chromosome 9.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR CDD; cd09272; RNase_HI_RT_Ty1; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR025724; GAG-pre-integrase_dom.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR013103; RVT_2.
DR InterPro; IPR001878; Znf_CCHC.
DR PANTHER; PTHR42648:SF18; INTEGRASE CATALYTIC DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR42648; TRANSPOSASE, PUTATIVE-RELATED; 1.
DR Pfam; PF13976; gag_pre-integrs; 1.
DR Pfam; PF00665; rve; 1.
DR Pfam; PF07727; RVT_2; 1.
DR SMART; SM00343; ZnF_C2HC; 2.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000075243}.
FT DOMAIN 461..624
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT REGION 722..749
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 722..741
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1316 AA; 148206 MW; D039374D37F24F69 CRC64;
MVVSWLVHSV SPSIRQSILW MDQADDIWKD LKTRYSQGDL LRVSDLQLEA SSLKQGDLSV
TEYFTKLRIL WDELENFRPD PNCTCTIKCA CSVLTIIAQR KLEDQAMQFL RGLNDQYNNV
KSHVLLMEPP ISKIFSYVVQ QERQLLGQNF IANVHLDRTS SINAVASPTC THCNRTGHTD
AVCFRKHGFP STTNRSNKGS SNRPRRICTH CGKTGHTIEV CYQKHGFPPG SKPLSDKTAS
VNHTVTGDCK VTESTQSLES QDVRTSFPIV CSAISTPSSW ILDSGATDHV SSSLSHFSSF
SPINPIDVKL PTGQHVLATH SGTVKFTNSF YLVDVLYIPA FTYNLISISK LVSSLSCQLI
FDHHSCIIQE TNTMKKIGTV DVNEGLYSFT ASNIHHPSTN SVIVHPKCSI QPIDLWHFRM
GHLSHERLQS FPFLSVDTSF SCNTCHHAKQ KKLPFPLSHS YASQPFDLLH MDIWGPCSVT
SMHGHKYFLT IVDDHTRFTW LFLMQNKSET RQHIINFINQ VETQFDKHVK VIRTDNGLEF
SMTQYFSSKG IIHQTTCVET PQQNGIVERK HQHLLNVTRS LLFQANLPSI FWCFALMHAT
FLINCIPTPF LHNISPFEKL YGHPCDISIL CVFGCLCYSS TITSHRTKLD PRAHPCIFLG
FKPHTKGYLL FNLHTHGLLV SRNVLFHEDH FPSFTKPHSP SFSSPVPIHY NYVDYPTFPS
SSIVESSDPP TSDQHSSPPP LRRSTRPRRP PTYLQDFHGA FTSTSTAHSS TGIRHPLHSF
LSYDLLSPSF HHYVFSISSV TEPKNFAEAS KSDSWLKAMH EEIFALEANN TWVLTTLPPH
KTAIGCRWVY KVKHKADGSI DRYKARLVAK GYTQMEGLDF FDTFSPVAKL TTVRLLLSLA
AINNWHLKQL DVNNAFLHGD LNEEVYMQLP PGLTPSFPGQ VCRLQRSLYG LKQASRQWYA
RLSSFLIQHG YVPSPSDHSL FLKCSPATTT AILIYVDDIV LAGNDLTEIH HLTSLLHTTF
QIKDLGNLKY FLGLEVARNH TGIHLCQRKY ILDLLSDTGM LASKPVSTPM DYSMHLSASS
GTPLTDTAAY RRLVGRLIYL TNTRPDITYA VQQLSQFVSN PTTAHRQALF RILRYLKGTP
GSGIFLSVNS SVQLRAFSDS DWAGCPDTRR SITGFAVYLG DSLISWKSKK QITVSRSSSE
AEYRALATTT CELQWLSYLL KDFHIDPISP SILYCDNQSA LQIASNPVFH ERTKHIEIDC
HIVRDKVSTG LLKLLPVSSS QQLADILTKP LSPFVFRSHC SKLGMLNIHS QLEGGS
//