ID A0A151UHN7_CAJCA Unreviewed; 1060 AA.
AC A0A151UHN7;
DT 08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT 08-JUN-2016, sequence version 1.
DT 27-MAR-2024, entry version 25.
DE SubName: Full=Transposon Ty3-I Gag-Pol polyprotein {ECO:0000313|EMBL:KYP78758.1};
GN ORFNames=KK1_048671 {ECO:0000313|EMBL:KYP78758.1};
OS Cajanus cajan (Pigeon pea) (Cajanus indicus).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; fabids; Fabales; Fabaceae; Papilionoideae; 50 kb inversion clade;
OC NPAAA clade; indigoferoid/millettioid clade; Phaseoleae; Cajanus.
OX NCBI_TaxID=3821 {ECO:0000313|EMBL:KYP78758.1, ECO:0000313|Proteomes:UP000075243};
RN [1] {ECO:0000313|EMBL:KYP78758.1, ECO:0000313|Proteomes:UP000075243}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Asha {ECO:0000313|Proteomes:UP000075243};
RX PubMed=22057054; DOI=10.1038/nbt.2022;
RA Varshney R.K., Chen W., Li Y., Bharti A.K., Saxena R.K., Schlueter J.A.,
RA Donoghue M.T., Azam S., Fan G., Whaley A.M., Farmer A.D., Sheridan J.,
RA Iwata A., Tuteja R., Penmetsa R.V., Wu W., Upadhyaya H.D., Yang S.P.,
RA Shah T., Saxena K.B., Michael T., McCombie W.R., Yang B., Zhang G.,
RA Yang H., Wang J., Spillane C., Cook D.R., May G.D., Xu X., Jackson S.A.;
RT "Draft genome sequence of pigeonpea (Cajanus cajan), an orphan legume crop
RT of resource-poor farmers.";
RL Nat. Biotechnol. 30:83-89(2012).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KYP78758.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AGCT01059561; KYP78758.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A151UHN7; -.
DR OMA; RFHEDAN; -.
DR Proteomes; UP000075243; Unassembled WGS sequence.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR CDD; cd09274; RNase_HI_RT_Ty3; 1.
DR CDD; cd01647; RT_LTR; 1.
DR Gene3D; 1.10.340.70; -; 1.
DR Gene3D; 3.10.20.370; -; 1.
DR Gene3D; 3.30.70.270; -; 2.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR041588; Integrase_H2C2.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR041373; RT_RNaseH.
DR PANTHER; PTHR35046:SF9; CCHC-TYPE DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR35046; ZINC KNUCKLE (CCHC-TYPE) FAMILY PROTEIN; 1.
DR Pfam; PF17921; Integrase_H2C2; 1.
DR Pfam; PF17917; RT_RNaseH; 1.
DR Pfam; PF00665; rve; 1.
DR Pfam; PF00078; RVT_1; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
DR PROSITE; PS50878; RT_POL; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000075243}.
FT DOMAIN 196..375
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|PROSITE:PS50878"
FT DOMAIN 712..872
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT REGION 1010..1029
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:KYP78758.1"
SQ SEQUENCE 1060 AA; 121810 MW; B400B84D75E5A035 CRC64;
CDVVPMEACH ILMGRPWQFD KQTLHDGLTN KITFTHKDKK FVLHPLSPSQ VIEDQVRMKA
KREQEKKKLL KIDEKEDSRE INVPSLEIVQ EESFQSKPLH KTSLFIEPSS HILMCRGTLT
CTATSSLETS LPLEVKNLLN EFDDIFPKEG PMGLPPFRGI EHQIDLVPGA SLPNRPAYRT
NPQETKEIEK QVQELLEKGW IQKSLSPCAV PVILVPKKDG KWRMCCDCRA INNITIKYRH
PIPRLDDMLD ELHGSSIFSK VDLKSGYHQI RIKEGDEWKT AFKTKFGLYE WLVMPFGLTN
APSTFMRLMN HALRDCIGKF VVVYFDDILI YSQSLSDHVD HLRQVFLVLR DNHLFANVDK
CTFCVDNVIF LGFVVSKNGV HVDPEKIKAI QEWPIPTNVS EVRSFHGLAS FYRRFVPNFS
TLASPLNELV KKDVVFEWKE KHNLAFQDLK HKLTQAPVLA LPDFSKTFEL ECDASGLGIG
AVLLQGGHPI AYFSEKLHGA TLNYPTYDKE LYALVRALQT WEHYLVTKEF VIHSDHESLK
YLKGQHKLNK RHAKWVEYLE QFPYVIKYKK GSTNVVADAL SRRHVLLNTL GSQILGFDDI
KELYEKDLDF ANFYSLCIQK PYQGYYISEG FLFKENKLCI PQGSIRKLLV RESHEGGLMG
HFGIEKTLSL LREKFFWPHM KRDVQRFCSS CIACLQAKST TKPHGLYTPL PISSSPWVDI
SMDFILGLPR TQRGKDSIFV VVDRFSKMAH FIPCHKVDDA SNIAKLFFQE IVRLHGLPKT
IVSDRDVKFL SHFWKTLWAR LGTKLLFSTT CHPQTDGQTE VVNRSLGTML RAILKGNKKS
WDDYLPHVEF AYNRVVHKTT NMSPFEIVYG FNPLTPLDLL PLPDVASFIH KEGTSRAEFV
KKLHERVRDH IQSQTEKYQK YNNKGRKEVI FKEGDWVWLH LRKDRFPSKR KSKLSPRGDG
PFQILRKINN NAYVLDLPSE YGVSSSFNVS DLSLFTGLAT LEEDALDLRS NPLQEGGDDG
GGPWAKGPTT RAMARRMHEE WAQAQERPIT LFCWALAQAH
//