GenomeNet

Database: UniProt
Entry: A0A151UHN7_CAJCA
LinkDB: A0A151UHN7_CAJCA
Original site: A0A151UHN7_CAJCA 
ID   A0A151UHN7_CAJCA        Unreviewed;      1060 AA.
AC   A0A151UHN7;
DT   08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT   08-JUN-2016, sequence version 1.
DT   27-MAR-2024, entry version 25.
DE   SubName: Full=Transposon Ty3-I Gag-Pol polyprotein {ECO:0000313|EMBL:KYP78758.1};
GN   ORFNames=KK1_048671 {ECO:0000313|EMBL:KYP78758.1};
OS   Cajanus cajan (Pigeon pea) (Cajanus indicus).
OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC   Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC   rosids; fabids; Fabales; Fabaceae; Papilionoideae; 50 kb inversion clade;
OC   NPAAA clade; indigoferoid/millettioid clade; Phaseoleae; Cajanus.
OX   NCBI_TaxID=3821 {ECO:0000313|EMBL:KYP78758.1, ECO:0000313|Proteomes:UP000075243};
RN   [1] {ECO:0000313|EMBL:KYP78758.1, ECO:0000313|Proteomes:UP000075243}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=cv. Asha {ECO:0000313|Proteomes:UP000075243};
RX   PubMed=22057054; DOI=10.1038/nbt.2022;
RA   Varshney R.K., Chen W., Li Y., Bharti A.K., Saxena R.K., Schlueter J.A.,
RA   Donoghue M.T., Azam S., Fan G., Whaley A.M., Farmer A.D., Sheridan J.,
RA   Iwata A., Tuteja R., Penmetsa R.V., Wu W., Upadhyaya H.D., Yang S.P.,
RA   Shah T., Saxena K.B., Michael T., McCombie W.R., Yang B., Zhang G.,
RA   Yang H., Wang J., Spillane C., Cook D.R., May G.D., Xu X., Jackson S.A.;
RT   "Draft genome sequence of pigeonpea (Cajanus cajan), an orphan legume crop
RT   of resource-poor farmers.";
RL   Nat. Biotechnol. 30:83-89(2012).
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:KYP78758.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AGCT01059561; KYP78758.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A151UHN7; -.
DR   OMA; RFHEDAN; -.
DR   Proteomes; UP000075243; Unassembled WGS sequence.
DR   GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR   GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR   CDD; cd09274; RNase_HI_RT_Ty3; 1.
DR   CDD; cd01647; RT_LTR; 1.
DR   Gene3D; 1.10.340.70; -; 1.
DR   Gene3D; 3.10.20.370; -; 1.
DR   Gene3D; 3.30.70.270; -; 2.
DR   Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR   Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR   InterPro; IPR043502; DNA/RNA_pol_sf.
DR   InterPro; IPR001584; Integrase_cat-core.
DR   InterPro; IPR041588; Integrase_H2C2.
DR   InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR   InterPro; IPR012337; RNaseH-like_sf.
DR   InterPro; IPR036397; RNaseH_sf.
DR   InterPro; IPR000477; RT_dom.
DR   InterPro; IPR041373; RT_RNaseH.
DR   PANTHER; PTHR35046:SF9; CCHC-TYPE DOMAIN-CONTAINING PROTEIN; 1.
DR   PANTHER; PTHR35046; ZINC KNUCKLE (CCHC-TYPE) FAMILY PROTEIN; 1.
DR   Pfam; PF17921; Integrase_H2C2; 1.
DR   Pfam; PF17917; RT_RNaseH; 1.
DR   Pfam; PF00665; rve; 1.
DR   Pfam; PF00078; RVT_1; 1.
DR   SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR   SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR   PROSITE; PS50994; INTEGRASE; 1.
DR   PROSITE; PS50878; RT_POL; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000075243}.
FT   DOMAIN          196..375
FT                   /note="Reverse transcriptase"
FT                   /evidence="ECO:0000259|PROSITE:PS50878"
FT   DOMAIN          712..872
FT                   /note="Integrase catalytic"
FT                   /evidence="ECO:0000259|PROSITE:PS50994"
FT   REGION          1010..1029
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   NON_TER         1
FT                   /evidence="ECO:0000313|EMBL:KYP78758.1"
SQ   SEQUENCE   1060 AA;  121810 MW;  B400B84D75E5A035 CRC64;
     CDVVPMEACH ILMGRPWQFD KQTLHDGLTN KITFTHKDKK FVLHPLSPSQ VIEDQVRMKA
     KREQEKKKLL KIDEKEDSRE INVPSLEIVQ EESFQSKPLH KTSLFIEPSS HILMCRGTLT
     CTATSSLETS LPLEVKNLLN EFDDIFPKEG PMGLPPFRGI EHQIDLVPGA SLPNRPAYRT
     NPQETKEIEK QVQELLEKGW IQKSLSPCAV PVILVPKKDG KWRMCCDCRA INNITIKYRH
     PIPRLDDMLD ELHGSSIFSK VDLKSGYHQI RIKEGDEWKT AFKTKFGLYE WLVMPFGLTN
     APSTFMRLMN HALRDCIGKF VVVYFDDILI YSQSLSDHVD HLRQVFLVLR DNHLFANVDK
     CTFCVDNVIF LGFVVSKNGV HVDPEKIKAI QEWPIPTNVS EVRSFHGLAS FYRRFVPNFS
     TLASPLNELV KKDVVFEWKE KHNLAFQDLK HKLTQAPVLA LPDFSKTFEL ECDASGLGIG
     AVLLQGGHPI AYFSEKLHGA TLNYPTYDKE LYALVRALQT WEHYLVTKEF VIHSDHESLK
     YLKGQHKLNK RHAKWVEYLE QFPYVIKYKK GSTNVVADAL SRRHVLLNTL GSQILGFDDI
     KELYEKDLDF ANFYSLCIQK PYQGYYISEG FLFKENKLCI PQGSIRKLLV RESHEGGLMG
     HFGIEKTLSL LREKFFWPHM KRDVQRFCSS CIACLQAKST TKPHGLYTPL PISSSPWVDI
     SMDFILGLPR TQRGKDSIFV VVDRFSKMAH FIPCHKVDDA SNIAKLFFQE IVRLHGLPKT
     IVSDRDVKFL SHFWKTLWAR LGTKLLFSTT CHPQTDGQTE VVNRSLGTML RAILKGNKKS
     WDDYLPHVEF AYNRVVHKTT NMSPFEIVYG FNPLTPLDLL PLPDVASFIH KEGTSRAEFV
     KKLHERVRDH IQSQTEKYQK YNNKGRKEVI FKEGDWVWLH LRKDRFPSKR KSKLSPRGDG
     PFQILRKINN NAYVLDLPSE YGVSSSFNVS DLSLFTGLAT LEEDALDLRS NPLQEGGDDG
     GGPWAKGPTT RAMARRMHEE WAQAQERPIT LFCWALAQAH
//
DBGET integrated database retrieval system