GenomeNet

Database: UniProt
Entry: A0A151UG51_CAJCA
LinkDB: A0A151UG51_CAJCA
Original site: A0A151UG51_CAJCA 
ID   A0A151UG51_CAJCA        Unreviewed;       794 AA.
AC   A0A151UG51;
DT   08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT   08-JUN-2016, sequence version 1.
DT   27-MAR-2024, entry version 18.
DE   SubName: Full=Retrovirus-related Pol polyprotein from transposon TNT 1-94 {ECO:0000313|EMBL:KYP78285.1};
GN   ORFNames=KK1_045812 {ECO:0000313|EMBL:KYP78285.1};
OS   Cajanus cajan (Pigeon pea) (Cajanus indicus).
OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC   Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC   rosids; fabids; Fabales; Fabaceae; Papilionoideae; 50 kb inversion clade;
OC   NPAAA clade; indigoferoid/millettioid clade; Phaseoleae; Cajanus.
OX   NCBI_TaxID=3821 {ECO:0000313|EMBL:KYP78285.1, ECO:0000313|Proteomes:UP000075243};
RN   [1] {ECO:0000313|EMBL:KYP78285.1, ECO:0000313|Proteomes:UP000075243}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=cv. Asha {ECO:0000313|Proteomes:UP000075243};
RX   PubMed=22057054; DOI=10.1038/nbt.2022;
RA   Varshney R.K., Chen W., Li Y., Bharti A.K., Saxena R.K., Schlueter J.A.,
RA   Donoghue M.T., Azam S., Fan G., Whaley A.M., Farmer A.D., Sheridan J.,
RA   Iwata A., Tuteja R., Penmetsa R.V., Wu W., Upadhyaya H.D., Yang S.P.,
RA   Shah T., Saxena K.B., Michael T., McCombie W.R., Yang B., Zhang G.,
RA   Yang H., Wang J., Spillane C., Cook D.R., May G.D., Xu X., Jackson S.A.;
RT   "Draft genome sequence of pigeonpea (Cajanus cajan), an orphan legume crop
RT   of resource-poor farmers.";
RL   Nat. Biotechnol. 30:83-89(2012).
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:KYP78285.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AGCT01047044; KYP78285.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A151UG51; -.
DR   OMA; DYCPNNR; -.
DR   Proteomes; UP000075243; Unassembled WGS sequence.
DR   GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR   GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR   Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR   InterPro; IPR025724; GAG-pre-integrase_dom.
DR   InterPro; IPR001584; Integrase_cat-core.
DR   InterPro; IPR012337; RNaseH-like_sf.
DR   InterPro; IPR036397; RNaseH_sf.
DR   PANTHER; PTHR42648:SF18; INTEGRASE CATALYTIC DOMAIN-CONTAINING PROTEIN; 1.
DR   PANTHER; PTHR42648; TRANSPOSASE, PUTATIVE-RELATED; 1.
DR   Pfam; PF13976; gag_pre-integrs; 1.
DR   Pfam; PF14223; Retrotran_gag_2; 1.
DR   Pfam; PF00665; rve; 1.
DR   SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR   PROSITE; PS50994; INTEGRASE; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000075243}.
FT   DOMAIN          391..557
FT                   /note="Integrase catalytic"
FT                   /evidence="ECO:0000259|PROSITE:PS50994"
SQ   SEQUENCE   794 AA;  92567 MW;  B6890D1661A3909D CRC64;
     MRIFIEAIDI AVWDAIENGP YIPMTKDGDG KREKHWSEWS DDEKKRAQYD YRAKNIITSA
     LSIDEFFRIS QCKSAKEMWD TLQVTHEGTS DVKRSRKHTL IREYELLRMN HGESISDFQK
     RFTHLINHLV DLGRKFEEEE LNLKVLQCLD RSWQAKVTTI EESKDLTSLT LATLFGKLRT
     FFRKMNSKTK RENECLRAKS LLWYLDSGCS RHMTGDLSKF SSLKLKNEGF VTYGDNKKGK
     ILGHGNIGNS SFSTLIENVL LVEGLKHNLL SISQLSDKGF KIEFDNTCCL IYDKLTKEIR
     CIGQTIDNIY MLDLEHSITI SNTKCLITKE NNIWFWHRRA THIHMDHLNK LSRNELVIGL
     PKLKFNKDKL CDACQKGKQV KASFKSKNLI STSIPLQLIH TDFFGPSRTM SLGGNYYGLV
     MVDDYSRFTW VMFLANKSEA FNGFKKFAKL LQNEKNTNIT SIRSDHGGEF QNILFQKFCE
     EHGINHNFSA PRTPQQNGVV ERKNRSLEEL ARTMLNETKL PKYFWADAIN TTCHVLNKVL
     IRPILKRTPY EIYNGRKPNI SYFRVFGCKY FVLNNGKEQL CKFNATTDET IFLGYSKNRK
     AYRVYNKRTL VVGESVHVVF NETNKQETKQ TEIEDLTDLL DQPPLENEQS EMAKESESME
     TIEKSREQLP KEWKTSKDLS IENIIGNIGK GVHDSITTRR SIKNICNIMD FVSQVEPKTI
     DEALYDEHWL MTMQEELNQF EKNEVYKIID YSRDSLYSTS QIIDYCPNNR LFQRLLEFLF
     SQIIDYCPNN RLFS
//
DBGET integrated database retrieval system