GenomeNet

Database: UniProt
Entry: A0A151SGE0_CAJCA
LinkDB: A0A151SGE0_CAJCA
Original site: A0A151SGE0_CAJCA 
ID   A0A151SGE0_CAJCA        Unreviewed;       464 AA.
AC   A0A151SGE0;
DT   08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT   08-JUN-2016, sequence version 1.
DT   27-MAR-2024, entry version 18.
DE   SubName: Full=Retrovirus-related Pol polyprotein from transposon TNT 1-94 {ECO:0000313|EMBL:KYP53808.1};
GN   ORFNames=KK1_024382 {ECO:0000313|EMBL:KYP53808.1};
OS   Cajanus cajan (Pigeon pea) (Cajanus indicus).
OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC   Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC   rosids; fabids; Fabales; Fabaceae; Papilionoideae; 50 kb inversion clade;
OC   NPAAA clade; indigoferoid/millettioid clade; Phaseoleae; Cajanus.
OX   NCBI_TaxID=3821 {ECO:0000313|EMBL:KYP53808.1, ECO:0000313|Proteomes:UP000075243};
RN   [1] {ECO:0000313|EMBL:KYP53808.1, ECO:0000313|Proteomes:UP000075243}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=cv. Asha {ECO:0000313|Proteomes:UP000075243};
RX   PubMed=22057054; DOI=10.1038/nbt.2022;
RA   Varshney R.K., Chen W., Li Y., Bharti A.K., Saxena R.K., Schlueter J.A.,
RA   Donoghue M.T., Azam S., Fan G., Whaley A.M., Farmer A.D., Sheridan J.,
RA   Iwata A., Tuteja R., Penmetsa R.V., Wu W., Upadhyaya H.D., Yang S.P.,
RA   Shah T., Saxena K.B., Michael T., McCombie W.R., Yang B., Zhang G.,
RA   Yang H., Wang J., Spillane C., Cook D.R., May G.D., Xu X., Jackson S.A.;
RT   "Draft genome sequence of pigeonpea (Cajanus cajan), an orphan legume crop
RT   of resource-poor farmers.";
RL   Nat. Biotechnol. 30:83-89(2012).
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; KQ483411; KYP53808.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A151SGE0; -.
DR   Proteomes; UP000075243; Unassembled WGS sequence.
DR   GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR   GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR   GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR   Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR   InterPro; IPR025724; GAG-pre-integrase_dom.
DR   InterPro; IPR001584; Integrase_cat-core.
DR   InterPro; IPR012337; RNaseH-like_sf.
DR   InterPro; IPR036397; RNaseH_sf.
DR   InterPro; IPR036875; Znf_CCHC_sf.
DR   PANTHER; PTHR42648:SF18; INTEGRASE CATALYTIC DOMAIN-CONTAINING PROTEIN; 1.
DR   PANTHER; PTHR42648; TRANSPOSASE, PUTATIVE-RELATED; 1.
DR   Pfam; PF13976; gag_pre-integrs; 1.
DR   Pfam; PF14223; Retrotran_gag_2; 1.
DR   Pfam; PF00665; rve; 1.
DR   SUPFAM; SSF57756; Retrovirus zinc finger-like domains; 1.
DR   SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR   PROSITE; PS50994; INTEGRASE; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000075243}.
FT   DOMAIN          370..464
FT                   /note="Integrase catalytic"
FT                   /evidence="ECO:0000259|PROSITE:PS50994"
FT   REGION          82..121
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        103..121
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   NON_TER         1
FT                   /evidence="ECO:0000313|EMBL:KYP53808.1"
SQ   SEQUENCE   464 AA;  52496 MW;  34C96B5C4F33A9A8 CRC64;
     LLKNFVELKY KEGTPISNHL SEFQGRYDQL AGVGIKFDDD LLGLFLLNSL PDSWETFRVS
     MISATPNGDI SLQMAKRGAL NEEMRRKTRG TSSYSEVLVT ENMGRSQKKE QKSGRDKSRS
     KSISRYKNVE CHYCHKIGHI QKNCFLWKKE SKDKKGKHRE RDHNDGDRVT TATCGDLVCL
     RDYDIVNLVS DESMWIVDSG ATLHVTSRKE FFTSYTSGDF GVLKMGNDGV SKVIGVGDVC
     LQTNMGMQLL LKGVKHVPDV RFNLISVQVL DDGGYDNHFG SGKWKLTKGN LIVAKGEKNS
     KLYWTKALVA KDSVNAMYME SSLWHRRLGH ISEKGMNCLA KKDMLLGLKN VELEKCSHCM
     AGKQARVSFK KYPPSRKSEL LELVHSDVCG PLKVKSFNGA LYFVTFIDDC SRKLWVYALK
     TKDQVLEKFK EFHALVERQS GKKLKCIRTD NGGEYCGPVD VYCK
//
DBGET integrated database retrieval system