GenomeNet

Database: UniProt
Entry: A0A151RV81_CAJCA
LinkDB: A0A151RV81_CAJCA
Original site: A0A151RV81_CAJCA 
ID   A0A151RV81_CAJCA        Unreviewed;      1139 AA.
AC   A0A151RV81;
DT   08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT   08-JUN-2016, sequence version 1.
DT   27-MAR-2024, entry version 28.
DE   SubName: Full=Transposon Ty3-I Gag-Pol polyprotein {ECO:0000313|EMBL:KYP46454.1};
GN   ORFNames=KK1_031967 {ECO:0000313|EMBL:KYP46454.1};
OS   Cajanus cajan (Pigeon pea) (Cajanus indicus).
OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC   Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC   rosids; fabids; Fabales; Fabaceae; Papilionoideae; 50 kb inversion clade;
OC   NPAAA clade; indigoferoid/millettioid clade; Phaseoleae; Cajanus.
OX   NCBI_TaxID=3821 {ECO:0000313|EMBL:KYP46454.1, ECO:0000313|Proteomes:UP000075243};
RN   [1] {ECO:0000313|EMBL:KYP46454.1, ECO:0000313|Proteomes:UP000075243}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=cv. Asha {ECO:0000313|Proteomes:UP000075243};
RX   PubMed=22057054; DOI=10.1038/nbt.2022;
RA   Varshney R.K., Chen W., Li Y., Bharti A.K., Saxena R.K., Schlueter J.A.,
RA   Donoghue M.T., Azam S., Fan G., Whaley A.M., Farmer A.D., Sheridan J.,
RA   Iwata A., Tuteja R., Penmetsa R.V., Wu W., Upadhyaya H.D., Yang S.P.,
RA   Shah T., Saxena K.B., Michael T., McCombie W.R., Yang B., Zhang G.,
RA   Yang H., Wang J., Spillane C., Cook D.R., May G.D., Xu X., Jackson S.A.;
RT   "Draft genome sequence of pigeonpea (Cajanus cajan), an orphan legume crop
RT   of resource-poor farmers.";
RL   Nat. Biotechnol. 30:83-89(2012).
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; KQ483558; KYP46454.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A151RV81; -.
DR   Proteomes; UP000075243; Unassembled WGS sequence.
DR   GO; GO:0004190; F:aspartic-type endopeptidase activity; IEA:InterPro.
DR   GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR   GO; GO:0003964; F:RNA-directed DNA polymerase activity; IEA:UniProtKB-KW.
DR   GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR   GO; GO:0006508; P:proteolysis; IEA:InterPro.
DR   CDD; cd00303; retropepsin_like; 1.
DR   Gene3D; 1.10.340.70; -; 1.
DR   Gene3D; 3.30.70.270; -; 1.
DR   Gene3D; 2.40.70.10; Acid Proteases; 1.
DR   Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR   InterPro; IPR001969; Aspartic_peptidase_AS.
DR   InterPro; IPR016197; Chromo-like_dom_sf.
DR   InterPro; IPR043502; DNA/RNA_pol_sf.
DR   InterPro; IPR001584; Integrase_cat-core.
DR   InterPro; IPR041588; Integrase_H2C2.
DR   InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR   InterPro; IPR005162; Retrotrans_gag_dom.
DR   InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR   InterPro; IPR012337; RNaseH-like_sf.
DR   InterPro; IPR036397; RNaseH_sf.
DR   InterPro; IPR041577; RT_RNaseH_2.
DR   PANTHER; PTHR45835:SF103; INTEGRASE CATALYTIC DOMAIN-CONTAINING PROTEIN; 1.
DR   PANTHER; PTHR45835; YALI0A06105P; 1.
DR   Pfam; PF13650; Asp_protease_2; 1.
DR   Pfam; PF17921; Integrase_H2C2; 1.
DR   Pfam; PF03732; Retrotrans_gag; 1.
DR   Pfam; PF17919; RT_RNaseH_2; 1.
DR   SUPFAM; SSF50630; Acid proteases; 1.
DR   SUPFAM; SSF54160; Chromo domain-like; 1.
DR   SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR   SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR   PROSITE; PS00141; ASP_PROTEASE; 1.
DR   PROSITE; PS50994; INTEGRASE; 1.
PE   4: Predicted;
KW   Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW   Nucleotidyltransferase {ECO:0000256|ARBA:ARBA00022695};
KW   Reference proteome {ECO:0000313|Proteomes:UP000075243};
KW   RNA-directed DNA polymerase {ECO:0000256|ARBA:ARBA00022918};
KW   Transferase {ECO:0000256|ARBA:ARBA00022679}.
FT   DOMAIN          722..886
FT                   /note="Integrase catalytic"
FT                   /evidence="ECO:0000259|PROSITE:PS50994"
FT   REGION          191..225
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        191..216
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1139 AA;  129429 MW;  111E441BDCAF4812 CRC64;
     MVLKQHSSTE SASSNNGHNQ PFQVRSVKLD FPRFDGSEVL QWIFKAEQFF SYYRTPDDQR
     LLIASIHLDK DVVPWYQMMI REHPFHSWIA FTRALEMEFG PSPYEGPRSQ LFKLTQTNSV
     QAYYVQFTAL ANRVQGVTQE ALLDCFVGGL KPDIRRDVIA QSPPSLLRTV SLAKLYEEKY
     TIKPKPFSSS FFQKNQTTNT NQTTPQSLKS TSLPPLLPSP DSKPTYVKKL TSAEMQLRRD
     KGICFTCDDK FSPNHRCPNK QYFVLQWEED DEPELQPEPP DVIEAVMGTG SQDHHLSYNA
     LNGSSGLGTM KFQGSINGVR VQILLDSGSS DNFLQPRIAQ CLKIPVEPIP NLQVLVGNGN
     SLVAEGLIRD LGVRIQGHTL KLPVYLLPVS GADLVLGAAW LATIGPHISN YSTLTLKFYL
     GNQFITLYGQ KPSLPQPAQF NHMRRMQHTH AIAELFTLQF SHIDAPNDQL LYFPADMEPE
     LAMLLYTYRT VFDVPSGLPP RFLGLTGYYR KFIKGYASIA APLSNLLKKE SFHWTEQTTA
     AFENLKAAVT KAPVLALPDF SKLFTLETDA SGIGIGAVLN KPGKENLAAD CLSRSFFMAW
     SEPKLQIVHV LKEALQADLQ LQSIKELCLQ NKAPDPHYSV HDQLLYWKGR LVIPNSHNLV
     KQILYEFHTS LLGGHAGMAR TLARISAQFY WSGMQKDVKE FVQQCLVCQQ AKSATTLPAG
     LLQPLPIPMQ IWEDLAMDFI TGLPLSHGFT VILVVIDRLS KYAHFFTMKT DYTSKQVAEI
     FVKNVVKLHG FPKTIVSDRD KVFTSQFWQH LFKLSGTTIN LTTAYHPQSD GQSEALNKCL
     EMYLRCFTHD SPKDWAQLLP WAEFWYNTAC HNSSGMTPFK VVYGRDPPKL IKYTVDQADP
     VSLQEQLLTR DLTINKLKQN LHKAQGHMKK YADQKRRQLE FQIGDLVLVK LQPYRQHSVA
     LRKNQKLSLR YFGPFPVIER IGLVAYKLLL PSTTKIHPVF HVSQLKPCKG EHTTPYIPFP
     FTNVKVQPII QPAKILKERV IVQGQQQVPQ KLVQWQGFEN EQATWEDTTA LQQAFPDFNL
     EDKVNLKGGG IVTRQNMIKE SLAEKESEEH VEEWKLRARK GSRPKITNTK LKDYHWLKE
//
DBGET integrated database retrieval system