GenomeNet

Database: UniProt
Entry: A0A151SDY6_CAJCA
LinkDB: A0A151SDY6_CAJCA
Original site: A0A151SDY6_CAJCA 
ID   A0A151SDY6_CAJCA        Unreviewed;      1305 AA.
AC   A0A151SDY6;
DT   08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT   08-JUN-2016, sequence version 1.
DT   27-MAR-2024, entry version 20.
DE   SubName: Full=Retrovirus-related Pol polyprotein from transposon TNT 1-94 {ECO:0000313|EMBL:KYP53032.1};
GN   ORFNames=KK1_025032 {ECO:0000313|EMBL:KYP53032.1};
OS   Cajanus cajan (Pigeon pea) (Cajanus indicus).
OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC   Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC   rosids; fabids; Fabales; Fabaceae; Papilionoideae; 50 kb inversion clade;
OC   NPAAA clade; indigoferoid/millettioid clade; Phaseoleae; Cajanus.
OX   NCBI_TaxID=3821 {ECO:0000313|EMBL:KYP53032.1, ECO:0000313|Proteomes:UP000075243};
RN   [1] {ECO:0000313|EMBL:KYP53032.1, ECO:0000313|Proteomes:UP000075243}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=cv. Asha {ECO:0000313|Proteomes:UP000075243};
RX   PubMed=22057054; DOI=10.1038/nbt.2022;
RA   Varshney R.K., Chen W., Li Y., Bharti A.K., Saxena R.K., Schlueter J.A.,
RA   Donoghue M.T., Azam S., Fan G., Whaley A.M., Farmer A.D., Sheridan J.,
RA   Iwata A., Tuteja R., Penmetsa R.V., Wu W., Upadhyaya H.D., Yang S.P.,
RA   Shah T., Saxena K.B., Michael T., McCombie W.R., Yang B., Zhang G.,
RA   Yang H., Wang J., Spillane C., Cook D.R., May G.D., Xu X., Jackson S.A.;
RT   "Draft genome sequence of pigeonpea (Cajanus cajan), an orphan legume crop
RT   of resource-poor farmers.";
RL   Nat. Biotechnol. 30:83-89(2012).
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; KQ483417; KYP53032.1; -; Genomic_DNA.
DR   Proteomes; UP000075243; Unassembled WGS sequence.
DR   GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR   GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR   GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR   CDD; cd09272; RNase_HI_RT_Ty1; 1.
DR   Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR   Gene3D; 4.10.60.10; Zinc finger, CCHC-type; 1.
DR   InterPro; IPR043502; DNA/RNA_pol_sf.
DR   InterPro; IPR025724; GAG-pre-integrase_dom.
DR   InterPro; IPR001584; Integrase_cat-core.
DR   InterPro; IPR012337; RNaseH-like_sf.
DR   InterPro; IPR036397; RNaseH_sf.
DR   InterPro; IPR013103; RVT_2.
DR   InterPro; IPR001878; Znf_CCHC.
DR   InterPro; IPR036875; Znf_CCHC_sf.
DR   PANTHER; PTHR42648:SF18; INTEGRASE CATALYTIC DOMAIN-CONTAINING PROTEIN; 1.
DR   PANTHER; PTHR42648; TRANSPOSASE, PUTATIVE-RELATED; 1.
DR   Pfam; PF13976; gag_pre-integrs; 1.
DR   Pfam; PF14223; Retrotran_gag_2; 1.
DR   Pfam; PF00665; rve; 1.
DR   Pfam; PF07727; RVT_2; 1.
DR   Pfam; PF00098; zf-CCHC; 1.
DR   SMART; SM00343; ZnF_C2HC; 1.
DR   SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR   SUPFAM; SSF57756; Retrovirus zinc finger-like domains; 1.
DR   SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR   PROSITE; PS50994; INTEGRASE; 1.
DR   PROSITE; PS50158; ZF_CCHC; 1.
PE   4: Predicted;
KW   Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00047};
KW   Reference proteome {ECO:0000313|Proteomes:UP000075243};
KW   Zinc {ECO:0000256|PROSITE-ProRule:PRU00047};
KW   Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00047}.
FT   DOMAIN          229..243
FT                   /note="CCHC-type"
FT                   /evidence="ECO:0000259|PROSITE:PS50158"
FT   DOMAIN          475..641
FT                   /note="Integrase catalytic"
FT                   /evidence="ECO:0000259|PROSITE:PS50994"
FT   REGION          181..229
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          709..754
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1305 AA;  148718 MW;  83B0A20004D62D5E CRC64;
     MAAKFEIEKF NGNNFSLWKL KIRAILRKDN CLKAIEGRPS DITDEKWREM DDNAVANLHL
     AMADSVLSSI AEKTTAKEIW DTLVKLYEVK SLHTRIFLKR KLYTLRMSES TAVTDHINNL
     NTLFAQLSTA DFNIVENERA ELLLQSLPDS YDQLIINITN NNVVGRLSFE DVAGAILEEE
     SRRKNKEDRL ESSKQAEALT MMRGRSMERG SSGSQNHGRS KSRGRKNLKC YNCGMRGHMK
     KDCWHKKSGG GKNSEASTSQ GCVASTQEDG EILYSEAISS RGEKQLHECW ILDSGATWHM
     TPNRDWFCTY ESISGGSVFM GNDHALEIAG VGTIKLKMYD GTIRTIQGVR HVKGLKKNLL
     SIGQLDDLKC KIHVEGGILK VVKGNLVVMK AEKITSNLYL LLGETLQEAD ASVAAISQEE
     ATMMWHRRLG HMSERGLKIL AERNLLSGLK MVTLPFCEHC VTSKQHRLQF AKVTTRSKHI
     LDLIHSDVWE SPEISMGGAK YFVSFIDDYS RRLWVYPIKK KSDVFPVFKE FKAQVELDTG
     KRIKCLRTDN GGEYIDGDFL AFCKQEGIKR QFTVAHTPQQ NGVAERMNRT LLERTRAMLR
     TAGLAKSFWA EAVKTACYLI NRSPSTAIGL KTPMEMWSGK PSNYSSLHVF GCPVYVMYNS
     QERTKLDPKS RKCIFLGYAD NVKGYRLWDP TARKVVVSRD VVFAETELQK EQENDSTIKD
     TSIVEIGGKS KKDDTSEAEQ EHEEQEPDEA NDEEVQQIRR ERRRPSWHSD YVMASHDAYC
     LLTEEGEPST FQEALRSSDV SQWMAAMHEE IEALHRNKTW ELVDLPKGRK AIGCKWVYKI
     KRDGNDQVER YRARLVVKGY AQKAGIDFNE IFSPVVRLTT IRVVLAMCAA FNLHLEQLDV
     KTAFLHGELQ EEIYMLQPEG FKEQGKENLV CRLTKSLYGL KQAPRCWYKR FDSFIISLGY
     NRLSSDHCTY YNRFDDNDFI ILLLYVDDML VVGPNKDQIQ ELKAQLAREF DMKDLGPANK
     ILGMQIHRDR KNRRIWLSQK NYLLKVLRRF NMQDCKPIST PLPVNYKLSS SMSPSNEAER
     MEMSRVPYAS AVGSLMYAMI CTRPDIAQAV GAVSRFMADP GREHWSIVKR ILRYIKGTSD
     VALCFEGSEF VVRGYVDSDF AGDLDKRKST TGYVFTLAGG AVSWLSKLQT VVALSTTEAE
     YMAATQACKE AIWIQRLLEE LGHKQEKITV YCDSQSALHI ARNPAFHSRT KHIGVQYHFV
     REVVEEGKVD MQKIHTKDNI ADVMTKPVNT DKFTWCRSLF GLLKT
//
DBGET integrated database retrieval system