ID A0A151R9M2_CAJCA Unreviewed; 790 AA.
AC A0A151R9M2;
DT 08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT 08-JUN-2016, sequence version 1.
DT 22-FEB-2023, entry version 23.
DE SubName: Full=Retrovirus-related Pol polyprotein from transposon 17.6 {ECO:0000313|EMBL:KYP39237.1};
GN ORFNames=KK1_039482 {ECO:0000313|EMBL:KYP39237.1};
OS Cajanus cajan (Pigeon pea) (Cajanus indicus).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; fabids; Fabales; Fabaceae; Papilionoideae; 50 kb inversion clade;
OC NPAAA clade; indigoferoid/millettioid clade; Phaseoleae; Cajanus.
OX NCBI_TaxID=3821 {ECO:0000313|EMBL:KYP39237.1, ECO:0000313|Proteomes:UP000075243};
RN [1] {ECO:0000313|EMBL:KYP39237.1, ECO:0000313|Proteomes:UP000075243}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Asha {ECO:0000313|Proteomes:UP000075243};
RX PubMed=22057054; DOI=10.1038/nbt.2022;
RA Varshney R.K., Chen W., Li Y., Bharti A.K., Saxena R.K., Schlueter J.A.,
RA Donoghue M.T., Azam S., Fan G., Whaley A.M., Farmer A.D., Sheridan J.,
RA Iwata A., Tuteja R., Penmetsa R.V., Wu W., Upadhyaya H.D., Yang S.P.,
RA Shah T., Saxena K.B., Michael T., McCombie W.R., Yang B., Zhang G.,
RA Yang H., Wang J., Spillane C., Cook D.R., May G.D., Xu X., Jackson S.A.;
RT "Draft genome sequence of pigeonpea (Cajanus cajan), an orphan legume crop
RT of resource-poor farmers.";
RL Nat. Biotechnol. 30:83-89(2012).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KQ483924; KYP39237.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A151R9M2; -.
DR Proteomes; UP000075243; Unassembled WGS sequence.
DR GO; GO:0004190; F:aspartic-type endopeptidase activity; IEA:InterPro.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:InterPro.
DR CDD; cd00303; retropepsin_like; 1.
DR CDD; cd01647; RT_LTR; 1.
DR Gene3D; 3.30.70.270; -; 2.
DR Gene3D; 2.40.70.10; Acid Proteases; 1.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR Gene3D; 4.10.60.10; Zinc finger, CCHC-type; 1.
DR InterPro; IPR001969; Aspartic_peptidase_AS.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR005162; Retrotrans_gag_dom.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR041577; RT_RNaseH_2.
DR InterPro; IPR001878; Znf_CCHC.
DR InterPro; IPR036875; Znf_CCHC_sf.
DR PANTHER; PTHR24559:SF425; RT_RNASEH DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR24559; TRANSPOSON TY3-I GAG-POL POLYPROTEIN; 1.
DR Pfam; PF03732; Retrotrans_gag; 1.
DR Pfam; PF17919; RT_RNaseH_2; 1.
DR Pfam; PF08284; RVP_2; 1.
DR Pfam; PF00078; RVT_1; 1.
DR Pfam; PF00098; zf-CCHC; 1.
DR SMART; SM00343; ZnF_C2HC; 2.
DR SUPFAM; SSF50630; Acid proteases; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF57756; Retrovirus zinc finger-like domains; 1.
DR PROSITE; PS00141; ASP_PROTEASE; 1.
DR PROSITE; PS50878; RT_POL; 1.
DR PROSITE; PS50158; ZF_CCHC; 2.
PE 4: Predicted;
KW Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00047};
KW Reference proteome {ECO:0000313|Proteomes:UP000075243};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00047};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00047}.
FT DOMAIN 218..234
FT /note="CCHC-type"
FT /evidence="ECO:0000259|PROSITE:PS50158"
FT DOMAIN 239..254
FT /note="CCHC-type"
FT /evidence="ECO:0000259|PROSITE:PS50158"
FT DOMAIN 500..679
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|PROSITE:PS50878"
FT REGION 164..190
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 165..190
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 790 AA; 89825 MW; 7F2CD42E26BA00D8 CRC64;
MNWLTEIEKI FNVMDCPLTQ KVKLATFMLT ADAHFWWEGA LRRIIDGGVH LNWDNFKRVF
LEKYFPDDVR SLKEMEFLEH KQGNDMVEAY VTKFDALVRY YTHYHGEGGE RAKCIKFVNG
LRPEVKTVIN YQEIYHFPTL VNKCSIYDRD NRARAAFYKG AGGPTRTVNP STSGRSKPYS
TPTRFQGSMA TTNRSKPFIR ESTVNTTGSV GGSTSSGRCG KCGRVGHNKS ECRNKEITCF
NCNGKGHIST QCPEPPRTRV TGSGSQVERP KTMGRVFALS GVEAAWSKNL IQGTCFIAET
PFVVLFDSGA THSFISISCV QKLNLPVSLL NFDLVVETPT NGPMTTFSVC LKCPLTISDR
QFLIDLICLP LSRLNVILGM DWLSSHHVLL NCFDKSISFG ESNSTEFLSA ADIKTCLKEN
ERVYMILASL TIETDSKLDE IPLVREFPKV FPNDVSSLPP EREIEFSIDL VSGTSPISIA
PYRMSPKELV ELKKQIEELQ EKQFIRPSVS PWGAPVLLVK KKDGSMRLCV DYRQLNKVTI
KNKYPLPRID DLMDQLDGAC VFSKIDLRPE YHQIRVRAED VPKTAFRTRY GHYKYLVMPF
GVTNAPGVFM DYMNRIFHPY LDRFVVVFID DILVYSKTRE EHAEHLRIVL QTLKEKQLYA
KLSKCEFWLE SVNFLGHVIS KGGIVVDPAK VEAVLEWKAP KSVSEIRSFL RLAGYYRRFI
ENFSRIALPL TKLTKKNQPF VWDSRCEESF LELKRRLTSA PVLVLPDPSK TFKVFCDASK
LGLGGVLMQE
//