ID A0A151REI5_CAJCA Unreviewed; 951 AA.
AC A0A151REI5;
DT 08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT 08-JUN-2016, sequence version 1.
DT 27-MAR-2024, entry version 21.
DE SubName: Full=Transposon Ty3-I Gag-Pol polyprotein {ECO:0000313|EMBL:KYP40889.1};
GN ORFNames=KK1_037746 {ECO:0000313|EMBL:KYP40889.1};
OS Cajanus cajan (Pigeon pea) (Cajanus indicus).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; fabids; Fabales; Fabaceae; Papilionoideae; 50 kb inversion clade;
OC NPAAA clade; indigoferoid/millettioid clade; Phaseoleae; Cajanus.
OX NCBI_TaxID=3821 {ECO:0000313|EMBL:KYP40889.1, ECO:0000313|Proteomes:UP000075243};
RN [1] {ECO:0000313|EMBL:KYP40889.1, ECO:0000313|Proteomes:UP000075243}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Asha {ECO:0000313|Proteomes:UP000075243};
RX PubMed=22057054; DOI=10.1038/nbt.2022;
RA Varshney R.K., Chen W., Li Y., Bharti A.K., Saxena R.K., Schlueter J.A.,
RA Donoghue M.T., Azam S., Fan G., Whaley A.M., Farmer A.D., Sheridan J.,
RA Iwata A., Tuteja R., Penmetsa R.V., Wu W., Upadhyaya H.D., Yang S.P.,
RA Shah T., Saxena K.B., Michael T., McCombie W.R., Yang B., Zhang G.,
RA Yang H., Wang J., Spillane C., Cook D.R., May G.D., Xu X., Jackson S.A.;
RT "Draft genome sequence of pigeonpea (Cajanus cajan), an orphan legume crop
RT of resource-poor farmers.";
RL Nat. Biotechnol. 30:83-89(2012).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KQ483807; KYP40889.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A151REI5; -.
DR Proteomes; UP000075243; Unassembled WGS sequence.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR CDD; cd00303; retropepsin_like; 1.
DR CDD; cd01647; RT_LTR; 1.
DR Gene3D; 3.30.70.270; -; 2.
DR Gene3D; 2.40.70.10; Acid Proteases; 1.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR005162; Retrotrans_gag_dom.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR001878; Znf_CCHC.
DR PANTHER; PTHR35046:SF9; CCHC-TYPE DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR35046; ZINC KNUCKLE (CCHC-TYPE) FAMILY PROTEIN; 1.
DR Pfam; PF03732; Retrotrans_gag; 1.
DR Pfam; PF00078; RVT_1; 1.
DR SMART; SM00343; ZnF_C2HC; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR PROSITE; PS50878; RT_POL; 1.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Reference proteome {ECO:0000313|Proteomes:UP000075243}.
FT DOMAIN 632..811
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|PROSITE:PS50878"
FT REGION 306..333
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 371..393
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 501..523
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 42..69
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 306..331
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 504..519
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 951 AA; 110672 MW; 2E6B0A3C0917C679 CRC64;
MSNPSQSDSD LESPKSFSQI VKEVEALKLW RKQEVILKNK EKIEREVQLV ALEEEIHKIK
QQEVKLLEKL NKKKSHKSSH GSISQEDVGS LNVDEYYQPT PRRIIREPKV RESQVDLPPF
HGKEDVDAYL DWEMKVEQIF TCHQVGEERK FPLATLAFQG QAMYWWTTLV RERRLHNDPL
IEYWNDLRSA MRIRHIPSYY SRELMDKLQR LQQKNLSVEE YRQKMELYLM RAGIREEERL
TIARFFSGLN FDIRDRVELL PYRDLDDLVQ LCIRVEQQHL RKNSFKKEKT QSNSYVKKDY
KREGQFSKYD SSKNSSKGQE KEKDREKNKN AITSSSKSSE IKCFKCLGRS HIASECPNKK
VMILRGQDIY SSHDESSSTT SSDSETSEED HQVERAYPYD GQLLMIRRLL GSQPNESHIS
QRENIFHIRC KIFDKACSLI VDSGSCCNCC STRLIEKLDL TPIPHPKPYQ LHWLNEDGDI
IVDKQMKVKF SIGNYEDHVK KKKKVEKKKE KKHKSLKKKK EEKPLPLEGI QQEEILKQTL
LVEKSSYILL CRSMLRCHNL NLGPSSLPIE VSQLLKEFDD VFPSEGPKGL PPFRDIEHQI
DFVPGASLPN RLAYRTNPQE TKEIENQVQE LLDKGWVQKS LSPCVVLVLL VPKKDGKWRM
CCDCRAINNI TIKYRHPIPR LDDMLDELHG ATIFSKIDLK SGYHQIRIKE GDEWKIAFNT
KFGLYEWLFM PFGLTSAPST FMRLMNHVLR ECIGKFVVVY FDDILIYSRS QGEHLGHLRE
VLLILRNYHL FANVKKFTFC VDYVVFLGFI VSKEGVRVDP EKIKVIQEWP QPKNVSEVRS
FHGLASFYRR FVPNFSSIAS PLNELVKKDV PFTFNVSDLK PFVGASDIED ESLDSRMNPF
QKGSNDGRAW TKGPTTRVMA RRLLEDLTAL ELSGPNGLGG PKVLFTWALL E
//