ID A0A151SZ90_CAJCA Unreviewed; 795 AA.
AC A0A151SZ90;
DT 08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT 08-JUN-2016, sequence version 1.
DT 24-JAN-2024, entry version 19.
DE SubName: Full=Retrovirus-related Pol polyprotein from transposon 297 family {ECO:0000313|EMBL:KYP60112.1};
GN ORFNames=KK1_015560 {ECO:0000313|EMBL:KYP60112.1};
OS Cajanus cajan (Pigeon pea) (Cajanus indicus).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; fabids; Fabales; Fabaceae; Papilionoideae; 50 kb inversion clade;
OC NPAAA clade; indigoferoid/millettioid clade; Phaseoleae; Cajanus.
OX NCBI_TaxID=3821 {ECO:0000313|EMBL:KYP60112.1, ECO:0000313|Proteomes:UP000075243};
RN [1] {ECO:0000313|EMBL:KYP60112.1, ECO:0000313|Proteomes:UP000075243}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Asha {ECO:0000313|Proteomes:UP000075243};
RX PubMed=22057054; DOI=10.1038/nbt.2022;
RA Varshney R.K., Chen W., Li Y., Bharti A.K., Saxena R.K., Schlueter J.A.,
RA Donoghue M.T., Azam S., Fan G., Whaley A.M., Farmer A.D., Sheridan J.,
RA Iwata A., Tuteja R., Penmetsa R.V., Wu W., Upadhyaya H.D., Yang S.P.,
RA Shah T., Saxena K.B., Michael T., McCombie W.R., Yang B., Zhang G.,
RA Yang H., Wang J., Spillane C., Cook D.R., May G.D., Xu X., Jackson S.A.;
RT "Draft genome sequence of pigeonpea (Cajanus cajan), an orphan legume crop
RT of resource-poor farmers.";
RL Nat. Biotechnol. 30:83-89(2012).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM003612; KYP60112.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A151SZ90; -.
DR Proteomes; UP000075243; Chromosome 10.
DR CDD; cd00303; retropepsin_like; 1.
DR CDD; cd09274; RNase_HI_RT_Ty3; 1.
DR CDD; cd01647; RT_LTR; 1.
DR Gene3D; 3.10.20.370; -; 1.
DR Gene3D; 3.30.70.270; -; 1.
DR Gene3D; 2.40.70.10; Acid Proteases; 1.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR032567; LDOC1-rel.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR005162; Retrotrans_gag_dom.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR041577; RT_RNaseH_2.
DR PANTHER; PTHR15503; LDOC1 RELATED; 1.
DR PANTHER; PTHR15503:SF22; TRANSPOSON TY3-I GAG POLYPROTEIN; 1.
DR Pfam; PF03732; Retrotrans_gag; 1.
DR Pfam; PF17919; RT_RNaseH_2; 1.
DR Pfam; PF08284; RVP_2; 1.
DR Pfam; PF00078; RVT_1; 1.
DR SUPFAM; SSF50630; Acid proteases; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000075243}.
FT DOMAIN 31..119
FT /note="Retrotransposon gag"
FT /evidence="ECO:0000259|Pfam:PF03732"
FT DOMAIN 516..595
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|Pfam:PF00078"
FT DOMAIN 616..710
FT /note="Reverse transcriptase/retrotransposon-derived
FT protein RNase H-like"
FT /evidence="ECO:0000259|Pfam:PF17919"
FT REGION 158..181
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 795 AA; 91188 MW; F130A9C27C0F54A3 CRC64;
MFNGEDPVGW ITRAEIYFRV QETSEVVKVS LAQLCMEGGT IHFFNALLND QGDLSWEDLK
RELLDRYGGL GEGSVFKQLT AIRQNGSVDE YIQKFESLVA QILKLHDEQY FAYFIHGLQD
DIRSRLQSLH VANPLTRGRM MNTARAVEME LANRSNRWQG RNNTRGGGGS HLGYRTQTSV
SEGVGRNKIG QRDRGTKHLA YQDLLDRRQR GLCYKCGGPY GPLHQCPVKQ LRLILVDDEL
QVEYERDDNK VEQLKEEDKS ESEGECSSIC LHRLEKERNH PLNTMKLRGQ VRGIPLFVLV
DSGATHNFIS KKLVDVMGWS QESTKRMKIL MGDGHKSETS GVCRGLRVET TVGEFTVDAF
LFELGDIDMI LGMSWLVSLG EMVVDWNKQR MKIVTPHGTK VLEGALRGEP LLASLSETIP
KDEQIGGVEL KQEQQEALNE VLKRFRDVFV EPKGLPPIRA KEHAIILKAG QEPINVRLYR
YPYHQKNEIE HQVRELLTNG HIRHSQSEYS SPVILVKKKK NQWRMCVNYR APNKATILDK
FPIPVIEELP DELHGASFFS KLDLKSGYHQ VRMRREDIPK TAGFLGLTEY YRKFIRDYGK
VARPLTDLTK KDGFGWNEQA QRAFDELKKK VTTAPVLVLP NFEKEFELEC DASRMGIGAI
PMQERRPVAY FSKALGEKNL TKSAYEKELM AVALAIQHWR PYLLGRKFKV YSDQKSLRQL
MQQRVTTGSQ QNWLAKLLGY NFDIMYKPGV ENKGADALSR SMGEVELRAM SYQPIWLDYK
ELIAETHAED KWMLE
//