ID A0A151QNH5_CAJCA Unreviewed; 1340 AA.
AC A0A151QNH5;
DT 08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT 08-JUN-2016, sequence version 1.
DT 27-MAR-2024, entry version 24.
DE SubName: Full=Retrovirus-related Pol polyprotein from transposon TNT 1-94 {ECO:0000313|EMBL:KYP31826.1};
DE EC=1.2.1.27 {ECO:0000313|EMBL:KYP31826.1};
GN ORFNames=KK1_047663 {ECO:0000313|EMBL:KYP31826.1};
OS Cajanus cajan (Pigeon pea) (Cajanus indicus).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; fabids; Fabales; Fabaceae; Papilionoideae; 50 kb inversion clade;
OC NPAAA clade; indigoferoid/millettioid clade; Phaseoleae; Cajanus.
OX NCBI_TaxID=3821 {ECO:0000313|EMBL:KYP31826.1, ECO:0000313|Proteomes:UP000075243};
RN [1] {ECO:0000313|EMBL:KYP31826.1, ECO:0000313|Proteomes:UP000075243}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Asha {ECO:0000313|Proteomes:UP000075243};
RX PubMed=22057054; DOI=10.1038/nbt.2022;
RA Varshney R.K., Chen W., Li Y., Bharti A.K., Saxena R.K., Schlueter J.A.,
RA Donoghue M.T., Azam S., Fan G., Whaley A.M., Farmer A.D., Sheridan J.,
RA Iwata A., Tuteja R., Penmetsa R.V., Wu W., Upadhyaya H.D., Yang S.P.,
RA Shah T., Saxena K.B., Michael T., McCombie W.R., Yang B., Zhang G.,
RA Yang H., Wang J., Spillane C., Cook D.R., May G.D., Xu X., Jackson S.A.;
RT "Draft genome sequence of pigeonpea (Cajanus cajan), an orphan legume crop
RT of resource-poor farmers.";
RL Nat. Biotechnol. 30:83-89(2012).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KQ485638; KYP31826.1; -; Genomic_DNA.
DR Proteomes; UP000075243; Unassembled WGS sequence.
DR GO; GO:0004491; F:methylmalonate-semialdehyde dehydrogenase (acylating, NAD) activity; IEA:UniProtKB-EC.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR CDD; cd09272; RNase_HI_RT_Ty1; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR025724; GAG-pre-integrase_dom.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR013103; RVT_2.
DR InterPro; IPR001878; Znf_CCHC.
DR InterPro; IPR036875; Znf_CCHC_sf.
DR PANTHER; PTHR42648:SF18; INTEGRASE CATALYTIC DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR42648; TRANSPOSASE, PUTATIVE-RELATED; 1.
DR Pfam; PF13976; gag_pre-integrs; 1.
DR Pfam; PF14223; Retrotran_gag_2; 1.
DR Pfam; PF00665; rve; 1.
DR Pfam; PF07727; RVT_2; 1.
DR Pfam; PF00098; zf-CCHC; 1.
DR SMART; SM00343; ZnF_C2HC; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF57756; Retrovirus zinc finger-like domains; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
DR PROSITE; PS50158; ZF_CCHC; 1.
PE 4: Predicted;
KW Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00047};
KW Oxidoreductase {ECO:0000313|EMBL:KYP31826.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000075243};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00047};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00047}.
FT DOMAIN 261..275
FT /note="CCHC-type"
FT /evidence="ECO:0000259|PROSITE:PS50158"
FT DOMAIN 500..681
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT REGION 204..248
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 211..245
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1340 AA; 152572 MW; 9F6FB672F9AE2476 CRC64;
MGSTGTNFPA NLPVLNGKNW DRWRVQMKAI LGYQEVAEIV EEGYPTLTKD STDAQKALYR
ENKKKDCKAT FLIHQCVDEA HFEKIAGAAT SQEAWKILEK CSEGAEQLKK VRLQTMRRQY
ELMQMENNEK IAEFFNRIIT HTNAMKNCGE KITDQTIVEK ILRTLDPKFD HIVVAIEESK
KLEELKVEEL QGSLEAHEQR LIERGSVKSD DHQALQAQTS KKGRYNSKGN FRGRGQNSNR
RGSFSNWRGG KKKVIDRKRI KCFNCNRIGH FSAECEAAPG RTDQRGSQSH GDYQAHMAKE
DNEANLEEQP LMLMMITNPE SYNNEEWYID SGCSNHMTGH RDWFVNFDPK KKSTVKFADN
RATQVEGSGN VLVKREDGRQ TVITEVLYVP GMTTNLISLG QLLEKGCSVN SVKGFLEIYD
KTKRLVMKAP LAKNRTFKVS LNTIESQCLS AAMLSDDSWL WHRRLGHLNF RDLSLLKSKE
MLTGLPSIKI PKKICDNCLI SKQPRNSFSN FTASKANEVL HVVYSDVCGP IDTPSLGGNR
YFVSFVDDLS RKAWLYLIKA KSDVFSIFKD FKALVEKQSG KCIKILRTDG GGEFTSGEFE
GFCKEHGIVH EVTAPYTPQH NGIAERRNRT ILNMVRSMLK EKNLPHSFWG EAAMTAVYVL
NRCPTKRLGS MVPEEAWSGS KPSVKHLRIF GSLCYRHVPD QRRKKLDDKS EAMIFVGYNS
TGSYKLFNPK NQQVLFSRDV YFDESSSWAE FQSTSDIIPK FQFEWKDEDL VGETHQEVVN
SELQMVADRP TRVRSFPLRL SDYQIYHDSA ITEEGDLVQH MALLTDMEPI TFEEAISKEV
WRSAMEEELK SIEKNDTWEM VNLPQNKKAI AVKWVFKTKF KSDGSIAKHK ARLVAKGFMQ
KEGIDYSEVF APVARLETVR LIVALASWRN WKLWQLDVKS AFLNGPLDEE VFVTQPPGFI
CKGKELKVLR LKKALYGLKQ APRAWNKRID SFLTGFGFQK CSVEHGVYIK TVSETEILVL
CLYVDDLLIT GSSLTAIESL KQGLKSEFEM TDLGILSYFL GIEFAYTEKG IFMHQRKYMS
EVLKRFKMLG CKPAETPAEL NVKLDKSEDE GSVDGTMFRQ IVGSLRFICH SRPEIAFSVG
LVSRFMSDPR QSHLVAAKRI MRYLRGTLSY GILFPHHTKG DDSLHLVAYS DSDWCGDLVD
RRSTMGQVFL LSGSPISWNS KKQTVVALST CEAEYIAACA AACQALWISS LLKELKMFTG
EAVDLLVDSK SAIDLAKNPV SHGRSKHIDT KFHFLRDQVS KGRIRLQHCR SEKQLADIMT
KSMKSERFKE LREFLNVVSL
//