ID A0A151UH81_CAJCA Unreviewed; 958 AA.
AC A0A151UH81;
DT 08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT 08-JUN-2016, sequence version 1.
DT 27-MAR-2024, entry version 30.
DE SubName: Full=Retrotransposable element Tf2 {ECO:0000313|EMBL:KYP78647.1};
GN ORFNames=KK1_047294 {ECO:0000313|EMBL:KYP78647.1};
OS Cajanus cajan (Pigeon pea) (Cajanus indicus).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; fabids; Fabales; Fabaceae; Papilionoideae; 50 kb inversion clade;
OC NPAAA clade; indigoferoid/millettioid clade; Phaseoleae; Cajanus.
OX NCBI_TaxID=3821 {ECO:0000313|EMBL:KYP78647.1, ECO:0000313|Proteomes:UP000075243};
RN [1] {ECO:0000313|EMBL:KYP78647.1, ECO:0000313|Proteomes:UP000075243}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Asha {ECO:0000313|Proteomes:UP000075243};
RX PubMed=22057054; DOI=10.1038/nbt.2022;
RA Varshney R.K., Chen W., Li Y., Bharti A.K., Saxena R.K., Schlueter J.A.,
RA Donoghue M.T., Azam S., Fan G., Whaley A.M., Farmer A.D., Sheridan J.,
RA Iwata A., Tuteja R., Penmetsa R.V., Wu W., Upadhyaya H.D., Yang S.P.,
RA Shah T., Saxena K.B., Michael T., McCombie W.R., Yang B., Zhang G.,
RA Yang H., Wang J., Spillane C., Cook D.R., May G.D., Xu X., Jackson S.A.;
RT "Draft genome sequence of pigeonpea (Cajanus cajan), an orphan legume crop
RT of resource-poor farmers.";
RL Nat. Biotechnol. 30:83-89(2012).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KYP78647.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AGCT01056453; KYP78647.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A151UH81; -.
DR OMA; MESERTI; -.
DR Proteomes; UP000075243; Unassembled WGS sequence.
DR GO; GO:0004190; F:aspartic-type endopeptidase activity; IEA:InterPro.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0003964; F:RNA-directed DNA polymerase activity; IEA:UniProtKB-KW.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:InterPro.
DR CDD; cd00303; retropepsin_like; 1.
DR Gene3D; 1.10.340.70; -; 1.
DR Gene3D; 2.40.70.10; Acid Proteases; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR Gene3D; 4.10.60.10; Zinc finger, CCHC-type; 1.
DR InterPro; IPR001969; Aspartic_peptidase_AS.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR041588; Integrase_H2C2.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR005162; Retrotrans_gag_dom.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR001878; Znf_CCHC.
DR InterPro; IPR036875; Znf_CCHC_sf.
DR PANTHER; PTHR45835:SF105; IPP TRANSFERASE; 1.
DR PANTHER; PTHR45835; YALI0A06105P; 1.
DR Pfam; PF17921; Integrase_H2C2; 1.
DR Pfam; PF03732; Retrotrans_gag; 1.
DR Pfam; PF08284; RVP_2; 1.
DR Pfam; PF00098; zf-CCHC; 1.
DR SMART; SM00343; ZnF_C2HC; 2.
DR SUPFAM; SSF50630; Acid proteases; 1.
DR SUPFAM; SSF57756; Retrovirus zinc finger-like domains; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS00141; ASP_PROTEASE; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
DR PROSITE; PS50158; ZF_CCHC; 2.
PE 4: Predicted;
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00047};
KW Nucleotidyltransferase {ECO:0000256|ARBA:ARBA00022695};
KW Reference proteome {ECO:0000313|Proteomes:UP000075243};
KW RNA-directed DNA polymerase {ECO:0000256|ARBA:ARBA00022918};
KW Transferase {ECO:0000256|ARBA:ARBA00022679};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00047};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00047}.
FT DOMAIN 213..229
FT /note="CCHC-type"
FT /evidence="ECO:0000259|PROSITE:PS50158"
FT DOMAIN 234..249
FT /note="CCHC-type"
FT /evidence="ECO:0000259|PROSITE:PS50158"
FT DOMAIN 581..748
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT REGION 248..267
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 958 AA; 108648 MW; CD0D6034C58DB0E1 CRC64;
MEIEKIFNAM ECPLAQKVRL ATFMLTTDAH FWWEGALQRM IDGGVQLNWD NFKRVFLEKY
LSGDVRSQKE VEFLELKQGN DSVAEYAAKF DALFRYCTLY HGEGGERAKC IKFVNGLCPE
IKIAINYQEM YHYPTLVNKS RIYDRDNRAR AIFYKGAGGP MRAASSSAPG KSKPYSAPAK
IQGNHAESRF VAGNSFVSGA ASVGGSVSTP TSRCKKCGLR GHEHYNCPDK EITCFNCNGK
GHISTQCPQP PRQRMMGTGA SSQPKRPKTT GRVFALSSAE AAQSDNLIQG ICFIAENPFV
VLFDSGATHS FISFACVQKL KLPVSYLSYD LVVETPTDGP ITTSSICLKC PIVIFERHFL
VDLICLPLSQ LDIILGMEWL SSNHVLLNCF EKSISFGESK CGEFLSANKI KTCLKENEKV
YMILACLTLE KDHEISEIPL VSANVVADAL SRKSLRIIKL GALRVTNILR DEIREGQKVD
PFLLSLVEKL NQGVESEFRV GVDGVLRFKD RLCIPSVPEL KRAILEEGHR SSLSIHPGAT
KMYQDLRKMF WWPKMKREVE EFVYACLICQ KAKVEHQRPS GLLQPLDVPV WKWDSISMDF
VVGLPRTVKN LDAIWVIVDR LTKSAHFIPI NIRYPLERLT KLYIGEIVRL HGVPTSIVSD
RDPRFTSRFW ESLHKALGTK LRLSSAYHPQ TDGQTERTIQ SLEDLLRACI LEHGGSWDSF
LPLIEFTYNN SYHSSIGMAP YEALYGRRCR TPLCWVEPGE NAILGPEIVQ QTTEKVRMIQ
DKMRASQSRQ KSYHDKRRKD LEFKEGDHVF LKVTLWTGVG RALKSRKLTP RFIGPYQVLK
RIGDVAYQIA LPPSLSNLHN VFHVSQLRKY IHDPSHIVES DNIQLKDNLT YETIPLRIED
KRVKQLRGKE IPLVKVVWGG NVEESATWEL ESQMQETYPF LFASGNFEDE ISKRRGEL
//