ID A0AA88XK92_9ASTE Unreviewed; 445 AA.
AC A0AA88XK92;
DT 27-MAR-2024, integrated into UniProtKB/TrEMBL.
DT 27-MAR-2024, sequence version 1.
DT 28-JAN-2026, entry version 8.
DE RecName: Full=Integrase zinc-binding domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=RJ639_028966 {ECO:0000313|EMBL:KAK3040520.1};
OS Escallonia herrerae.
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC asterids; campanulids; Escalloniales; Escalloniaceae; Escallonia.
OX NCBI_TaxID=1293975 {ECO:0000313|EMBL:KAK3040520.1, ECO:0000313|Proteomes:UP001188597};
RN [1] {ECO:0000313|EMBL:KAK3040520.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=UCBG64.0493 {ECO:0000313|EMBL:KAK3040520.1};
RC TISSUE=Leaf {ECO:0000313|EMBL:KAK3040520.1};
RA Chanderbali A., Dervinis C., Anghel I., Soltis D., Soltis P., Zapata F.;
RT "Draft genome assemblies for two species of Escallonia (Escalloniales).";
RL Submitted (DEC-2022) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KAK3040520.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JAVXUP010000052; KAK3040520.1; -; Genomic_DNA.
DR AlphaFoldDB; A0AA88XK92; -.
DR Proteomes; UP001188597; Unassembled WGS sequence.
DR GO; GO:0004190; F:aspartic-type endopeptidase activity; IEA:InterPro.
DR GO; GO:0004519; F:endonuclease activity; IEA:UniProtKB-KW.
DR GO; GO:0016779; F:nucleotidyltransferase activity; IEA:UniProtKB-KW.
DR GO; GO:0006508; P:proteolysis; IEA:InterPro.
DR CDD; cd05483; retropepsin_like_bacteria; 1.
DR FunFam; 1.10.340.70:FF:000001; Retrovirus-related Pol polyprotein from transposon gypsy-like Protein; 1.
DR Gene3D; 1.10.340.70; -; 1.
DR Gene3D; 2.40.70.10; Acid Proteases; 1.
DR InterPro; IPR001969; Aspartic_peptidase_AS.
DR InterPro; IPR041588; Integrase_H2C2.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR034122; Retropepsin-like_bacterial.
DR InterPro; IPR005162; Retrotrans_gag_dom.
DR InterPro; IPR050951; Retrovirus_Pol_polyprotein.
DR PANTHER; PTHR37984; PROTEIN CBG26694; 1.
DR PANTHER; PTHR37984:SF5; PROTEIN NYNRIN-LIKE; 1.
DR Pfam; PF13650; Asp_protease_2; 1.
DR Pfam; PF17921; Integrase_H2C2; 1.
DR Pfam; PF03732; Retrotrans_gag; 1.
DR SUPFAM; SSF50630; Acid proteases; 1.
DR PROSITE; PS00141; ASP_PROTEASE; 1.
PE 4: Predicted;
KW Endonuclease {ECO:0000256|ARBA:ARBA00022759};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022759};
KW Nuclease {ECO:0000256|ARBA:ARBA00022722};
KW Nucleotidyltransferase {ECO:0000256|ARBA:ARBA00022695};
KW Reference proteome {ECO:0000313|Proteomes:UP001188597};
KW Transferase {ECO:0000256|ARBA:ARBA00022679}.
FT DOMAIN 21..95
FT /note="Retrotransposon gag"
FT /evidence="ECO:0000259|Pfam:PF03732"
FT DOMAIN 381..432
FT /note="Integrase zinc-binding"
FT /evidence="ECO:0000259|Pfam:PF17921"
FT REGION 124..182
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 205..229
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 148..172
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 206..221
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 445 AA; 50301 MW; CA33F9ED26166E14 CRC64;
MGSCLPSWSR MSTLLTAEIE YGSCVINTWD ILKRELKSQF FFSENTAFNV RKALLECKHT
SSVGEYCQAF SALMLDISDM SAVERLFFFM EGLKPWARTE LNRRRVNNLN EAIIAVDSLS
YYNSEPQRPP QRGNLSRSIG GKKLGGQAPN QSWGSKSSWV QRGAQSGSQS RHPPTDEQTD
VQDYEEEDAV GAFPQWCNAV TTQVRNPEES LTEEKPKDMP PKKKSNVPGK GLMYVDVKVN
GKAIRAMVDT GATHNYISST DVERLGLTLE KGCGRARWQE LLAEFNFMLE YRAGSTNNVA
DALNRRAELD QVALMAMNAI VGSDSRVAIN IGKKIRKALT RDPVAQQLLK LIERGKTRQF
WQEDGLLMTK GRRVYVPRVD DLRRTLIREC HDTLWAGHAG VERTLALLQQ GYYWPQMGDK
VQEYVKTCLT CQDKVERKKK AGLLQ
//