ID V4MEK3_EUTSA Unreviewed; 848 AA.
AC V4MEK3;
DT 22-JAN-2014, integrated into UniProtKB/TrEMBL.
DT 22-JAN-2014, sequence version 1.
DT 27-MAR-2024, entry version 37.
DE RecName: Full=Homing endonuclease LAGLIDADG domain-containing protein {ECO:0000259|Pfam:PF03161};
GN ORFNames=EUTSA_v10022548mg {ECO:0000313|EMBL:ESQ50953.1};
OS Eutrema salsugineum (Saltwater cress) (Sisymbrium salsugineum).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Brassicales; Brassicaceae; Eutremeae; Eutrema.
OX NCBI_TaxID=72664 {ECO:0000313|EMBL:ESQ50953.1, ECO:0000313|Proteomes:UP000030689};
RN [1] {ECO:0000313|EMBL:ESQ50953.1, ECO:0000313|Proteomes:UP000030689}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=23518688; DOI=10.3389/fpls.2013.00046;
RA Yang R., Jarvis D.E., Chen H., Beilstein M.A., Grimwood J., Jenkins J.,
RA Shu S., Prochnik S., Xin M., Ma C., Schmutz J., Wing R.A.,
RA Mitchell-Olds T., Schumaker K.S., Wang X.;
RT "The Reference Genome of the Halophytic Plant Eutrema salsugineum.";
RL Front. Plant Sci. 4:46-46(2013).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KI517392; ESQ50953.1; -; Genomic_DNA.
DR RefSeq; XP_006409500.1; XM_006409437.1.
DR AlphaFoldDB; V4MEK3; -.
DR STRING; 72664.V4MEK3; -.
DR EnsemblPlants; ESQ50953; ESQ50953; EUTSA_v10022548mg.
DR Gramene; ESQ50953; ESQ50953; EUTSA_v10022548mg.
DR KEGG; eus:EUTSA_v10022548mg; -.
DR eggNOG; KOG4197; Eukaryota.
DR OMA; MESYAQR; -.
DR Proteomes; UP000030689; Unassembled WGS sequence.
DR GO; GO:0009507; C:chloroplast; IEA:GOC.
DR GO; GO:0004519; F:endonuclease activity; IEA:InterPro.
DR GO; GO:0010239; P:chloroplast mRNA processing; IEA:EnsemblPlants.
DR GO; GO:0000373; P:Group II intron splicing; IEA:EnsemblPlants.
DR GO; GO:0048564; P:photosystem I assembly; IEA:EnsemblPlants.
DR GO; GO:0006388; P:tRNA splicing, via endonucleolytic cleavage and ligation; IEA:EnsemblPlants.
DR Gene3D; 3.10.28.10; Homing endonucleases; 2.
DR Gene3D; 1.25.40.10; Tetratricopeptide repeat domain; 3.
DR InterPro; IPR027434; Homing_endonucl.
DR InterPro; IPR004860; LAGLIDADG_2.
DR InterPro; IPR002885; Pentatricopeptide_rpt.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR NCBIfam; TIGR00756; PPR; 2.
DR PANTHER; PTHR47539; PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEIN OTP51, CHLOROPLASTIC; 1.
DR PANTHER; PTHR47539:SF1; PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEIN OTP51, CHLOROPLASTIC; 1.
DR Pfam; PF03161; LAGLIDADG_2; 1.
DR Pfam; PF01535; PPR; 4.
DR SUPFAM; SSF81901; HCP-like; 1.
DR SUPFAM; SSF55608; Homing endonucleases; 1.
DR PROSITE; PS51375; PPR; 2.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000030689};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT REPEAT 455..489
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 524..558
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT DOMAIN 626..788
FT /note="Homing endonuclease LAGLIDADG"
FT /evidence="ECO:0000259|Pfam:PF03161"
FT REGION 806..834
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 813..834
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 848 AA; 97254 MW; 43CAA2D12CEEE6B0 CRC64;
MTKTNGRDAT TMIVSGARGF SSPLSVASSS SAAVTVNVTS FNVSSLSFNL NIIPCYLVRH
CSSSSSILLR PLSFSLLRHR SIYSRRSLRR LSFHGNQKPS FSQANSTAQR SVTFFEHVAG
IKESREKTID FGDLESARND ARNFATRRVE TDVEVRELED LPEEWRRSKL AWLCKEAPSH
KAVTLVRLLN AQKKWVRQED ATYIALHCMR IRENETGFRV YRWMTQQNWY VFDFGLATKL
ADFLGKERKF TKCREVFDDI LNQGRVPSES TFHILVVAYL SSSEQGCLEE ACSVYNRMIQ
LGGYKPRLSL HNSLFRALVS KQGVPSSDDL KQAEFIFHNV LTTGLELQKD VYTGLIWLHS
CQDEVDIDRI NSLREEMKEA DFRESKEVVV SLLRAYAKEG SVEEVEKTWL ELLDLDCGIP
SQAFVYKMEA YSKVGDSAKA LEIFREMEKY LGGATVSGYH KIIEVLCKVQ QVELAESLLK
EFVESGKKPL LPSYIEIVKM YFVLGLHEKL EMAFVECLEK CQPSQTIYNI YLDSLVKIGS
LEKAGDVFDE MKSNGTINVN ARSCNTLLKG YLDSENHVKA KKIYELMSLK KYEIEPPLME
KLDYILSLVR KEVKKPLSMK LSKEQREVLV GLLLGGLQIE SDKEMKSHKI KFEFRDNSQA
HVILRQHIHD QFREWLAPSS DLQEDIPFNF SSVSHSYFGF YAEHFWPKGR SEIPKLIHRW
LSPHSLAYWY MYSGFKTSSG DIILRLKGSL EGVEKVVKAL RGKSMECRVK KKGKVFWIGL
QGTNSAWFWK LIEPHVLEEM KDHLKPASES MNNDGDEEQS INFDTSSDNS SDDRIDYTYQ
MKVNNVEV
//