ID A0A0K9PUF6_ZOSMR Unreviewed; 992 AA.
AC A0A0K9PUF6;
DT 11-NOV-2015, integrated into UniProtKB/TrEMBL.
DT 11-NOV-2015, sequence version 1.
DT 27-MAR-2024, entry version 31.
DE SubName: Full=Pre-mRNA splicing factor {ECO:0000313|EMBL:KMZ71870.1};
GN ORFNames=ZOSMA_173G00220 {ECO:0000313|EMBL:KMZ71870.1};
OS Zostera marina (Eelgrass).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; Liliopsida; Zosteraceae; Zostera.
OX NCBI_TaxID=29655 {ECO:0000313|EMBL:KMZ71870.1, ECO:0000313|Proteomes:UP000036987};
RN [1] {ECO:0000313|Proteomes:UP000036987}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Finnish {ECO:0000313|Proteomes:UP000036987};
RX PubMed=26814964; DOI=10.1038/nature16548;
RA Olsen J.L., Rouze P., Verhelst B., Lin Y.-C., Bayer T., Collen J.,
RA Dattolo E., De Paoli E., Dittami S., Maumus F., Michel G., Kersting A.,
RA Lauritano C., Lohaus R., Toepel M., Tonon T., Vanneste K., Amirebrahimi M.,
RA Brakel J., Bostroem C., Chovatia M., Grimwood J., Jenkins J.W.,
RA Jueterbock A., Mraz A., Stam W.T., Tice H., Bornberg-Bauer E., Green P.J.,
RA Pearson G.A., Procaccini G., Duarte C.M., Schmutz J., Reusch T.B.H.,
RA Van de Peer Y.;
RT "The genome of the seagrass Zostera marina reveals angiosperm adaptation to
RT the sea.";
RL Nature 530:331-335(2016).
CC -!- SIMILARITY: Belongs to the CEF1 family.
CC {ECO:0000256|ARBA:ARBA00010506}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KMZ71870.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LFYR01000653; KMZ71870.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A0K9PUF6; -.
DR STRING; 29655.A0A0K9PUF6; -.
DR OMA; KMGMAGE; -.
DR OrthoDB; 131128at2759; -.
DR Proteomes; UP000036987; Unassembled WGS sequence.
DR GO; GO:0000974; C:Prp19 complex; IBA:GO_Central.
DR GO; GO:0005681; C:spliceosomal complex; IBA:GO_Central.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0000398; P:mRNA splicing, via spliceosome; IBA:GO_Central.
DR CDD; cd00167; SANT; 1.
DR CDD; cd11659; SANT_CDC5_II; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 2.
DR InterPro; IPR047242; CDC5L/Cef1.
DR InterPro; IPR021786; Cdc5p/Cef1_C.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017930; Myb_dom.
DR InterPro; IPR001005; SANT/Myb.
DR InterPro; IPR047240; SANT_CDC5L_II.
DR PANTHER; PTHR45885; CELL DIVISION CYCLE 5-LIKE PROTEIN; 1.
DR PANTHER; PTHR45885:SF1; CELL DIVISION CYCLE 5-LIKE PROTEIN; 1.
DR Pfam; PF11831; Myb_Cef; 1.
DR Pfam; PF13921; Myb_DNA-bind_6; 1.
DR SMART; SM00717; SANT; 2.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS51294; HTH_MYB; 2.
DR PROSITE; PS50090; MYB_LIKE; 2.
PE 3: Inferred from homology;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW mRNA processing {ECO:0000256|ARBA:ARBA00022664};
KW mRNA splicing {ECO:0000256|ARBA:ARBA00023187};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000036987};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Spliceosome {ECO:0000256|ARBA:ARBA00022728}.
FT DOMAIN 2..57
FT /note="HTH myb-type"
FT /evidence="ECO:0000259|PROSITE:PS51294"
FT DOMAIN 2..53
FT /note="Myb-like"
FT /evidence="ECO:0000259|PROSITE:PS50090"
FT DOMAIN 54..103
FT /note="Myb-like"
FT /evidence="ECO:0000259|PROSITE:PS50090"
FT DOMAIN 58..107
FT /note="HTH myb-type"
FT /evidence="ECO:0000259|PROSITE:PS51294"
FT REGION 113..150
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 934..956
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 969..992
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 848..904
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 934..954
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 992 AA; 113475 MW; F9338FFAF7D41FCB CRC64;
MRIMIKGGVW KNTEDEILKA AVMKYGKNQW ARISSLLVRK SAKQCKARWY EWLDPSIKKT
EWTREEDEKL LHLARLMPTQ WRTIAPIVGR TPSQCLERYE KLLDAACTKD ENYEANEDPR
KLRPGEIDPN PESKPARPDP VDMDEDEKEM LSEARARLAN TRGKKAKRKA REKQLEEARR
LASLQKRREL KAAGINNRKR RKRKGIDYNA EIPFEKRPPP GFFDTVDEER VVEQPRFPTT
IEELEGRRRV DIENQLRKQD IAKNKIAERQ DAPSAIMQAN KLNDPEAVRK RAKLMLPAPQ
ISDHELEEIA KMGYASDLLS GNEYIAEGSG ATRALLANYS QTPQRGMTPF RTPQRTPLAK
EDSVMMEAEN LARLRESQTP LLGGENPELH PSDFSGITPR KREIQTPNPM ATPLISPGPI
DLTPRIGMTP SRDGYSFGMT PRGTPIRDQL RINEDMDLQD SAKHEFRKQV EMRRNLRSGL
SSLPQPKNDY QIVINPIPEE KEEPEIKIEE DMSDKLARER AEEQARQDAL LKKRSKVLQR
ELPRPPAISM EIIRNLLIKG GEDKSSFVPP TLFEQADEMI NMELLTLLEH DNAKYPLDKK
AEKVKNKGSK RMANGKSFTS IPEIEDFDNN ELQEADFLIK EEVLFLREAM GHENESLDDF
IKARDSCQDD LMYFPSRGTY GLASVTSNNE KVEALHYEFE NIKKRLDDEA KKATKFEQKI
KVLTHGHQTF AGKLWFEIEA TFKQMDMAAS ELECFEILQK QEKVAASYRI KSIKEELNRQ
KDLEVILQNR YGKLLAEQSR TQKLVDDHRA HLQIEEEIAA NNKVIEEELA ANNRAIEEEL
AARNLALEDE SAAKNQSVEE EMSQAVEEEI VEVERVSVHV QNVEEELAAK NQVVEEEMTQ
AVEEQITRAV EEQTNAGESV PVYVYANIED DNPVKKQVDE KPEFDGDSAS KAETESCLEE
ISLAAATKKV TCNKQSVENE TTTNDTIGEP NL
//