ID A0A2T9YLG6_9FUNG Unreviewed; 805 AA.
AC A0A2T9YLG6;
DT 18-JUL-2018, integrated into UniProtKB/TrEMBL.
DT 18-JUL-2018, sequence version 1.
DT 13-SEP-2023, entry version 23.
DE RecName: Full=Pre-mRNA-splicing factor CEF1 {ECO:0008006|Google:ProtNLM};
GN ORFNames=BB559_003404 {ECO:0000313|EMBL:PVU93159.1};
OS Furculomyces boomerangus.
OC Eukaryota; Fungi; Fungi incertae sedis; Zoopagomycota; Kickxellomycotina;
OC Harpellomycetes; Harpellales; Harpellaceae; Furculomyces.
OX NCBI_TaxID=61424 {ECO:0000313|EMBL:PVU93159.1, ECO:0000313|Proteomes:UP000245699};
RN [1] {ECO:0000313|EMBL:PVU93159.1, ECO:0000313|Proteomes:UP000245699}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=AUS-77-4 {ECO:0000313|EMBL:PVU93159.1,
RC ECO:0000313|Proteomes:UP000245699};
RX PubMed=29764946;
RA Wang Y., Stata M., Wang W., Stajich J.E., White M.M., Moncalvo J.M.;
RT "Comparative Genomics Reveals the Core Gene Toolbox for the Fungus-Insect
RT Symbiosis.";
RL MBio 9:e00636-e00618(2018).
CC -!- SIMILARITY: Belongs to the CEF1 family.
CC {ECO:0000256|ARBA:ARBA00010506}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PVU93159.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; MBFT01000332; PVU93159.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A2T9YLG6; -.
DR STRING; 61424.A0A2T9YLG6; -.
DR Proteomes; UP000245699; Unassembled WGS sequence.
DR GO; GO:0000974; C:Prp19 complex; IEA:InterPro.
DR GO; GO:0005681; C:spliceosomal complex; IEA:UniProtKB-KW.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0000398; P:mRNA splicing, via spliceosome; IEA:InterPro.
DR CDD; cd00167; SANT; 1.
DR CDD; cd11659; SANT_CDC5_II; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 2.
DR InterPro; IPR047242; CDC5L/Cef1.
DR InterPro; IPR021786; Cdc5p/Cef1_C.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017930; Myb_dom.
DR InterPro; IPR001005; SANT/Myb.
DR InterPro; IPR047240; SANT_CDC5L_II.
DR PANTHER; PTHR45885; CELL DIVISION CYCLE 5-LIKE PROTEIN; 1.
DR PANTHER; PTHR45885:SF1; CELL DIVISION CYCLE 5-LIKE PROTEIN; 1.
DR Pfam; PF11831; Myb_Cef; 1.
DR Pfam; PF13921; Myb_DNA-bind_6; 1.
DR SMART; SM00717; SANT; 2.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS51294; HTH_MYB; 2.
DR PROSITE; PS50090; MYB_LIKE; 2.
PE 3: Inferred from homology;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW mRNA processing {ECO:0000256|ARBA:ARBA00022664};
KW mRNA splicing {ECO:0000256|ARBA:ARBA00023187};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000245699};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Spliceosome {ECO:0000256|ARBA:ARBA00022728}.
FT DOMAIN 2..57
FT /note="HTH myb-type"
FT /evidence="ECO:0000259|PROSITE:PS51294"
FT DOMAIN 2..53
FT /note="Myb-like"
FT /evidence="ECO:0000259|PROSITE:PS50090"
FT DOMAIN 54..103
FT /note="Myb-like"
FT /evidence="ECO:0000259|PROSITE:PS50090"
FT DOMAIN 58..107
FT /note="HTH myb-type"
FT /evidence="ECO:0000259|PROSITE:PS51294"
FT REGION 111..130
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 138..163
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 210..253
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 223..253
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 805 AA; 90994 MW; FBDEC09618978B75 CRC64;
MRVIVKGGVW KNTEDEILKA AVMKYGKNQW ARISSLLVRK TPKQCKARWF EWLDPSIKKT
EWSKDEDEKL LHLAKLMPTQ WRTIAPIVGR TPAQCLERYQ KLLDDAEAHM SGTKDDLSLQ
GPQGGEFGDI TAEDIRRLRP GEIDPDPESK PARPDPVDMD EDEKEIRELK AAGIEIKKTK
KRKHMDYNTD IPFEKRPAPG FYETYDELKS KGLTQGPGSK KLQNIEPKKR WEREEDALKK
QQEKKKKKDG KDGSDAIAFV RSKLDAAILE KEQLERLSSR KKLILPQPQV SDAELETIAR
IAQQGEQAQR FVESDNIASQ ILLGDYSSHG RQSITARTPQ LPAQTDSVMA EAINLVGLTS
QQTPLLGEEN TPFHTGSGTG FEGITPASRT VQTPNPLMTP LRMNIDGSST VGKGEYTNRV
PRTPYHDELG LNTPLLGIDS VSGTPNINEI NKKRLQKSLS AKLSSLPAPK NQFEIVIPDL
KDIEASELSK NSRVEEEYIE DREIAEKKLA MKRAEEDKKR LERRSTPVKM DLPRPLFFDG
KSLKRLNSFN KLKSSLSKSI DKDDISAALE LISLEMAHLI TSDGSEHPYI ATKYNTSRQS
ITPITKLPDQ GNSTIIGDQE IIDARQLIQK EYEILKEELN LDSKKTDGGN NESLTVDQIS
TLPINTESKH VFVPSKQSFI SMSDISSEDY IESCKNTISQ LHDQMVKDAT KATKMEKKQN
ILLGGYLERS KVLSKKIMES FKELEDSKLA QNSFRFLQAC EQTSIPIRIQ TLQEQVGRLG
IIENDLQQKY KDLFDRYNSL HVAEN
//