GenomeNet

Database: UniProt
Entry: G9P1K2_HYPAI
LinkDB: G9P1K2_HYPAI
Original site: G9P1K2_HYPAI 
ID   G9P1K2_HYPAI            Unreviewed;       777 AA.
AC   G9P1K2;
DT   22-FEB-2012, integrated into UniProtKB/TrEMBL.
DT   22-FEB-2012, sequence version 1.
DT   27-MAR-2024, entry version 58.
DE   RecName: Full=Pre-mRNA splicing factor CEF1 {ECO:0008006|Google:ProtNLM};
GN   ORFNames=TRIATDRAFT_33700 {ECO:0000313|EMBL:EHK43336.1};
OS   Hypocrea atroviridis (strain ATCC 20476 / IMI 206040) (Trichoderma
OS   atroviride).
OC   Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Sordariomycetes;
OC   Hypocreomycetidae; Hypocreales; Hypocreaceae; Trichoderma.
OX   NCBI_TaxID=452589 {ECO:0000313|EMBL:EHK43336.1, ECO:0000313|Proteomes:UP000005426};
RN   [1] {ECO:0000313|EMBL:EHK43336.1, ECO:0000313|Proteomes:UP000005426}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=ATCC 20476 / IMI 206040 {ECO:0000313|Proteomes:UP000005426};
RX   PubMed=21501500; DOI=10.1186/gb-2011-12-4-r40;
RA   Kubicek C.P., Herrera-Estrella A., Seidl-Seiboth V., Martinez D.A.,
RA   Druzhinina I.S., Thon M., Zeilinger S., Casas-Flores S., Horwitz B.A.,
RA   Mukherjee P.K., Mukherjee M., Kredics L., Alcaraz L.D., Aerts A., Antal Z.,
RA   Atanasova L., Cervantes-Badillo M.G., Challacombe J., Chertkov O.,
RA   McCluskey K., Coulpier F., Deshpande N., von Doehren H., Ebbole D.J.,
RA   Esquivel-Naranjo E.U., Fekete E., Flipphi M., Glaser F.,
RA   Gomez-Rodriguez E.Y., Gruber S., Han C., Henrissat B., Hermosa R.,
RA   Hernandez-Onate M., Karaffa L., Kosti I., Le Crom S., Lindquist E.,
RA   Lucas S., Luebeck M., Luebeck P.S., Margeot A., Metz B., Misra M.,
RA   Nevalainen H., Omann M., Packer N., Perrone G., Uresti-Rivera E.E.,
RA   Salamov A., Schmoll M., Seiboth B., Shapiro H., Sukno S.,
RA   Tamayo-Ramos J.A., Tisch D., Wiest A., Wilkinson H.H., Zhang M.,
RA   Coutinho P.M., Kenerley C.M., Monte E., Baker S.E., Grigoriev I.V.;
RT   "Comparative genome sequence analysis underscores mycoparasitism as the
RT   ancestral life style of Trichoderma.";
RL   Genome Biol. 12:R40.1-R40.15(2011).
CC   -!- SIMILARITY: Belongs to the CEF1 family.
CC       {ECO:0000256|ARBA:ARBA00010506}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:EHK43336.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; ABDG02000026; EHK43336.1; -; Genomic_DNA.
DR   RefSeq; XP_013940603.1; XM_014085128.1.
DR   AlphaFoldDB; G9P1K2; -.
DR   STRING; 452589.G9P1K2; -.
DR   GeneID; 25783882; -.
DR   eggNOG; KOG0050; Eukaryota.
DR   HOGENOM; CLU_009082_0_0_1; -.
DR   OMA; KMGMAGE; -.
DR   OrthoDB; 131128at2759; -.
DR   Proteomes; UP000005426; Unassembled WGS sequence.
DR   GO; GO:0000974; C:Prp19 complex; IEA:InterPro.
DR   GO; GO:0005681; C:spliceosomal complex; IEA:UniProtKB-KW.
DR   GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR   GO; GO:0000398; P:mRNA splicing, via spliceosome; IEA:InterPro.
DR   CDD; cd00167; SANT; 1.
DR   CDD; cd11659; SANT_CDC5_II; 1.
DR   Gene3D; 1.10.10.60; Homeodomain-like; 2.
DR   InterPro; IPR047242; CDC5L/Cef1.
DR   InterPro; IPR021786; Cdc5p/Cef1_C.
DR   InterPro; IPR009057; Homeobox-like_sf.
DR   InterPro; IPR017930; Myb_dom.
DR   InterPro; IPR001005; SANT/Myb.
DR   InterPro; IPR047240; SANT_CDC5L_II.
DR   PANTHER; PTHR45885; CELL DIVISION CYCLE 5-LIKE PROTEIN; 1.
DR   PANTHER; PTHR45885:SF1; CELL DIVISION CYCLE 5-LIKE PROTEIN; 1.
DR   Pfam; PF11831; Myb_Cef; 1.
DR   Pfam; PF13921; Myb_DNA-bind_6; 1.
DR   SMART; SM00717; SANT; 2.
DR   SUPFAM; SSF46689; Homeodomain-like; 1.
DR   PROSITE; PS51294; HTH_MYB; 2.
DR   PROSITE; PS50090; MYB_LIKE; 2.
PE   3: Inferred from homology;
KW   Coiled coil {ECO:0000256|SAM:Coils};
KW   DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW   mRNA processing {ECO:0000256|ARBA:ARBA00022664};
KW   mRNA splicing {ECO:0000256|ARBA:ARBA00023187};
KW   Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW   Reference proteome {ECO:0000313|Proteomes:UP000005426};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Spliceosome {ECO:0000256|ARBA:ARBA00022728}.
FT   DOMAIN          1..56
FT                   /note="HTH myb-type"
FT                   /evidence="ECO:0000259|PROSITE:PS51294"
FT   DOMAIN          1..52
FT                   /note="Myb-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50090"
FT   DOMAIN          53..102
FT                   /note="Myb-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50090"
FT   DOMAIN          57..106
FT                   /note="HTH myb-type"
FT                   /evidence="ECO:0000259|PROSITE:PS51294"
FT   REGION          110..192
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          250..319
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          507..532
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COILED          737..771
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COMPBIAS        126..166
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        250..286
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   777 AA;  86501 MW;  C8AB328E7136437E CRC64;
     MPVVKGGVWT NIEDEILKAS VSKYGLNQWA RVSSLLARKT AKQCKARWNE WLDPSIKKIE
     WSKEEDEKLL HLAKIMPTQW RTIAPIVGRT ANQCLERYQK LLDEAEARES SGLGLMGPDG
     GETQAPSADD VRRLRPGELD PDPETKPARP DTIDLDEDEK EMLSEARARL ANTQGKKAKR
     KARERQQEES RRLATLQKRR ELKTAGINIK VTTRKQGEMD YNADIPFERK AAAGFYDTSE
     EKVKNDLQRA AFDPRKQQLA SKRKGDGDED NERKRRKNDK EGISESQKAA IKAGQMQRIR
     EAEQSSKRRP LNLPAPQVGD GELEDIVKMG KMGEAANSLA RESDNDATRG FVNSYSTLNT
     NAPIRTPRAP AQEDHIANEI RNIRALNDTQ SALLGGENTP LHQGAGSTGF EGIAPRKHVM
     ATPNPLATPL RNGGANGAAP GQTPMRTPRD TFALNQEDGM SMTGATPRDI RNREMAMRNQ
     LRAGLAALPK PKDTEWEFEI PDEQKETVAA DDAMEEDAAD RDRRERERRE AAEALERRRR
     TQVMQRGLPR PVVVDLTDLM KRAKAIDDPS AALIAAETAA LMAHDAIKFP LSGSQVKGKP
     SPLAQIDDSS LADARLRIIS ETKPLPSFED IQAAFESRAN GDSLLLGLGC YNDDEDEQDA
     AMRAAFDSVQ DSIMASAEED AKLEKKLTLH LGGYQKRQKM LKDKVSDAAD ALDKAKVALS
     GFKILAINED TAINRRLASL RDEVNFISRR EREAQEEYQK AKEELEALRS ANINGYH
//
DBGET integrated database retrieval system