ID A0A428QZ22_9HYPO Unreviewed; 982 AA.
AC A0A428QZ22;
DT 08-MAY-2019, integrated into UniProtKB/TrEMBL.
DT 08-MAY-2019, sequence version 1.
DT 27-MAR-2024, entry version 19.
DE RecName: Full=CCAAT-binding factor domain-containing protein {ECO:0000259|Pfam:PF03914};
GN ORFNames=CEP54_001715 {ECO:0000313|EMBL:RSL70565.1};
OS Fusarium duplospermum.
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Sordariomycetes;
OC Hypocreomycetidae; Hypocreales; Nectriaceae; Fusarium;
OC Fusarium solani species complex.
OX NCBI_TaxID=1325734 {ECO:0000313|EMBL:RSL70565.1, ECO:0000313|Proteomes:UP000288168};
RN [1] {ECO:0000313|EMBL:RSL70565.1, ECO:0000313|Proteomes:UP000288168}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=NRRL62584 {ECO:0000313|EMBL:RSL70565.1,
RC ECO:0000313|Proteomes:UP000288168};
RA Stajich J.E., Carrillo J., Kijimoto T., Eskalen A., O'Donnell K.,
RA Kasson M.;
RT "Comparative genomic analysis of Ambrosia Fusariam Clade fungi.";
RL Submitted (JUN-2017) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the CBF/MAK21 family.
CC {ECO:0000256|ARBA:ARBA00007797}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:RSL70565.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; NKCI01000009; RSL70565.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A428QZ22; -.
DR STRING; 1325734.A0A428QZ22; -.
DR Proteomes; UP000288168; Unassembled WGS sequence.
DR GO; GO:0043231; C:intracellular membrane-bounded organelle; IEA:UniProt.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR005612; CCAAT-binding_factor.
DR InterPro; IPR040155; CEBPZ/Mak21-like.
DR PANTHER; PTHR12048; CCAAT-BINDING FACTOR-RELATED; 1.
DR PANTHER; PTHR12048:SF0; CCAAT_ENHANCER-BINDING PROTEIN ZETA; 1.
DR Pfam; PF03914; CBF; 1.
DR SUPFAM; SSF48371; ARM repeat; 1.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000288168}.
FT DOMAIN 591..744
FT /note="CCAAT-binding factor"
FT /evidence="ECO:0000259|Pfam:PF03914"
FT REGION 1..76
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 126..180
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 503..543
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 690..716
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 875..959
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 26..76
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 134..172
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 694..716
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 911..937
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 939..959
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 982 AA; 109197 MW; 40AD0E084046E44B CRC64;
MAKPSKKGKG PRRSLNGSGL DEKSLDKLTT SIDKKLTSND HKRKQPPTKA AGDQHQKRQR
NSEGSASDKK GSKIDNKTLL EEIKSLGGDE RDLELIQDID SEDEAVAQDS KAVDKSLKDE
LAAFSKQLGF EEVEVSEASD DEEEAEEEED DDEEEGDEDE DEFEEDEEEE EETKDVIPKK
VGNLAFEPRA DWHAAELRKL PGPNSDGPSS FKGSIEALKQ YAQQLLEEDA AKYRTSVFAS
SSHKFLSTIM SSGTLTDKVS ALTLAVQESP VHNIRAFDAL MSLASKKSRG QALGAIGALV
DLLGPGTLLP ADRRLRTFQT QPGLLGTLQR TPAATWTSDK PLPGKITSSH LISWIYEDWL
KETYFRIIQL LESWCSDEID YSRTRALDFV YGLLKEKPEQ EANLLRLLVN KLGDRDRKIS
SRASYLLLQM QNSHPGMKPI IVRTIEQEIL LHPSQDHRSK YYAINTLNQT ILGTKEPALA
ETLMRIYFEL FTVILKTGSL GITPTESKPA EEPEDEEAIK KKLARRRKPR GKPNKPVATE
PETEAADKLV SAILTGVNRA APFMVGNEAA MESHLDTLFK IAHSANFNTG IQALLLIQQI
SSSKNLANDR FYKTLYESLL DPRLITSSKQ ALYLNLLLRA LKNDVDSRRV KAFAKRMVQI
AGLHQPAFTC GLLYVISHLR ETFPDLSTLL DEPEESSLDD KPGSERPVYD GRKRDPEYSN
ANQSCLWEVI PLQGHYHPSV TLYASSIVSP NEKAQKPDLD SHSLIRFLDK FVYRNPKAAD
ASKGVSIMQP LRAAKDLGDI WLGSRGPAAS TPSVNSAAFW KKKAEDVAAE DIFFHEYFQQ
VDKESRETKK KAQAEGDGEE NDEQEDEIWK ALVSTQPGVD PDDEGSDVGF DLDDSDMASE
AGSSPALSLD SDMGEDDDDM SVDIEGSDDE MGGAMLSEDE DGFEVKEAKS EKPKSRRRQL
KDLPMFASVD DYAELLAGEE DM
//