ID A0A428QS89_9HYPO Unreviewed; 979 AA.
AC A0A428QS89;
DT 08-MAY-2019, integrated into UniProtKB/TrEMBL.
DT 08-MAY-2019, sequence version 1.
DT 27-MAR-2024, entry version 15.
DE RecName: Full=Transcription factor domain-containing protein {ECO:0000259|SMART:SM00906};
GN ORFNames=CEP53_002763 {ECO:0000313|EMBL:RSL68111.1};
OS Fusarium sp. AF-6.
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Sordariomycetes;
OC Hypocreomycetidae; Hypocreales; Nectriaceae; Fusarium;
OC Fusarium solani species complex.
OX NCBI_TaxID=1325737 {ECO:0000313|EMBL:RSL68111.1, ECO:0000313|Proteomes:UP000287544};
RN [1] {ECO:0000313|EMBL:RSL68111.1, ECO:0000313|Proteomes:UP000287544}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=NRRL62590 {ECO:0000313|EMBL:RSL68111.1,
RC ECO:0000313|Proteomes:UP000287544};
RA Stajich J.E., Carrillo J., Kijimoto T., Eskalen A., O'Donnell K.,
RA Kasson M.;
RT "Comparative genomic analysis of Ambrosia Fusariam Clade fungi.";
RL Submitted (JUN-2017) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the HpcH/HpaI aldolase family.
CC {ECO:0000256|ARBA:ARBA00005568}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:RSL68111.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; NKCJ01000038; RSL68111.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A428QS89; -.
DR STRING; 1325737.A0A428QS89; -.
DR Proteomes; UP000287544; Unassembled WGS sequence.
DR GO; GO:0003677; F:DNA binding; IEA:InterPro.
DR GO; GO:0016829; F:lyase activity; IEA:UniProtKB-KW.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0006351; P:DNA-templated transcription; IEA:InterPro.
DR CDD; cd12148; fungal_TF_MHR; 1.
DR Gene3D; 3.20.20.60; Phosphoenolpyruvate-binding domains; 1.
DR InterPro; IPR005000; Aldolase/citrate-lyase_domain.
DR InterPro; IPR015813; Pyrv/PenolPyrv_Kinase-like_dom.
DR InterPro; IPR040442; Pyrv_Kinase-like_dom_sf.
DR InterPro; IPR007219; Transcription_factor_dom_fun.
DR PANTHER; PTHR30502; 2-KETO-3-DEOXY-L-RHAMNONATE ALDOLASE; 1.
DR PANTHER; PTHR30502:SF0; PHOSPHOENOLPYRUVATE CARBOXYLASE FAMILY PROTEIN; 1.
DR Pfam; PF04082; Fungal_trans; 1.
DR Pfam; PF03328; HpcH_HpaI; 1.
DR SMART; SM00906; Fungal_trans; 1.
DR SUPFAM; SSF51621; Phosphoenolpyruvate/pyruvate domain; 1.
PE 3: Inferred from homology;
KW Lyase {ECO:0000256|ARBA:ARBA00023239};
KW Reference proteome {ECO:0000313|Proteomes:UP000287544}.
FT DOMAIN 585..658
FT /note="Transcription factor"
FT /evidence="ECO:0000259|SMART:SM00906"
FT REGION 317..419
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 871..910
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 328..350
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 362..419
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 979 AA; 108257 MW; 78EC43534C96C559 CRC64;
MASVQYTNSL LKCVARKEVC KSFGIKLLTS PQIVQTARHA GFDSLFIDLE HAWLTLAEAS
NLCNVGHLAG ITPFVRVPHQ CGNGFVQRVL DGGAMGVIFP HIESADEAKA AVKISKYPPY
GCRSMTGAMP LFNMRPTPLK EAIEFGNNSG STVFAMIESK NAVNNSEEIA AVEGVDVLLI
GSFDLSIDLG VGGNWDSKEY RTSVEKVSQV CRKHNKIFGV AGIYDNPTLH EWFINTLGAR
FMLVQQDLSL IAGGGQRAVR AIPLLSYMPL KKGQMLWNRA VSVLLEARTI MLDQPTWSTA
NILSDIAELE SRLARYESEG MEPLPNNGTR PRSESSGLLD PGSPTSGLSL GSPVLEADMP
MAQDDQSTTP RSITSQIPNT SNEQHSIPET QTQTPSFETR SHHHTSTIAA ESSLSSSNEF
GRKVHEVLTN SGPSSARTIP ISPNPVQPMV NHSSPFRVST QVIPQLPSEE EAFQHLETVG
FYIGQTQCHY DLRGLTDRIG WLYENMHHPQ THELWYMQVL LVLAIGQIFK ADGEEEGNLP
GTAFFEFVEQ NLPTASAQYR LGRLAVEVNA LMAMYLQMAN RKEEAYLYIN TALRLAILHG
YHQKDSERNL LRSEKAQINR LWWTVYMQER RLAAANGKPS GIIDSVISMS LPSEAPGFPT
GTAIRTNIKI AQVTGQVITI LYGTKFKKEQ DFVSHAQQII KSLADIAKEI PSEQSLSLCG
NSELALRTSA SLHLMLYQAT LLTIRPLMLH AAQMILSGQP CNELEGGSLD TLSKTCSEAA
RRLLEVIIAL KRKGILPIFG FFDCDAIFSA AFIMLLTMIF DSACEPSQRL NPTPGLKDAM
DMLHYMAEHG NTFARQRFQE VQSVRDHLSA ALNSREANTP TSTTNRTTSV SEQQLGSDTS
GVRSAQPSTH EYPSWYPPMW DMSDQWLHSM DLNGELEDLP LGDSFDQYQS LLNDPDWSLT
GQDVGDFAEL RRHVLRLNP
//