ID A0A2C5WSL3_9PEZI Unreviewed; 1177 AA.
AC A0A2C5WSL3;
DT 20-DEC-2017, integrated into UniProtKB/TrEMBL.
DT 20-DEC-2017, sequence version 1.
DT 27-MAR-2024, entry version 23.
DE SubName: Full=Nitrogen regulatory protein areA {ECO:0000313|EMBL:PHH52049.1};
GN Name=AREA {ECO:0000313|EMBL:PHH52049.1};
GN ORFNames=CFIMG_004306RA {ECO:0000313|EMBL:PHH52049.1};
OS Ceratocystis fimbriata CBS 114723.
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Sordariomycetes;
OC Hypocreomycetidae; Microascales; Ceratocystidaceae; Ceratocystis.
OX NCBI_TaxID=1035309 {ECO:0000313|EMBL:PHH52049.1, ECO:0000313|Proteomes:UP000222788};
RN [1] {ECO:0000313|EMBL:PHH52049.1, ECO:0000313|Proteomes:UP000222788}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CBS 114723 {ECO:0000313|EMBL:PHH52049.1,
RC ECO:0000313|Proteomes:UP000222788};
RX PubMed=23931120; DOI=10.1016/j.funbio.2013.06.004;
RA Simpson M.C., Wilken P.M., Coetzee M.P., Wingfield M.J., Wingfield B.D.;
RT "Analysis of microsatellite markers in the genome of the plant pathogen
RT Ceratocystis fimbriata.";
RL Fungal Biol. 117:545-555(2013).
RN [2] {ECO:0000313|EMBL:PHH52049.1, ECO:0000313|Proteomes:UP000222788}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CBS 114723 {ECO:0000313|EMBL:PHH52049.1,
RC ECO:0000313|Proteomes:UP000222788};
RX PubMed=24563841; DOI=10.5598/imafungus.2013.04.02.14;
RA Wilken P.M., Steenkamp E.T., Wingfield M.J., de Beer Z.W., Wingfield B.D.;
RT "IMA Genome-F 1: Ceratocystis fimbriata: Draft nuclear genome sequence for
RT the plant pathogen, Ceratocystis fimbriata.";
RL IMA Fungus 4:357-358(2013).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PHH52049.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; APWK03000077; PHH52049.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A2C5WSL3; -.
DR STRING; 1035309.A0A2C5WSL3; -.
DR OrthoDB; 318925at2759; -.
DR Proteomes; UP000222788; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro.
DR GO; GO:0043565; F:sequence-specific DNA binding; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IEA:InterPro.
DR CDD; cd00202; ZnF_GATA; 1.
DR Gene3D; 3.30.50.10; Erythroid Transcription Factor GATA-1, subunit A; 1.
DR InterPro; IPR013860; AreA_GATA.
DR InterPro; IPR039355; Transcription_factor_GATA.
DR InterPro; IPR000679; Znf_GATA.
DR InterPro; IPR013088; Znf_NHR/GATA.
DR PANTHER; PTHR10071:SF281; GATA-TYPE DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR10071; TRANSCRIPTION FACTOR GATA FAMILY MEMBER; 1.
DR Pfam; PF08550; DUF1752; 1.
DR Pfam; PF00320; GATA; 1.
DR PRINTS; PR00619; GATAZNFINGER.
DR SMART; SM00401; ZnF_GATA; 1.
DR SUPFAM; SSF57716; Glucocorticoid receptor-like (DNA-binding domain); 1.
DR PROSITE; PS00344; GATA_ZN_FINGER_1; 1.
DR PROSITE; PS50114; GATA_ZN_FINGER_2; 1.
PE 4: Predicted;
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Nitrate assimilation {ECO:0000256|ARBA:ARBA00023063};
KW Reference proteome {ECO:0000313|Proteomes:UP000222788};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00094}.
FT DOMAIN 875..928
FT /note="GATA-type"
FT /evidence="ECO:0000259|PROSITE:PS50114"
FT REGION 56..91
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 115..163
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 204..224
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 302..358
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 430..562
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 714..880
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 929..1107
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1146..1166
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 56..81
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 128..154
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 207..224
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 302..329
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 430..444
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 447..463
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 472..514
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 545..562
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 722..744
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 768..832
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 859..880
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 932..993
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1000..1014
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1053..1107
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1177 AA; 122935 MW; F6C577CEC49F7E98 CRC64;
MSTAIDSYRF TPMAESMDLF GATSRSAAQT PTSLSLSAFD LEFDYADDTQ LHASAQSHHM
LTASSSESAS VSAPTSTKHV HDSQSYNHRQ SHKHFDMAAA TLHLFDAGMP PMRSLQAAAP
SSLSPLPPHP NQTLSATTDD DDGSSKNAQS HKSLSPPSSP PPSLDELAMC VWKFFSQTKQ
SLPQQQRIEN MSWRMMHAGL RRLSMQAHAS KRSSSSMSSG TAVGSSTLSA HSVASTSLSS
NHSLSFHGPS GIAQQLQKTS DNYHSMQSGS SSERAAVAVD SMNLDDFICD SIAATPIPSS
HVITQSSLSP NSNKGGDSSL DNQGQGTHAL ASAIPIKPRK PASVSASASD SASGAGAGAG
IGSVSDDFAD AAAPVANSGS MIMSGESQAH GSLQTQLHQS AQNFVPQSLP YPLNHHGRLK
DEFNYVTRHH RKTSVDDRRT RKRPANFSPH VSATTSLDSS NNLEPESELH EYSLDNTSTA
SAPGTGTGTG TSTGTGSAAT STVMGTQQQQ QQQHQQHHHH SHHSSTTSVP FPLDTYPMEN
DPIITSAGPY QQAFPFSPST SPLMQHGPFS NAFGSINANG LNNRTEYYAP QTSNFQAPNN
STSHSIGETD PFFFGNIENR HHGSIRPTSH GHGISNSVNS QYIYNSTNNP MFPAAESTTT
FGSSFTGIDP SHVFPDQSGR SPSTISMPTD TMFSNFAADS DEEENTAFPD RLVMAPEFSP
GSSLEETGTT STSSAMNWDP TLPGQFQTHA ARFPGGPPRK NTMSSEADKP DWDSSSAPRS
QPIHGTTSTT TTTTNSSNRR SKALRAAPGT TSSQSRRNPF APSNSNSPPA DAPTSLPGGF
MSAQPSRQSS PPPPPKSAST TNLQTNSSSG ATADNSSPTT CTNCFTQTTP LWRRNPEGQP
LCNACGLFLK LHGVVRPLSL KTDVIKKRNR GTGAATPSST STRAGKKSGL SSRKNSTLAL
SSTPAAATVV TGATASTPPS NMRPGSATND GDSPGASAAG GKTAGSTPNG FPANVNNKVA
IPPASQPAAR VGTAQNRTST TSSKRQRRHS KSAGSDLVSA VSMPPLSSTS LSSTTTVTAM
SGSNSTQHHH SQSMDIDSPG SSDSADLMRT FSASHGLGPS IGSPGSLGLT SAFGLGNANR
SIVSPTVSGT SAAGQQQNIL GSGSAGPQEW EWLTMSL
//