ID A0A512ULK8_9ASCO Unreviewed; 672 AA.
AC A0A512ULK8;
DT 16-OCT-2019, integrated into UniProtKB/TrEMBL.
DT 16-OCT-2019, sequence version 1.
DT 24-JAN-2024, entry version 15.
DE RecName: Full=GATA-type domain-containing protein {ECO:0000259|PROSITE:PS50114};
GN ORFNames=JCM33374_g5631 {ECO:0000313|EMBL:GEQ71945.1};
OS Metschnikowia sp. JCM 33374.
OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; Saccharomycetes;
OC Saccharomycetales; Metschnikowiaceae; Metschnikowia.
OX NCBI_TaxID=2562755 {ECO:0000313|EMBL:GEQ71945.1, ECO:0000313|Proteomes:UP000319698};
RN [1] {ECO:0000313|EMBL:GEQ71945.1, ECO:0000313|Proteomes:UP000319698}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=JCM 33374 {ECO:0000313|EMBL:GEQ71945.1,
RC ECO:0000313|Proteomes:UP000319698};
RA Hirao A.S., Imai R., Endoh R., Ohkuma M., Degawa Y.;
RT "Draft genome sequence of a novel Metschnikowia sp. strain JCM 33374, a
RT nectar yeast isolated from a bumblebee.";
RL Submitted (JAN-2019) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:GEQ71945.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; BIMT01000137; GEQ71945.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A512ULK8; -.
DR STRING; 2562755.A0A512ULK8; -.
DR Proteomes; UP000319698; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro.
DR GO; GO:0043565; F:sequence-specific DNA binding; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IEA:InterPro.
DR CDD; cd00202; ZnF_GATA; 1.
DR Gene3D; 3.30.50.10; Erythroid Transcription Factor GATA-1, subunit A; 1.
DR InterPro; IPR013860; AreA_GATA.
DR InterPro; IPR039355; Transcription_factor_GATA.
DR InterPro; IPR000679; Znf_GATA.
DR InterPro; IPR013088; Znf_NHR/GATA.
DR PANTHER; PTHR10071:SF281; BOX A-BINDING FACTOR-RELATED; 1.
DR PANTHER; PTHR10071; TRANSCRIPTION FACTOR GATA FAMILY MEMBER; 1.
DR Pfam; PF08550; DUF1752; 1.
DR Pfam; PF00320; GATA; 1.
DR PRINTS; PR00619; GATAZNFINGER.
DR SMART; SM00401; ZnF_GATA; 1.
DR SUPFAM; SSF57716; Glucocorticoid receptor-like (DNA-binding domain); 1.
DR PROSITE; PS00344; GATA_ZN_FINGER_1; 1.
DR PROSITE; PS50114; GATA_ZN_FINGER_2; 1.
PE 4: Predicted;
KW Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00094};
KW Reference proteome {ECO:0000313|Proteomes:UP000319698};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00094};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00094}.
FT DOMAIN 498..551
FT /note="GATA-type"
FT /evidence="ECO:0000259|PROSITE:PS50114"
FT REGION 81..107
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 164..298
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 419..498
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 545..629
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 164..193
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 241..298
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 419..436
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 437..451
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 557..580
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 593..607
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 672 AA; 73684 MW; B01338C15D0F92F6 CRC64;
MYKNAKNLIP YRSRMENLIW RMMHVKNQKL RQNTLNIRTS QDFGSLSFNP ILGQLSSEEE
MASPEEVKKE DFDYVAHIRR MSNTQSSRKR PAPMSPFLPA SNGSGLPEPG QVSNHIHSNL
SAALKDQRIP SPDSNENNHG YAFSLDSLAF EASSYPNNLE EQTKSFDFSS QSRGLDLDSV
QEPTPSLPSR PTFAHSLSAR PADFLTNNNH HHHHPQHLHG MQDSSSFHPT PDPELPHVDP
NLFNRNSNPS GLASSAPQSI SFPQSQSRFP PSKRQTPSSS YDPSSFMYQP GPTSGLPASL
SRQNNSFVSV ADHFAAPSRP FTPYDDDVGS IPDSMNLGNS FYDPSLTGML STRGSLADPP
LVNETDSKMT YFDTNVKGPS PSFLSSQFSQ QQPQNSQFSL NAGAAWANSY FEDVGSPLGS
VSTSNSGNTP NTRLAPSPKK KTKKAKPKKK SEPIIHIKNE VAPGGSKNGS QAPETKPAKS
NAAPKANAKG NAAQSSTTTS NIECANCFTK TTPLWRRNPQ GEPLCNACGL FLKLHGTVRP
LSLKTDVIKK RQRGQGPGTL SRRNSSQTPG IIQNPQIPTS KPDRDGDDLN PKPINNSGPT
QGLANSAKKK ETKQKKGQMG NPIPVKSEET LVPIHELDKD SDWFSIPQND LKLGHDEQDE
NKGQWDWLSM NM
//