ID A0A1B7NWZ6_9EURO Unreviewed; 999 AA.
AC A0A1B7NWZ6;
DT 02-NOV-2016, integrated into UniProtKB/TrEMBL.
DT 02-NOV-2016, sequence version 1.
DT 13-SEP-2023, entry version 26.
DE RecName: Full=White collar 1 protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=ACJ72_04360 {ECO:0000313|EMBL:OAX81300.1};
OS Emergomyces africanus.
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes;
OC Eurotiomycetidae; Onygenales; Ajellomycetaceae; Emergomyces.
OX NCBI_TaxID=1955775 {ECO:0000313|EMBL:OAX81300.1, ECO:0000313|Proteomes:UP000091918};
RN [1] {ECO:0000313|EMBL:OAX81300.1, ECO:0000313|Proteomes:UP000091918}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CBS 136260 {ECO:0000313|EMBL:OAX81300.1,
RC ECO:0000313|Proteomes:UP000091918};
RA Cuomo C.A., Schwartz I.S., Kenyon C., de Hoog G.S., Govender N.P.,
RA Botha A., Moreno L., de Vries M., Munoz J.F., Stielow J.B.;
RT "Emmonsia species relationships and genome sequence.";
RL Submitted (JUL-2015) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OAX81300.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LGUA01000503; OAX81300.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A1B7NWZ6; -.
DR STRING; 1658172.A0A1B7NWZ6; -.
DR OrthoDB; 728091at2759; -.
DR Proteomes; UP000091918; Unassembled WGS sequence.
DR GO; GO:0043565; F:sequence-specific DNA binding; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0006355; P:regulation of DNA-templated transcription; IEA:InterPro.
DR CDD; cd00130; PAS; 3.
DR CDD; cd00202; ZnF_GATA; 1.
DR Gene3D; 3.30.50.10; Erythroid Transcription Factor GATA-1, subunit A; 1.
DR Gene3D; 3.30.450.20; PAS domain; 3.
DR InterPro; IPR001610; PAC.
DR InterPro; IPR000014; PAS.
DR InterPro; IPR035965; PAS-like_dom_sf.
DR InterPro; IPR013655; PAS_fold_3.
DR InterPro; IPR000679; Znf_GATA.
DR InterPro; IPR013088; Znf_NHR/GATA.
DR NCBIfam; TIGR00229; sensory_box; 1.
DR PANTHER; PTHR47429:SF7; GATA-FACTOR; 1.
DR PANTHER; PTHR47429; PROTEIN TWIN LOV 1; 1.
DR Pfam; PF00320; GATA; 1.
DR Pfam; PF08447; PAS_3; 1.
DR Pfam; PF13426; PAS_9; 2.
DR SMART; SM00086; PAC; 1.
DR SMART; SM00091; PAS; 3.
DR SMART; SM00401; ZnF_GATA; 1.
DR SUPFAM; SSF57716; Glucocorticoid receptor-like (DNA-binding domain); 1.
DR SUPFAM; SSF55785; PYP-like sensor domain (PAS domain); 3.
DR PROSITE; PS00344; GATA_ZN_FINGER_1; 1.
DR PROSITE; PS50114; GATA_ZN_FINGER_2; 1.
DR PROSITE; PS50112; PAS; 2.
PE 4: Predicted;
KW Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00094};
KW Reference proteome {ECO:0000313|Proteomes:UP000091918};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00094};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00094}.
FT DOMAIN 398..419
FT /note="PAS"
FT /evidence="ECO:0000259|PROSITE:PS50112"
FT DOMAIN 580..635
FT /note="PAS"
FT /evidence="ECO:0000259|PROSITE:PS50112"
FT DOMAIN 958..991
FT /note="GATA-type"
FT /evidence="ECO:0000259|PROSITE:PS50114"
FT REGION 818..837
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 999 AA; 112110 MW; C0E6AFF47EFC1A41 CRC64;
MIPHILVGAM DQNEDFHSAH NHSEKLVNSP KLYTHYSISG LHSNVSDLAR NWNLEKVQPA
EDRDPTESNS QDQHRQIPIQ FFNYKMPQNT TLEIAASVPL ECGDIVPVQL DASQAPAGVS
TASLGTSQPS TNTPRGLNVN GAVEFYNASH TFYPPENAVS AAPGIVSSLL EDRWNYLGSP
DTVPHCFSDT SSLHLAELAL GGFPAIKPTP HCLSDPKSLA ESSQHRALRR CRDLPRRRPS
SGSHRLLLPD LRKPNRNLQS QSYYPYEHQG LPIQSSYGIR ASEPLGQPGI LPHEYPYWIS
RSNIESDFNN HSGDILPDLK ICPDVGIDIN DKDGSTLPAA FPDNAAYQFP AENRTGEPNA
VDVLNCISSR SNPEINIGAI DMSCSFIICR ITAGGHPVVY VSDEFHRLTG YSHEETVGRD
CRFLQAPTGT MEPGSKRRTS DERAIRHLKL KINAKSEVQE CLVNYRKGGQ PFLNLLSVIP
IRWLSNEYNF IVGFQLDLVD SPQAVTGKND DGSYTVNYRR QVLAPNIYDP GRTVSSQNKY
LSFKNQNHAN IRVDKVGPQS PVPFRTSWDK MFLGNSEFLF YIISTTGVFI YVTPSTSTIL
EYEPEELHGL TISSICHPCD IVTVIRELRD SEPGSVVNLL YRIQRKISGY TWFESLGSVY
IETMQKQKHI VLLGKLLVVY TMSRDELMKN GGINEGEVWV KISPSGIVLF VSSNASTFLD
RPPESLVGES VQNLLSGKSR EGFENALGIA RNGKRVTYKH ELQHRRGNWL QVQSTIYPVD
KVKEALKPIF LILQIRLLKL GRTVFGFRSK AASMRTQKRG GSNIIIRPPT ASDPCLRPRP
DFATHGFGNQ NIRPPSSYPP YTGQHITPCS DMKPDPNDMA IYHGHALNPS IHNLQQEQQE
SEKENKENCN IFEELNPTRA TNWQAEIDHL KKRNRLLAEE LQYLTTSKKK RKRKRDVEMP
EKDCSQCHTK TTPEWRRGPS GLRDLCNSCG LRWAKQVST
//