ID Q2H4F2_CHAGB Unreviewed; 539 AA.
AC Q2H4F2;
DT 21-MAR-2006, integrated into UniProtKB/TrEMBL.
DT 21-MAR-2006, sequence version 1.
DT 27-MAR-2024, entry version 78.
DE RecName: Full=Myb-like domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=CHGG_06463 {ECO:0000313|EMBL:EAQ89844.1};
OS Chaetomium globosum (strain ATCC 6205 / CBS 148.51 / DSM 1962 / NBRC 6347 /
OS NRRL 1970) (Soil fungus).
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Sordariomycetes;
OC Sordariomycetidae; Sordariales; Chaetomiaceae; Chaetomium.
OX NCBI_TaxID=306901 {ECO:0000313|EMBL:EAQ89844.1, ECO:0000313|Proteomes:UP000001056};
RN [1] {ECO:0000313|EMBL:EAQ89844.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=CBS 148.51 {ECO:0000313|EMBL:EAQ89844.1};
RA Giovannoni S.J., Cho J.-C., Ferriera S., Johnson J., Kravitz S.,
RA Halpern A., Remington K., Beeson K., Tran B., Rogers Y.-H., Friedman R.,
RA Venter J.C.;
RL Submitted (MAR-2005) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:EAQ89844.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=CBS 148.51 {ECO:0000313|EMBL:EAQ89844.1};
RG The Broad Institute Genome Sequencing Platform;
RA Birren B., Lander E., Galagan J., Devon K., Nusbaum C., Ma L.-J., Jaffe D.,
RA Butler J., Alvarez P., Gnerre S., Grabherr M., Kleber M., Mauceli E.,
RA Brockman W., Rounsley S., Young S., LaButti K., Pushparaj V., DeCaprio D.,
RA Crawford M., Koehrsen M., Engels R., Montgomery P., Pearson M., Howarth C.,
RA Kodira C., Yandava C., Zeng Q., Alvarado L., Oleary S., Untereiner W.;
RT "Annotation of the Chaetomium globosum CBS 148.51 Genome.";
RL Submitted (FEB-2006) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CH408031; EAQ89844.1; -; Genomic_DNA.
DR RefSeq; XP_001222558.1; XM_001222557.1.
DR AlphaFoldDB; Q2H4F2; -.
DR STRING; 306901.Q2H4F2; -.
DR GeneID; 4391513; -.
DR VEuPathDB; FungiDB:CHGG_06463; -.
DR eggNOG; ENOG502S1YG; Eukaryota.
DR HOGENOM; CLU_030814_0_0_1; -.
DR InParanoid; Q2H4F2; -.
DR OMA; AMPPIIN; -.
DR OrthoDB; 207546at2759; -.
DR Proteomes; UP000001056; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-KW.
DR CDD; cd11660; SANT_TRF; 2.
DR Gene3D; 1.10.246.220; -; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017930; Myb_dom.
DR InterPro; IPR001005; SANT/Myb.
DR PANTHER; PTHR46734:SF1; TELOMERIC REPEAT-BINDING FACTOR 1; 1.
DR PANTHER; PTHR46734; TELOMERIC REPEAT-BINDING FACTOR 1 TERF1; 1.
DR Pfam; PF00249; Myb_DNA-binding; 1.
DR SMART; SM00717; SANT; 2.
DR SUPFAM; SSF46689; Homeodomain-like; 2.
DR PROSITE; PS51294; HTH_MYB; 1.
DR PROSITE; PS50090; MYB_LIKE; 1.
PE 4: Predicted;
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000001056}.
FT DOMAIN 220..279
FT /note="HTH myb-type"
FT /evidence="ECO:0000259|PROSITE:PS51294"
FT DOMAIN 220..273
FT /note="Myb-like"
FT /evidence="ECO:0000259|PROSITE:PS50090"
FT REGION 13..79
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 205..231
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 319..387
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 447..489
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 23..38
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 350..364
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 447..471
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 539 AA; 59000 MW; 0D8C51C14DE75AA0 CRC64;
MATIEPWLTN LLNKPKSPAL PPIQPVSLST TTASQPISLP PLGPELASSH RSERSAPTQI
TPLSAADDAH PVKPLGRPDD AFALHTASIR PLQLLLRESE ANVPASTLRS IVDDGPGSPD
DVLTKKRHRA LTTKEDLVQL PKLPKKQKSA QQVVPPIIAG LHEPPPDAAV FPPITSASFD
DNENFSLGPW KDTGSASEDR PALFSAADAE SSNNPKPKRR PMKPRRKWTE EETNNLLLGV
SRHGVGRWTT ILEDPGFQFN GRTAGDLKDR FRTCCPEELR LTATKEQAAN GEPGPAAVAD
GSAKPKLGIQ IEDILQPAGD EGVENGNTSP SAALDCDAAP KKRKPRAHRK KLEDLAELGI
HGPFEKSHRR KRRPFTKQDD DEILDGLSQY GPSWTRIQRD PKYNLSSRQP TDLRDRVRNK
YPDIYANIEK ANSPKDAWRG KTNILEPSVN TAKDNSRSTT ALPSFEPQLN RSGSKEDMLR
RTATPSAYES TESLPALADI FDMTEAHSSS FLGSASDLDL NHLLDDPYSG SERRLGLGR
//