ID Q2H8C6_CHAGB Unreviewed; 652 AA.
AC Q2H8C6;
DT 21-MAR-2006, integrated into UniProtKB/TrEMBL.
DT 21-MAR-2006, sequence version 1.
DT 27-MAR-2024, entry version 88.
DE RecName: Full=Homeobox domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=CHGG_03528 {ECO:0000313|EMBL:EAQ91593.1};
OS Chaetomium globosum (strain ATCC 6205 / CBS 148.51 / DSM 1962 / NBRC 6347 /
OS NRRL 1970) (Soil fungus).
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Sordariomycetes;
OC Sordariomycetidae; Sordariales; Chaetomiaceae; Chaetomium.
OX NCBI_TaxID=306901 {ECO:0000313|EMBL:EAQ91593.1, ECO:0000313|Proteomes:UP000001056};
RN [1] {ECO:0000313|Proteomes:UP000001056}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 6205 / CBS 148.51 / DSM 1962 / NBRC 6347 / NRRL 1970
RC {ECO:0000313|Proteomes:UP000001056};
RX PubMed=25720678; DOI=10.1128/genomeA.00021-15;
RA Cuomo C.A., Untereiner W.A., Ma L.-J., Grabherr M., Birren B.W.;
RT "Draft genome sequence of the cellulolytic fungus Chaetomium globosum.";
RL Genome Announc. 3:E0002115-E0002115(2015).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CH408030; EAQ91593.1; -; Genomic_DNA.
DR RefSeq; XP_001230044.1; XM_001230043.1.
DR AlphaFoldDB; Q2H8C6; -.
DR GeneID; 4388533; -.
DR VEuPathDB; FungiDB:CHGG_03528; -.
DR eggNOG; KOG0773; Eukaryota.
DR HOGENOM; CLU_008497_3_0_1; -.
DR InParanoid; Q2H8C6; -.
DR OMA; RYDWTRH; -.
DR OrthoDB; 450547at2759; -.
DR Proteomes; UP000001056; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0006355; P:regulation of DNA-templated transcription; IEA:InterPro.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR008422; Homeobox_KN_domain.
DR InterPro; IPR013087; Znf_C2H2_type.
DR PANTHER; PTHR11850:SF102; HOMEOBOX PROTEIN HOMOTHORAX; 1.
DR PANTHER; PTHR11850; HOMEOBOX PROTEIN TRANSCRIPTION FACTORS; 1.
DR Pfam; PF05920; Homeobox_KN; 1.
DR SMART; SM00389; HOX; 1.
DR SMART; SM00355; ZnF_C2H2; 2.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
DR PROSITE; PS00028; ZINC_FINGER_C2H2_1; 2.
DR PROSITE; PS50157; ZINC_FINGER_C2H2_2; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|PROSITE-ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00042};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000001056};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00042};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00042}.
FT DOMAIN 32..95
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DOMAIN 238..266
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DNA_BIND 34..96
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 20..41
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 190..232
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 190..210
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 211..226
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 652 AA; 72227 MW; E5F4D3186D509068 CRC64;
MSLPYHPQVT AVDLAVPLRG GNLTDNETSK TAKARPKTAR HAKGSTRILN EWFVLHSTYP
YPTEEEKQIL AARTGLTTRQ VSFWFVNARR RKTTWGPQCA SSASDAPLSL PTFDRLPDSG
WEGMAPMDRW RHTPPDQEGA SLSAIAEAAE RNPHASSAPT LGYYGFDTGE MLSEAHWSIS
SFGSHHAAIP GSGSDSSVLS HSSGSATSVR STHSRRRRRR RQRSPVRASS GKSASRLYQC
TFCTDTFKTR YDWTRHESTL HLALEKWTCI PSGPTYYDRA LLRPRCALCD VLDPSDSHLR
SHGSLECASK PHPDRTFYRK DHFRQHLRVC HGVDDILPSM KNWKSHISQI KSRCGFCGES
FQLWSARNDH LAEHFRLGAR MKSWKGCRGL EPAVALLVEN AIPPYLIGIE SNDLQPFSAS
RMAMKGSSTG VASGQVSGTM FEILTARLGD YVRAERERGT VITDDLLRKE ARLILFGDDD
PWNQTPADNA EWLALFKEGY SLAPTLASCG DVTAGVSLAQ QQCWFGPPVC DTNPSPFTLE
KMRQATVSSS SFGPPEICGV LLYNQMIHQQ EDASLAVPWS WQTPECLAEF SQMCETNQAV
MAADPTNDLN FDMVFGEFND LEQDEQIPNI GLEAPGVNDC IGAAGKLMDT CT
//