ID G0SHD3_CHATD Unreviewed; 641 AA.
AC G0SHD3;
DT 19-OCT-2011, integrated into UniProtKB/TrEMBL.
DT 19-OCT-2011, sequence version 1.
DT 27-MAR-2024, entry version 46.
DE SubName: Full=Putative sequence-specific DNA binding protein {ECO:0000313|EMBL:EGS17622.1};
GN ORFNames=CTHT_0069620 {ECO:0000313|EMBL:EGS17622.1};
OS Chaetomium thermophilum (strain DSM 1495 / CBS 144.50 / IMI 039719)
OS (Thermochaetoides thermophila).
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Sordariomycetes;
OC Sordariomycetidae; Sordariales; Chaetomiaceae; Thermochaetoides.
OX NCBI_TaxID=759272 {ECO:0000313|Proteomes:UP000008066};
RN [1] {ECO:0000313|EMBL:EGS17622.1, ECO:0000313|Proteomes:UP000008066}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=DSM 1495 / CBS 144.50 / IMI 039719
RC {ECO:0000313|Proteomes:UP000008066};
RX PubMed=21784248; DOI=10.1016/j.cell.2011.06.039;
RA Amlacher S., Sarges P., Flemming D., van Noort V., Kunze R., Devos D.P.,
RA Arumugam M., Bork P., Hurt E.;
RT "Insight into structure and assembly of the nuclear pore complex by
RT utilizing the genome of a eukaryotic thermophile.";
RL Cell 146:277-289(2011).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|PROSITE-ProRule:PRU00108,
CC ECO:0000256|RuleBase:RU000682}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GL988047; EGS17622.1; -; Genomic_DNA.
DR RefSeq; XP_006697240.1; XM_006697177.1.
DR AlphaFoldDB; G0SHD3; -.
DR STRING; 759272.G0SHD3; -.
DR GeneID; 18261000; -.
DR KEGG; cthr:CTHT_0069620; -.
DR eggNOG; KOG0849; Eukaryota.
DR HOGENOM; CLU_023524_1_0_1; -.
DR OMA; SHADPMI; -.
DR OrthoDB; 5401018at2759; -.
DR Proteomes; UP000008066; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR PANTHER; PTHR24324:SF12; HOMEOBOX DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR24324; HOMEOBOX PROTEIN HHEX; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000008066}.
FT DOMAIN 91..151
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 93..152
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 1..120
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 621..641
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 9..29
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 40..120
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 641 AA; 70272 MW; 374616CCF4105B68 CRC64;
MATDAVKAEV KLETIHEETK QEVKQEFKPE DALTPGAPSL AATPSPSKSP SNKSPEPATK
PISPNKVTKP STNGTTTNGR QKTRKSTLTQ QQKNNKRQRA TPDQLATLES EFNKNPTPTA
QVRERIAEEI NMTERSVQIW FQNRRAKIKM LAKKSLENGE DMDSIPESMR QYLAMQAMEH
GKSIPGFLGR PGYLGYGHGE GSGQGKVLIH HLNCRSLTIG TWVRVGQNAM DLIVFYSPDK
CTMTYYINND QAGYKIEYPF SFIKNIYLNQ AEGEHGGITI ELLQPPLFFM DSSTTSTFVQ
VTDFTENFQA SRVLTHQLGG NAKVLSTQLA KLVSLDAFIN RHNIPVPPPP PPPPAPLFDQ
MQPPLSMSAP VSPALRPASQ PNFAQPHIGM FQEQQWGIAP QHTMTMRGPG HKRQRSRSVP
GPIDLQTMQL LQNPPSFHIT QPDNQSAVQS PHIFSPIPQQ PNLLGPLNPN LRIDTRAGFG
LDMRNYPLSA TTAPTPSDFP SPGFFATQPQ DTPAIPAASF TPYSATAPTF SPMVSPSNIG
VPPPSISPLS FNHSDPAIVG ESPPITMPTI CEGSAISDDG STLNDLYPAG KQQTMTLPMH
PHSPFMEPNQ ADMELNQFMD LKRFENDPAS LSPPESVPAQ S
//