ID A0A2C5XND8_9PEZI Unreviewed; 1202 AA.
AC A0A2C5XND8;
DT 20-DEC-2017, integrated into UniProtKB/TrEMBL.
DT 20-DEC-2017, sequence version 1.
DT 27-MAR-2024, entry version 19.
DE SubName: Full=Homeobox protein 4 {ECO:0000313|EMBL:PHH56284.1};
GN Name=hbx4_0 {ECO:0000313|EMBL:PHH56284.1};
GN ORFNames=CFIMG_000411RA {ECO:0000313|EMBL:PHH56284.1};
OS Ceratocystis fimbriata CBS 114723.
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Sordariomycetes;
OC Hypocreomycetidae; Microascales; Ceratocystidaceae; Ceratocystis.
OX NCBI_TaxID=1035309 {ECO:0000313|EMBL:PHH56284.1, ECO:0000313|Proteomes:UP000222788};
RN [1] {ECO:0000313|EMBL:PHH56284.1, ECO:0000313|Proteomes:UP000222788}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CBS 114723 {ECO:0000313|EMBL:PHH56284.1,
RC ECO:0000313|Proteomes:UP000222788};
RX PubMed=23931120; DOI=10.1016/j.funbio.2013.06.004;
RA Simpson M.C., Wilken P.M., Coetzee M.P., Wingfield M.J., Wingfield B.D.;
RT "Analysis of microsatellite markers in the genome of the plant pathogen
RT Ceratocystis fimbriata.";
RL Fungal Biol. 117:545-555(2013).
RN [2] {ECO:0000313|EMBL:PHH56284.1, ECO:0000313|Proteomes:UP000222788}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CBS 114723 {ECO:0000313|EMBL:PHH56284.1,
RC ECO:0000313|Proteomes:UP000222788};
RX PubMed=24563841; DOI=10.5598/imafungus.2013.04.02.14;
RA Wilken P.M., Steenkamp E.T., Wingfield M.J., de Beer Z.W., Wingfield B.D.;
RT "IMA Genome-F 1: Ceratocystis fimbriata: Draft nuclear genome sequence for
RT the plant pathogen, Ceratocystis fimbriata.";
RL IMA Fungus 4:357-358(2013).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PHH56284.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; APWK03000001; PHH56284.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A2C5XND8; -.
DR STRING; 1035309.A0A2C5XND8; -.
DR OrthoDB; 450547at2759; -.
DR Proteomes; UP000222788; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0006355; P:regulation of DNA-templated transcription; IEA:InterPro.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 3.30.160.60; Classic Zinc Finger; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR008422; Homeobox_KN_domain.
DR InterPro; IPR006600; HTH_CenpB_DNA-bd_dom.
DR InterPro; IPR013087; Znf_C2H2_type.
DR PANTHER; PTHR11850:SF102; HOMEOBOX PROTEIN HOMOTHORAX; 1.
DR PANTHER; PTHR11850; HOMEOBOX PROTEIN TRANSCRIPTION FACTORS; 1.
DR Pfam; PF05920; Homeobox_KN; 1.
DR Pfam; PF03221; HTH_Tnp_Tc5; 1.
DR SMART; SM00389; HOX; 1.
DR SMART; SM00355; ZnF_C2H2; 3.
DR SUPFAM; SSF46689; Homeodomain-like; 2.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
DR PROSITE; PS51253; HTH_CENPB; 1.
DR PROSITE; PS00028; ZINC_FINGER_C2H2_1; 1.
DR PROSITE; PS50157; ZINC_FINGER_C2H2_2; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|PROSITE-ProRule:PRU00108,
KW ECO:0000313|EMBL:PHH56284.1};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00042};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000222788};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00042};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00042}.
FT DOMAIN 205..268
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DOMAIN 411..439
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 787..861
FT /note="HTH CENPB-type"
FT /evidence="ECO:0000259|PROSITE:PS51253"
FT DNA_BIND 207..269
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 1..20
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 171..213
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 274..298
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 346..395
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 855..917
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1147..1172
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 346..383
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 855..887
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 894..917
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1202 AA; 134334 MW; BF21BCAF226D7B49 CRC64;
MDEFLNWDGS VPPPDSDVPN IPSLSASSAE LMEITAQDLQ DDVSFALDLA FNAHEGPDSA
GPQTCPSDNI VTSVPQPLDP GAGLFSADWL INAPTPCLEC LDGGFLCKLI KEGQYKGFCT
SCVALGSMCS FGNEFRTASG LETSSYNSLP IIDSQPSAAM VQNPTESFFT PLQASSQPTY
DSPFIQPRPF SQLEQPQQDQ KADAGSSAKT NTRFSRASVR VLRQWLSSHS RHPYPSDEEK
DMLQRQTGLN KTQITNWLAN ARRRGKVQIA VRSTSPHPQT FSLDSPRRNT PGTENMNPLQ
RWANSPPEHE PASAIAIARA VTASSSASST GASSYHNYTD DGSGLSLHGS SASSVGTSRS
SGGSFASAMS HASRGSIGSF SSMTRGRRRR RRRAGGLVVE KKSLHNPLRT YQCTFCPETF
KTKHDWQRHE KSLHLSLERW VCSPNGTAAI NPANQTMSCV FCGEANPSEA HLETHNCSSC
LDKTLPERTF YRKDHLNQHL RLVHNVRFVD WSMKDWKISA PTIQSRCGFC GIVMTTWAIR
IEHLSEHFKS GASMSEWKGD WGFERHILDL VENSIPPYLI DTERNSPYPM AAEQAPPESP
RDAYELIKDE LVYYMANRFE NNHDFPTDDQ LRVEACRIIF ASEVLSLQGI SSMASWLRDV
LMDDEVIYRQ AKFGPLRTQA ENRLAILKIN GKDNLFEACP LEHALQDYVV TRQLIGMTPV
DSELQEEACR IVGQVEEVAT HPCEAIANFL LRLIKSSTKW LARFRQRAHL PRSEEMGDEW
YRSTDPATID STIHSYSRLE RELGDYLEHQ RREGIEPTNL DLQRRARIII YEFDDGWNQT
AADSTEWLTA FRERHPANSS PNTVCQGPTS GSLDQAASNS TESPRVPVPS SPIVHSPDNA
SINRSSEVET QFSSPGPVGI NSKQLRVRIG TFLNDSNCFR RLERELSRYV TMCMSNNNPN
KHVPTDEELQ HQARWVVYDD DDPWNQTAAD NLEWLRRFKR CSGLLNDGGP GLSRGNEWNV
SQGGSGFAPP FAYPKNGFTE HVSSQKVIDV NVHGKIFGTE GGVASKYLED LEGRYPRPAV
VFCSRELESG LIDFVSRTVA SGAAFPTDVE LKDQARRILS TQVTAADDEV LLEQFKKTVK
ERLASVNDEN GQQHQQNEKQ PQRLGQRPQE QQQVPEILVD QMMTERDFDD LLDGINFDLE
LP
//