ID A0A084QBJ5_STAC4 Unreviewed; 867 AA.
AC A0A084QBJ5;
DT 29-OCT-2014, integrated into UniProtKB/TrEMBL.
DT 29-OCT-2014, sequence version 1.
DT 27-MAR-2024, entry version 30.
DE RecName: Full=MI domain-containing protein {ECO:0000259|PROSITE:PS51366};
GN ORFNames=S40285_05863 {ECO:0000313|EMBL:KFA61330.1};
OS Stachybotrys chlorohalonata (strain IBT 40285).
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Sordariomycetes;
OC Hypocreomycetidae; Hypocreales; Stachybotryaceae; Stachybotrys.
OX NCBI_TaxID=1283841 {ECO:0000313|EMBL:KFA61330.1, ECO:0000313|Proteomes:UP000028524};
RN [1] {ECO:0000313|EMBL:KFA61330.1, ECO:0000313|Proteomes:UP000028524}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=IBT 40285 {ECO:0000313|EMBL:KFA61330.1,
RC ECO:0000313|Proteomes:UP000028524};
RX PubMed=25015739; DOI=10.1186/1471-2164-15-590;
RA Semeiks J., Borek D., Otwinowski Z., Grishin N.V.;
RT "Comparative genome sequencing reveals chemotype-specific gene clusters in
RT the toxigenic black mold Stachybotrys.";
RL BMC Genomics 15:590-590(2014).
CC -!- SUBCELLULAR LOCATION: Nucleus, nucleolus
CC {ECO:0000256|ARBA:ARBA00004604}.
CC -!- SIMILARITY: Belongs to the CWC22 family.
CC {ECO:0000256|ARBA:ARBA00006856}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KL660858; KFA61330.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A084QBJ5; -.
DR STRING; 1283841.A0A084QBJ5; -.
DR HOGENOM; CLU_006786_2_1_1; -.
DR InParanoid; A0A084QBJ5; -.
DR OMA; FMVDILN; -.
DR OrthoDB; 5491616at2759; -.
DR Proteomes; UP000028524; Unassembled WGS sequence.
DR GO; GO:0005730; C:nucleolus; IEA:UniProtKB-SubCell.
DR GO; GO:0003723; F:RNA binding; IEA:InterPro.
DR Gene3D; 1.25.40.180; -; 1.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR003891; Initiation_fac_eIF4g_MI.
DR InterPro; IPR003890; MIF4G-like_typ-3.
DR PANTHER; PTHR18034; CELL CYCLE CONTROL PROTEIN CWF22-RELATED; 1.
DR PANTHER; PTHR18034:SF4; NUCLEOLAR MIF4G DOMAIN-CONTAINING PROTEIN 1; 1.
DR Pfam; PF02847; MA3; 1.
DR Pfam; PF02854; MIF4G; 1.
DR SMART; SM00543; MIF4G; 1.
DR SUPFAM; SSF48371; ARM repeat; 1.
DR PROSITE; PS51366; MI; 1.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000028524}.
FT DOMAIN 643..776
FT /note="MI"
FT /evidence="ECO:0000259|PROSITE:PS51366"
FT REGION 1..306
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 68..87
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 101..130
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 146..190
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 201..219
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 242..262
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 263..280
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 867 AA; 96493 MW; 9F0B11A78368D265 CRC64;
MPASSVAELQ QKLYKRMGLD APEKAQQRKS HGARSYSRPT SRKDQRRAHR IQKKSHIYAK
PAASNRPASL HGRSQSKHTQ SSGKGVTHPR ANGKRAIPSE DTEDEDEDES GIDDEDLGGS
SDEDGGGDLE GESDAMALNG KKGKTDAMPR AVRERLSRDD AEIEELERKL GIKKGRKSLP
QSFKDDGLDE LIGDVESEAE SREEEPNKRK RDYDEWLSAK RAKTNSGASR SKARHVEEDE
DDDHALSGED QYEDEDSNGE EEGVGSSSDN YDQSSFDGFD SDSGPKAPIQ GSRQRENPYV
APTTGAVVAK YVPPSLRKTA DSEGEAKTRL RRQIRGQINR LTDANILSIV QAVEGFYQTN
ARGDVTEILT DAVLTQVCMP ETLPDQFFVL TGGFIAAMYK VIGSSFGSHF INRVVMDFGQ
EYERAISQGA QPSKETSNLI TLLSQLYTFE VVSCKIIFDY MERLLSELNE LNVELLLRIC
RMAGRLLRRD DSQTLKHVST VLNKAVAQVG YANISARTKF MVETIGDLKN SKPKAKGMDS
AVVSEHVMRM KKRLGELKSQ TRRLEGLAPM GMSLEDIERA GTQGKWWLVG ASVPVKPSGR
GATDGPGSAD IDGNADVSTD DEDMDFILPD YPQKARAQGF GTTTQIAIFT ALMSAIDDEH
GYQQYLQLGL KKDEQLEIAR VLVQCVGSEA DYNEYYALVG KQACQNGRVR FAFQNRLWRI
FRGLGESLFG DEVEDEETAD SERMKDEHRL RNVARFYASL IMEGALALSV LKPLELPELN
SWTAVFIEQL MISLLQACRS KSVEEEESRL NRVFGAIRDL PVLAAGVHWI LRKRVKKTKL
ISSKEAKKLE KVREKAQSIV QQVKTVE
//