ID A0A1Z5TU95_HORWE Unreviewed; 896 AA.
AC A0A1Z5TU95;
DT 27-SEP-2017, integrated into UniProtKB/TrEMBL.
DT 27-SEP-2017, sequence version 1.
DT 27-MAR-2024, entry version 18.
DE RecName: Full=MI domain-containing protein {ECO:0000259|PROSITE:PS51366};
GN ORFNames=BTJ68_00330 {ECO:0000313|EMBL:OTA39592.1};
OS Hortaea werneckii EXF-2000.
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Dothideomycetes;
OC Dothideomycetidae; Mycosphaerellales; Teratosphaeriaceae; Hortaea.
OX NCBI_TaxID=1157616 {ECO:0000313|EMBL:OTA39592.1, ECO:0000313|Proteomes:UP000194280};
RN [1] {ECO:0000313|EMBL:OTA39592.1, ECO:0000313|Proteomes:UP000194280}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=EXF-2000 {ECO:0000313|EMBL:OTA39592.1,
RC ECO:0000313|Proteomes:UP000194280};
RA Sinha S., Flibotte S., Neira M., Lenassi M., Gostincar C., Stajich J.E.,
RA Nislow C.E.;
RT "The recent genome duplication of the halophilic yeast Hortaea werneckii:
RT insights from long-read sequencing.";
RL Submitted (JAN-2017) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus, nucleolus
CC {ECO:0000256|ARBA:ARBA00004604}.
CC -!- SIMILARITY: Belongs to the CWC22 family.
CC {ECO:0000256|ARBA:ARBA00006856}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OTA39592.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; MUNK01000002; OTA39592.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A1Z5TU95; -.
DR STRING; 1157616.A0A1Z5TU95; -.
DR VEuPathDB; FungiDB:BTJ68_00330; -.
DR InParanoid; A0A1Z5TU95; -.
DR OrthoDB; 5473641at2759; -.
DR Proteomes; UP000194280; Unassembled WGS sequence.
DR GO; GO:0005730; C:nucleolus; IEA:UniProtKB-SubCell.
DR GO; GO:0003723; F:RNA binding; IEA:InterPro.
DR Gene3D; 1.25.40.180; -; 1.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR003891; Initiation_fac_eIF4g_MI.
DR InterPro; IPR003890; MIF4G-like_typ-3.
DR PANTHER; PTHR18034; CELL CYCLE CONTROL PROTEIN CWF22-RELATED; 1.
DR PANTHER; PTHR18034:SF4; NUCLEOLAR MIF4G DOMAIN-CONTAINING PROTEIN 1; 1.
DR Pfam; PF02847; MA3; 1.
DR Pfam; PF02854; MIF4G; 1.
DR SMART; SM00544; MA3; 1.
DR SMART; SM00543; MIF4G; 1.
DR SUPFAM; SSF48371; ARM repeat; 1.
DR PROSITE; PS51366; MI; 1.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000194280}.
FT DOMAIN 649..779
FT /note="MI"
FT /evidence="ECO:0000259|PROSITE:PS51366"
FT REGION 1..298
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 599..635
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 32..63
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 91..110
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 111..145
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 160..190
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 203..237
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 238..278
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 896 AA; 98270 MW; 53FA30207F18FF18 CRC64;
MSRNNGYGGP KLPKFLLDQV GGGHRINKQT SRKDRRKAER DEKKAARARP AQRLDRASQR
AQVEESDDDD DGLGDEDEAP SPPQRPAKDA KPAKSILKAS KPKKPEPESE PDSETGEELE
EQDDGEDLLE EVDDDEDDDE EDERAKSDDS FTISRQAAKA GLGHEEDEIA ALERKLGMKG
KKRSYDFGDD ELDWLVTGSD SEDGGRGTKR KRPEDSKWLR DKRMKANERK EVDRRDPEDD
ENSDDDDSGE DDEQDEDVEN PFSEDEISGD DFEGFDSEAD AEAGTSAPPP PKRERENPYV
APVAKEAAPS AGKYVPPSLR KPASSEDEAL KQLRRQIQGQ LNRLSEANML SILTAIESVY
TNNARQHVTS TLVDLLVSLV SDPAVLNDTF IILHAGFSAA LYKVVGTDFG AQLLEKIVET
FDAHRSDGDS EGKQPLNLMA FLSSLYAFQV VGCGIIFDYI RLLLDTMSES NTELLLRIIR
SSGTQLRADD PTALKDIVLL LQRNVAAAGG EANVSVRTKF MIETINNLKN NRMKPGAAAT
SAVTAEHTTR MRKTLGSLNA RSTLRANEPL RITRADIKDS DKKGKWWLVG ASYHDPAKLA
SSTNNSSGSQ SAAKAATANS MEDDGYESET PGNNVNLTRL ARQQGMNTDI RRAVFIALLS
AADYKDAHMR LLKLKLKSKQ FLSEVPRVLV HCAGAEAVYN PYYTLIAKEV CLQGPRKGMA
KHFLFAAWDV LRRLGGDGEE EEDAGEEVGT KKIVNLGKMY GTLIAEGILG LGGVLKTVKG
EFAYLAPKTS LFVEVLLTTI FLQLRKKTRK DEEGFVSKVR DVFMQTGQVS AGLVQGLQFF
VRSSLSRAAL ANGKKEERTL KEGCDVAAQA LQEAAQGAVV LGEDDDDDGE GGYSSG
//