ID A0A2T3A811_9PEZI Unreviewed; 880 AA.
AC A0A2T3A811;
DT 18-JUL-2018, integrated into UniProtKB/TrEMBL.
DT 18-JUL-2018, sequence version 1.
DT 27-MAR-2024, entry version 14.
DE RecName: Full=MI domain-containing protein {ECO:0000259|PROSITE:PS51366};
GN ORFNames=BD289DRAFT_260078 {ECO:0000313|EMBL:PSR85492.1};
OS Coniella lustricola.
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Sordariomycetes;
OC Sordariomycetidae; Diaporthales; Schizoparmaceae; Coniella.
OX NCBI_TaxID=2025994 {ECO:0000313|EMBL:PSR85492.1, ECO:0000313|Proteomes:UP000241462};
RN [1] {ECO:0000313|EMBL:PSR85492.1, ECO:0000313|Proteomes:UP000241462}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=B22-T-1 {ECO:0000313|EMBL:PSR85492.1,
RC ECO:0000313|Proteomes:UP000241462};
RA Raudabaugh D.B., Iturriaga T., Carver A., Mondo S., Pangilinan J.,
RA Lipzen A., He G., Amirebrahimi M., Grigoriev I.V., Miller A.N.;
RT "Coniella lustricola, a new species from submerged detritus.";
RL Mycol. Prog. 17:191-203(2018).
CC -!- SUBCELLULAR LOCATION: Nucleus, nucleolus
CC {ECO:0000256|ARBA:ARBA00004604}.
CC -!- SIMILARITY: Belongs to the CWC22 family.
CC {ECO:0000256|ARBA:ARBA00006856}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KZ678442; PSR85492.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A2T3A811; -.
DR STRING; 2025994.A0A2T3A811; -.
DR InParanoid; A0A2T3A811; -.
DR Proteomes; UP000241462; Unassembled WGS sequence.
DR GO; GO:0005730; C:nucleolus; IEA:UniProtKB-SubCell.
DR GO; GO:0003723; F:RNA binding; IEA:InterPro.
DR Gene3D; 1.25.40.180; -; 1.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR003891; Initiation_fac_eIF4g_MI.
DR InterPro; IPR003890; MIF4G-like_typ-3.
DR PANTHER; PTHR18034; CELL CYCLE CONTROL PROTEIN CWF22-RELATED; 1.
DR PANTHER; PTHR18034:SF4; NUCLEOLAR MIF4G DOMAIN-CONTAINING PROTEIN 1; 1.
DR Pfam; PF02847; MA3; 1.
DR Pfam; PF02854; MIF4G; 1.
DR SMART; SM00544; MA3; 1.
DR SMART; SM00543; MIF4G; 1.
DR SUPFAM; SSF48371; ARM repeat; 1.
DR PROSITE; PS51366; MI; 1.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000241462}.
FT DOMAIN 650..779
FT /note="MI"
FT /evidence="ECO:0000259|PROSITE:PS51366"
FT REGION 20..159
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 179..307
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 606..633
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 22..38
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 61..88
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 107..148
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 179..198
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 208..234
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 254..292
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 606..623
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 880 AA; 98311 MW; EA72B3C91C06232A CRC64;
MAPRGREPLR PSLPSALLEQ LGVDGAQSSS LSSPRRGQFQ TSRKERRKQE RIHKRQRTAT
VQKRTPAGRP TARTASSTKP TDVSKKTMVA SFSHKHKQKP AARDVLDDEI DDDFGGFSAD
DVDEEKEGDD GIDSFESDNE DGDNDDGYED TATTTVSNTS KVVKEKLAKE DAEIAALERK
LGIKNRKSLP KSFEEDGLGD LLGDLGAESD GDDDKSEKRK RKAEADEWLA QKRRKALHKA
GVQPKHDTGE DSDGLAGLGE DDEDLSSEDL DLYEDDEDQD DNSDDYGSDD MNDEEEVKPV
KKRENPYVAP VAATAQAQKY VPPARRQATG SDAELLLRIR RQTQGLINRL TESNLLAILG
DIEKLYRDYP RQHVTSILVD ILLTQICEPT SLPDTLLILT AGFSTAVYQV IGPDFGAQLV
QQTVDRFQFH FAQVSGGDAQ NVPKHTSNLI TFLSEIYNFQ LIRCNLIFDY IRLLLSNLSE
HNAELLLRIV RMAGQSLRRD DPLALKDIVS LIGPAVKKIG EGNLSVRTKF MIESINDLKN
NKVKTGASAT AVVQDHMTRM KKLLGSMQTR KLKATEPLRI GLKDIQESDK RGKWWLVGAS
WAGRDEAPKK QVEAAKDGRD DTDNDDLSSD DEDVMPDLAA IGKENGMNTD VRLSIFIAIM
SAYDYEDAYN RLSKLRLNKD RQKEIAYVLI QCTGMEQQYN PYYALVARKV CGDGRIKFAL
QDSLWKFLRR LGEPLFGEDA DEDDVDTIDS RRIFGVAKMF GFLLAHGALS LSILKCLDLA
YVRPKTRDLL EVMLMVMILE LRKAKTEVET VFASVADMPE LSKGLQYFLR KRVAKSDLFA
NKKDSKRVKK ACRTAIKALD GGVEAATETL DMVDLDEDED
//