ID A0A0C7NGQ2_9SACH Unreviewed; 1049 AA.
AC A0A0C7NGQ2;
DT 29-APR-2015, integrated into UniProtKB/TrEMBL.
DT 29-APR-2015, sequence version 1.
DT 27-MAR-2024, entry version 28.
DE SubName: Full=LALA0S15e00144g1_1 {ECO:0000313|EMBL:CEP64910.1};
GN ORFNames=LALA0_S15e00144g {ECO:0000313|EMBL:CEP64910.1};
OS Lachancea lanzarotensis.
OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; Saccharomycetes;
OC Saccharomycetales; Saccharomycetaceae; Lachancea.
OX NCBI_TaxID=1245769 {ECO:0000313|EMBL:CEP64910.1, ECO:0000313|Proteomes:UP000054304};
RN [1] {ECO:0000313|EMBL:CEP64910.1, ECO:0000313|Proteomes:UP000054304}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CBS 12615 {ECO:0000313|EMBL:CEP64910.1,
RC ECO:0000313|Proteomes:UP000054304};
RA Neuveglise Cecile;
RL Submitted (DEC-2014) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus, nucleolus
CC {ECO:0000256|ARBA:ARBA00004604}.
CC -!- SIMILARITY: Belongs to the CWC22 family.
CC {ECO:0000256|ARBA:ARBA00006856}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LN736374; CEP64910.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A0C7NGQ2; -.
DR STRING; 1245769.A0A0C7NGQ2; -.
DR HOGENOM; CLU_006786_2_0_1; -.
DR OrthoDB; 5491616at2759; -.
DR Proteomes; UP000054304; Unassembled WGS sequence.
DR GO; GO:0005730; C:nucleolus; IEA:UniProtKB-SubCell.
DR GO; GO:0003723; F:RNA binding; IEA:InterPro.
DR Gene3D; 1.25.40.180; -; 1.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR003891; Initiation_fac_eIF4g_MI.
DR InterPro; IPR003890; MIF4G-like_typ-3.
DR PANTHER; PTHR18034; CELL CYCLE CONTROL PROTEIN CWF22-RELATED; 1.
DR PANTHER; PTHR18034:SF4; NUCLEOLAR MIF4G DOMAIN-CONTAINING PROTEIN 1; 1.
DR Pfam; PF02847; MA3; 1.
DR Pfam; PF02854; MIF4G; 1.
DR SMART; SM00544; MA3; 1.
DR SMART; SM00543; MIF4G; 1.
DR SUPFAM; SSF48371; ARM repeat; 1.
DR PROSITE; PS51366; MI; 1.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000054304}.
FT DOMAIN 797..933
FT /note="MI"
FT /evidence="ECO:0000259|PROSITE:PS51366"
FT REGION 21..318
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 362..469
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 45..93
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 124..142
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 143..157
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 228..297
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 362..378
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 402..425
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 426..443
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1049 AA; 118335 MW; DA19D26E757728B9 CRC64;
MRNEKHGIRI PGTILDELKR QDYGEDRRFN VGKRKRATQL GRKERRKQQR REEKSRGTAG
GRAREDLKKA GNKVDKPAKN VKKPLQKVAQ KEVEEEAEEE EEVVGNGREI SDSGSLPFSS
DDELSSSDFD DFGDDVEEQN GQESEGSGES GTDMTAEETM QALKRAKQAK KANKGTYSDD
DDEPATDMTA EETMRALKRA KDAKKAKKAT HSDADEELST DMTVEETMQA LKRAKEVKKA
RYSDSDDEPD TEMTAEETMR ALKRAKDAKK TGAANLKTKT DKPVSENKQK RSNHPPIAEE
PTVKPSLTPS QLAQAERDKW DYDYYAQKLG LKGKKKTLKA NSEFDAVGGL LEGLDFFENY
NSDISDESES EVESEASNGL SDEPENISKR HSAGEADENG PENPFSSDDE VSEGDFEEFD
ENDLDEQEWK ELRELEGETS DQETTVRENP YVAPTSEPAA YVPPSLRQKN LSSEPSQLVV
ELRKKVKSSL NKLSVSNITV IVSALNELYN SYPRQHVSDA LANQILEIVC QRDKLLDSLM
INYAAVVFSL WKLRGTEVGA SFLQTWVEKL LESFRQGQAK LEQLEKENGE ESTFVMSKEC
TNLVSFLSYC YNFGMVSSRI IYNVIELLIK TPNEFTSDIL LRIISISGQL IRGDDPKALK
EIISELLTNV KNVKQSTRMK FLLEVMSDLK NNRLKPSLLA ASHSGVKKSI SKLINDTASS
PSDPLQVTLD DIESIDTKGK WWLVGASWKG NQESAFEHKK SLAKKEESSS TIAIEDNIFD
DIPDWSEIAR AQRMNTEVRR ALFVSVMSAQ DFMDAFARID KLNLKNKQSL EIPRVLLHCL
AMEGSKGSYN PFYALLGIKL SQQSHHVAKA FQFIFWDMVK KFEQDDSDIE SAGEEVEFSE
DQKLKKISCQ GRFFGSLLAQ EILKLDIFKH VPLMGGLNGD GTLFIEVMVF QFLLSIAKKC
EVKKKSGGVR AISFDPAPFQ RLAEHSNGFD NSKSVFAALR MFLQKRFKYQ AFITEQEGTK
EFERQKRRLE WALPEFLDLL KVGLKSQEL
//