ID A0A2P6VNC7_9CHLO Unreviewed; 1290 AA.
AC A0A2P6VNC7;
DT 23-MAY-2018, integrated into UniProtKB/TrEMBL.
DT 23-MAY-2018, sequence version 1.
DT 27-MAR-2024, entry version 19.
DE SubName: Full=Nucleolar MIF4G domain-containing 1 {ECO:0000313|EMBL:PSC75596.1};
GN ORFNames=C2E20_1375 {ECO:0000313|EMBL:PSC75596.1};
OS Micractinium conductrix.
OC Eukaryota; Viridiplantae; Chlorophyta; core chlorophytes; Trebouxiophyceae;
OC Chlorellales; Chlorellaceae; Chlorella clade; Micractinium.
OX NCBI_TaxID=554055 {ECO:0000313|EMBL:PSC75596.1, ECO:0000313|Proteomes:UP000239649};
RN [1] {ECO:0000313|EMBL:PSC75596.1, ECO:0000313|Proteomes:UP000239649}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=SAG 241.80 {ECO:0000313|EMBL:PSC75596.1,
RC ECO:0000313|Proteomes:UP000239649};
RX PubMed=29178410; DOI=10.1111/tpj.13789;
RA Arriola M.B., Velmurugan N., Zhang Y., Plunkett M.H., Hondzo H.,
RA Barney B.M.;
RT "Genome sequences of Chlorella sorokiniana UTEX 1602 and Micractinium
RT conductrix SAG 241.80: implications to maltose excretion by a green alga.";
RL Plant J. 93:566-586(2018).
CC -!- SUBCELLULAR LOCATION: Nucleus, nucleolus
CC {ECO:0000256|ARBA:ARBA00004604}.
CC -!- SIMILARITY: Belongs to the CWC22 family.
CC {ECO:0000256|ARBA:ARBA00006856}.
CC -!- SIMILARITY: Belongs to the SCO1/2 family.
CC {ECO:0000256|ARBA:ARBA00010996}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PSC75596.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LHPF02000002; PSC75596.1; -; Genomic_DNA.
DR STRING; 554055.A0A2P6VNC7; -.
DR Proteomes; UP000239649; Unassembled WGS sequence.
DR GO; GO:0005730; C:nucleolus; IEA:UniProtKB-SubCell.
DR GO; GO:0003723; F:RNA binding; IEA:InterPro.
DR Gene3D; 1.25.40.180; -; 1.
DR Gene3D; 3.40.30.10; Glutaredoxin; 1.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR029159; CA109-like.
DR InterPro; IPR003891; Initiation_fac_eIF4g_MI.
DR InterPro; IPR003890; MIF4G-like_typ-3.
DR InterPro; IPR003782; SCO1/SenC.
DR InterPro; IPR036249; Thioredoxin-like_sf.
DR PANTHER; PTHR18034; CELL CYCLE CONTROL PROTEIN CWF22-RELATED; 1.
DR PANTHER; PTHR18034:SF4; NUCLEOLAR MIF4G DOMAIN-CONTAINING PROTEIN 1; 1.
DR Pfam; PF15011; CA109-like; 1.
DR Pfam; PF02847; MA3; 1.
DR Pfam; PF02854; MIF4G; 1.
DR Pfam; PF02630; SCO1-SenC; 1.
DR SMART; SM00544; MA3; 1.
DR SMART; SM00543; MIF4G; 1.
DR SUPFAM; SSF48371; ARM repeat; 1.
DR SUPFAM; SSF52833; Thioredoxin-like; 1.
DR PROSITE; PS51366; MI; 1.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000239649}.
FT DOMAIN 702..818
FT /note="MI"
FT /evidence="ECO:0000259|PROSITE:PS51366"
FT REGION 1..131
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 187..373
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 30..64
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 236..272
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 292..307
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 336..364
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1290 AA; 135942 MW; 224FD4CB0CD65E7E CRC64;
MFVGRGGRGR GRSRGSGAGR SGPRLPFKLN KELFGEDDRG GGRGRGRGQQ PRKERRRTER
EERCGGGGGG GGGGRDDGSG GGGRGRGGGR SGGSRQPPER RPIAKGGRGI DGPPAKRQRT
GAAPPAAAAG SKFEELLPVH LQAGAPLDPE KELQRQLARK LGLKKGKAQL GGEDGLDEFL
EELGGIDDLF GSEAEEDASL AQQQQRRQGT KRRAGAAAPA AYGSGSGSGS EGEGEGGEDF
TSDEEDGLEG LLDSGSDGEG GSDEEEEEGG SGSEEGGGSL GSSGDSLLDG LGSSEEESDE
EMGGSSGSEG EEEGAAAAVW QRRQKRQRAG SEGEESGSEE GDGEEEEGSD SLASGSEEEE
EEQQAAAATA PVGKYVPPAA RRAAAAAAAA AGGSGSEAAA RVERRVRGLL NRLAEANIQS
IVSDLAELYQ QEGRRLVSDT VCEELVAAAA EGPRASDRFA AVTAGAVAGL AGSVHAAEIV
ANFLDRLAGR LDAAAASGDS LCCHNLSMVA AHLYTTGALK PDLVYSLLDS WRQRFTETDV
SMIVTLLHAC GLQLRAADPI MMKEFVVATH ARAAEAGAAG GSGLTKRAQM LLELVVDVKN
NRRRDDAKRR VVELAPAVGK WLKACGASDA AVGGITWQKV QQRNKKGIWW MPEATDAQPR
QLGGGSAALQ AASAAAAVGA GEGGLAGPEL LKLAAAMRMN TDVRRAAFCV IMGSEDCVDA
VEKLLRLGLK GEQERELVRV TVECCLQEKA WNAYYAHLLL RLCGVGKGHR MTLQFCMWDH
LKDLEGLEVR RLTNLARLLA SVLASGALPS TMLKVVDFSA ALTAREVMFW RICFQHLLAT
CKTADDCTLL FRRIAAVKEL KSLRLALAAF FRRSVGPWLA AKEPGAGGLT AEQLALAVGF
GAYTYETFVL AKQDMRPLTE EITRHPSISS LKMIDQNGSR VTAQNLLGKW ALIDFGSLGS
DADVKGINAI CRAAEAAQQR TGVAVTPVFL SLTPRHDKVE DLKRVAGAAG HPGLVLLTGD
ADGVLECAHK YKALKKSQVL SAVKGDEQYK KGDTAVDESY VSSFVYLVDP AGQFAELWPK
DRPLGCLSGM ASGKTLERLR QMYAKTDAEV NNWSSLQEEG MSLLGTLANI AARVPALEDP
SAYGLLPAAA AAPGLPARLL AKQLEALQGL ILQLQECLSG MQAAVDGMAR QSQQAQRMVT
QDRALTPAVC AAAAPPVPPV DACLQGLREI WRMHAEELRL KRALAEEVRL DSSEAELREV
AALWAAQLHL DAARVSDLVY VVAATQERRP
//