ID A0A2B4RAG3_STYPI Unreviewed; 835 AA.
AC A0A2B4RAG3;
DT 20-DEC-2017, integrated into UniProtKB/TrEMBL.
DT 20-DEC-2017, sequence version 1.
DT 27-MAR-2024, entry version 16.
DE SubName: Full=Nucleolar MIF4G domain-containing protein 1 {ECO:0000313|EMBL:PFX13799.1};
GN Name=Nom1 {ECO:0000313|EMBL:PFX13799.1};
GN ORFNames=AWC38_SpisGene22089 {ECO:0000313|EMBL:PFX13799.1};
OS Stylophora pistillata (Smooth cauliflower coral).
OC Eukaryota; Metazoa; Cnidaria; Anthozoa; Hexacorallia; Scleractinia;
OC Astrocoeniina; Pocilloporidae; Stylophora.
OX NCBI_TaxID=50429 {ECO:0000313|EMBL:PFX13799.1, ECO:0000313|Proteomes:UP000225706};
RN [1] {ECO:0000313|Proteomes:UP000225706}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Voolstra C.R., Li Y., Liew Y.J., Baumgarten S., Zoccola D., Flot J.-F.,
RA Tambutte S., Allemand D., Aranda M.;
RT "Comparative analysis of the genomes of Stylophora pistillata and Acropora
RT digitifera provides evidence for extensive differences between species of
RT corals.";
RL bioRxiv 0:0-0(2017).
CC -!- SUBCELLULAR LOCATION: Nucleus, nucleolus
CC {ECO:0000256|ARBA:ARBA00004604}.
CC -!- SIMILARITY: Belongs to the CWC22 family.
CC {ECO:0000256|ARBA:ARBA00006856}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PFX13799.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LSMT01000894; PFX13799.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A2B4RAG3; -.
DR STRING; 50429.A0A2B4RAG3; -.
DR Proteomes; UP000225706; Unassembled WGS sequence.
DR GO; GO:0005730; C:nucleolus; IEA:UniProtKB-SubCell.
DR GO; GO:0003723; F:RNA binding; IEA:InterPro.
DR Gene3D; 1.25.40.180; -; 1.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR003891; Initiation_fac_eIF4g_MI.
DR InterPro; IPR003890; MIF4G-like_typ-3.
DR PANTHER; PTHR18034; CELL CYCLE CONTROL PROTEIN CWF22-RELATED; 1.
DR PANTHER; PTHR18034:SF4; NUCLEOLAR MIF4G DOMAIN-CONTAINING PROTEIN 1; 1.
DR Pfam; PF02847; MA3; 1.
DR Pfam; PF02854; MIF4G; 1.
DR SMART; SM00544; MA3; 1.
DR SMART; SM00543; MIF4G; 1.
DR SUPFAM; SSF48371; ARM repeat; 1.
DR PROSITE; PS51366; MI; 1.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000225706}.
FT DOMAIN 637..753
FT /note="MI"
FT /evidence="ECO:0000259|PROSITE:PS51366"
FT REGION 1..71
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 95..126
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 214..297
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..21
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 55..71
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 217..233
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 235..251
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 835 AA; 96338 MW; 3CD364D55C2A07A4 CRC64;
MADPRGRKRK KEKRDEFFHI QAKGQYLKIP KNQEWDGEDS STVGGEERVR KLPSRRDKRQ
QARVQKKAKK AAFFSNKKGK TTKMLSSGFI RNKLKEGSDL TEKPTGNKRG MVSTKKRKRR
RTKEVAEGQE FLKQRLLAEN RKEDKEIKRL EKLLNIGSKE KVTSKKFKVD GLDYLLEVCE
HQLKEHDKDS GDELKSSDGL FSKELEHYDV MGIDGNSKLN MSESAQRDSN DSYGDSQISD
DDDDNDHDVN DDEYSTSDPS YGECIENKHY NESFDDSGHD NNDDGEGSEE EDYGTDKRDL
METFENNKEI DFELKNSGSK ANSSNQTDEF KRYVPPHLRI KNLSQNQNEH LQRLSCKMKG
LLNRLNERNM SIISNEIEHI YMQNSRNDVN KILSNHILAS CVSVSMMPDK LLMEHIMLLA
ILSSHVGIDV AAFFIERLAE LFDELHHHNK GSSGLGKECF NVVALFAHLC NFKDIVQQLV
NSFSEKDIEL LLLLLKLVGA EIRRGDPSAL KEIILQIQAK ASSTPMLIDD SRVCFMLEII
TKLRNNNLRK IPGYDCSQLE HLRKTLHSLT RESGCLVSNQ LKVSLEDLLN VKLKGRWWTV
GYALPQTPQI NTMEVFSTKL QNTKLLELAR KQRMNTDVRK NVFLIMMTSE DYIDAFEKLM
RLNLKDVQTR EVIHVLIDCC IQEKMYNPYY AYLGQKFCEN SRSYQVTFQY SFWDKFKLLS
SLAPHSLDNL LRLICHLFAT RALSLSMLKV VNFMALEKSS VQFFTDLFHH LLSTYSKDAI
RYVFERISVK KELASLCQNL RIFLKHFIGG QKSIILEKQL NLVDGILAVS HKTKL
//