ID A0A087YCV1_POEFO Unreviewed; 840 AA.
AC A0A087YCV1;
DT 29-OCT-2014, integrated into UniProtKB/TrEMBL.
DT 26-NOV-2014, sequence version 2.
DT 27-MAR-2024, entry version 51.
DE SubName: Full=Nucleolar protein with MIF4G domain 1 {ECO:0000313|Ensembl:ENSPFOP00000015854.2};
OS Poecilia formosa (Amazon molly) (Limia formosa).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Ovalentaria; Atherinomorphae; Cyprinodontiformes; Poeciliidae; Poeciliinae;
OC Poecilia.
OX NCBI_TaxID=48698 {ECO:0000313|Ensembl:ENSPFOP00000015854.2, ECO:0000313|Proteomes:UP000028760};
RN [1] {ECO:0000313|Proteomes:UP000028760}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=female {ECO:0000313|Proteomes:UP000028760};
RA Schartl M., Warren W.;
RL Submitted (OCT-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSPFOP00000015854.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus, nucleolus
CC {ECO:0000256|ARBA:ARBA00004604}.
CC -!- SIMILARITY: Belongs to the CWC22 family.
CC {ECO:0000256|ARBA:ARBA00006856}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AYCK01006240; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR RefSeq; XP_016528691.1; XM_016673205.1.
DR AlphaFoldDB; A0A087YCV1; -.
DR STRING; 48698.ENSPFOP00000015854; -.
DR Ensembl; ENSPFOT00000015876.2; ENSPFOP00000015854.2; ENSPFOG00000015771.2.
DR GeneID; 103140460; -.
DR KEGG; pfor:103140460; -.
DR CTD; 64434; -.
DR eggNOG; KOG2141; Eukaryota.
DR GeneTree; ENSGT00940000153458; -.
DR OMA; FMVDILN; -.
DR OrthoDB; 5491616at2759; -.
DR Proteomes; UP000028760; Unassembled WGS sequence.
DR GO; GO:0005730; C:nucleolus; IEA:UniProtKB-SubCell.
DR GO; GO:0003723; F:RNA binding; IEA:InterPro.
DR GO; GO:0042254; P:ribosome biogenesis; IEA:Ensembl.
DR Gene3D; 1.25.40.180; -; 1.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR003891; Initiation_fac_eIF4g_MI.
DR InterPro; IPR003890; MIF4G-like_typ-3.
DR PANTHER; PTHR18034; CELL CYCLE CONTROL PROTEIN CWF22-RELATED; 1.
DR PANTHER; PTHR18034:SF4; NUCLEOLAR MIF4G DOMAIN-CONTAINING PROTEIN 1; 1.
DR Pfam; PF02847; MA3; 1.
DR Pfam; PF02854; MIF4G; 1.
DR SMART; SM00544; MA3; 1.
DR SMART; SM00543; MIF4G; 1.
DR SUPFAM; SSF48371; ARM repeat; 1.
DR PROSITE; PS51366; MI; 1.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000028760}.
FT DOMAIN 633..749
FT /note="MI"
FT /evidence="ECO:0000259|PROSITE:PS51366"
FT REGION 32..147
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 196..315
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 32..46
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 47..68
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 69..108
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 126..147
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 228..281
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 840 AA; 95677 MW; 4F3AB352AD157CF5 CRC64;
MKGGKRKQNK KGNAVLQKYM CAVDEFVKNI SVHEEADRDQ RLRSVKTKSR KELRKEKRRL
KKAKMKSHYE GKKSVSFASD VGEKLEVRFE NKHQTQKKKT DQIKREVTKP LSNKQPGVPK
EKTESSGPKS SSKKGKKINK LQESRKMALL EANEQEDREI KKLERCLGLN KRKNKKSLPQ
SFVTDGLDYI LGVLDSSSSG AGIYDGEFDE DEDMDTAREN FEKLDQDDSL LSDEDGEAED
DIASEESDDA EEEEMGSEED EEEEMGSEED EEEEMGDEND MNESGADAAS GAESEEESDE
EASHPDQRSE IATSIIGKYV PPQLRNITDD KRKAELEKLK RKVKGLLNRL SEANMASICG
QLEELYMSCS RKDMNDTLTD VLLAACVTPT LMPDRLLMEH VLLVSILHHA VGLEVGAHFL
ETVVRRFDEV YKNSSEDKEC DNLVAIVAHL YNFQVVHSVL IFDILKLLVG TFSEKDIELV
LFVLRNVGFA LRKDDALALK ELISDAQRKA SDIGAKFKDQ TRVRFMLETM LALKNNDMRK
IPGYDPEPVE RLRKLQRTLI RSSAAGSDLK LRVSLENILK AEQVGRWWIV GSSWSGAPMI
SKQDDTTSKQ RAAEGQFSQK VLELARKQRM NTEVRRNIFC VLMTSEDYLD AFEKLLRMGL
KDKQEREIVH VLMDCCLQEK TFNAFYAVLG EKFCSQDRRF QMTFQFCLWD KFKELSNLPS
CAFTNLVQLV THFLRKKCLS LSILKVIEFG ELDKSKVRFL RQVLTKLLKE TEPEEIASIF
GRISGIAKLG LLRDGLKLFI SHFLLKNSQS QGPAEHAALL RERAQVATKA METKDTKLKL
//