ID A0A151PCZ9_ALLMI Unreviewed; 681 AA.
AC A0A151PCZ9;
DT 08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT 08-JUN-2016, sequence version 1.
DT 27-MAR-2024, entry version 20.
DE SubName: Full=Nucleolar MIF4G domain-containing protein 1 {ECO:0000313|EMBL:KYO46932.1};
GN Name=NOM1 {ECO:0000313|EMBL:KYO46932.1};
GN ORFNames=Y1Q_0014500 {ECO:0000313|EMBL:KYO46932.1};
OS Alligator mississippiensis (American alligator).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Crocodylia; Alligatoridae; Alligatorinae;
OC Alligator.
OX NCBI_TaxID=8496 {ECO:0000313|EMBL:KYO46932.1};
RN [1] {ECO:0000313|EMBL:KYO46932.1, ECO:0000313|Proteomes:UP000050525}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=KSC_2009_1 {ECO:0000313|EMBL:KYO46932.1};
RX PubMed=22293439; DOI=10.1186/gb-2012-13-1-415;
RA St John J.A., Braun E.L., Isberg S.R., Miles L.G., Chong A.Y., Gongora J.,
RA Dalzell P., Moran C., Bed'hom B., Abzhanov A., Burgess S.C., Cooksey A.M.,
RA Castoe T.A., Crawford N.G., Densmore L.D., Drew J.C., Edwards S.V.,
RA Faircloth B.C., Fujita M.K., Greenwold M.J., Hoffmann F.G., Howard J.M.,
RA Iguchi T., Janes D.E., Khan S.Y., Kohno S., de Koning A.J., Lance S.L.,
RA McCarthy F.M., McCormack J.E., Merchant M.E., Peterson D.G., Pollock D.D.,
RA Pourmand N., Raney B.J., Roessler K.A., Sanford J.R., Sawyer R.H.,
RA Schmidt C.J., Triplett E.W., Tuberville T.D., Venegas-Anaya M.,
RA Howard J.T., Jarvis E.D., Guillette L.J.Jr., Glenn T.C., Green R.E.,
RA Ray D.A.;
RT "Sequencing three crocodilian genomes to illuminate the evolution of
RT archosaurs and amniotes.";
RL Genome Biol. 13:415-415(2012).
CC -!- SUBCELLULAR LOCATION: Nucleus, nucleolus
CC {ECO:0000256|ARBA:ARBA00004604}.
CC -!- SIMILARITY: Belongs to the CWC22 family.
CC {ECO:0000256|ARBA:ARBA00006856}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KYO46932.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AKHW03000487; KYO46932.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A151PCZ9; -.
DR STRING; 8496.A0A151PCZ9; -.
DR eggNOG; KOG2141; Eukaryota.
DR Proteomes; UP000050525; Unassembled WGS sequence.
DR GO; GO:0005730; C:nucleolus; IEA:UniProtKB-SubCell.
DR GO; GO:0003723; F:RNA binding; IEA:InterPro.
DR Gene3D; 1.25.40.180; -; 1.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR004875; DDE_SF_endonuclease_dom.
DR InterPro; IPR003891; Initiation_fac_eIF4g_MI.
DR InterPro; IPR003890; MIF4G-like_typ-3.
DR PANTHER; PTHR18034; CELL CYCLE CONTROL PROTEIN CWF22-RELATED; 1.
DR PANTHER; PTHR18034:SF4; NUCLEOLAR MIF4G DOMAIN-CONTAINING PROTEIN 1; 1.
DR Pfam; PF03184; DDE_1; 1.
DR Pfam; PF02847; MA3; 1.
DR Pfam; PF02854; MIF4G; 1.
DR SMART; SM00544; MA3; 1.
DR SMART; SM00543; MIF4G; 1.
DR SUPFAM; SSF48371; ARM repeat; 1.
DR PROSITE; PS51366; MI; 1.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000050525}.
FT DOMAIN 474..590
FT /note="MI"
FT /evidence="ECO:0000259|PROSITE:PS51366"
FT REGION 115..173
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 121..157
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 681 AA; 77670 MW; CF14CC6B23104B54 CRC64;
MDQGVIKTFK SYYTRRLYRQ ALDTIDAGRV ENLTQFWKGY ILDCISIIQE AWEEVRPSTL
NACWHELWPE IVNDFRGFPT IDNQIPDIVR LAKWLGGEGF EDIEDVEVVE LFDSHQQEST
NEELEQLAAS SSDNGGTREP SPAGSLAEQT NLCPSPTKYV PPQARGAGER MDDKKREELL
RLKKTVNGLV NRLSEPNMAS ISGQLDELYM ANSRKDMNET LTDVLMNACV AATAMPARLM
MEHVLLVSIL HHTVGIEVGA HFLEAVVKKF DELYKSEAEG KECENLFILI AHLYNFHVVH
SLLIFDILKK LVSTFTEKDI ELILLLLKNV GFSLRKDDAS ALKELISEAQ NKAGAVGKKF
QDQSRVRFML ETMLALKNND MRKIPGYDPE PVEKLRKLQR PLVHNSGSGR ETQLRVSLES
LLNADQVGRW WIVGSSWSGA PMIDNANKTQ QQLPVGKVSS KILELARKQR MNTDIRKNIF
CVLMTGEDFL DAFEKLLKLG LKDQQEREIV HVLLDCCLQE KTYNPFYAYL AAKFCEYDRR
FQMTFQFSFW DKIRDLGNLS PTAFSNLVCL LVHVLKTKSL SISVLKVIEF SELDKPKVRF
LRQVLSMLLI KTDSEELSNI FGRLSDNPKL GMLREGLKLF LSHFLLKNVQ AHTTAEEASL
LKDRVEFVSR TLQAKESRLK L
//