ID A0A1S2ZNQ4_ERIEU Unreviewed; 893 AA.
AC A0A1S2ZNQ4;
DT 12-APR-2017, integrated into UniProtKB/TrEMBL.
DT 12-APR-2017, sequence version 1.
DT 27-MAR-2024, entry version 23.
DE RecName: Full=Pre-mRNA-splicing factor CWC22 homolog {ECO:0000256|ARBA:ARBA00040488};
DE AltName: Full=Nucampholin homolog {ECO:0000256|ARBA:ARBA00042174};
GN Name=CWC22 {ECO:0000313|RefSeq:XP_007522216.1};
OS Erinaceus europaeus (Western European hedgehog).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Eulipotyphla; Erinaceidae; Erinaceinae;
OC Erinaceus.
OX NCBI_TaxID=9365 {ECO:0000313|Proteomes:UP000079721, ECO:0000313|RefSeq:XP_007522216.1};
RN [1] {ECO:0000313|RefSeq:XP_007522216.1}
RP IDENTIFICATION.
RG RefSeq;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus speckle {ECO:0000256|ARBA:ARBA00004324}.
CC -!- SIMILARITY: Belongs to the CWC22 family.
CC {ECO:0000256|ARBA:ARBA00006856}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_007522216.1; XM_007522154.2.
DR AlphaFoldDB; A0A1S2ZNQ4; -.
DR STRING; 9365.ENSEEUP00000004896; -.
DR GeneID; 103112681; -.
DR CTD; 57703; -.
DR eggNOG; KOG2140; Eukaryota.
DR InParanoid; A0A1S2ZNQ4; -.
DR OrthoDB; 1115942at2759; -.
DR Proteomes; UP000079721; Unplaced.
DR GO; GO:0016607; C:nuclear speck; IEA:UniProtKB-SubCell.
DR GO; GO:0003723; F:RNA binding; IEA:InterPro.
DR Gene3D; 1.25.40.180; -; 1.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR003891; Initiation_fac_eIF4g_MI.
DR InterPro; IPR003890; MIF4G-like_typ-3.
DR PANTHER; PTHR18034; CELL CYCLE CONTROL PROTEIN CWF22-RELATED; 1.
DR PANTHER; PTHR18034:SF3; PRE-MRNA-SPLICING FACTOR CWC22 HOMOLOG; 1.
DR Pfam; PF02847; MA3; 1.
DR SMART; SM00544; MA3; 1.
DR SMART; SM00543; MIF4G; 1.
DR SUPFAM; SSF48371; ARM repeat; 1.
DR PROSITE; PS51366; MI; 1.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000079721}.
FT DOMAIN 446..562
FT /note="MI"
FT /evidence="ECO:0000259|PROSITE:PS51366"
FT REGION 1..120
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 396..435
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 646..893
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 9..116
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 402..429
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 654..703
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 725..772
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 785..835
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 845..893
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 893 AA; 103765 MW; 83AC701EAE1F914E CRC64;
MKSSVGQIKH SSGHDRKENY NSHQKVSSPE DRSAEQERSP RDRDYYDYSR SDYERSRRGC
SYDSSMESRS RDREKHRERD VNRKRSRRTP SPVRRSPEHA QDEPSTKKKK DELDPLLTRT
GGAYIPPAKL RMMQEQITDK NSLAYQRMSW EALKKSINGL INKVNISNIG IIIQELLQEN
IVRGRGLLSR SVLQAQSASP IFTHVYAALV AIINSKFPQT GELILKRLIL NFRKGYRRND
KQLCLTASKF VAHLINQNVA HEVLCLEMLT LLLERPTDDS VEVAIGFLKE CGLKLTQVSP
RGINAIFERL RNILHESEID KRVQYMIEVM FAVRKDGFKD HPVILGGLDL VEEDDQFTHM
LPLEDEYNPE DVLNVFKMDP NFMENEEKYK EIKKEILDEG DSDSNTDQDA GSSEDEDEEE
EEEEGEEDEE GQKVTIHDKT EINLVSFRRT IYLAIQSSLD FEECAHKLLK MEFPESQTKE
LCNMILDCCA QQRTYEKFFG LLAGRFCMLK KEYMESFESI FKEQYDTIHR LETNKLRNVA
KMFAHLLYTD SLPWSVLECI RLSEETTTSS SRIFVKIFFQ ELCEYMGLPK LNARLKDETL
QPFFEGLLPR DNPRNTRFAI NFFTSIGLGG LTDELREHLK NTPKVIVAQK PDIEPDKSSS
SSSSSASSSS ESDSSASDSD SSDSSSESSS EESDSSSSSS QSSSSDKDRR KKRQGKTKNK
VDKLTRKQHK NDKKQEERRP EQRYQETRTE RERRSEKHRD KNSRDTKWRD SITKYSSEQN
NYSGVVKDRD QEMHPDLENK HGGPKKKRSE RRNSFSENEH RHRNKDSENY KRRERSKSRE
KTRRHSGSRS GEDSYQNGAE RRCDKSSRCF ELSRESRKSH DRGREKSPTK QKK
//