ID H0ZGE8_TAEGU Unreviewed; 918 AA.
AC H0ZGE8;
DT 22-FEB-2012, integrated into UniProtKB/TrEMBL.
DT 17-JUN-2020, sequence version 2.
DT 27-MAR-2024, entry version 61.
DE RecName: Full=Pre-mRNA-splicing factor CWC22 homolog {ECO:0000256|ARBA:ARBA00040488};
DE AltName: Full=Nucampholin homolog {ECO:0000256|ARBA:ARBA00042174};
GN Name=CWC22 {ECO:0000313|Ensembl:ENSTGUP00000009660.2};
OS Taeniopygia guttata (Zebra finch) (Poephila guttata).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Passeriformes; Passeroidea; Estrildidae;
OC Estrildinae; Taeniopygia.
OX NCBI_TaxID=59729 {ECO:0000313|Ensembl:ENSTGUP00000009660.2, ECO:0000313|Proteomes:UP000007754};
RN [1] {ECO:0000313|Ensembl:ENSTGUP00000009660.2, ECO:0000313|Proteomes:UP000007754}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=20360741; DOI=10.1038/nature08819;
RA Warren W.C., Clayton D.F., Ellegren H., Arnold A.P., Hillier L.W.,
RA Kunstner A., Searle S., White S., Vilella A.J., Fairley S., Heger A.,
RA Kong L., Ponting C.P., Jarvis E.D., Mello C.V., Minx P., Lovell P.,
RA Velho T.A., Ferris M., Balakrishnan C.N., Sinha S., Blatti C., London S.E.,
RA Li Y., Lin Y.C., George J., Sweedler J., Southey B., Gunaratne P.,
RA Watson M., Nam K., Backstrom N., Smeds L., Nabholz B., Itoh Y., Whitney O.,
RA Pfenning A.R., Howard J., Volker M., Skinner B.M., Griffin D.K., Ye L.,
RA McLaren W.M., Flicek P., Quesada V., Velasco G., Lopez-Otin C.,
RA Puente X.S., Olender T., Lancet D., Smit A.F., Hubley R., Konkel M.K.,
RA Walker J.A., Batzer M.A., Gu W., Pollock D.D., Chen L., Cheng Z.,
RA Eichler E.E., Stapley J., Slate J., Ekblom R., Birkhead T., Burke T.,
RA Burt D., Scharff C., Adam I., Richard H., Sultan M., Soldatov A.,
RA Lehrach H., Edwards S.V., Yang S.P., Li X., Graves T., Fulton L.,
RA Nelson J., Chinwalla A., Hou S., Mardis E.R., Wilson R.K.;
RT "The genome of a songbird.";
RL Nature 464:757-762(2010).
RN [2] {ECO:0000313|Ensembl:ENSTGUP00000009660.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus speckle {ECO:0000256|ARBA:ARBA00004324}.
CC -!- SIMILARITY: Belongs to the CWC22 family.
CC {ECO:0000256|ARBA:ARBA00006856}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; H0ZGE8; -.
DR STRING; 59729.ENSTGUP00000009660; -.
DR Ensembl; ENSTGUT00000009764.2; ENSTGUP00000009660.2; ENSTGUG00000009340.2.
DR GeneTree; ENSGT00940000153458; -.
DR HOGENOM; CLU_006308_1_0_1; -.
DR InParanoid; H0ZGE8; -.
DR OMA; ILTEDMR; -.
DR OrthoDB; 1115942at2759; -.
DR TreeFam; TF300510; -.
DR Proteomes; UP000007754; Chromosome 7.
DR GO; GO:0005829; C:cytosol; IEA:Ensembl.
DR GO; GO:0016607; C:nuclear speck; IEA:UniProtKB-SubCell.
DR GO; GO:0071006; C:U2-type catalytic step 1 spliceosome; IEA:Ensembl.
DR GO; GO:0071007; C:U2-type catalytic step 2 spliceosome; IEA:Ensembl.
DR GO; GO:0071005; C:U2-type precatalytic spliceosome; IEA:Ensembl.
DR GO; GO:0003723; F:RNA binding; IEA:Ensembl.
DR GO; GO:0000398; P:mRNA splicing, via spliceosome; IEA:Ensembl.
DR Gene3D; 1.25.40.180; -; 1.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR003891; Initiation_fac_eIF4g_MI.
DR InterPro; IPR003890; MIF4G-like_typ-3.
DR PANTHER; PTHR18034; CELL CYCLE CONTROL PROTEIN CWF22-RELATED; 1.
DR PANTHER; PTHR18034:SF3; PRE-MRNA-SPLICING FACTOR CWC22 HOMOLOG; 1.
DR Pfam; PF02847; MA3; 1.
DR Pfam; PF02854; MIF4G; 1.
DR SMART; SM00544; MA3; 1.
DR SMART; SM00543; MIF4G; 1.
DR SUPFAM; SSF48371; ARM repeat; 1.
DR PROSITE; PS51366; MI; 1.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000007754}.
FT DOMAIN 460..576
FT /note="MI"
FT /evidence="ECO:0000259|PROSITE:PS51366"
FT REGION 1..133
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 416..449
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 661..918
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 9..106
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 420..445
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 661..710
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 746..918
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 918 AA; 107662 MW; E0D806D41C6161B7 CRC64;
MKSRVTQVNH GSSHDRKENY SSRHESLSPE DRRDMERDRS RSPSPRKRRY SDDSRYDEEY
SRREYYDDRT SDGRRMERGR DRHYERWEDR EYDRRKQRRY SSPDRRSPER STGQSSLAND
ETTSKKKKEE VDPILTRTGG AYIPPAKLRM MQEQITDKNS LAYQRMSWEA LKKSINGLVN
KVNVSNIENI IHELLQENIV RGRGLLSRSI LQAQSASPIF THVYAALVAI INSKFPNIGE
LILKRLILNF RKGYRRNDKQ LCLTSSKFVA HLMNQNVAHE VLCLEMLTLL LERPTDDSIE
VAIGFIKESG LKLTEVSPRG INAIFDRLRH ILHESKIDMR VQYMIEVMFA VRKDGFKDHP
IIPEGLDLVE EEDQFTHMLP LEDDYNPEDV LNVFKMDPNF MENEEKYKML KKEILDEGDT
ESEGNQEAGS SEEDEEDDEE EDEDGQKVTV HDKTEINLVS FRRTIYLAIQ SSLDFEECAH
KLLKMDFPES QTKELCNMIL DCCAQQRTYE KFFGLLAGRF CMLKKEYMES FEAIFKEQYD
TIHRLETNKL RNVAKMFAHL LYTDSIPWSV LECIILSEET TTSSSRIFVK IFFQELSEYM
GLPNLNARLK DVTLQPFFEG LMPRDNPRNT RFAINFFTSI GLGGLTDELR EHLKNAPKLI
MTQKQNVESS DSSSSSDTDS SSDSDSDSSS SSSESSSSSD SSSSSDSSSD SDVSKAKRKR
TQKKNRESDK VSRKKQERRR KSLEKKIGRR QQEERSDTES KSERNHRHIR DSHRRDDVSK
YHHRDESNGR DGYHSGKDRN HERSKDPENK HSNSKLKKAE RRASVSDDEN YRHRSKDDGH
RSRKRERSKS RERERGRGSP REEEQEERSR NGSERQRDKH GRHPEQHGDL QQHREPQQHR
ESRRSDDRRR ENSPHRRK
//