ID G1NAJ5_MELGA Unreviewed; 925 AA.
AC G1NAJ5;
DT 19-OCT-2011, integrated into UniProtKB/TrEMBL.
DT 29-SEP-2021, sequence version 3.
DT 27-MAR-2024, entry version 68.
DE RecName: Full=Pre-mRNA-splicing factor CWC22 homolog {ECO:0000256|ARBA:ARBA00040488};
DE AltName: Full=Nucampholin homolog {ECO:0000256|ARBA:ARBA00042174};
GN Name=CWC22 {ECO:0000313|Ensembl:ENSMGAP00000009548.3};
OS Meleagris gallopavo (Wild turkey).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Galloanserae; Galliformes; Phasianidae;
OC Meleagridinae; Meleagris.
OX NCBI_TaxID=9103 {ECO:0000313|Ensembl:ENSMGAP00000009548.3, ECO:0000313|Proteomes:UP000001645};
RN [1] {ECO:0000313|Ensembl:ENSMGAP00000009548.3, ECO:0000313|Proteomes:UP000001645}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=20838655; DOI=10.1371/journal.pbio.1000475;
RA Dalloul R.A., Long J.A., Zimin A.V., Aslam L., Beal K., Blomberg L.A.,
RA Bouffard P., Burt D.W., Crasta O., Crooijmans R.P., Cooper K.,
RA Coulombe R.A., De S., Delany M.E., Dodgson J.B., Dong J.J., Evans C.,
RA Frederickson K.M., Flicek P., Florea L., Folkerts O., Groenen M.A.,
RA Harkins T.T., Herrero J., Hoffmann S., Megens H.J., Jiang A., de Jong P.,
RA Kaiser P., Kim H., Kim K.W., Kim S., Langenberger D., Lee M.K., Lee T.,
RA Mane S., Marcais G., Marz M., McElroy A.P., Modise T., Nefedov M.,
RA Notredame C., Paton I.R., Payne W.S., Pertea G., Prickett D., Puiu D.,
RA Qioa D., Raineri E., Ruffier M., Salzberg S.L., Schatz M.C., Scheuring C.,
RA Schmidt C.J., Schroeder S., Searle S.M., Smith E.J., Smith J.,
RA Sonstegard T.S., Stadler P.F., Tafer H., Tu Z.J., Van Tassell C.P.,
RA Vilella A.J., Williams K.P., Yorke J.A., Zhang L., Zhang H.B., Zhang X.,
RA Zhang Y., Reed K.M.;
RT "Multi-platform next-generation sequencing of the domestic turkey
RT (Meleagris gallopavo): genome assembly and analysis.";
RL PLoS Biol. 8:E1000475-E1000475(2010).
RN [2] {ECO:0000313|Ensembl:ENSMGAP00000009548.3}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus speckle {ECO:0000256|ARBA:ARBA00004324}.
CC -!- SIMILARITY: Belongs to the CWC22 family.
CC {ECO:0000256|ARBA:ARBA00006856}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_010711839.1; XM_010713537.2.
DR AlphaFoldDB; G1NAJ5; -.
DR Ensembl; ENSMGAT00000010375.3; ENSMGAP00000009548.3; ENSMGAG00000009220.3.
DR GeneID; 100545638; -.
DR KEGG; mgp:100545638; -.
DR CTD; 57703; -.
DR GeneTree; ENSGT00940000153458; -.
DR HOGENOM; CLU_006308_1_0_1; -.
DR InParanoid; G1NAJ5; -.
DR TreeFam; TF300510; -.
DR Proteomes; UP000001645; Chromosome 7.
DR Bgee; ENSMGAG00000009220; Expressed in bursa of Fabricius and 17 other cell types or tissues.
DR GO; GO:0005829; C:cytosol; IEA:Ensembl.
DR GO; GO:0016607; C:nuclear speck; IEA:UniProtKB-SubCell.
DR GO; GO:0071006; C:U2-type catalytic step 1 spliceosome; IEA:Ensembl.
DR GO; GO:0071007; C:U2-type catalytic step 2 spliceosome; IEA:Ensembl.
DR GO; GO:0071005; C:U2-type precatalytic spliceosome; IEA:Ensembl.
DR GO; GO:0003723; F:RNA binding; IEA:Ensembl.
DR GO; GO:0000398; P:mRNA splicing, via spliceosome; IEA:Ensembl.
DR Gene3D; 1.25.40.180; -; 1.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR003891; Initiation_fac_eIF4g_MI.
DR InterPro; IPR003890; MIF4G-like_typ-3.
DR PANTHER; PTHR18034; CELL CYCLE CONTROL PROTEIN CWF22-RELATED; 1.
DR PANTHER; PTHR18034:SF3; PRE-MRNA-SPLICING FACTOR CWC22 HOMOLOG; 1.
DR Pfam; PF02847; MA3; 1.
DR Pfam; PF02854; MIF4G; 1.
DR SMART; SM00544; MA3; 1.
DR SMART; SM00543; MIF4G; 1.
DR SUPFAM; SSF48371; ARM repeat; 1.
DR PROSITE; PS51366; MI; 1.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000001645}.
FT DOMAIN 456..572
FT /note="MI"
FT /evidence="ECO:0000259|PROSITE:PS51366"
FT REGION 1..74
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 88..134
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 412..445
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 657..925
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 9..74
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 88..102
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 116..130
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 415..441
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 658..706
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 742..855
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 863..925
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 925 AA; 108652 MW; E4FBADC78A80A08D CRC64;
MKSRVTQINH GSSHEKKEKY SPRHRSLTPE DRDVEHDRSP SPRRRRYSDD SRYDQEYSRR
EYYDDRSSER RMERGRDRYY EKWEERDYDR RRKRRLSSPD HRSPERSTAQ GSNTHEEAAS
KKKKEEVDPI LTRTGGAYIP PAKLRMMQEQ ITDKNSLAYQ RMSWEALKKS INGLVNKVNV
SNIDNIIHEL LQENIVRGRG LLSRSILQAQ GASPIFTHVY AALVAIINSK FPNIGELILK
RLILNFRKGY RRNDKQLCLT SSKFVAHLMN QNVAHEVLCL EMLTLLLERP TDDSIEVAIG
FLKESGLKLT EVSPRGINAI FDRLRHILHE SKIDMRVQYM IEVMFAVRKD GFKDHPIIPE
GLDLVEEEDQ FTHMLPLEDE YNPEDVLNVF KMDPNFMENE EKYKALKKEI LDEGDSESEP
DQEAGSSDEE EDEDEEEDED GQKVTVHDKT EINLVSFRRT IYLAIQSSLD FEECAHKLLK
MDFPESQTKE LCNMILDCCA QQRTYEKFFG LLAGRFCMLK KEYMESFEAI FKEQYDTIHR
LETNKLRNVA KMFAHLLYTD SIPWSVLECI ILSEETTTSS SRIFVKIFFQ ELSEYMGLPN
LNARLKDITL QPFFEGLLPR DNPRNTRFAI NFFTSIGLGG LTDELREHLK NAPKMIMTQK
QDVESSDSSS SSETDSSSDS DSDSSSSSSE SSSSSDSSSS SSDSSDSDAS KAKRRRTQKK
NRESDKASRK KQERKRKSLE KKVGRRWQED KSDSESKSER NHRNMREAHR RDDTSKYHHR
DESDGRDNYE AHRRDDVSKY HHRDESNGRD DYHSGRDRNN ERSKDLENKH SNSKLKKAER
RASFSDDESY RHGSRDNGHR SRKRERSRSG ERSYNKSSPR EEEDDHRYRN GSERLREKYS
HYTDQYRESR KYEDRRRENS PHRRK
//