ID A0A384ATJ4_BALAS Unreviewed; 901 AA.
AC A0A384ATJ4;
DT 07-NOV-2018, integrated into UniProtKB/TrEMBL.
DT 07-NOV-2018, sequence version 1.
DT 27-MAR-2024, entry version 21.
DE RecName: Full=Pre-mRNA-splicing factor CWC22 homolog {ECO:0000256|ARBA:ARBA00040488};
DE AltName: Full=Nucampholin homolog {ECO:0000256|ARBA:ARBA00042174};
GN Name=CWC22 {ECO:0000313|RefSeq:XP_007190449.1};
OS Balaenoptera acutorostrata scammoni (North Pacific minke whale)
OS (Balaenoptera davidsoni).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Artiodactyla; Whippomorpha; Cetacea; Mysticeti;
OC Balaenopteridae; Balaenoptera.
OX NCBI_TaxID=310752 {ECO:0000313|Proteomes:UP000261681, ECO:0000313|RefSeq:XP_007190449.1};
RN [1] {ECO:0000313|RefSeq:XP_007190449.1}
RP IDENTIFICATION.
RC TISSUE=Muscle {ECO:0000313|RefSeq:XP_007190449.1};
RG RefSeq;
RL Submitted (JAN-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus speckle {ECO:0000256|ARBA:ARBA00004324}.
CC -!- SIMILARITY: Belongs to the CWC22 family.
CC {ECO:0000256|ARBA:ARBA00006856}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_007190449.1; XM_007190387.1.
DR AlphaFoldDB; A0A384ATJ4; -.
DR STRING; 310752.A0A384ATJ4; -.
DR KEGG; bacu:103011404; -.
DR CTD; 57703; -.
DR InParanoid; A0A384ATJ4; -.
DR OrthoDB; 1115942at2759; -.
DR Proteomes; UP000261681; Unplaced.
DR GO; GO:0016607; C:nuclear speck; IEA:UniProtKB-SubCell.
DR GO; GO:0003723; F:RNA binding; IEA:InterPro.
DR Gene3D; 1.25.40.180; -; 1.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR003891; Initiation_fac_eIF4g_MI.
DR InterPro; IPR003890; MIF4G-like_typ-3.
DR PANTHER; PTHR18034; CELL CYCLE CONTROL PROTEIN CWF22-RELATED; 1.
DR PANTHER; PTHR18034:SF3; PRE-MRNA-SPLICING FACTOR CWC22 HOMOLOG; 1.
DR Pfam; PF02847; MA3; 1.
DR Pfam; PF02854; MIF4G; 1.
DR SMART; SM00544; MA3; 1.
DR SMART; SM00543; MIF4G; 1.
DR SUPFAM; SSF48371; ARM repeat; 1.
DR PROSITE; PS51366; MI; 1.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000261681}.
FT DOMAIN 452..568
FT /note="MI"
FT /evidence="ECO:0000259|PROSITE:PS51366"
FT REGION 1..127
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 402..442
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 652..901
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 30..87
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 409..437
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 660..710
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 732..776
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 790..901
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 901 AA; 104782 MW; 8945AD3152F45D96 CRC64;
MKSSVAQIKH SSSHDRRENY NSYQRTSSPE DRYMEQERSP RDRDYFDYSR SDYEHSRRGR
SYDDSMESRS RDREKRRERD VDRKRSRKSP SPGRRSPEPL VTQSSSAQDE PTAKKKKEEL
DPLLTRTGGA YIPPAKLRMM QEQITDKNSL AYQRMSWEAL KKSINGLINK VNISNIGIII
QELLQENIVR GRGLLSRSVL QAQSASPIFT HVYAALVAII NSKFPQIGEL ILKRLILNFR
KGYRRNDKPL CLTASKFVAH LINQNVAHEV LCLEMLTLLL ERPTDDSVEV AIGFLKECGL
KLTQVSPRGI NAIFERLRNI LHESEIDKRV QYMIEVMFAV RKDGFKDHPV VLEGLDLVEE
DDQFTHMLPL EDDYNPEDVL NVFKMDPNFM ENEEKYKTIK KEILDEGDSD SNTDQDAGSS
EEEEEEEEEE GEEDEEGQTV TIHDKTEINL VSFRRTIYLA IQSSLDFEEC AHKLLKMEFP
EGQTKELCNM ILDCCAQQRT YEKFFGLLAG RFCMLKKEYM ESFESIFKEQ YDTIHRLETN
KLRNVAKMFA HLLYTDSLPW SVLECIKLSE ETTTSSSRIF VKIFFQELCE YMGLPKLNGR
LKDETLQPFF EGLLPRDNPR NTRFAINFFT SIGLGGLTDE LREHLKNTPK VIVAQKPDVE
PNKSSPSSSS SASSSSESDS SASGSDSSDS SSESSSEESD SSSTGSQSSA SDKDVRKKGQ
GNHRRKEVNK LIRKQHTNDR KQEQRRPEQR HQETRTERER RSEKPRDRDS RDPVTKYASD
RGLPSERNSY SRAMKDREQE MYMDLEHNLG DPKKKRGERR NSFSENERKY RNKDSENFRR
KDRSKSREKD KKHSASRSDE DRYQNGAERR WEKSSQYSEQ SRESKKNQDR RREKSPTKQK
K
//