ID A0A401GKR4_9APHY Unreviewed; 840 AA.
AC A0A401GKR4;
DT 08-MAY-2019, integrated into UniProtKB/TrEMBL.
DT 08-MAY-2019, sequence version 1.
DT 28-JUN-2023, entry version 11.
DE SubName: Full=Pre-mRNA-splicing factor CWC22 {ECO:0000313|EMBL:GBE82742.1};
GN ORFNames=SCP_0411270 {ECO:0000313|EMBL:GBE82742.1};
OS Sparassis crispa.
OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; Agaricomycetes;
OC Polyporales; Sparassidaceae; Sparassis.
OX NCBI_TaxID=139825 {ECO:0000313|EMBL:GBE82742.1, ECO:0000313|Proteomes:UP000287166};
RN [1] {ECO:0000313|EMBL:GBE82742.1, ECO:0000313|Proteomes:UP000287166}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=30375506; DOI=10.1038/s41598-018-34415-6;
RA Kiyama R., Furutani Y., Kawaguchi K., Nakanishi T.;
RT "Genome sequence of the cauliflower mushroom Sparassis crispa
RT (Hanabiratake) and its association with beneficial usage.";
RL Sci. Rep. 8:16053-16053(2018).
CC -!- SIMILARITY: Belongs to the CWC22 family.
CC {ECO:0000256|ARBA:ARBA00006856}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:GBE82742.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; BFAD01000004; GBE82742.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A401GKR4; -.
DR STRING; 139825.A0A401GKR4; -.
DR InParanoid; A0A401GKR4; -.
DR OrthoDB; 1115942at2759; -.
DR Proteomes; UP000287166; Unassembled WGS sequence.
DR GO; GO:0003723; F:RNA binding; IEA:InterPro.
DR Gene3D; 1.25.40.180; -; 1.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR003891; Initiation_fac_eIF4g_MI.
DR InterPro; IPR003890; MIF4G-like_typ-3.
DR PANTHER; PTHR18034; CELL CYCLE CONTROL PROTEIN CWF22-RELATED; 1.
DR PANTHER; PTHR18034:SF3; PRE-MRNA-SPLICING FACTOR CWC22 HOMOLOG; 1.
DR Pfam; PF02847; MA3; 1.
DR Pfam; PF02854; MIF4G; 1.
DR SMART; SM00544; MA3; 1.
DR SMART; SM00543; MIF4G; 1.
DR SUPFAM; SSF48371; ARM repeat; 1.
DR PROSITE; PS51366; MI; 1.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000287166}.
FT DOMAIN 425..541
FT /note="MI"
FT /evidence="ECO:0000259|PROSITE:PS51366"
FT REGION 1..98
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 389..415
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 632..840
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..16
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 53..98
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 395..413
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 640..663
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 664..686
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 698..776
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 795..840
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 840 AA; 97517 MW; 4CFC38E695A74774 CRC64;
MATAATLTPT TPSEGPYQRR KRSRSRSPEN AEQPPTSRRR SLSPPPKARL VELPKVNDAD
PVRRAERERE VAARMAASEL ERMEKDKSKD KDKESDARAE FAKLTGTRSG GVYMPPARLR
ALQEAASKDK TSAEYQRLSW DALRKSITGI VNRVNIANIK NVVPELFAEN LIRGRGLFAR
SVMKAQSASL PFTPVFAALV AIINTKLPQV GELVLTRLIS QFRRSFKRND KIVCHSTTTF
IGHLVNQGVA HEIIALQILV LLLERPTDDS IEIAVGFTRE VGAFLAENSP KANATVFERF
RAVLNEGSIS QRVQYMIEVL MQVRKDKYKD NPIVPEGLDL VEEDDQITHQ IQLEEELQVQ
EGLNIFKFDP NYLENEEKYK AIKVEILGGS SEEESGSEES SDEDDEEAVE EQAGIEDRTE
TNLLNLRRVI YLTIMNALNY EEAVHKLLKV QIKEGQELEM CNMIIECCSQ ERSYSTFYGL
IGERFCKLNR VWNECFEQAF ENYYTTIHRY ETNRLRNIAR FFGHLFATDS ISWVAMSCIL
LTEDDTTSSS RIFIKIMLTE MTESMGVKTL VERFKDDEVK RACQGMFPME NPKNTRFAIN
YFTSIALGAV TEEMREHLKN APRLIMEQRR AMLEAESSSS DSSSDEDSDP DDDSDSDTEN
DSESDSEDSR RGERRRRSET PPKKQVRGRG DSYSPPPRGS RRFRDDSLSP PRRDRRSPTP
PVRYRNGDRE QERDRDERDD RYRSRRETPP HRRDYSPDRR DRDRDRDRDR RDHARYPPPS
PSPPRRSRYD RRSRSRSLPR RDYDKRDDYR RRRNDSRDDS RERERRRSIS RERYGRDSRR
//