ID A0A3Q2EAQ6_CYPVA Unreviewed; 501 AA.
AC A0A3Q2EAQ6;
DT 10-APR-2019, integrated into UniProtKB/TrEMBL.
DT 10-APR-2019, sequence version 1.
DT 27-MAR-2024, entry version 26.
DE RecName: Full=G-patch domain and KOW motifs-containing protein {ECO:0000256|RuleBase:RU369096};
OS Cyprinodon variegatus (Sheepshead minnow).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Ovalentaria; Atherinomorphae; Cyprinodontiformes; Cyprinodontidae;
OC Cyprinodon.
OX NCBI_TaxID=28743 {ECO:0000313|Ensembl:ENSCVAP00000029366.1, ECO:0000313|Proteomes:UP000265020};
RN [1] {ECO:0000313|Ensembl:ENSCVAP00000029366.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- FUNCTION: RNA-binding protein involved in pre-mRNA splicing.
CC {ECO:0000256|RuleBase:RU369096}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|RuleBase:RU369096}.
CC -!- SIMILARITY: Belongs to the MOS2 family. {ECO:0000256|ARBA:ARBA00010966,
CC ECO:0000256|RuleBase:RU369096}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_015247840.1; XM_015392354.1.
DR AlphaFoldDB; A0A3Q2EAQ6; -.
DR STRING; 28743.ENSCVAP00000029366; -.
DR Ensembl; ENSCVAT00000022267.1; ENSCVAP00000029366.1; ENSCVAG00000017066.1.
DR GeneID; 107095957; -.
DR KEGG; cvg:107095957; -.
DR CTD; 27238; -.
DR GeneTree; ENSGT00390000015154; -.
DR OMA; AHKDKEK; -.
DR OrthoDB; 55448at2759; -.
DR Proteomes; UP000265020; Unplaced.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0000398; P:mRNA splicing, via spliceosome; IEA:UniProtKB-UniRule.
DR CDD; cd13152; KOW_GPKOW_A; 1.
DR CDD; cd13153; KOW_GPKOW_B; 1.
DR Gene3D; 2.30.30.140; -; 1.
DR Gene3D; 2.30.30.30; -; 1.
DR InterPro; IPR000467; G_patch_dom.
DR InterPro; IPR041993; GPKOW_KOW1.
DR InterPro; IPR041994; GPKOW_KOW2.
DR InterPro; IPR005824; KOW.
DR InterPro; IPR014722; Rib_uL2_dom2.
DR InterPro; IPR045166; Spp2-like.
DR InterPro; IPR026822; Spp2/MOS2_G-patch.
DR InterPro; IPR008991; Translation_prot_SH3-like_sf.
DR PANTHER; PTHR15818; G PATCH AND KOW-CONTAINING; 1.
DR PANTHER; PTHR15818:SF2; G-PATCH DOMAIN AND KOW MOTIFS-CONTAINING PROTEIN; 1.
DR Pfam; PF12656; G-patch_2; 1.
DR SMART; SM00443; G_patch; 1.
DR SMART; SM00739; KOW; 2.
DR SUPFAM; SSF50104; Translation proteins SH3-like domain; 1.
DR PROSITE; PS50174; G_PATCH; 1.
PE 3: Inferred from homology;
KW mRNA processing {ECO:0000256|RuleBase:RU369096};
KW mRNA splicing {ECO:0000256|RuleBase:RU369096};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|RuleBase:RU369096};
KW Reference proteome {ECO:0000313|Proteomes:UP000265020}.
FT DOMAIN 178..224
FT /note="G-patch"
FT /evidence="ECO:0000259|PROSITE:PS50174"
FT REGION 1..111
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 212..249
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 314..381
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..18
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 53..69
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 86..110
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 226..249
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 501 AA; 56408 MW; 05BA1EB6C9C84F4D CRC64;
MASNDEDANA SKSAFADSQG ERKSAAVSFG FTRTVNKFKP VAGGTPTTKV EKDYLTGIDR
NELQSTKPTE KPKELIIPLI QKNRWHPVGQ TRQGNEKSEA PVSDQDSVES QAVKELIEDS
RRQLELWQNG PQPDSNTNLS IPLLMQNKVP DGFEDGDHVK VDLRPESSTE ADYESVPVEA
YGLAMLKGMG WRKEEGIGRT FKQDVKPIEH QLRPKGLGLG ADRSAVKDLE PNKRQRPPKP
GEEQAKEEEL AMAPGGCVLV ESGAHKDLYG KIEGVDADNA RVVVKLAIGG KAVTISQHGV
KLVGKKEYEK YSKDLSRLSK AHKDKEKEKE RQRQRVEEEG KSSSRDKGRD EGKEERKRKH
RESSRDRDKP PVKEAKQRTA PPSWLRRDLK VRFIDKAFKA GKYYNSKMRV EDVLTPLTCQ
CRTEEGGLLD DVKQDMLETI IPKGEYDTVM VVLGEHRGQV GRILQRDKNK CKAMVQLDRY
EEKLFSLDYD SICQYVGASD H
//