ID A0A2K6SGL1_SAIBB Unreviewed; 696 AA.
AC A0A2K6SGL1;
DT 28-MAR-2018, integrated into UniProtKB/TrEMBL.
DT 28-MAR-2018, sequence version 1.
DT 27-MAR-2024, entry version 23.
DE SubName: Full=SURP and G-patch domain containing 1 {ECO:0000313|Ensembl:ENSSBOP00000006513.1};
GN Name=SUGP1 {ECO:0000313|Ensembl:ENSSBOP00000006513.1};
OS Saimiri boliviensis boliviensis (Bolivian squirrel monkey).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Platyrrhini; Cebidae;
OC Saimiriinae; Saimiri.
OX NCBI_TaxID=39432 {ECO:0000313|Ensembl:ENSSBOP00000006513.1, ECO:0000313|Proteomes:UP000233220};
RN [1] {ECO:0000313|Ensembl:ENSSBOP00000006513.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; A0A2K6SGL1; -.
DR STRING; 39432.ENSSBOP00000006513; -.
DR Ensembl; ENSSBOT00000023257.1; ENSSBOP00000006513.1; ENSSBOG00000020160.1.
DR GeneTree; ENSGT00410000025695; -.
DR Proteomes; UP000233220; Unplaced.
DR GO; GO:0005654; C:nucleoplasm; IEA:Ensembl.
DR GO; GO:0003723; F:RNA binding; IEA:InterPro.
DR GO; GO:0006396; P:RNA processing; IEA:InterPro.
DR Gene3D; 1.10.10.790; Surp module; 2.
DR InterPro; IPR000467; G_patch_dom.
DR InterPro; IPR040169; SUGP1/2.
DR InterPro; IPR000061; Surp.
DR InterPro; IPR035967; SWAP/Surp_sf.
DR PANTHER; PTHR23340; ARGININE/SERINE RICH SPLICING FACTOR SF4/14; 1.
DR PANTHER; PTHR23340:SF3; SURP AND G-PATCH DOMAIN-CONTAINING PROTEIN 1; 1.
DR Pfam; PF01585; G-patch; 1.
DR Pfam; PF01805; Surp; 2.
DR SMART; SM00443; G_patch; 1.
DR SMART; SM00648; SWAP; 2.
DR SUPFAM; SSF109905; Surp module (SWAP domain); 2.
DR PROSITE; PS50174; G_PATCH; 1.
DR PROSITE; PS50128; SURP; 2.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000233220}.
FT DOMAIN 191..233
FT /note="SURP motif"
FT /evidence="ECO:0000259|PROSITE:PS50128"
FT DOMAIN 266..309
FT /note="SURP motif"
FT /evidence="ECO:0000259|PROSITE:PS50128"
FT DOMAIN 562..609
FT /note="G-patch"
FT /evidence="ECO:0000259|PROSITE:PS50174"
FT REGION 46..129
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 148..173
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 317..393
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 582..608
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 648..696
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 54..106
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 344..359
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 668..684
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 696 AA; 77463 MW; 984A35FC838A985B CRC64;
MSLKMDNRDV AGKANRWFGV APPKSGKMNM NILHQEELIA QKKREIEAKM EQKAKQNQVA
SPQPPHPGEM TNAHNSSCVS NKFANDGSFL QQFLKLQKSQ TSTDAPPSTP SVPPSTPTPS
AGKRSLLISR RTGLGLASLP GPVKSYSHAK QLPVAHRPSV FQSPDEDEEE DYEQWLEIKV
SPPEGAETRK VIEKLARFVA EGGPELEKVA MEDYKDNPAF AFLHDKNSRE FLYYRKKVAE
IRKEAQKSQA ASQKVSPPED EEVKNLAEKL ARFIADGGPE VETIALQNNR ENQAFSFLYE
PNSQGYKYYR QKLEEFRKTK AGSTGTLTAP DPSLKRKSPP EALSGSLSPA TTCPASSTPA
PTVTPAPAAP GKPASAATVK RKRKSRWGPE EDKVELPPAE LVQRDVDASP SPLSVQDLKG
LGYEKGKPVG LVGVTELSDA QKKQLKEQQE MQQMYDMIMQ HKRAMQDMQL LWEKAVQQHQ
HGYDSDEEVD SELGTWEHQL RRMEMDKTRE WAEQLTKMGR GKHFIGDFLP PDELEKFMET
FKALKEGREP DYSEYKEFKL TVENIGYQML MKMGWKEGEG LGSEGQGIKN PVNKGTTTVD
GAGFGIDRPA ELSKEDDEYE AFRKRMMLAY RFRPNPLVRI PSWAAFSQSA GEGGGGNRRL
PSFHSPAGPG SPVRPLPPSC IQTPSIPQTC PATSRV
//