ID I3MVI6_ICTTR Unreviewed; 1099 AA.
AC I3MVI6;
DT 11-JUL-2012, integrated into UniProtKB/TrEMBL.
DT 22-NOV-2017, sequence version 2.
DT 27-MAR-2024, entry version 53.
DE SubName: Full=SURP and G-patch domain containing 2 {ECO:0000313|Ensembl:ENSSTOP00000016131.2};
GN Name=SUGP2 {ECO:0000313|Ensembl:ENSSTOP00000016131.2};
OS Ictidomys tridecemlineatus (Thirteen-lined ground squirrel) (Spermophilus
OS tridecemlineatus).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Sciuromorpha; Sciuridae;
OC Xerinae; Marmotini; Ictidomys.
OX NCBI_TaxID=43179 {ECO:0000313|Ensembl:ENSSTOP00000016131.2, ECO:0000313|Proteomes:UP000005215};
RN [1] {ECO:0000313|Proteomes:UP000005215}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RG The Broad Institute Genome Assembly & Analysis Group;
RG Computational R&D Group;
RG and Sequencing Platform;
RA Di Palma F., Alfoldi J., Johnson J., Berlin A., Gnerre S., Jaffe D.,
RA MacCallum I., Young S., Walker B.J., Lindblad-Toh K.;
RT "The Draft Genome of Spermophilus tridecemlineatus.";
RL Submitted (NOV-2011) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSSTOP00000016131.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AGTP01071245; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGTP01071246; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGTP01071247; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR STRING; 43179.ENSSTOP00000016131; -.
DR Ensembl; ENSSTOT00000028830.2; ENSSTOP00000016131.2; ENSSTOG00000026887.2.
DR Ensembl; ENSSTOT00000039606.1; ENSSTOP00000025326.1; ENSSTOG00000026887.2.
DR Ensembl; ENSSTOT00000039898.1; ENSSTOP00000030412.1; ENSSTOG00000026887.2.
DR eggNOG; KOG0965; Eukaryota.
DR GeneTree; ENSGT00410000025695; -.
DR HOGENOM; CLU_529544_0_0_1; -.
DR TreeFam; TF326321; -.
DR Proteomes; UP000005215; Unassembled WGS sequence.
DR GO; GO:0016604; C:nuclear body; IEA:Ensembl.
DR GO; GO:0003723; F:RNA binding; IEA:InterPro.
DR GO; GO:0006396; P:RNA processing; IEA:InterPro.
DR Gene3D; 1.10.10.790; Surp module; 2.
DR InterPro; IPR000467; G_patch_dom.
DR InterPro; IPR040169; SUGP1/2.
DR InterPro; IPR000061; Surp.
DR InterPro; IPR035967; SWAP/Surp_sf.
DR PANTHER; PTHR23340; ARGININE/SERINE RICH SPLICING FACTOR SF4/14; 1.
DR PANTHER; PTHR23340:SF0; SURP AND G-PATCH DOMAIN-CONTAINING PROTEIN 1 ISOFORM X1; 1.
DR Pfam; PF01585; G-patch; 1.
DR Pfam; PF01805; Surp; 1.
DR SMART; SM00443; G_patch; 1.
DR SMART; SM00648; SWAP; 2.
DR SUPFAM; SSF109905; Surp module (SWAP domain); 2.
DR PROSITE; PS50174; G_PATCH; 1.
DR PROSITE; PS50128; SURP; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000005215}.
FT DOMAIN 786..829
FT /note="SURP motif"
FT /evidence="ECO:0000259|PROSITE:PS50128"
FT DOMAIN 1028..1074
FT /note="G-patch"
FT /evidence="ECO:0000259|PROSITE:PS50174"
FT REGION 61..91
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 187..207
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 698..780
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 850..976
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1048..1078
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 765..779
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 862..884
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 885..911
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 912..928
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 954..969
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1099 AA; 122770 MW; 241931022588C1B9 CRC64;
MAARRIAQET FDAVLQEKAK RYHVDPTAET VGETLQFKAQ DLLRTIPRSR ADMYDDVHSD
SRYSLSGSVA HSRDAGREGL RSDVFPGPSF RSSNPSIGDD NYFRKECGRD LEFAHADTRD
QVFGHRKLGH FHSQDWKFAL RGSWEQDFGR PVSQESSWSQ EYSFGPSAVL GDLASSRLIE
KECLEKESQD YDVDHPGEAD SVSRSTGQVQ ARGRALNIID QEGTLLGKGD TQGLLTAKGG
VGKLVTLRGV TTKKVPVINR ITSKTQGTNQ IQKATPSPDV TLGTNLRTED IQFPTQKIPL
GLDLKNIRFP RRKMSFDVID KSDVFSRFGI EIIKWAGFHT IKDDVKFSQL FQTLFELETE
TCAKMLASFK CSLKPEHRDF CFFTIKFLKH SALKTPRVDN EFLNMLLDKG AVKTKNCFFE
IIKPFDKYIM RLQDRLLKSV TPLLMACNAY ELSVKMKTLT NPLDLAIALE TTNSLCRKSL
ALLGQTFSLA SSFRQEKILE AVGLQDIAPS PASFPNFEDS TLFGREYIDH LKAWLITSGC
PLQVKKTEAE PAREEEKMIA PMKPEIQARA PSGLNDVPQR ADHKVVDTID QLVTRIVEGT
LSPKERSVLK EDPAYWFLSD ENSLEYKYYK LKLAETQRMS QTLQGTDRKP TSAECAVRAM
LYAQAVRNLK KRLLPWQRRR ILRAQGLRGW KARRATTGTQ TLLSSGTRLK HHGRQAAGSS
QAKSSLPDGN NAAKDCPPDP ARHSPQDPSS EASGPSPRPA EMDVSETPHT SSPSPSADVD
MKTMETAEKL ARFVAQVGPE IEQFSIENST DNPDLWFLHD QNSSAFKFYR KKVFELCPSI
CFTSSPLSLQ ASMGGESADS QESPLDHAER EGELEDEHPQ QEAELESPEV MPEEEEEEED
EEEEDEEDEG GEEALAHRAS RPGIRADKSE GSEGSPPADG LPGEGAEDDP AGTPGLSQAT
SSTCFPRKRI SSKSLKVGMI PAPKRVCLIQ EPQVHEPVRI AYDRPRGRPM SKKKKPKDLD
FAQQKLTDKN LGFQMLQKMG WKEGHGLGSC GKGIREPVSM GTASEGEGLG AEGQEHKEDT
FDVFRQRMMQ MYRHKRANK
//