ID A0A091DGD6_FUKDA Unreviewed; 1495 AA.
AC A0A091DGD6;
DT 26-NOV-2014, integrated into UniProtKB/TrEMBL.
DT 26-NOV-2014, sequence version 1.
DT 24-JAN-2024, entry version 35.
DE SubName: Full=U2 snRNP-associated SURP motif-containing protein {ECO:0000313|EMBL:KFO21856.1};
GN ORFNames=H920_16740 {ECO:0000313|EMBL:KFO21856.1};
OS Fukomys damarensis (Damaraland mole rat) (Cryptomys damarensis).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Hystricomorpha; Bathyergidae;
OC Fukomys.
OX NCBI_TaxID=885580 {ECO:0000313|EMBL:KFO21856.1, ECO:0000313|Proteomes:UP000028990};
RN [1] {ECO:0000313|EMBL:KFO21856.1, ECO:0000313|Proteomes:UP000028990}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC TISSUE=Liver {ECO:0000313|EMBL:KFO21856.1};
RA Gladyshev V.N., Fang X.;
RT "The Damaraland mole rat (Fukomys damarensis) genome and evolution of
RT African mole rats.";
RL Submitted (NOV-2013) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KN124268; KFO21856.1; -; Genomic_DNA.
DR STRING; 885580.ENSFDAP00000004041; -.
DR eggNOG; KOG0151; Eukaryota.
DR Proteomes; UP000028990; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProt.
DR GO; GO:0003723; F:RNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0008146; F:sulfotransferase activity; IEA:InterPro.
DR GO; GO:0006396; P:RNA processing; IEA:InterPro.
DR CDD; cd21370; cwf21_SR140; 1.
DR CDD; cd12223; RRM_SR140; 1.
DR Gene3D; 1.25.40.90; -; 1.
DR Gene3D; 3.30.70.330; -; 1.
DR Gene3D; 6.10.140.420; -; 1.
DR Gene3D; 3.40.50.300; P-loop containing nucleotide triphosphate hydrolases; 1.
DR Gene3D; 1.10.10.790; Surp module; 1.
DR InterPro; IPR006569; CID_dom.
DR InterPro; IPR008942; ENTH_VHS.
DR InterPro; IPR013170; mRNA_splic_Cwf21_dom.
DR InterPro; IPR012677; Nucleotide-bd_a/b_plait_sf.
DR InterPro; IPR027417; P-loop_NTPase.
DR InterPro; IPR035979; RBD_domain_sf.
DR InterPro; IPR000504; RRM_dom.
DR InterPro; IPR047488; SR140_cwf21.
DR InterPro; IPR035009; SR140_RRM.
DR InterPro; IPR000863; Sulfotransferase_dom.
DR InterPro; IPR000061; Surp.
DR InterPro; IPR035967; SWAP/Surp_sf.
DR PANTHER; PTHR23140; RNA PROCESSING PROTEIN LD23810P; 1.
DR PANTHER; PTHR23140:SF0; U2 SNRNP-ASSOCIATED SURP MOTIF-CONTAINING PROTEIN; 1.
DR Pfam; PF04818; CID; 1.
DR Pfam; PF08312; cwf21; 1.
DR Pfam; PF00076; RRM_1; 1.
DR Pfam; PF00685; Sulfotransfer_1; 1.
DR Pfam; PF01805; Surp; 1.
DR SMART; SM01115; cwf21; 1.
DR SMART; SM00582; RPR; 1.
DR SMART; SM00360; RRM; 1.
DR SMART; SM00648; SWAP; 1.
DR SUPFAM; SSF48464; ENTH/VHS domain; 1.
DR SUPFAM; SSF52540; P-loop containing nucleoside triphosphate hydrolases; 1.
DR SUPFAM; SSF54928; RNA-binding domain, RBD; 1.
DR SUPFAM; SSF109905; Surp module (SWAP domain); 1.
DR PROSITE; PS51391; CID; 1.
DR PROSITE; PS50102; RRM; 1.
DR PROSITE; PS50128; SURP; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000028990};
KW RNA-binding {ECO:0000256|ARBA:ARBA00022884, ECO:0000256|PROSITE-
KW ProRule:PRU00176}.
FT DOMAIN 244..325
FT /note="RRM"
FT /evidence="ECO:0000259|PROSITE:PS50102"
FT DOMAIN 400..443
FT /note="SURP motif"
FT /evidence="ECO:0000259|PROSITE:PS50128"
FT DOMAIN 504..649
FT /note="CID"
FT /evidence="ECO:0000259|PROSITE:PS51391"
FT REGION 1..44
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 57..81
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 111..171
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 185..245
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 754..811
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 825..951
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1097..1124
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 14..33
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 63..81
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 112..131
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 135..151
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 185..219
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 755..776
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 777..811
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 825..894
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 916..951
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1495 AA; 170022 MW; 78EE310C3E2BAB4B CRC64;
MDASGPSDSD MPSRTRPKSP RKHNYRNENT RENLCDSPHQ NLSRPLLENK LKAFSIGKMS
TAKRTLSKKE QEELKKKEDE KAAAEIYEEF LAAFEGSDGN KVKTFVRGGV VNAAKEEHET
DEKRGKIYKP SSRFADQKNP PNQSSNERPP SLLVIETKKP PLKKGEKEKK KSNLELFKEE
LKQIQEERDE RHKTKGRLSR FEPPQSDSDG QRRSMDAPSR RNRSSGVLDD YAPGSHDVGD
PSTTNLYLGN INPQMNEEML CQEFGRFGPL ASVKIMWPRT DEERARERNC GFVAFMNRRD
AERALKNLNG KMIMSFEMKL GWGKAVPIPP HPIYIPPSMM EHTLPPPPSG LPFNAQPRER
LKNPNAPMLP PPKNKEDFEK TLSQAIVKVV IPTERNLLAL IHRMIEFVVR EGPMFEAMIM
NREINNPMFR FLFENQTPAH VYYRWKLYSI LQGDSPTKWR TEDFRMFKNG SFWRPPPLNP
YLHGMSEEQE TEAFVEEPSK KGALKEEQRD KLEEILRGLT PRKNDIGDAM VFCLNNAEAA
EEIVDCITES LSILKTPLPK KIARLYLVSD VLYNSSAKVA NASYYRKFFE TKLCQIFSDL
NATYRTIQGH LQSENFKQRV MTCFRAWEDW AIYPEPFLIK LQNIFLGLVN IIEEKETEDV
PDDLDGAPIE EELDGAPLED VDGIPIDATP IDDLDGVPIK SLDDDLDGVP LDATEDSKKN
EPIFKVAPSK WEAVDESELE AQAVTTSKWE LFDQHEESEE EENQNQEEES EDEEDTQSSK
SEEHHLYSNP IKEEMTESKF SKYSEMSEEK RAKLREIELK VMKFQDELES GKRPKKPGQS
FQEQVEHYRD KLLQREKEKE LERERERDKK DKEKLESRSK DKKEKDECTP TRKERKRRHS
TSPSPSRSSS GRRVKSPSPK SERSERSERS HKESSRSRSS HKDSPRDVSK KAKRCFITGE
YSRFVPSSVV SSNAGRCGGT IVQPAILRNL VAKGIDENLG ISGLSLGELA TATSTLDGDM
TRFLVFAAFS PSGIILHDMI NYVPSLSLRC YEWKLQLSRT WDQLRQSRRI SDICSPGRWS
AANPASATGC CAHGDLSPPT RHGRARRRRG AAAPLGNGTR GTRGGRDKRQ LVYVFTTWRS
GSSFFGELFN QNPEVFFLYE PVWHVWQKLY PGDAVSLQGA ARDMLSALYR CDLSVFQLYS
PAGSGGRNLT TLGIFGAATN KVVCSSPLCP AYRKEVVGLV DDRVCKKCPP QRLARFEEEC
RKYRTLVIKG VRVFDVAVLA PLLRDPALDL KVIHLVRDPR AVASSRIRSR HGLIRESLQV
VRSRDPRAHR MPFLEAAGHK LSAKKEGMGG PADYHALGAM EVICNSMAKT LQIALQPPDW
LQGHYLVVRY EDLVGDPVKT LKRVYDFVGL LVSPEMEQFA LNMTSGSGSS SKPFVVSARN
ATQAANAWRT ALTFQQIKQV EEFCYQPMAV LGYERVNSPE EVKDLSKTLL RKPRL
//