ID A0A0B2QEZ4_GLYSO Unreviewed; 969 AA.
AC A0A0B2QEZ4;
DT 04-MAR-2015, integrated into UniProtKB/TrEMBL.
DT 04-MAR-2015, sequence version 1.
DT 27-MAR-2024, entry version 40.
DE SubName: Full=Protein RRC1 {ECO:0000313|EMBL:RZB41413.1};
DE SubName: Full=U2 snRNP-associated SURP motif-containing protein {ECO:0000313|EMBL:KHN19845.1};
GN ORFNames=D0Y65_055245 {ECO:0000313|EMBL:RZB41413.1}, glysoja_047735
GN {ECO:0000313|EMBL:KHN19845.1};
OS Glycine soja (Wild soybean).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; fabids; Fabales; Fabaceae; Papilionoideae; 50 kb inversion clade;
OC NPAAA clade; indigoferoid/millettioid clade; Phaseoleae; Glycine;
OC Glycine subgen. Soja.
OX NCBI_TaxID=3848 {ECO:0000313|EMBL:KHN19845.1};
RN [1] {ECO:0000313|EMBL:KHN19845.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC TISSUE=Root {ECO:0000313|EMBL:KHN19845.1};
RA Lam H.-M., Qi X., Li M.-W., Liu X., Xie M., Ni M., Xu X.;
RT "Identification of a novel salt tolerance gene in wild soybean by whole-
RT genome sequencing.";
RL Submitted (JUL-2014) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:RZB41413.1, ECO:0000313|Proteomes:UP000289340}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. W05 {ECO:0000313|Proteomes:UP000289340};
RC TISSUE=Hypocotyl of etiolated seedlings {ECO:0000313|EMBL:RZB41413.1};
RA Xie M., Chung C.Y.L., Li M.-W., Wong F.-L., Chan T.-F., Lam H.-M.;
RT "A high-quality reference genome of wild soybean provides a powerful tool
RT to mine soybean genomes.";
RL Submitted (SEP-2018) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KN658648; KHN19845.1; -; Genomic_DNA.
DR EMBL; QZWG01001011; RZB41413.1; -; Genomic_DNA.
DR OrthoDB; 9814at2759; -.
DR Proteomes; UP000053555; Unassembled WGS sequence.
DR Proteomes; UP000289340; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProt.
DR GO; GO:0003723; F:RNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0006396; P:RNA processing; IEA:InterPro.
DR CDD; cd21371; cwf21_RRC1-like; 1.
DR CDD; cd12223; RRM_SR140; 1.
DR Gene3D; 1.25.40.90; -; 1.
DR Gene3D; 3.30.70.330; -; 1.
DR Gene3D; 6.10.140.420; -; 1.
DR Gene3D; 1.10.10.790; Surp module; 1.
DR InterPro; IPR006569; CID_dom.
DR InterPro; IPR008942; ENTH_VHS.
DR InterPro; IPR013170; mRNA_splic_Cwf21_dom.
DR InterPro; IPR012677; Nucleotide-bd_a/b_plait_sf.
DR InterPro; IPR035979; RBD_domain_sf.
DR InterPro; IPR047491; RRC1-like_cwf21.
DR InterPro; IPR000504; RRM_dom.
DR InterPro; IPR003034; SAP_dom.
DR InterPro; IPR035009; SR140_RRM.
DR InterPro; IPR000061; Surp.
DR InterPro; IPR035967; SWAP/Surp_sf.
DR PANTHER; PTHR23140; RNA PROCESSING PROTEIN LD23810P; 1.
DR PANTHER; PTHR23140:SF0; U2 SNRNP-ASSOCIATED SURP MOTIF-CONTAINING PROTEIN; 1.
DR Pfam; PF04818; CID; 1.
DR Pfam; PF08312; cwf21; 1.
DR Pfam; PF00076; RRM_1; 1.
DR Pfam; PF01805; Surp; 1.
DR SMART; SM01115; cwf21; 1.
DR SMART; SM00582; RPR; 1.
DR SMART; SM00360; RRM; 1.
DR SMART; SM00648; SWAP; 1.
DR SUPFAM; SSF48464; ENTH/VHS domain; 1.
DR SUPFAM; SSF54928; RNA-binding domain, RBD; 1.
DR SUPFAM; SSF109905; Surp module (SWAP domain); 1.
DR PROSITE; PS51391; CID; 1.
DR PROSITE; PS50102; RRM; 1.
DR PROSITE; PS50800; SAP; 1.
DR PROSITE; PS50128; SURP; 1.
PE 4: Predicted;
KW mRNA processing {ECO:0000256|ARBA:ARBA00022664};
KW Reference proteome {ECO:0000313|Proteomes:UP000289340};
KW RNA-binding {ECO:0000256|ARBA:ARBA00022884, ECO:0000256|PROSITE-
KW ProRule:PRU00176}.
FT DOMAIN 186..267
FT /note="RRM"
FT /evidence="ECO:0000259|PROSITE:PS50102"
FT DOMAIN 335..378
FT /note="SURP motif"
FT /evidence="ECO:0000259|PROSITE:PS50128"
FT DOMAIN 443..588
FT /note="CID"
FT /evidence="ECO:0000259|PROSITE:PS51391"
FT DOMAIN 637..671
FT /note="SAP"
FT /evidence="ECO:0000259|PROSITE:PS50800"
FT REGION 1..181
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 411..443
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 745..823
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 863..969
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 11..35
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 43..57
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 61..80
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 99..169
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 417..443
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 746..776
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 777..792
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 794..808
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 908..962
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 969 AA; 109872 MW; B7166F6569B75443 CRC64;
MSSFSITRKK TPFQKHREEE EAKKKRAEDE TARLYAEFVE SFQGDTTPGS KTFVRGGTIN
PNDKFKDDSE GEKSKDGVSG PKKGSRYVPS FIPPPMATKG KESERKKEEE KPKEKEKGKS
RNIDHFMEEL KHEQEMRERR NQEREHWRDG RHTEHSISSR FDELPDDFDP SGKLPGSFDD
GDPQTTNLYV GNLSPKVDEN FLLRTFGRFG PIASVKIMWP RTEEERRRQR NCGFVAFMNR
ADGQAAKDEM QGVVVYEYEL KIGWGKSVAL PSQALPAPPP GHMAIRSKEG STVILSGPSG
PPVTTVPNQN SELVLTPNVP DIMVTPPEDD HLRHVIDTMA LYVLDGGCAF EQAIMERGRG
NPLFNFLFVL GSKEHTYYVW RLYSFAQGDT LQRWRTEPFI MITGSGRWIP PPLPMSKSPE
HEKEPGPTHA GGRSRRVEPE RTLTDAQRDE FEDMLRALTL ERSQIKEAMG FSLDNADAAG
EVVEVLTESL TLKETPIPTK IARLMLVSDI LHNSSAPVRN ASAYRTKFEA TLPDIMESFN
DLYRSIMGRI TAEALKERVL KVLQVWADWF LFSDAYVNGL RATFLRPGNS GVIPFHSICG
DAPEIEQKTA SEDMVVGGKT NQDAALAMGR GAAMKELMSL PLAELERRCR HNGLSLVGGR
EMMVARLLSL EEAEKQKGFE LDDELKYAHN QVSSGKYSSN QRETSAELDP VGLSAWNHYG
DEDIQSQGRS SVPLAPTLPI PQPKLKAFTK KEKNDPVLPA SKWAREDDES DDEQRSGKNL
GLSYSSSGSE NLDDGLVKAD ESESAADRSF SAHADSGMNE EQRQKLRRLE VALIEYGESL
EERGIKNLEE IEKKVQLHRK RLQVEYGLSD SGEDGQGNRR TSERRDRHDV SRKRHRSRSP
SSSPQPKLSG KDRDREHDLE RDRDRQRDRS HDFDSERGRD RHREKSGSRE RDDHDKDRSR
DRDRRRRAK
//