ID A0A835B6S2_9POAL Unreviewed; 580 AA.
AC A0A835B6S2;
DT 29-SEP-2021, integrated into UniProtKB/TrEMBL.
DT 29-SEP-2021, sequence version 1.
DT 05-FEB-2025, entry version 9.
DE RecName: Full=PSP proline-rich domain-containing protein {ECO:0000259|SMART:SM00581};
GN ORFNames=HU200_040206 {ECO:0000313|EMBL:KAF8691804.1};
OS Digitaria exilis.
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; Liliopsida; Poales; Poaceae; PACMAD clade;
OC Panicoideae; Panicodae; Paniceae; Anthephorinae; Digitaria.
OX NCBI_TaxID=1010633 {ECO:0000313|EMBL:KAF8691804.1, ECO:0000313|Proteomes:UP000636709};
RN [1] {ECO:0000313|EMBL:KAF8691804.1}
RP NUCLEOTIDE SEQUENCE.
RC TISSUE=Leaves {ECO:0000313|EMBL:KAF8691804.1};
RA Bennetzen J.L., Chen S., Ma X., Wang X., Yssel A.E.J., Chaluvadi S.R.,
RA Johnson M., Gangashetty P., Hamidou F., Sanogo M.D., Zwaenepoel A.,
RA Wallace J., Van De Peer Y., Van Deynze A.;
RT "Genome sequence and genetic diversity analysis of an under-domesticated
RT orphan crop, white fonio (Digitaria exilis).";
RL Submitted (JUL-2020) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KAF8691804.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JACEFO010001963; KAF8691804.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A835B6S2; -.
DR OrthoDB; 10260794at2759; -.
DR Proteomes; UP000636709; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:InterPro.
DR InterPro; IPR007180; DUF382.
DR InterPro; IPR006568; PSP_pro-rich.
DR InterPro; IPR052584; U2_snRNP_Complex_Component.
DR PANTHER; PTHR12785; SPLICING FACTOR 3B; 1.
DR PANTHER; PTHR12785:SF6; SPLICING FACTOR 3B SUBUNIT 2; 1.
DR Pfam; PF04037; DUF382; 1.
DR Pfam; PF04046; PSP; 1.
DR SMART; SM00581; PSP; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000636709}.
FT DOMAIN 299..352
FT /note="PSP proline-rich"
FT /evidence="ECO:0000259|SMART:SM00581"
FT REGION 1..80
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 104..154
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 385..466
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 538..580
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 12..22
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 23..33
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 34..44
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 112..129
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 404..429
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 433..443
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 450..465
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 538..551
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 561..580
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 580 AA; 66166 MW; 449002FB5C637CDA CRC64;
MAAAEVDPAP NPTTDLPNGS SAQDRKKSRE SDRRRRRRKQ KKNKAASNGA GAEPDEEAAP
DSANENADPK PQVEVEVEYV PEKAELDDAL LADFKDIFDK FTFKDSPAAT EDGEKKDEAA
TDAVKKGDGS DSDDDAQEDQ QKKEGGVSNK QKKLQRRMKI AELKQICARP DVVEVWDATA
SDPKLLVYLK SYRNTVPVPR HWCQKRKFLQ GKRGIEKQPF QLPDFIAATG IEKIRQAYIE
KEDSKKLKQK QRERMQPKMG KMDIDYQVLH DAFFKYQTKP KLTSHGDLYY EGKEFEVKLR
ETKPGVLSRE LKEALGMPDG APPPWLINMQ RYGPPPSYPQ LKIPGLNAPI PPGASFGYRP
GEWGKPPVDE HGRPLYGDVF GVLQQDEPNY DDEPVDRSKH WGDLEEEEEE EEDEEEEEEE
PMEDEDMEEG MQSVDTISST PTGVETPDVI DLRKLQRKEP EKQTEKQLYQ VLEQKEERIA
PGTLYGSSHT YVLGAQDKVA PKRVDLLKNQ KADKVDVTIQ PEELEVMDDV LAAKYEEARE
EEKLRNQKED FSDMVAENAS KRKRKQQEKD GKSKKKEFKF
//