ID A0A0P7WRF0_SCLFO Unreviewed; 845 AA.
AC A0A0P7WRF0;
DT 20-JAN-2016, integrated into UniProtKB/TrEMBL.
DT 20-JAN-2016, sequence version 1.
DT 22-FEB-2023, entry version 14.
DE RecName: Full=PSP proline-rich domain-containing protein {ECO:0000259|SMART:SM00581};
GN ORFNames=Z043_115375 {ECO:0000313|EMBL:KPP66156.1};
OS Scleropages formosus (Asian bonytongue) (Osteoglossum formosum).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Osteoglossocephala;
OC Osteoglossomorpha; Osteoglossiformes; Osteoglossidae; Scleropages.
OX NCBI_TaxID=113540 {ECO:0000313|EMBL:KPP66156.1, ECO:0000313|Proteomes:UP000034805};
RN [1] {ECO:0000313|EMBL:KPP66156.1, ECO:0000313|Proteomes:UP000034805}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Aro1 {ECO:0000313|EMBL:KPP66156.1};
RA Tan M.H., Gan H.M., Croft L.J., Austin C.M.;
RT "The genome of the Asian arowana (Scleropages formosus).";
RL Submitted (AUG-2015) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KPP66156.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JARO02005830; KPP66156.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A0P7WRF0; -.
DR STRING; 113540.ENSSFOP00015033088; -.
DR Proteomes; UP000034805; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:InterPro.
DR InterPro; IPR007180; DUF382.
DR InterPro; IPR006568; PSP_pro-rich.
DR PANTHER; PTHR12785; SPLICING FACTOR 3B; 1.
DR PANTHER; PTHR12785:SF6; SPLICING FACTOR 3B SUBUNIT 2; 1.
DR Pfam; PF04037; DUF382; 1.
DR Pfam; PF04046; PSP; 1.
DR SMART; SM00581; PSP; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000034805}.
FT DOMAIN 549..607
FT /note="PSP proline-rich"
FT /evidence="ECO:0000259|SMART:SM00581"
FT REGION 1..20
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 216..248
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 276..317
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 354..396
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 643..705
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 794..845
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 218..240
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 299..317
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 643..660
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 661..680
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 845 AA; 95242 MW; 8F0D9D6C7689B9FA CRC64;
MATEGPPGIE PGPTDLGNSV ASLNAWTNGE LQAKLAEIGA PNMGPREELL DTLKTYVLQT
GMIPTKPNLA GNDDTTPLPT QMPGMTLPPP PMPMPLPPGM GMMQAMSVMG APPPPGMHVP
MEEEQLKMAQ QRAAMVLQHE ERDVQQQGES RSMQEQLKEQ ELLEQQKRAA VLLEQERQQE
LVKMQQGGPS ARVPEPVTRP PGVTLPAVPQ LNPMRGMSQT GPPPPGISLV PPPSHRQRVP
PPPGEDAREV WQGEEVGIGP KIPQALEKIL QLKEMRQEQL SDKNRKRRNR KKKNKKKKAQ
TNGEMNKEKA EMEKEKEAAV EIEYITEEPE IYDPNFIFFK RIFEAFKLTD DVKKEKEKEP
EKVEKQETTT VKKKGFEEEK KDSDDSDEEA KRDIPKMSKK KLRRMNRLTV AELKQLVARP
DVVEMHDVTA QEPKLLVHLK ATRNTVPVPR HWCFKRKYLQ GKRGIEKPPF ELPEFIKRTG
IQEMREALQE KEDAKTMKTK MREKVRPKMG KIDIDYQKLH DAFFKWQIKP KLTIHGDLYY
EGKEFETRLK EKKPGDLSDE LRIALGMPVG PNAHKVPPPW LIAMQRYGPP PSYPNLKIPG
LNSPIPESCS FGYHAGGWGK PPVDEMGKPL YGDVFGTNAG DFQAKTEEEE VDRTPWGELE
PSDEESSEEE EEEESDEDKP DETGFFTPAD SGLITPGGFS SVPAGMETPE LIELRKKKIE
EAMDGNETPQ LYTVLPERRT TSVGAAMMAS THIYDVSGAM AGRKAGGGQE SQGVEVALAP
EELELDPMAM TQKYEEHVRE QQAQVEKEDF SDMVAEHAAK QKQKKRKAQP QDNRGSTKKY
KEFKF
//