ID A0A921ZGM4_MANSE Unreviewed; 854 AA.
AC A0A921ZGM4;
DT 22-FEB-2023, integrated into UniProtKB/TrEMBL.
DT 22-FEB-2023, sequence version 1.
DT 28-JAN-2026, entry version 9.
DE RecName: Full=PSP proline-rich domain-containing protein {ECO:0000259|SMART:SM00581};
GN ORFNames=O3G_MSEX010439 {ECO:0000313|EMBL:KAG6457678.1};
OS Manduca sexta (Tobacco hawkmoth) (Tobacco hornworm).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; Bombycoidea;
OC Sphingidae; Sphinginae; Sphingini; Manduca.
OX NCBI_TaxID=7130 {ECO:0000313|EMBL:KAG6457678.1, ECO:0000313|Proteomes:UP000791440};
RN [1] {ECO:0000313|EMBL:KAG6457678.1}
RP NUCLEOTIDE SEQUENCE.
RX PubMed=27522922;
RA Kanost M.R., Arrese E.L., Cao X., Chen Y.R., Chellapilla S.,
RA Goldsmith M.R., Grosse-Wilde E., Heckel D.G., Herndon N., Jiang H.,
RA Papanicolaou A., Qu J., Soulages J.L., Vogel H., Walters J.,
RA Waterhouse R.M., Ahn S.J., Almeida F.C., An C., Aqrawi P.,
RA Bretschneider A., Bryant W.B., Bucks S., Chao H., Chevignon G.,
RA Christen J.M., Clarke D.F., Dittmer N.T., Ferguson L.C.F., Garavelou S.,
RA Gordon K.H.J., Gunaratna R.T., Han Y., Hauser F., He Y., Heidel-Fischer H.,
RA Hirsh A., Hu Y., Jiang H., Kalra D., Klinner C., Konig C., Kovar C.,
RA Kroll A.R., Kuwar S.S., Lee S.L., Lehman R., Li K., Li Z., Liang H.,
RA Lovelace S., Lu Z., Mansfield J.H., McCulloch K.J., Mathew T., Morton B.,
RA Muzny D.M., Neunemann D., Ongeri F., Pauchet Y., Pu L.L., Pyrousis I.,
RA Rao X.J., Redding A., Roesel C., Sanchez-Gracia A., Schaack S., Shukla A.,
RA Tetreau G., Wang Y., Xiong G.H., Traut W., Walsh T.K., Worley K.C., Wu D.,
RA Wu W., Wu Y.Q., Zhang X., Zou Z., Zucker H., Briscoe A.D., Burmester T.,
RA Clem R.J., Feyereisen R., Grimmelikhuijzen C.J.P., Hamodrakas S.J.,
RA Hansson B.S., Huguet E., Jermiin L.S., Lan Q., Lehman H.K., Lorenzen M.,
RA Merzendorfer H., Michalopoulos I., Morton D.B., Muthukrishnan S.,
RA Oakeshott J.G., Palmer W., Park Y., Passarelli A.L., Rozas J.,
RA Schwartz L.M., Smith W., Southgate A., Vilcinskas A., Vogt R., Wang P.,
RA Werren J., Yu X.Q., Zhou J.J., Brown S.J., Scherer S.E., Richards S.,
RA Blissard G.W.;
RT "Multifaceted biological insights from a draft genome sequence of the
RT tobacco hornworm moth, Manduca sexta.";
RL Insect Biochem. Mol. Biol. 76:118-147(2016).
RN [2] {ECO:0000313|EMBL:KAG6457678.1}
RP NUCLEOTIDE SEQUENCE.
RA Kanost M.;
RL Submitted (DEC-2020) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KAG6457678.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JH668554; KAG6457678.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A921ZGM4; -.
DR Proteomes; UP000791440; Unassembled WGS sequence.
DR GO; GO:0005689; C:U12-type spliceosomal complex; IEA:TreeGrafter.
DR InterPro; IPR007180; DUF382.
DR InterPro; IPR006568; PSP_pro-rich.
DR InterPro; IPR052584; U2_snRNP_Complex_Component.
DR PANTHER; PTHR12785; SPLICING FACTOR 3B; 1.
DR PANTHER; PTHR12785:SF6; SPLICING FACTOR 3B SUBUNIT 2; 1.
DR Pfam; PF04037; DUF382; 1.
DR Pfam; PF04046; PSP; 1.
DR SMART; SM00581; PSP; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000791440}.
FT DOMAIN 550..608
FT /note="PSP proline-rich"
FT /evidence="ECO:0000259|SMART:SM00581"
FT REGION 1..180
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 209..320
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 360..397
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 658..711
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 809..854
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..18
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 19..44
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 54..65
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 89..101
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 150..163
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 164..174
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 219..231
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 243..255
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 274..291
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 292..320
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 360..377
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 387..397
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 659..674
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 809..829
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 844..854
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 854 AA; 93757 MW; 779C35F9347EA194 CRC64;
MDGPPGTGPP SNNGSIGPPG MPSFPPMPPP PGSMGPPMGP PGTMPPVSTA TGPPNIPPGM
GPPPGMMGMG PPGMGPPGMG PPGMGMGPPM GPPGMGPPRM PPNMMRGKSS FNNQNMDMGP
PGMGPPLGPW EGQGPPGWGR PTPDGPPGWE ENDDDEEEAE GEESSSIQGQ SLPSLLTMKI
DTPEEFKNKA APVGGVVLPK ALEEALAYKD QRQAALGDNE EKEKEADKEP EVPPAPVISR
EYDVEEDEGD SDDENVPTAP KPPVISKQET QMSKSHKNKR KKKKKREAKQ KRKDEAKKKE
DQTSSQDESG KSSDKENKKE VDIEYVQESI QFHELEPMYR QFHRILEAFK ITEKKDDAIK
EEGKDAPKPS KPQEKVQDQF AADEEAAEKN AADEKERLSK RKLKKLSRLS VAELKQLVSR
PDVVEMYDVT ARDPRLLVQL KAHRNTVQVP RHWCYKRKYL QGKRGIEKPP FDLPDFIKKT
GIMEMRASLQ DKEETKTLKA KMRERTRPKL GKIDIDYQKL HDAFFKWQTK PRMTIHGDLY
YEGKEFETRL REKKPGDLSE ELRTALGMPV GPGSHKVPPP WLIAQQRYGP PPSYPNLKIP
GLNAPIPEGC AFGYHAGGWG KPPVDEMGKP LYGDVFGHQS SGQDDIEDQD IDRTMWGELE
SESEEESEEE ESGDEGEKGT DQEMAAGVAT PGEGLVTPLG VSSVPPGVET PDTIELRKKK
LDNDLEGGDT PALYQVLPER RVGLTSGMMA STHVYDISAA NPGKRAVGGG AGIAPSEGSA
APAGVDVALD PSELELEPEA VAARYERHLR DARPRDHEDL SDMLADHVAR QKNKRKRQQT
TDSKQAKKYK EFKF
//