ID A0A445ISQ5_GLYSO Unreviewed; 770 AA.
AC A0A445ISQ5;
DT 08-MAY-2019, integrated into UniProtKB/TrEMBL.
DT 08-MAY-2019, sequence version 1.
DT 27-MAR-2024, entry version 14.
DE SubName: Full=Pre-mRNA-processing protein 40B isoform G {ECO:0000313|EMBL:RZB89116.1};
GN ORFNames=D0Y65_028128 {ECO:0000313|EMBL:RZB89116.1};
OS Glycine soja (Wild soybean).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; fabids; Fabales; Fabaceae; Papilionoideae; 50 kb inversion clade;
OC NPAAA clade; indigoferoid/millettioid clade; Phaseoleae; Glycine;
OC Glycine subgen. Soja.
OX NCBI_TaxID=3848 {ECO:0000313|EMBL:RZB89116.1, ECO:0000313|Proteomes:UP000289340};
RN [1] {ECO:0000313|EMBL:RZB89116.1, ECO:0000313|Proteomes:UP000289340}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. W05 {ECO:0000313|Proteomes:UP000289340};
RC TISSUE=Hypocotyl of etiolated seedlings {ECO:0000313|EMBL:RZB89116.1};
RA Xie M., Chung C.Y.L., Li M.-W., Wong F.-L., Chan T.-F., Lam H.-M.;
RT "A high-quality reference genome of wild soybean provides a powerful tool
RT to mine soybean genomes.";
RL Submitted (SEP-2018) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:RZB89116.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; QZWG01000010; RZB89116.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A445ISQ5; -.
DR Proteomes; UP000289340; Chromosome 10.
DR GO; GO:0045292; P:mRNA cis splicing, via spliceosome; IEA:InterPro.
DR CDD; cd00201; WW; 1.
DR Gene3D; 2.20.70.10; -; 1.
DR Gene3D; 1.10.10.440; FF domain; 5.
DR InterPro; IPR002713; FF_domain.
DR InterPro; IPR036517; FF_domain_sf.
DR InterPro; IPR039726; Prp40-like.
DR InterPro; IPR001202; WW_dom.
DR InterPro; IPR036020; WW_dom_sf.
DR PANTHER; PTHR11864:SF33; PRE-MRNA-PROCESSING PROTEIN 40B; 1.
DR PANTHER; PTHR11864; PRE-MRNA-PROCESSING PROTEIN PRP40; 1.
DR Pfam; PF01846; FF; 5.
DR Pfam; PF00397; WW; 1.
DR SMART; SM00441; FF; 5.
DR SMART; SM00456; WW; 1.
DR SUPFAM; SSF81698; FF domain; 5.
DR SUPFAM; SSF51045; WW domain; 1.
DR PROSITE; PS51676; FF; 4.
DR PROSITE; PS50020; WW_DOMAIN_2; 1.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Reference proteome {ECO:0000313|Proteomes:UP000289340}.
FT DOMAIN 12..45
FT /note="WW"
FT /evidence="ECO:0000259|PROSITE:PS50020"
FT DOMAIN 224..278
FT /note="FF"
FT /evidence="ECO:0000259|PROSITE:PS51676"
FT DOMAIN 291..346
FT /note="FF"
FT /evidence="ECO:0000259|PROSITE:PS51676"
FT DOMAIN 359..413
FT /note="FF"
FT /evidence="ECO:0000259|PROSITE:PS51676"
FT DOMAIN 431..494
FT /note="FF"
FT /evidence="ECO:0000259|PROSITE:PS51676"
FT REGION 62..138
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 167..221
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 625..770
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 272..304
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 407..446
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 485..512
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 69..122
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 167..183
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 187..213
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 625..720
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 731..770
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 770 AA; 88899 MW; CEFDEFA64EEC4C5C CRC64;
MDFDSIAGEM RVDATTNWKE YTSPDGRKYY YNKITNESKW SVPEELKLAR ELVEKAIVSG
ARPEALLNSH PQPSPTPSAI EATPNADNSS LPSQGEPSSP VSVSPVVTTS ISNLQSEMPS
GPSPSPADAI TGTKVDELEA PLNTVTPSDT SVGSDKAIVT DINTAVTPMN DVDNDSAQAT
LGSADGVSAE DKEDGKNDSI GEKSNDEAAE TKAVEPEPPV YANKMEAKDA FKALLESVNV
GSDWTWDRSM RLIINDKRYG ALKTLGERKQ AFNEYLNQRK KQEAEEKRMK QKKAREDFKK
MLEESTDLTS SARWSKAVSI FENDERFKAV ERDRDRRDMF ESFLEELLNK ERAKVQEERK
RNIMEYKKFL ESCDFIKAST QWRKVQDRLE ADERCSRLEK IDRLEIFQDY LHDLEKEEEE
QKKIQKEELR KTERKNREEF RKLMEEHIAS GILTAKTHWR DYYTKVKDLH AYVAVASNTS
GSTPKDLFED VAEELEKQYH EEKSRIKDTV KLAKITLSST WAFEDFKSAL SKAISTPPIS
DFNLKLVFDE LLERAKEKEE KEAKKRKRLS DDFFHLLHST KDITVSLKWE DCRPHVEDSQ
EFRSIGDESL CKEVFEEYIA QLKEEAKESE RKRKEERAKK EKDREERERR KGKQRKEKEG
GRERGKDEAH KKDKADSDSM ELTEIQTSKE NKRSEDDNRK QRKKLQSPEH EMDKGKTKKS
HGHGSDRKKS RRHSSGHESD EGRHKRHKRD HCREGDLEDG EFGDDHVDRW
//