ID G8Y2S9_PICSO Unreviewed; 652 AA.
AC G8Y2S9;
DT 22-FEB-2012, integrated into UniProtKB/TrEMBL.
DT 22-FEB-2012, sequence version 1.
DT 27-MAR-2024, entry version 60.
DE SubName: Full=Piso0_005739 protein {ECO:0000313|EMBL:CCE86090.1};
GN Name=Piso0_005739 {ECO:0000313|EMBL:CCE86090.1};
GN ORFNames=GNLVRS01_PISO0M21286g {ECO:0000313|EMBL:CCE86090.1};
OS Pichia sorbitophila (strain ATCC MYA-4447 / BCRC 22081 / CBS 7064 / NBRC
OS 10061 / NRRL Y-12695) (Hybrid yeast).
OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; Saccharomycetes;
OC Saccharomycetales; Debaryomycetaceae; Millerozyma.
OX NCBI_TaxID=559304 {ECO:0000313|EMBL:CCE86090.1, ECO:0000313|Proteomes:UP000005222};
RN [1] {ECO:0000313|Proteomes:UP000005222}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC MYA-4447 / BCRC 22081 / CBS 7064 / NBRC 10061 / NRRL
RC Y-12695 {ECO:0000313|Proteomes:UP000005222};
RX DOI=10.1534/g3.111.000745;
RA Leh Louis V., Despons L., Friedrich A., Martin T., Durrens P.,
RA Casaregola S., Neuveglise C., Fairhead C., Marck C., Cruz J.A.,
RA Straub M.L., Kugler V., Sacerdot C., Uzunov Z., Thierry A., Weiss S.,
RA Bleykasten C., De Montigny J., Jacques N., Jung P., Lemaire M., Mallet S.,
RA Morel G., Richard G.F., Sarkar A., Savel G., Schacherer J., Seret M.L.,
RA Talla E., Samson G., Jubin C., Poulain J., Vacherie B., Barbe V.,
RA Pelletier E., Sherman D.J., Westhof E., Weissenbach J., Baret P.V.,
RA Wincker P., Gaillardin C., Dujon B., Souciet J.L.;
RT "Pichia sorbitophila, an interspecies yeast hybrid reveals early steps of
RT genome resolution following polyploidization.";
RL G3 (Bethesda) 2:299-311(2012).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; FO082047; CCE86090.1; -; Genomic_DNA.
DR AlphaFoldDB; G8Y2S9; -.
DR STRING; 559304.G8Y2S9; -.
DR eggNOG; KOG0152; Eukaryota.
DR HOGENOM; CLU_005825_1_2_1; -.
DR InParanoid; G8Y2S9; -.
DR OMA; NEPIYKH; -.
DR Proteomes; UP000005222; Chromosome M.
DR GO; GO:0045292; P:mRNA cis splicing, via spliceosome; IEA:InterPro.
DR CDD; cd00201; WW; 2.
DR Gene3D; 2.20.70.10; -; 2.
DR Gene3D; 1.10.10.440; FF domain; 1.
DR InterPro; IPR002713; FF_domain.
DR InterPro; IPR036517; FF_domain_sf.
DR InterPro; IPR039726; Prp40-like.
DR InterPro; IPR001202; WW_dom.
DR InterPro; IPR036020; WW_dom_sf.
DR PANTHER; PTHR11864; PRE-MRNA-PROCESSING PROTEIN PRP40; 1.
DR PANTHER; PTHR11864:SF0; PRP40 PRE-MRNA PROCESSING FACTOR 40 HOMOLOG A (YEAST); 1.
DR Pfam; PF01846; FF; 2.
DR Pfam; PF00397; WW; 2.
DR SMART; SM00441; FF; 2.
DR SMART; SM00456; WW; 2.
DR SUPFAM; SSF81698; FF domain; 2.
DR SUPFAM; SSF51045; WW domain; 2.
DR PROSITE; PS51676; FF; 2.
DR PROSITE; PS01159; WW_DOMAIN_1; 2.
DR PROSITE; PS50020; WW_DOMAIN_2; 2.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000005222}.
FT DOMAIN 6..33
FT /note="WW"
FT /evidence="ECO:0000259|PROSITE:PS50020"
FT DOMAIN 38..65
FT /note="WW"
FT /evidence="ECO:0000259|PROSITE:PS50020"
FT DOMAIN 127..181
FT /note="FF"
FT /evidence="ECO:0000259|PROSITE:PS51676"
FT DOMAIN 349..408
FT /note="FF"
FT /evidence="ECO:0000259|PROSITE:PS51676"
FT REGION 64..110
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 610..652
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 78..102
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 610..639
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 652 AA; 76172 MW; D736DCD1C1FA667F CRC64;
MSSNVWKEAV DDHGRTYFYN PITNKTSWTR PADSTGKWKT YYTDDGKPYY HNVETGETTW
DIPTDLENTA PSEQQAADDI DNGDEYPAEV SGEDIEAEED EPEKEKELAK QDIKNRELIE
PAHFESFKDA ENAFVEMLRK NGVDSTWSFQ KVMSTFIKEP LYWAIPDTLD RQKLYEEYLV
QKLKEDMSNK SAIINNFEKN FIDVLQRYEK EGNLNFHTRW VTVKQLLIKE ENPIFKNSVL
SDDQVSEIFY KFTSELKQEH DSRVQKEKEQ ALKELKAYLT QINPELVEKC SDWTQLYETL
MVDPRFKANK HFIILNKLDI LELYRNEIHP LLLSNLKSEI AAVQKRNYRS DRKARQNFKD
FLLNKVTINA NTLFKDVFPI MENEDSFIDL CGRNGSNALD LFWDVVDEKY QLMKLKNNMI
DDLLMTLHKQ DSNEFDYNKS LATKKDFINT LLRAKMEKML TFDFNDFNLS EDDPELSAIY
DNLKRRQDLN RERAKTRIVK GIKTLTESLA HWISDNYGNS EKIKIFDDDT DLPDIVCIKK
SANLANFSLT SKNKDYNNIL HPFLIDSDIY KKLEESIMSP ELEKSVSLSS VVESSIEIFI
SLLNGDRKTD SNLTGRKRER SESSNIDNKR QKHDSTSGRQ TDTGKNPVLL NY
//