ID E3JWZ8_PUCGT Unreviewed; 877 AA.
AC E3JWZ8;
DT 11-JAN-2011, integrated into UniProtKB/TrEMBL.
DT 18-APR-2012, sequence version 2.
DT 24-JAN-2024, entry version 65.
DE RecName: Full=Pre-mRNA-processing protein prp40 {ECO:0008006|Google:ProtNLM};
GN ORFNames=PGTG_02034 {ECO:0000313|EMBL:EFP76573.2};
OS Puccinia graminis f. sp. tritici (strain CRL 75-36-700-3 / race SCCL)
OS (Black stem rust fungus).
OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina;
OC Pucciniomycetes; Pucciniales; Pucciniaceae; Puccinia.
OX NCBI_TaxID=418459 {ECO:0000313|EMBL:EFP76573.2, ECO:0000313|Proteomes:UP000008783};
RN [1]
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=CRL 75-36-700-3;
RG The Broad Institute Genome Sequencing Platform;
RA Birren B., Lander E., Galagan J., Nusbaum C., Devon K., Cuomo C., Jaffe D.,
RA Butler J., Alvarez P., Gnerre S., Grabherr M., Mauceli E., Brockman W.,
RA Young S., LaButti K., Sykes S., DeCaprio D., Crawford M., Koehrsen M.,
RA Engels R., Montgomery P., Pearson M., Howarth C., Larson L., White J.,
RA Zeng Q., Kodira C., Yandava C., Alvarado L., O'Leary S., Szabo L., Dean R.,
RA Schein J.;
RT "The Genome Sequence of Puccinia graminis f. sp. tritici Strain CRL 75-36-
RT 700-3.";
RL Submitted (JAN-2007) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Proteomes:UP000008783}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CRL 75-36-700-3 / race SCCL
RC {ECO:0000313|Proteomes:UP000008783};
RX PubMed=21536894; DOI=10.1073/pnas.1019315108;
RA Duplessis S., Cuomo C.A., Lin Y.-C., Aerts A., Tisserant E.,
RA Veneault-Fourrey C., Joly D.L., Hacquard S., Amselem J., Cantarel B.L.,
RA Chiu R., Coutinho P.M., Feau N., Field M., Frey P., Gelhaye E.,
RA Goldberg J., Grabherr M.G., Kodira C.D., Kohler A., Kuees U.,
RA Lindquist E.A., Lucas S.M., Mago R., Mauceli E., Morin E., Murat C.,
RA Pangilinan J.L., Park R., Pearson M., Quesneville H., Rouhier N.,
RA Sakthikumar S., Salamov A.A., Schmutz J., Selles B., Shapiro H.,
RA Tanguay P., Tuskan G.A., Henrissat B., Van de Peer Y., Rouze P.,
RA Ellis J.G., Dodds P.N., Schein J.E., Zhong S., Hamelin R.C.,
RA Grigoriev I.V., Szabo L.J., Martin F.;
RT "Obligate biotrophy features unraveled by the genomic analysis of rust
RT fungi.";
RL Proc. Natl. Acad. Sci. U.S.A. 108:9166-9171(2011).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; DS178266; EFP76573.2; -; Genomic_DNA.
DR RefSeq; XP_003320992.2; XM_003320944.2.
DR AlphaFoldDB; E3JWZ8; -.
DR STRING; 418459.E3JWZ8; -.
DR EnsemblFungi; EFP76573; EFP76573; PGTG_02034.
DR GeneID; 10528430; -.
DR KEGG; pgr:PGTG_02034; -.
DR VEuPathDB; FungiDB:PGTG_02034; -.
DR eggNOG; KOG0152; Eukaryota.
DR HOGENOM; CLU_005825_1_1_1; -.
DR InParanoid; E3JWZ8; -.
DR OrthoDB; 25674at2759; -.
DR Proteomes; UP000008783; Unassembled WGS sequence.
DR GO; GO:0005685; C:U1 snRNP; IBA:GO_Central.
DR GO; GO:0071004; C:U2-type prespliceosome; IBA:GO_Central.
DR GO; GO:0003723; F:RNA binding; IBA:GO_Central.
DR GO; GO:0045292; P:mRNA cis splicing, via spliceosome; IEA:InterPro.
DR GO; GO:0000398; P:mRNA splicing, via spliceosome; IBA:GO_Central.
DR CDD; cd00201; WW; 1.
DR Gene3D; 2.20.70.10; -; 2.
DR Gene3D; 1.10.10.440; FF domain; 3.
DR InterPro; IPR002713; FF_domain.
DR InterPro; IPR036517; FF_domain_sf.
DR InterPro; IPR039726; Prp40-like.
DR InterPro; IPR001202; WW_dom.
DR InterPro; IPR036020; WW_dom_sf.
DR PANTHER; PTHR11864; PRE-MRNA-PROCESSING PROTEIN PRP40; 1.
DR PANTHER; PTHR11864:SF0; PRP40 PRE-MRNA PROCESSING FACTOR 40 HOMOLOG A (YEAST); 1.
DR Pfam; PF01846; FF; 2.
DR Pfam; PF00397; WW; 2.
DR SMART; SM00441; FF; 3.
DR SMART; SM00456; WW; 2.
DR SUPFAM; SSF81698; FF domain; 3.
DR SUPFAM; SSF51045; WW domain; 2.
DR PROSITE; PS51676; FF; 1.
DR PROSITE; PS01159; WW_DOMAIN_1; 2.
DR PROSITE; PS50020; WW_DOMAIN_2; 2.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Reference proteome {ECO:0000313|Proteomes:UP000008783}.
FT DOMAIN 1..32
FT /note="WW"
FT /evidence="ECO:0000259|PROSITE:PS50020"
FT DOMAIN 40..73
FT /note="WW"
FT /evidence="ECO:0000259|PROSITE:PS50020"
FT DOMAIN 409..469
FT /note="FF"
FT /evidence="ECO:0000259|PROSITE:PS51676"
FT REGION 23..52
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 80..148
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 605..877
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 380..407
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 84..114
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 130..148
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 605..632
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 642..713
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 726..783
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 790..870
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 877 AA; 101948 MW; 9F86D4AAFD90CD22 CRC64;
MASAWTEHRS PTGRLYWFNA QTGTSSWERP EALKTPAERA LASTPWKEYQ TAEGRKYWHH
TTTKETTWTL PDAVREAIEK AAASAPPPAP PASSQPVSSA PGPPPAPMHP TFVPASTPNP
AATPATPAAA PAGTTPTTNL PPRPVTSILH SAPVTNHAPV PMPDFKSPED AERAFIGLLR
LKGVTPSWTW EQTMRDIITE PLYKALDTLA ARKAAWEKFI DSERKREKEN REKNIARVRA
SWDAGLDGLS EEKTIVDDQG SEVKLPGAPP RLWWTWDRLK LEVERRAPEV WKLCRDDEER
RVLWEDYLTE LRQRDTAAAN QLRGRQQEKL TSLLRAHQEK LNLPGEFEMI QWRVAQEAIL
QSEDFQNDED LRKMDDLDIL IVFEEEIKRA EKETMELKAK QKDEKRRSYR KTRAAYIKLL
HELKLSGQIH ADTMWKEIYP VLKDDERYQN MLGITGSSPL ELFWDVIDDL QLELEDKQKV
VEDLLEERNK KVGETTEFDE FLTWLPNDME PHKLDRPMLK QIFHMLVDYA VRTAKEEKRR
AEKRLRNQIE DLRYALKKLS PPVKLDTPYE EALERFSHLS EFKSLEGQDE GRKEAFNRYM
ERLKEKASVE DKRSRRKEEE SYRSDSRKKG SLAQLSDNES VNSASKRRRK DPLEDGKYHR
HQSPRTNRLH EDISSPRGNG EHREDKELDK DHERSLGRDR DPENDRERDD NPNADQATTG
QAPEGDNEKS RSRDRERRRD SEREKTEERS KDRADNRDRP SSRLSADDRD RRRKEYDLDR
HGSRRHRSSR RGDRDHDDDS RHRGSRHDED DGQSREKRSE NGLSDKRARS QDRPESRKKS
DTVPSDQERD SKRPKTVQPQ PEKPAEKSDG EEGELEG
//