ID W1Q7M6_OGAPD Unreviewed; 2395 AA.
AC W1Q7M6;
DT 19-MAR-2014, integrated into UniProtKB/TrEMBL.
DT 19-MAR-2014, sequence version 1.
DT 24-JAN-2024, entry version 46.
DE SubName: Full=Pre-mRNA-processing-splicing factor 8 {ECO:0000313|EMBL:ESW95916.1};
GN ORFNames=HPODL_02561 {ECO:0000313|EMBL:ESW95916.1};
OS Ogataea parapolymorpha (strain ATCC 26012 / BCRC 20466 / JCM 22074 / NRRL
OS Y-7560 / DL-1) (Yeast) (Hansenula polymorpha).
OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; Saccharomycetes;
OC Saccharomycetales; Pichiaceae; Ogataea.
OX NCBI_TaxID=871575 {ECO:0000313|EMBL:ESW95916.1, ECO:0000313|Proteomes:UP000008673};
RN [1] {ECO:0000313|EMBL:ESW95916.1, ECO:0000313|Proteomes:UP000008673}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 26012 / BCRC 20466 / JCM 22074 / NRRL Y-7560 / DL-1
RC {ECO:0000313|Proteomes:UP000008673};
RX PubMed=24279325; DOI=10.1186/1471-2164-14-837;
RA Ravin N.V., Eldarov M.A., Kadnikov V.V., Beletsky A.V., Schneider J.,
RA Mardanova E.S., Smekalova E.M., Zvereva M.I., Dontsova O.A., Mardanov A.V.,
RA Skryabin K.G.;
RT "Genome sequence and analysis of methylotrophic yeast Hansenula polymorpha
RT DL1.";
RL BMC Genomics 14:837-837(2013).
RN [2] {ECO:0000313|Proteomes:UP000008673}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 26012 / BCRC 20466 / JCM 22074 / NRRL Y-7560 / DL-1
RC {ECO:0000313|Proteomes:UP000008673};
RA Ravin N.V., Mardanov A.V., Eldarov M.A., Kadnikov V.V., Beletsky A.V.,
RA Zvereva M.I., Smekalova E.M., Dontsova O.A., Skryabin K.G.;
RT "Genome sequence of the methylotrophic yeast Hansenula polymorpha DL1.";
RL Submitted (OCT-2013) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:ESW95916.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AEOI02000010; ESW95916.1; -; Genomic_DNA.
DR RefSeq; XP_013932346.1; XM_014076871.1.
DR STRING; 871575.W1Q7M6; -.
DR GeneID; 25772013; -.
DR KEGG; opa:HPODL_02561; -.
DR eggNOG; KOG1795; Eukaryota.
DR HOGENOM; CLU_000380_3_0_1; -.
DR OMA; ANKWNTS; -.
DR OrthoDB; 246127at2759; -.
DR Proteomes; UP000008673; Chromosome VII.
DR GO; GO:0005681; C:spliceosomal complex; IEA:UniProtKB-KW.
DR GO; GO:0008237; F:metallopeptidase activity; IEA:InterPro.
DR GO; GO:0030623; F:U5 snRNA binding; IEA:InterPro.
DR GO; GO:0017070; F:U6 snRNA binding; IEA:InterPro.
DR GO; GO:0000398; P:mRNA splicing, via spliceosome; IEA:InterPro.
DR CDD; cd08056; MPN_PRP8; 1.
DR CDD; cd13838; RNase_H_like_Prp8_IV; 1.
DR Gene3D; 1.20.80.40; -; 1.
DR Gene3D; 3.30.420.230; -; 1.
DR Gene3D; 3.90.1570.40; -; 1.
DR Gene3D; 3.40.140.10; Cytidine Deaminase, domain 2; 1.
DR Gene3D; 3.30.43.40; Pre-mRNA-processing-splicing factor 8, U5-snRNA-binding domain; 1.
DR InterPro; IPR000555; JAMM/MPN+_dom.
DR InterPro; IPR012591; PRO8NT.
DR InterPro; IPR012592; PROCN.
DR InterPro; IPR012984; PROCT.
DR InterPro; IPR027652; PRP8.
DR InterPro; IPR021983; PRP8_domainIV.
DR InterPro; IPR043173; Prp8_domainIV_fingers.
DR InterPro; IPR043172; Prp8_domainIV_palm.
DR InterPro; IPR019581; Prp8_U5-snRNA-bd.
DR InterPro; IPR042516; Prp8_U5-snRNA-bd_sf.
DR InterPro; IPR019580; Prp8_U6-snRNA-bd.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR019582; RRM_spliceosomal_PrP8.
DR PANTHER; PTHR11140; PRE-MRNA SPLICING FACTOR PRP8; 1.
DR PANTHER; PTHR11140:SF0; PRE-MRNA-PROCESSING-SPLICING FACTOR 8; 1.
DR Pfam; PF08082; PRO8NT; 1.
DR Pfam; PF08083; PROCN; 1.
DR Pfam; PF08084; PROCT; 1.
DR Pfam; PF12134; PRP8_domainIV; 1.
DR Pfam; PF10598; RRM_4; 1.
DR Pfam; PF10597; U5_2-snRNA_bdg; 1.
DR Pfam; PF10596; U6-snRNA_bdg; 1.
DR SMART; SM00232; JAB_MPN; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 2.
PE 4: Predicted;
KW mRNA processing {ECO:0000256|ARBA:ARBA00022664};
KW mRNA splicing {ECO:0000256|ARBA:ARBA00023187};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000008673};
KW RNA-binding {ECO:0000256|ARBA:ARBA00022884};
KW Spliceosome {ECO:0000256|ARBA:ARBA00022728}.
FT DOMAIN 2158..2292
FT /note="JAB1/MPN/MOV34 metalloenzyme"
FT /evidence="ECO:0000259|SMART:SM00232"
FT REGION 1..94
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 8..22
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 25..39
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 50..77
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2395 AA; 276488 MW; CBFD296602D99B34 CRC64;
MPPKPPPKPP KRTDEKSSEQ KKAAHPPPPT GPPPGLSRGA SGAPPAKKPK NGTQVPPPPP
GLKTTRTLPP PPPGAPGGIK ASKKESREKE LETRARSWLQ LQKRRYRESN LKSHGVVQTR
KIEMPKEHLR KILNDQGDLS SKKFSQEKRS ILGSLKYMPH AVLKLLENMP QPWEAVKEVR
VIYHQSGAIT FVDEIPRVIE PVYIAQWATM WLQMRREKKD RRHFKRIKFP VFDDEEPPIN
FSENIEDLEP LDAIQMELQD VEDLPVADWI YDEKPLVDDR SRMNGPSYRA WRLDLDIMAT
LYRLSTPLVD DIFDPNFHYL FDNESFFTAK ALNVALPGGP KFEPLQKDID PEHEDFNEFN
SLDRIIFRNP IKSEYRVAFP HLYNSSVRGV QLAWYHHNSV VFSRKEDPEL PAFQFQANYN
PVTPKKEVID DTDFEEDREF EIDVQPFMED QSLEPENTYE AIELLWAPYP FNKRSGRTVR
AEDVALVKSW YLQHAPRDLP VKVRVSYQRL LKTHVANELH KTVPSTQGKA KLLKELKNTK
FFHQTTIDWV EAGLQVCRQG YNMLNLMIHR RGLTYLHLDY NFNLKPTKTL TTKERKKSRF
GNAFHLIREI LRVVKLIVDA HVQYRLGNVD AFQLADGIYY ALNHLGQLTG IYRYKYKVMH
QIRACKDLKH VVYYRFNAVI GKGPGCGFWQ PAWRVWIFFM RGIVPLLERW LGNLLARQFE
GRQSKEVAKT ITKQRVDSYY DLELRAAVLH DILDMIPEGI KQNKSKAILQ HLSEAWRCWK
ANIPWNVPGM PEPIKKIIER YVKAKADGWI SVAHYNRQRI KAGAAVDKAV AKKNLGRLTR
LWVKNEQERQ QNFQKEGPYV TPQEGVSIYM TMVHWLESRK FIPIPFPPVN YKHDTKILVL
ALENLKETYN AKGRLNSQER EELALIEQAY DNPHEFLANI KKTILTQRNF KEVTLEMMDY
YSHIVPVYDV EPLEKIVDAY LDQYLWYEAD KRGLFPNWVK PSDNEIPPLL VYKWCQGIAN
LQNVWDVSEG QCNVLLQTNL NKLAEKVDFT LLNQLLRLIV DSSIADYLTA KNNVGITFKD
MNHVNQYGLI RGLQFSSFIF QYYGLVVDLL LLGLERASEI AGPPQRPNDF LEFTDIQTQC
KSPIRIYSRY VDKVHIFFRF SQSEADELIQ EFLAENPDPN FEHIVGYNNR RCWPRDSRMK
LMRHDVHLGR AVFWEMQGRV PDSIVSIDWE DTLASVYSKD NANILFTMCG FDVRIIPKER
MLEDTSSKEG VWDLYDENSK ESVAKAYLQV SQESVEEFNN RVRQILMTSG SATFTKVASK
WNTTLLSMFA YFREAVISTE PLLNAIVKNE TRIQTRIKLG LNSKMPSRFP PAVFYTPKEL
GGLGMLSASH ILIPASDLRW SKQTDTGITH FRSGLTHDDD RLIPTIYRYI TTWENEFLDS
QRVWSEFAIK RAEAEQQRRR LTYEDLEENW DRGIPRISTL FQKDRQTLAI DKGFRVRKEF
KQFSVSRNNP FWWISDRHDG KLWNLNAYRS DVIQALGGIE TILEHTLFKG TGFESWEGLF
WEKASGFEDT LKFKKLTNAQ RSGLSQIPNR RFTLWWSPTI NRANVYVGFL VQLDLTGIFL
HGKIPTLKIS LIQIFRAHLW QKIHESIVVD LCQVLDSQLD ELQIDSVEKM AIHPRKSYKM
NSSTADILLT SSFQWPCSRP SLLFDTNDQM NAVKSDKFWL DVQLRYGDYD SHDISRYARA
KFLDYTSDAT STYPSPTGLL IAVDLAYNMY DAYGNWFPEL KPLIQNAMKT IMKMNPALYV
LRERIRKGLQ LYQAQPQEAF LSSSNYAELF NNENKLFVDD VNVYRVVTHS TFEGNTAVKC
LNGALFILNP RTGQLFLKII HSSAFQGQKR RTQLSKWKSA EEVAALVRSL PREEQPKQLI
ITRKGIQDPL EVHMLDFPNI QIRPSELHLP FGAALKIDKL LDIVNMAKEP QMVLFNIYDD
WMKTCSSFTA FNRLVVIMRG LEINKERTKL ILRPDSSIET KPHHLWPSLS DEQWRNVETQ
LADLILSDYA SKYNVDINSL TDTEIRDIIL GQDIRAPSAK QQSVADIEGT AEPASQLTAV
KTETTNVHGE KITTVTTTNH EQAKFESRTD WRLRAISSGS LHLRAKKVFV SSGDFADTDS
YAYVMPKNIL SKFIKMGDVR TQVAAYMFGR SPADNSQVKE IISLVVVPQV GDNHRVELPS
SLPSSPYLDG LEPLGWIHTV PAGKSSEGDD CMELLTHCKL SSQFNWNAMS SVINVAFTPG
SVTLSSVSLT PEGYKWGQTH LDALGSMLVA PGYSEEFRKK TPLILTDKLK GYFLVPDIDT
WNYSFIANAW TEDFEFDLKL DNPIPFYHEL HRPLHFTLFD RTETALEAGQ ENVFA
//