ID K7G4W3_PELSI Unreviewed; 2335 AA.
AC K7G4W3;
DT 09-JAN-2013, integrated into UniProtKB/TrEMBL.
DT 09-JAN-2013, sequence version 1.
DT 27-MAR-2024, entry version 59.
DE SubName: Full=Pre-mRNA processing factor 8 {ECO:0000313|Ensembl:ENSPSIP00000015324.1};
GN Name=PRPF8 {ECO:0000313|Ensembl:ENSPSIP00000015324.1};
OS Pelodiscus sinensis (Chinese softshell turtle) (Trionyx sinensis).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Testudinata; Testudines; Cryptodira; Trionychia;
OC Trionychidae; Pelodiscus.
OX NCBI_TaxID=13735 {ECO:0000313|Ensembl:ENSPSIP00000015324.1, ECO:0000313|Proteomes:UP000007267};
RN [1] {ECO:0000313|Proteomes:UP000007267}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Daiwa-1 {ECO:0000313|Proteomes:UP000007267};
RG Soft-shell Turtle Genome Consortium;
RL Submitted (OCT-2011) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Proteomes:UP000007267}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Daiwa-1 {ECO:0000313|Proteomes:UP000007267};
RX PubMed=23624526; DOI=10.1038/ng.2615;
RA Wang Z., Pascual-Anaya J., Zadissa A., Li W., Niimura Y., Huang Z., Li C.,
RA White S., Xiong Z., Fang D., Wang B., Ming Y., Chen Y., Zheng Y.,
RA Kuraku S., Pignatelli M., Herrero J., Beal K., Nozawa M., Li Q., Wang J.,
RA Zhang H., Yu L., Shigenobu S., Wang J., Liu J., Flicek P., Searle S.,
RA Wang J., Kuratani S., Yin Y., Aken B., Zhang G., Irie N.;
RT "The draft genomes of soft-shell turtle and green sea turtle yield insights
RT into the development and evolution of the turtle-specific body plan.";
RL Nat. Genet. 45:701-706(2013).
RN [3] {ECO:0000313|Ensembl:ENSPSIP00000015324.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AGCU01112957; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGCU01112958; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGCU01112959; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR STRING; 13735.ENSPSIP00000015324; -.
DR Ensembl; ENSPSIT00000015396.1; ENSPSIP00000015324.1; ENSPSIG00000013393.1.
DR eggNOG; KOG1795; Eukaryota.
DR GeneTree; ENSGT00390000015210; -.
DR HOGENOM; CLU_000380_3_0_1; -.
DR OMA; ANKWNTS; -.
DR TreeFam; TF105613; -.
DR Proteomes; UP000007267; Unassembled WGS sequence.
DR GO; GO:0071006; C:U2-type catalytic step 1 spliceosome; IEA:Ensembl.
DR GO; GO:0071007; C:U2-type catalytic step 2 spliceosome; IEA:Ensembl.
DR GO; GO:0071005; C:U2-type precatalytic spliceosome; IEA:Ensembl.
DR GO; GO:0046540; C:U4/U6 x U5 tri-snRNP complex; IEA:Ensembl.
DR GO; GO:0070530; F:K63-linked polyubiquitin modification-dependent protein binding; IEA:Ensembl.
DR GO; GO:0008237; F:metallopeptidase activity; IEA:InterPro.
DR GO; GO:0030623; F:U5 snRNA binding; IEA:InterPro.
DR GO; GO:0017070; F:U6 snRNA binding; IEA:InterPro.
DR GO; GO:0000244; P:spliceosomal tri-snRNP complex assembly; IEA:Ensembl.
DR CDD; cd08056; MPN_PRP8; 1.
DR CDD; cd13838; RNase_H_like_Prp8_IV; 1.
DR Gene3D; 1.20.80.40; -; 1.
DR Gene3D; 3.30.420.230; -; 1.
DR Gene3D; 3.90.1570.40; -; 1.
DR Gene3D; 3.40.140.10; Cytidine Deaminase, domain 2; 1.
DR Gene3D; 3.30.43.40; Pre-mRNA-processing-splicing factor 8, U5-snRNA-binding domain; 1.
DR InterPro; IPR000555; JAMM/MPN+_dom.
DR InterPro; IPR037518; MPN.
DR InterPro; IPR012591; PRO8NT.
DR InterPro; IPR012592; PROCN.
DR InterPro; IPR012984; PROCT.
DR InterPro; IPR027652; PRP8.
DR InterPro; IPR021983; PRP8_domainIV.
DR InterPro; IPR043173; Prp8_domainIV_fingers.
DR InterPro; IPR043172; Prp8_domainIV_palm.
DR InterPro; IPR019581; Prp8_U5-snRNA-bd.
DR InterPro; IPR042516; Prp8_U5-snRNA-bd_sf.
DR InterPro; IPR019580; Prp8_U6-snRNA-bd.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR019582; RRM_spliceosomal_PrP8.
DR PANTHER; PTHR11140; PRE-MRNA SPLICING FACTOR PRP8; 1.
DR PANTHER; PTHR11140:SF0; PRE-MRNA-PROCESSING-SPLICING FACTOR 8; 1.
DR Pfam; PF01398; JAB; 1.
DR Pfam; PF08082; PRO8NT; 1.
DR Pfam; PF08083; PROCN; 1.
DR Pfam; PF08084; PROCT; 1.
DR Pfam; PF12134; PRP8_domainIV; 1.
DR Pfam; PF10598; RRM_4; 1.
DR Pfam; PF10597; U5_2-snRNA_bdg; 1.
DR Pfam; PF10596; U6-snRNA_bdg; 1.
DR SMART; SM00232; JAB_MPN; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 2.
DR PROSITE; PS50249; MPN; 1.
PE 4: Predicted;
KW mRNA processing {ECO:0000256|ARBA:ARBA00022664};
KW mRNA splicing {ECO:0000256|ARBA:ARBA00023187};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000007267};
KW RNA-binding {ECO:0000256|ARBA:ARBA00022884};
KW Spliceosome {ECO:0000256|ARBA:ARBA00022728}.
FT DOMAIN 2103..2234
FT /note="MPN"
FT /evidence="ECO:0000259|PROSITE:PS50249"
SQ SEQUENCE 2335 AA; 273603 MW; DE8A84BCFC409A8D CRC64;
MAGVFPYRGG CAPVPSPLAP LPDYMSEEKL QEKARKWQQL QAKRYAEKRK FGFVDAQKED
MPPEHVRKII RDHGDMTNRK FRHDKRVYLG ALKYMPHAVL KLLENMPMPW EQIRDVPVLY
HITGAISFVN EIPWVIEPVY ISQWGSMWIM MRREKRDRRH FKRMRFPPFD DEEPPLDYAD
NILDVEPLEA IQLELDPEED APVLDWFYDH QPLKDNRKYV NGSTYQRWQF TLPMMSTLYR
LANQLLTDLV DDNYFYLFDL KAFFTSKALN MAIPGGPKFE PLVRDINLQD EDWNEFNDIN
KIIIRQPIRT EYKIAFPYLY NNLPHHVHLT WYHTPNVVFI KTEDPDLPAF YFDPLINPIS
HRHSVKSQEP LPDDDEEFEL PEFVEPFLKD TPLYTDNTAN GIALLWAPRP FNLRSGRTRR
ALDIPLVKNW YREHCPAGQP VKVRVSYQKL LKYYVLNALK HRPPKAQKKR YLFRSFKATK
FFQSTKLDWV EVGLQVCRQG YNMLNLLIHR KNLNYLHLDY NFNLKPVKTL TTKERKKSRF
GNAFHLCREV LRLTKLVVDS HVQYRLGNVD AFQLADGLQY IFAHVGQLTG MYRYKYKLMR
QIRMCKDLKH LIYYRFNTGP VGKGPGCGFW APGWRVWLFF MRGITPLLER WLGNLLARQF
EGRHSKGVAK TVTKQRVESH FDLELRAAVM HDILDMMPEG IKQNKARTIL QHLSEAWRCW
KANIPWKVPG LPTPIENMIL RYVKAKADWW TNTAHYNRER IRRGATVDKT VCKKNLGRLT
RLYLKAEQER QHNYLKDGPY ITAEEAVAVY TTTVHWLESR RFSPIPFPPL SYKHDTKLLI
LALERLKEAF SVKSRLNQSQ REELGLIEQA YDNPHEALSR IKRHLLTQRA FKEVGIEFMD
LYSHLVPVYD VEPLEKITDA YLDQYLWYEA DKRRLFPPWI KPADTEPPPL LVYKWCQGIN
NLQDVWETCE GECNVMLESR FEKMYEKIDL TLLNRLLRLI VDHNIADYMT AKNNVVINYK
DMNHTNSYGI IRGLQFASFI VQYYGLVMDL LVLGLHRASE MAGPPQMPND FLSFQDIATE
VAHPIRLFCR YIDRIHIFFR FTADEARDLI QRYLTEHPDP NNENIVGYNN KKCWPRDARM
RLMKHDVNLG RAVFWDIKNR LPRSVTTVQW ENSFVSVYSK DNPNLLFNMC GFECRILPKC
RTSYEEFTHK DGVWNLQNEV TKERTAQCFL RVDDESMQRF HNRVRQILMA SGSTTFTKIV
NKWNTALIGL MTYFREAVVN TQELLDLLVK CENKIQTRIK IGLNSKMPSR FPPVVFYTPK
ELGGLGMLSM GHVLIPQSDL RWSKQTDVGI THFRSGMSHE EDQLIPNLYR YIQPWESEFI
DSQRVWAEYA LKRQEAIAQN RRLTLEDLED SWDRGIPRIN TLFQKDRHTL AYDKGWRVRT
DFKQYQVLKQ NPFWWTHQRH DGKLWNLNNY RTDMIQALGG VEGILEHTLF KGTYFPTWEG
LFWEKASGFE ESMKWKKLTN AQRSGLNQIP NRRFTLWWSP TINRANVYVG FQVQLDLTGI
FMHGKIPTLK ISLIQIFRAH LWQKIHESIV MDLCQVFDQE LDALEIETVQ KETIHPRKSY
KMNSSCADIL LFASYKWNVS RPSLLADSKD VMDSTTTQKY WIDIQLRWGD YDSHDIERYA
RAKFLDYTTD NMSIYPSPTG VLIAIDLAYN LHSAYGNWFP GSKPLIQQAM AKIMKANPAL
YVLRERIRKG LQLYSSEPTE PYLSSQNYGE LFSNQIIWFV DDTNVYRVTI HKTFEGNLTT
KPINGAIFIF NPRTGQLFLK IIHTSVWAGQ KRLGQLAKWK TAEEVAALIR SLPVEEQPKQ
IIVTRKGMLD PLEVHLLDFP NIVIKGSELQ LPFQACLKVE KFGDLILKAT EPQMVLFNLY
DDWLKTISSY TAFSRLILIL RALHVNNDRA KVILKPDKTT ITEPHHIWPT LTDEEWIKVE
VQLKDLILAD YGKKNNVNVA SLTQSEIRDI ILGMEISAPS QQRQQIAEIE KQTKEQSQLT
ATQTRTVNKH GDEIITSTTS NYETQTFSSK TEWRVRAISA ANLHLRTNHI YVSSDDIKET
GYTYILPKNV LKKFICISDL RAQIAGYLYG VSPPDNPQVK EIRCIVMVPQ WGTHQTVHLP
GQLPQHEYLK EMEPLGWIHT QPNESPQLSP QDVTTHAKVM ADNPSWDGEK TIIITCSFTP
GSCTLTAYKL TPSGYEWGRQ NTDKGNNPKG YLPSHYERVQ MLLSDRFLGF FMVPAQGSWN
YNFMGVRHDP NMKYELQLAN PKEFYHEVHR PSHFLNFALL QEGEVYSADR EDLYA
//