ID A0A0L7LWB8_PLAF4 Unreviewed; 3135 AA.
AC A0A0L7LWB8;
DT 11-NOV-2015, integrated into UniProtKB/TrEMBL.
DT 11-NOV-2015, sequence version 1.
DT 13-SEP-2023, entry version 31.
DE RecName: Full=Pre-mRNA-processing-splicing factor 8 {ECO:0008006|Google:ProtNLM};
GN ORFNames=PFDG_00265 {ECO:0000313|EMBL:KOB84899.1};
OS Plasmodium falciparum (isolate Dd2).
OC Eukaryota; Sar; Alveolata; Apicomplexa; Aconoidasida; Haemosporida;
OC Plasmodiidae; Plasmodium; Plasmodium (Laverania).
OX NCBI_TaxID=57267 {ECO:0000313|EMBL:KOB84899.1, ECO:0000313|Proteomes:UP000054282};
RN [1] {ECO:0000313|Proteomes:UP000054282}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RG The Broad Institute Genome Sequencing Platform;
RA Volkman S.K., Neafsey D.E., Dash A.P., Chitnis C.E., Hartl D.L.,
RA Young S.K., Zeng Q., Koehrsen M., Alvarado L., Berlin A., Borenstein D.,
RA Chapman S.B., Chen Z., Engels R., Freedman E., Gellesch M., Goldberg J.,
RA Griggs A., Gujja S., Heilman E.R., Heiman D.I., Howarth C., Jen D.,
RA Larson L., Mehta T., Neiman D., Park D., Pearson M., Roberts A., Saif S.,
RA Shea T., Shenoy N., Sisk P., Stolte C., Sykes S., Walk T., White J.,
RA Yandava C., Haas B., Henn M.R., Nusbaum C., Birren B.;
RT "Annotation of Plasmodium falciparum Dd2.";
RL Submitted (SEP-2006) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Proteomes:UP000054282}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RG The Broad Institute Genome Sequencing Platform;
RA Birren B., Lander E., Galagan J., Nusbaum C., Devon K., Henn M., Jaffe D.,
RA Butler J., Alvarez P., Gnerre S., Grabherr M., Kleber M., Mauceli E.,
RA Brockman W., MacCallum I.A., Rounsley S., Young S., LaButti K.,
RA Pushparaj V., DeCaprio D., Crawford M., Koehrsen M., Engels R.,
RA Montgomery P., Pearson M., Howarth C., Larson L., Luoma S., White J.,
RA Kodira C., Zeng Q., O'Leary S., Yandava C., Alvarado L., Wirth D.,
RA Volkman S., Hartl D.;
RT "The genome sequence of Plasmodium falciparum Dd2.";
RL Submitted (SEP-2006) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; DS016073; KOB84899.1; -; Genomic_DNA.
DR EnsemblProtists; KOB84899; KOB84899; PFDG_00265.
DR VEuPathDB; PlasmoDB:PfDd2_040010800; -.
DR OMA; ANKWNTS; -.
DR Proteomes; UP000054282; Unassembled WGS sequence.
DR GO; GO:0005681; C:spliceosomal complex; IEA:InterPro.
DR GO; GO:0030623; F:U5 snRNA binding; IEA:InterPro.
DR GO; GO:0017070; F:U6 snRNA binding; IEA:InterPro.
DR GO; GO:0000398; P:mRNA splicing, via spliceosome; IEA:InterPro.
DR CDD; cd08056; MPN_PRP8; 1.
DR CDD; cd13838; RNase_H_like_Prp8_IV; 1.
DR Gene3D; 1.20.80.40; -; 1.
DR Gene3D; 3.30.420.230; -; 1.
DR Gene3D; 3.90.1570.40; -; 1.
DR Gene3D; 3.40.140.10; Cytidine Deaminase, domain 2; 1.
DR Gene3D; 3.30.43.40; Pre-mRNA-processing-splicing factor 8, U5-snRNA-binding domain; 1.
DR InterPro; IPR012591; PRO8NT.
DR InterPro; IPR012592; PROCN.
DR InterPro; IPR012984; PROCT.
DR InterPro; IPR027652; PRP8.
DR InterPro; IPR021983; PRP8_domainIV.
DR InterPro; IPR043173; Prp8_domainIV_fingers.
DR InterPro; IPR043172; Prp8_domainIV_palm.
DR InterPro; IPR019581; Prp8_U5-snRNA-bd.
DR InterPro; IPR042516; Prp8_U5-snRNA-bd_sf.
DR InterPro; IPR019580; Prp8_U6-snRNA-bd.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR019582; RRM_spliceosomal_PrP8.
DR PANTHER; PTHR11140; PRE-MRNA SPLICING FACTOR PRP8; 1.
DR PANTHER; PTHR11140:SF0; PRE-MRNA-PROCESSING-SPLICING FACTOR 8; 1.
DR Pfam; PF08082; PRO8NT; 1.
DR Pfam; PF08083; PROCN; 1.
DR Pfam; PF08084; PROCT; 1.
DR Pfam; PF12134; PRP8_domainIV; 1.
DR Pfam; PF10598; RRM_4; 1.
DR Pfam; PF10597; U5_2-snRNA_bdg; 1.
DR Pfam; PF10596; U6-snRNA_bdg; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 2.
PE 4: Predicted;
KW mRNA processing {ECO:0000256|ARBA:ARBA00022664};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000054282};
KW RNA-binding {ECO:0000256|ARBA:ARBA00022884}.
FT DOMAIN 403..554
FT /note="PRO8NT"
FT /evidence="ECO:0000259|Pfam:PF08082"
FT DOMAIN 940..1347
FT /note="PROCN"
FT /evidence="ECO:0000259|Pfam:PF08083"
FT DOMAIN 1532..1622
FT /note="RNA recognition motif spliceosomal PrP8"
FT /evidence="ECO:0000259|Pfam:PF10598"
FT DOMAIN 1864..1996
FT /note="Pre-mRNA-processing-splicing factor 8 U5-snRNA-
FT binding"
FT /evidence="ECO:0000259|Pfam:PF10597"
FT DOMAIN 2097..2253
FT /note="Pre-mRNA-processing-splicing factor 8 U6-snRNA-
FT binding"
FT /evidence="ECO:0000259|Pfam:PF10596"
FT DOMAIN 2456..2686
FT /note="PRP8"
FT /evidence="ECO:0000259|Pfam:PF12134"
FT DOMAIN 3011..3131
FT /note="PROCT"
FT /evidence="ECO:0000259|Pfam:PF08084"
FT REGION 1..22
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 324..343
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 744..910
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1773..1838
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2952..2999
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 329..343
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 758..807
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 809..839
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 840..862
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 870..903
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1773..1829
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2961..2983
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 3135 AA; 366324 MW; BF418BD0D83589B0 CRC64;
MSHNGSFEQS SEDNKNEGSD VLTNSTQHLE NNVINNYDDA NKSDELNSSH NVMNDKASVE
NKQDNMCNNI NDIFFDKPDN INNNNNNEKN NMNDINNIPQ NVHNGFINNI GNVPYNNMNA
FPPNMPKLPT NMPFLPPNMP ILPPHLQHMP NVLPHLQNMP NVPPHLASFP NMINLPNLPP
HMHNLPPNMH SLPPHMHNLP PNMHSLPPNM NYIPPGINNY MPNMMNMPPP YMMKMPNMKM
KSNKIINNVS NNVADNVRNS NLYNEEGIQP NNIHNNIHNN NNDHGGQDIN SSPYYLSQGS
YLPNNIKMNN NEIDQLEVNG LSLSSPFNEQ QKKKKDMNNK NKKAKKYHDF EGDEENYNTS
ERDENSMYDS NAFSIIKEKA RKWKMLNSKK YSKKKKFGVV EEKEEMPCEH LRKIVKEHGD
MSNKKYRYDK RVYLGALKYI PHAVFKLLEN IPMPWEQIKN TKVIYHITGA ITFVNETFVV
IDPLYIAQWG TMWIMMRREK RDRKHFKRMR FPPFDDEEPP LDYADNILDI EPLECIQMKL
DKDEDKSVID WFYDSKPLLY NRNHIPGTSY KKYKLSLEQM GVLYRLGNQL FSDFQDDNYF
YLFNLKSFYT AKALNMAIPG GPKFEPLYRD IYEDDEDWNE FNDINKIIIR QQIRTEYKIA
FPYLYNNRPR KIAISKYHSP MCVYIKLEDI DLPPFYFDLI INPIPSYKIR KFNKSSEKKD
SELFDDDFYL TYTRKEIYYY DHGDDDKKKK STSKSRKHSK HSDADDNRYD KGYRKYRKSS
SSYKSFKRDK RKSTNSSNDK DIDEEDYNSG VSSIDNNDNS DTYISSSKYN SNNMSSRTSK
NKDETYEIDS TVENDSHDGS LKKEKNKKKR KNPYNDDNYK GDDKNKSDDD DNYKGDDNND
NNNKYKSDNI SSCKKNKKMI IKHVEYGILP LLHNYPLYTE RTINGIQLYH APYPFNKKCG
YTRRGIDIPL VQSWFKEHIS TKYPVKVRVS YQKLLKCWVL NHLHSKRPKS MKKKYLFRIF
KSTKFFQCTE MDWVEVGLQV CRQGYNMLNL LIHRKNLNYL HLDYNFNLKP VKTLTTKERK
KSRFGNAFHL CREILRLTKS IVDSHVQYRL GNIDAYQLAD GIQYIFAHVG QLTGMYRYKY
RLMRQVRMCK DLKHLIYYRF NTGSVGKGPG CGLWAPLWRV WIFFLRGVIP LLERWLSNLL
ARQFEGRVSK GIAKTVTKQR VESHFDLELR AAVMHDIIDM IPEGLKNNKG KARLILQHLS
EAWRCWKANI PWKVVGLPLP VENIIIRYIK LKADWWVNAT YYNRERIKRG ATVDKTVCKK
NLGRLTRLWL KAEQERQHEY LKDGPYVSGE EAVALYTTAI HWFESRKFTH IPFPPLNYKH
DTKLLILALE KLKETFTVKN RLNQSQREEL GFIEQAYDNP YETLSRIKRH LLTQRAFKEI
SISFLDLYTH LVPVYEVDPL EKITDAYLDQ YLWYEGDLRN LFPNWVKPSD NEPQPLLVYK
MCQGINNLHN IWDTKNNECV VMLQTQFSKI YEKIDLTLLN RLLRLIVDHN IADYITAKNN
TNITFKDMNH INSFGIIRGL QFSSFVFQYY TIIIDLLILG LTRAYDIAGP YNDVNQFLTF
QNVQIETRHP IRLYCRYVDK IWILFKFTNE ESKDLIQKFL TENPDPNNEN IVGYNNKTCW
PRDCRMRKMK HDVNLGRATF WEIQNRIPRS LTSLDWDHYN TFVSVYSKDN PNLLFSIAGF
EVRILPKIRQ LSYGYNGIMY TSYMNEYPRG VGTKDETSKK NGLLHDDEKS KKVGSLKDEV
TKGKSHVDKN EENSDNNKND NKNDSTHANT HDMVGDNNYD GGVKNNFYNS SGGEKNVVVS
SSVKEGTWKL QNEMTKEITA EAYLKVSDNS MKRFENRVRQ ILMSSGSTTF TKIANKWNTT
LIGLMTYFRE AVLDTEELLD LLVKCENKIQ TRIKIGLNSK MPSRFPPVVF YTPKELGGLG
MLSMGHILIP ESDLRYMKQT DNGRITHFRS GLSHEEDQLI PNLYRYISTW ESEFLESQRV
WCEYALKRNE CHNQNKKITL EDLEDSWDKG IPRINTLFQK DRHTLAYDKG WRIRQLFKQY
QIIKSNPFWW TNQRHDGKLW NLNNYRTDMI QALGGVEGIL EHTLFKGTFF PTWEGLFWEK
ASGFEESMKY KKLTNAQRSG LNQIPNRRFT LWWSPTINRA NVYVGFQVQL DLTGIFMHGK
IPTLKISLIQ IFRAHLWQKI HESLVMDICQ VFDLNCDLLD IETVQKETIH PRKSYKMNSS
CADILLFANY KWGISKPSLL TDEDHIFTNN TLGSTSGTNN NIMLNSNMIN SGSNNSSSNN
MNSVSFGSFP YTSNQFWIDI QLRWGDFDSH DIERYSRAKF LDYTTDNLSI YPCLTGVLIG
VDLAYNLYSA YGNWFNNLKP LMQKALQKIV QSNPSLYVLR ERIRKGLQLY SSEPTEPYLN
TQNYNELFSS QTIWFVDDTN VYRVTIHKTF EGNLTTKPIN GAIFILNPKT GQLFLKIIHT
SVWIGQKRLS QLAKWKTAEE VASLIRSLPI EEQPKQIIVT RKGMLDPLEV HLLDFPNIII
KGTELNLPFQ ALLKLNKIGD LILKATQPQM LLFNLYDDWL NSISSFTAFS RLILILRSLH
INPQQTKILL QPNKNIVTTQ PHHIWPSFNN NQWIHLEVQL KDLILNDYSK RNNVHIASLT
QNEIRDILLG MEITPPSIQR QQIAELEKNN LDLMEQQMKV TTSKTTTKHG NEIIVSTLSP
HEQQTFTTKT DWKIRYLANN SLLFRTKNIY VNNNNMSNMS NINTISASAS SHNILNKNGT
NSDNQNSHYH TSINSINDYT YVIAKNLLEK FICISDLKIQ VGGFLFGSSP EDNSYVKEIK
CILIPPQIGN YQSVTLSSYM PSSKYLQNLE LLGWIHTQTT NCSNTNNHLT AYDMVAHFNF
LQECKRQMSK GKKVADASHN DDDVDDYDDD DYNNNEDDYN NNNEDDNINN NSEGGTKRDE
TYKMWDKNKT IILTCSFTPG SCTINAYKLT SDGYSFAKSK KNSSDLYVFP NVNNLYEPVQ
ILLSNVFVGY FLIPDDHIWN YNLMGIKFNN NQKYAPHLDI PQPFYADIHR PNHFLQFSLL
DQRDADEADV ETSFI
//