ID Q4UCE6_THEAN Unreviewed; 2786 AA.
AC Q4UCE6;
DT 05-JUL-2005, integrated into UniProtKB/TrEMBL.
DT 05-JUL-2005, sequence version 1.
DT 24-JAN-2024, entry version 91.
DE SubName: Full=Splicing factor (PRP8 homologue), putative {ECO:0000313|EMBL:CAI75505.1};
GN ORFNames=TA03780 {ECO:0000313|EMBL:CAI75505.1};
OS Theileria annulata.
OC Eukaryota; Sar; Alveolata; Apicomplexa; Aconoidasida; Piroplasmida;
OC Theileriidae; Theileria.
OX NCBI_TaxID=5874 {ECO:0000313|EMBL:CAI75505.1, ECO:0000313|Proteomes:UP000001950};
RN [1] {ECO:0000313|EMBL:CAI75505.1, ECO:0000313|Proteomes:UP000001950}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Ankara {ECO:0000313|Proteomes:UP000001950};
RX PubMed=15994557; DOI=10.1126/science.1110418;
RA Pain A., Renauld H., Berriman M., Murphy L., Yeats C.A., Weir W.,
RA Kerhornou A., Aslett M., Bishop R., Bouchier C., Cochet M., Coulson R.M.R.,
RA Cronin A., de Villiers E.P., Fraser A., Fosker N., Gardner M., Goble A.,
RA Griffiths-Jones S., Harris D.E., Katzer F., Larke N., Lord A., Maser P.,
RA McKellar S., Mooney P., Morton F., Nene V., O'Neil S., Price C.,
RA Quail M.A., Rabbinowitsch E., Rawlings N.D., Rutter S., Saunders D.,
RA Seeger K., Shah T., Squares R., Squares S., Tivey A., Walker A.R.,
RA Woodward J., Dobbelaere D.A.E., Langsley G., Rajandream M.A., McKeever D.,
RA Shiels B., Tait A., Barrell B.G., Hall N.;
RT "Genome of the host-cell transforming parasite Theileria annulata compared
RT with T. parva.";
RL Science 309:131-133(2005).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CR940352; CAI75505.1; -; Genomic_DNA.
DR RefSeq; XP_954981.1; XM_949888.1.
DR STRING; 5874.Q4UCE6; -.
DR GeneID; 3864613; -.
DR KEGG; tan:TA03780; -.
DR VEuPathDB; PiroplasmaDB:TA03780; -.
DR eggNOG; KOG1795; Eukaryota.
DR InParanoid; Q4UCE6; -.
DR OMA; ANKWNTS; -.
DR OrthoDB; 246127at2759; -.
DR Proteomes; UP000001950; Chromosome 3.
DR GO; GO:0005681; C:spliceosomal complex; IEA:InterPro.
DR GO; GO:0030623; F:U5 snRNA binding; IEA:InterPro.
DR GO; GO:0017070; F:U6 snRNA binding; IEA:InterPro.
DR GO; GO:0000398; P:mRNA splicing, via spliceosome; IEA:InterPro.
DR CDD; cd08056; MPN_PRP8; 1.
DR CDD; cd13838; RNase_H_like_Prp8_IV; 1.
DR Gene3D; 1.20.80.40; -; 1.
DR Gene3D; 3.30.420.230; -; 1.
DR Gene3D; 3.90.1570.40; -; 1.
DR Gene3D; 3.40.140.10; Cytidine Deaminase, domain 2; 1.
DR Gene3D; 3.30.43.40; Pre-mRNA-processing-splicing factor 8, U5-snRNA-binding domain; 1.
DR InterPro; IPR012591; PRO8NT.
DR InterPro; IPR012592; PROCN.
DR InterPro; IPR012984; PROCT.
DR InterPro; IPR027652; PRP8.
DR InterPro; IPR021983; PRP8_domainIV.
DR InterPro; IPR043173; Prp8_domainIV_fingers.
DR InterPro; IPR043172; Prp8_domainIV_palm.
DR InterPro; IPR019581; Prp8_U5-snRNA-bd.
DR InterPro; IPR042516; Prp8_U5-snRNA-bd_sf.
DR InterPro; IPR019580; Prp8_U6-snRNA-bd.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR019582; RRM_spliceosomal_PrP8.
DR PANTHER; PTHR11140; PRE-MRNA SPLICING FACTOR PRP8; 1.
DR PANTHER; PTHR11140:SF0; PRE-MRNA-PROCESSING-SPLICING FACTOR 8; 1.
DR Pfam; PF08082; PRO8NT; 1.
DR Pfam; PF08083; PROCN; 1.
DR Pfam; PF08084; PROCT; 1.
DR Pfam; PF12134; PRP8_domainIV; 1.
DR Pfam; PF10598; RRM_4; 1.
DR Pfam; PF10597; U5_2-snRNA_bdg; 1.
DR Pfam; PF10596; U6-snRNA_bdg; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 2.
PE 4: Predicted;
KW mRNA processing {ECO:0000256|ARBA:ARBA00022664};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000001950};
KW RNA-binding {ECO:0000256|ARBA:ARBA00022884}.
FT DOMAIN 311..460
FT /note="PRO8NT"
FT /evidence="ECO:0000259|Pfam:PF08082"
FT DOMAIN 829..1234
FT /note="PROCN"
FT /evidence="ECO:0000259|Pfam:PF08083"
FT DOMAIN 1447..1537
FT /note="RNA recognition motif spliceosomal PrP8"
FT /evidence="ECO:0000259|Pfam:PF10598"
FT DOMAIN 1688..1821
FT /note="Pre-mRNA-processing-splicing factor 8 U5-snRNA-
FT binding"
FT /evidence="ECO:0000259|Pfam:PF10597"
FT DOMAIN 1920..2077
FT /note="Pre-mRNA-processing-splicing factor 8 U6-snRNA-
FT binding"
FT /evidence="ECO:0000259|Pfam:PF10596"
FT DOMAIN 2238..2466
FT /note="PRP8"
FT /evidence="ECO:0000259|Pfam:PF12134"
FT DOMAIN 2707..2778
FT /note="PROCT"
FT /evidence="ECO:0000259|Pfam:PF08084"
FT REGION 1..20
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 181..265
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 623..745
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1403..1427
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 184..209
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 210..254
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 627..656
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 675..706
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 715..741
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2786 AA; 322257 MW; D0D51D3C138A64B2 CRC64;
MELDSKLESS PLEDSGQDSN INIENINQQN ANTQIPNINQ IANTQLPNQN TQVPNGNLPN
LPFPNVNIPI INTSNIPMMP IPNTVPIPIP NPIPVPNMPM PNTIPNTIPV PMPNMPIPNP
IPMIPIPNPM TNPMGNPMGN PMANPMGNNI PNPIPNPMPN PIMNPLMNML PTNIIPNMPI
MPPPGFNNIN QTTTGKDTKG ATGGKGSGTK STKGKKDTSG KNTKETKETT AKSKDSDTTD
TKDSADKSND TMDGEEGDAV GPSTVTVEND MEVRFHFGDF ERLQEKARKW QKLNTRKFSK
PVSSVSNLSM QMPPEHVRKV IRDHGDMSSR RYRYDKRVYL GALKYVPHAV YKLLENMPMP
WEQVRNVQAL YHVTGAITFV DEIPWVVDPI FLAQWGTMWI MMRREKRDRR HFKRMRFPPF
DDEEPPIDYG ENILDLEPLE AIQMQLDPEE DQSVIEWFYD HKPLQYNRKH INGTSYRRWF
LTLEQMAVLF RLASQLFSDI LDDNYFYLFN LKAFYTAKAL NTAIPGGPKF EPLYRDIDED
EDWNEFNDIS KLIIRQQIRT EYKIAFPYLY NSRPRKVAMT NYHTKLCSYI RHEDPDLPIF
HYDPIINPIP SYTIQYNYVS SMGIKGGKDS RDVRDVRDVR DSRDSRDSRD SRDGVDGKGV
VNGVELNGVN GVNGVMKEHK ERDRDRDRDR DRRRSNSRER HRDKDRHRDR DRSRHRDRRS
NSRDKSTRNK ITKDKDNKYD VTTNGKGANF AAMECTNGKG ANFAAMECTN GKGANFAAME
CTMGKGTNKV APFGASTEDT SLGDTDTVTE EYKLNGIKPL LTRIELETER TGNGISLYWA
PHPFNKRSGM CRRAIDLPIV NTWYREHVPK EYPVKVRVSY QKLLKGWVIS NLHAKKPKGM
KKRRLFKVFR GTKFFQSTEL DWVEVGLQVC RQGYNMLNLL IHRKNLNYLH LDYNFNLKPV
KTLTTKERKK SRFGNAFHLC REILRLTKLV VDSHVQYRLG NVDAYQLADG LQYIFSHVGQ
LTGMYRYKYR LMRQVRMCKD LKHLIYYRFN TGPVGKGPGC GFWICGWRVW CFFLRGILPL
LERWLGNLLA RQFEGRVTKG VAKTVTKQRV ESHFDLELRA AVMHDILDMM PEGIRASKSR
TILQHLSEAW RCWKANIPWK VPQLPSPIEN MILRYVKLKA DWWTNACYYN RERIKRGATV
DKTVCRKNLG RLTRLYLKAE HERQYNYLKD GPYLSGEEAV AIYTTAVHWL ESRKFVHIPF
PPLNYKHDTK ILILSLEQLK EPYASKGRLN QSQREELGLI EQAYDNPHEC LSRIKRHLLT
QRAFKEVTIE FFDMYSHLIP VYDIDPLEKI TDAYLNQYLW YEADNRKLFP NWVKPSDSEP
PPLLVYKICQ GINNFTSIWE TNGSNNGSNN GSNNGLNNGP NNELNNGLNN EPNYLVLLST
KYDKVYEKVD LTLLNRLLRL IVDHNIADYI TAKNNVSISF KDMSHINSFG FIRGLQFSPF
VFMYYSLVLD LLLLGLGRAT EIAGPYNSEN EFLTFPSTEI ELKHPIRMFM RFIDEIYILF
KFNSEEARQL VQRYLTENPD PNNENVVGYN NKNCWPKDCR MRLMKHDVNL GRAAFWEMQS
RLPRSITTLE WSDSFVSVYS KDNPNLLFSL CGFEIRIIKF RTGDFTHSST NSMGTMSGTM
SGGGMVLKES SWRLQNMKTK ELSAIAYLRV SNESMSIFEN RIRQILMSSG STTFTKIANK
WNTALISLMT YFREATIHTN ELLDLLVKCE NKIQTRIKIG LNSKMPSRFP PVVFYSPKEL
GGLGMLSMGH ILIPQSDLRF SKQTDVGITH FRSGMSHEDD QLIPNLYRYI QTWESEFIES
QRVWAEYALK RQEAQQQNRR LTLEDLEDSW DRGIPRINTL FQKDRHTLAY DKGWRVTLYF
RKYQVLRFNP FWWTHQRHDG KLWNLNNYRT DMIQALGGVE AILEHTLFKG TYFSTWEGLF
WEKASGFEES MKYKKLTNAQ RSGLTQIPNR RFTLWWSPTI NRANVYVGFQ VQLDLTGIFM
HGKLPTLKIS LIQIFRAHLW QKIHESLVMD LCQVLDMELD SLEIETVQKE TIHPRKSYKM
NSSCADILLT SSYKWKFGKP SLLTDTSPIE KLENGTKYWI DVQLRWGDYD SHDIERYSRS
KFLDYTGDSM SIYPCPTGCL IAVDLAYNLH SGYGYWFEGM RELMVRAMNK IMKANPALFV
LRERIRKSLQ LYSSEPTEPY LSSQNMGELF GSQTIWFVDD TNVYRVTIHK TFEGNLTTKP
VNGAIFIFNP KTGQLFLKVI HTSVWAGQKR LSQLSKWKTA EEVVALIRSL PVEEQPKQII
VTRRGMLDPL EVHLVDFPNI VIKGSELQLP YQSIMKLEKF GDLILRATQP EMVLFNLYDD
WLKSISSYTA FSRLILILRA IHVNTERAKC ILKPNKTTIT LPHHVWPNLT DNEWINVEIA
LKDLILADYA KRNSVTVTSL TQTEIRDIIL GMEIMPPDVQ RQQIEENIEV GPKSVTTKTV
NVHGEEIVVT TQSPHEQQVF ASKTDWRNRC LASGTLHLRA KHIYVVPVES EQVIVLPNNL
IKKLISIADL RTQVGAYLYS KVEKNEHTVY SIVCMVLVPQ VGTHKTIVLP KLKPEHEALS
ELVPIGWIFT RPNEGEIEQQ TLEMHQKMIN DFGWDPTAVM TTCTFTPGSC AISARRLVNA
LEKPQNLQLE KVQVLVSDTF KGFFLVPVDG AWNYNFMGAK HSPHMNFQLQ IEVPKPFYDP
IHRPLHFIQF AHTSQEKIFE DDDPLS
//