ID H3CCS4_TETNG Unreviewed; 2230 AA.
AC H3CCS4;
DT 18-APR-2012, integrated into UniProtKB/TrEMBL.
DT 18-APR-2012, sequence version 1.
DT 27-MAR-2024, entry version 58.
DE SubName: Full=Pre-mRNA processing factor 8 {ECO:0000313|Ensembl:ENSTNIP00000006047.1};
OS Tetraodon nigroviridis (Spotted green pufferfish) (Chelonodon
OS nigroviridis).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Eupercaria; Tetraodontiformes; Tetradontoidea; Tetraodontidae; Tetraodon.
OX NCBI_TaxID=99883 {ECO:0000313|Ensembl:ENSTNIP00000006047.1, ECO:0000313|Proteomes:UP000007303};
RN [1] {ECO:0000313|Proteomes:UP000007303}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=15496914; DOI=10.1038/nature03025;
RA Jaillon O., Aury J.-M., Brunet F., Petit J.-L., Stange-Thomann N.,
RA Mauceli E., Bouneau L., Fischer C., Ozouf-Costaz C., Bernot A., Nicaud S.,
RA Jaffe D., Fisher S., Lutfalla G., Dossat C., Segurens B., Dasilva C.,
RA Salanoubat M., Levy M., Boudet N., Castellano S., Anthouard V., Jubin C.,
RA Castelli V., Katinka M., Vacherie B., Biemont C., Skalli Z., Cattolico L.,
RA Poulain J., De Berardinis V., Cruaud C., Duprat S., Brottier P.,
RA Coutanceau J.-P., Gouzy J., Parra G., Lardier G., Chapple C.,
RA McKernan K.J., McEwan P., Bosak S., Kellis M., Volff J.-N., Guigo R.,
RA Zody M.C., Mesirov J., Lindblad-Toh K., Birren B., Nusbaum C., Kahn D.,
RA Robinson-Rechavi M., Laudet V., Schachter V., Quetier F., Saurin W.,
RA Scarpelli C., Wincker P., Lander E.S., Weissenbach J., Roest Crollius H.;
RT "Genome duplication in the teleost fish Tetraodon nigroviridis reveals the
RT early vertebrate proto-karyotype.";
RL Nature 431:946-957(2004).
RN [2] {ECO:0000313|Ensembl:ENSTNIP00000006047.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 99883.ENSTNIP00000006047; -.
DR Ensembl; ENSTNIT00000006195.1; ENSTNIP00000006047.1; ENSTNIG00000003460.1.
DR GeneTree; ENSGT00390000015210; -.
DR HOGENOM; CLU_000380_3_0_1; -.
DR InParanoid; H3CCS4; -.
DR OMA; ANKWNTS; -.
DR TreeFam; TF105613; -.
DR Proteomes; UP000007303; Unassembled WGS sequence.
DR GO; GO:0005681; C:spliceosomal complex; IEA:UniProtKB-KW.
DR GO; GO:0008237; F:metallopeptidase activity; IEA:InterPro.
DR GO; GO:0030623; F:U5 snRNA binding; IEA:InterPro.
DR GO; GO:0017070; F:U6 snRNA binding; IEA:InterPro.
DR GO; GO:0000398; P:mRNA splicing, via spliceosome; IEA:InterPro.
DR CDD; cd08056; MPN_PRP8; 1.
DR CDD; cd13838; RNase_H_like_Prp8_IV; 1.
DR Gene3D; 1.20.80.40; -; 1.
DR Gene3D; 3.30.420.230; -; 1.
DR Gene3D; 3.90.1570.40; -; 1.
DR Gene3D; 3.40.140.10; Cytidine Deaminase, domain 2; 1.
DR Gene3D; 3.30.43.40; Pre-mRNA-processing-splicing factor 8, U5-snRNA-binding domain; 1.
DR InterPro; IPR000555; JAMM/MPN+_dom.
DR InterPro; IPR037518; MPN.
DR InterPro; IPR012591; PRO8NT.
DR InterPro; IPR012592; PROCN.
DR InterPro; IPR012984; PROCT.
DR InterPro; IPR027652; PRP8.
DR InterPro; IPR021983; PRP8_domainIV.
DR InterPro; IPR043173; Prp8_domainIV_fingers.
DR InterPro; IPR043172; Prp8_domainIV_palm.
DR InterPro; IPR019581; Prp8_U5-snRNA-bd.
DR InterPro; IPR042516; Prp8_U5-snRNA-bd_sf.
DR InterPro; IPR019580; Prp8_U6-snRNA-bd.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR019582; RRM_spliceosomal_PrP8.
DR PANTHER; PTHR11140; PRE-MRNA SPLICING FACTOR PRP8; 1.
DR PANTHER; PTHR11140:SF0; PRE-MRNA-PROCESSING-SPLICING FACTOR 8; 1.
DR Pfam; PF01398; JAB; 1.
DR Pfam; PF08082; PRO8NT; 1.
DR Pfam; PF08083; PROCN; 2.
DR Pfam; PF08084; PROCT; 1.
DR Pfam; PF12134; PRP8_domainIV; 1.
DR Pfam; PF10598; RRM_4; 1.
DR Pfam; PF10597; U5_2-snRNA_bdg; 1.
DR Pfam; PF10596; U6-snRNA_bdg; 1.
DR SMART; SM00232; JAB_MPN; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 2.
DR PROSITE; PS50249; MPN; 1.
PE 4: Predicted;
KW mRNA processing {ECO:0000256|ARBA:ARBA00022664};
KW mRNA splicing {ECO:0000256|ARBA:ARBA00023187};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000007303};
KW RNA-binding {ECO:0000256|ARBA:ARBA00022884};
KW Spliceosome {ECO:0000256|ARBA:ARBA00022728}.
FT DOMAIN 1998..2129
FT /note="MPN"
FT /evidence="ECO:0000259|PROSITE:PS50249"
FT REGION 1..27
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 8..26
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2230 AA; 260910 MW; 6A7F36E6D0A8889B CRC64;
MAATFPYRPV PPGMPPGVPP PAPVPDYMSE EKLQEKDYIA RKWQQLQAKR YSEKRKFGFV
DAQKEDMPPE HVRKIIRDHG DMTNRKFRHD KRVYLGALKY MPHAVLKLLE NMPMPWEQIR
DVPVLYHITG AISFVNEIPW VIEPVYISQW GTMWIMMRRE KRDRRHFKRM RFPPFDDEEP
PLDYADNILD VEPLEAIQME LDPEEDSSCA EWLYEHQPLK DTAKFVNGAT YRRWQFTLPM
MSTLYRLANQ LLTDLVDLNY FYLFDLKAFF TSKALNMAIP GGPKFEPLVR DINLQDEDWN
EFNDINKIII RQPIRTEYKI AFPYLYNNLP HHVHLTWYHT PNVVFIKTED PDLPAFYFDP
LINPISHRHS VKSQEPLPDD DEEFELPEYV EPFLKETPLY TDNTANGIAL LWAPRPFNLR
SGRTRRAIDI PLIKNWYREH CPAGQPVKVR VSYQKLLKYY VLNALKHRPP KAQKKRYLFR
SFKATKFFQS TKLDWVEVGL QVCRQGYNML NLLIHRKNLN YLHLDYNFNL KPVKTLTTKX
LCCCSNSYSL YSGSTPVGRH SKGVAKTVTK QRVESHFDLE LRAAVMHDIL DMMPEGIKQN
KARTILQHLS ESWRCWKANI PWKVPGLPTP IENMILRYVK AKADWWTNTA HYNRERIRRG
ATVDKTVCKK NLGRLTRLYL KAEQERQHNY LKDGPYITAE EAVAIYTTTV HWLESRRFSP
IPFPPLSYKH DTKLLILALE RLKEAYSVKS RLNQSQREEL GLIEQAYDNP HEALSRIKRH
LLTQRAFKEV GIEFMDLYSH LVPVYDVEPL EKITDAYLDQ YLWYEADKRR LFPPWIKPAD
TEPPPLLVYK WCQGINNLQD VWETGEGECN VMLESRFEKM YEKIDLTLLN RLLRLIVDHN
IADYMTAKNN VVINYKDMNH TNSYGIIRGL QFASFIVQYY GLVMDLLVLG LHRASEMAGP
PQMPNDFLSF QDTSTESAHP IRLYCRYIDR IHIFFRFSAD DARDLIQRYL TEHPDPNNEN
IVGYNNKKCW PRDARMRLMK HDVNLGRAVF WDIKNRLPRS VTTVQWENSF VSVYSKDNPN
LLFNMCGFEC RILPKCRTSY EEFTHKDGVW NLQNEVTKER TAQCFLRVDD ESMQRFHNRV
RQILMASGST TFTKIVNKWN TALIGLMTYF REAVVNTQEL LDLLVKCENK IQTRIKIGLN
SKMPSRFPPV VFYTPKELGG LGMLSMGHVL IPQSDLRWSK QTDVGITHFR SGMSHEEDQL
IPNLYRYIQP WESEFIDSQR VWAEYALKRQ EAIAQNRRLT LEDLEDSWDR GIPRINTLFQ
KDRHTLAYDK GWRVRTDFKQ YQVLKQNPFW WTHQRHDGKL WNLNNYRTDM IQLGGVEGIL
EHTLFKGTYF PTWEGLFWEK ASGFEESMKW KKLTNAQRSG LNQIPNRRFT LWWSPTINRA
NVYVGFQVQL DLTGIFMHGK IPTLKISLIQ IFRAHLWQKI HESIVMDLCQ VFDQELDALE
IETVQKETIH PRKSYKMNSS CADILLFASY KWNVSRPSLL ADSKDVMDST TTQKYWIDIQ
LRWGDYDSHD IERYARAKFL DYTTDNMSIY PSPTGVLIAI DLAYNLHSAY GNWFPGSKPL
IQQAMAKIMK ANPALYVLRE RIRKGLQLYS SEPTEPYLSS QNYGELFSNQ IIWFVDDTNV
YRVTIHKTFE GNLTTKPING AIFIFNPRTG QLFLKIIHTS VWAGQKRLGQ LAKWKTAEEV
AALIRSLPVE EQPKQIIVTR KGMLDPLEVH LLDFPNIVIK GSELQLPFQA CLKVEKFGDL
ILKATEPQMV LFNLYDDWLK TISSYTAFSR LILILRALHV NNDRAKVILK PDKTTITEPH
HIWPTLTDEE WIKVEVQLKD LILADYGKKN NVNVASLTQS EIRDIILGME ISAPSQQRQQ
IAEIEKQTKE QSQLTATQTR TVNKHGDEII TSTTSNYETQ TFSSKTEWRV RAISAANLHL
RTNHIYVSSD DIKETGYTYI LPKNVLKKFI CISDLRAQIA GYLYGTSPPD NPQVKEIRCI
VMVPQWGTHQ TVHLPNQLPG HEYLKEMEPL GWIHTQPNES PQLSPQDVTT HAKVMADNPS
WDGEKTIIIT CSFTPGSCTL TAYKLTPSGY EWGRQNTDKG NNPKGYLPSH YERVQMLLSD
RFLGFFMVPG QGSWNYNFMG VRHDPNMKYD LQLANPKEFY HEVHRPSHFL NFASLQEGEI
YNADREDMYA
//