ID A0EGN4_PARTE Unreviewed; 920 AA.
AC A0EGN4;
DT 28-NOV-2006, integrated into UniProtKB/TrEMBL.
DT 28-NOV-2006, sequence version 1.
DT 27-MAR-2024, entry version 56.
DE SubName: Full=Chromosome undetermined scaffold_95, whole genome shotgun sequence {ECO:0000313|EMBL:CAK94475.1};
GN ORFNames=GSPATT00026799001 {ECO:0000313|EMBL:CAK94475.1};
OS Paramecium tetraurelia.
OC Eukaryota; Sar; Alveolata; Ciliophora; Intramacronucleata;
OC Oligohymenophorea; Peniculida; Parameciidae; Paramecium.
OX NCBI_TaxID=5888 {ECO:0000313|EMBL:CAK94475.1, ECO:0000313|Proteomes:UP000000600};
RN [1] {ECO:0000313|EMBL:CAK94475.1, ECO:0000313|Proteomes:UP000000600}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Stock d4-2 {ECO:0000313|EMBL:CAK94475.1,
RC ECO:0000313|Proteomes:UP000000600};
RX PubMed=17086204; DOI=10.1038/nature05230;
RG Genoscope;
RA Aury J.-M., Jaillon O., Duret L., Noel B., Jubin C., Porcel B.M.,
RA Segurens B., Daubin V., Anthouard V., Aiach N., Arnaiz O., Billaut A.,
RA Beisson J., Blanc I., Bouhouche K., Camara F., Duharcourt S., Guigo R.,
RA Gogendeau D., Katinka M., Keller A.-M., Kissmehl R., Klotz C., Koll F.,
RA Le Moue A., Lepere C., Malinsky S., Nowacki M., Nowak J.K., Plattner H.,
RA Poulain J., Ruiz F., Serrano V., Zagulski M., Dessen P., Betermier M.,
RA Weissenbach J., Scarpelli C., Schachter V., Sperling L., Meyer E.,
RA Cohen J., Wincker P.;
RT "Global trends of whole-genome duplications revealed by the ciliate
RT Paramecium tetraurelia.";
RL Nature 444:171-178(2006).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CT868677; CAK94475.1; -; Genomic_DNA.
DR RefSeq; XP_001461848.1; XM_001461811.1.
DR AlphaFoldDB; A0EGN4; -.
DR STRING; 5888.A0EGN4; -.
DR EnsemblProtists; CAK94475; CAK94475; GSPATT00026799001.
DR GeneID; 5047633; -.
DR KEGG; ptm:GSPATT00026799001; -.
DR eggNOG; KOG2044; Eukaryota.
DR HOGENOM; CLU_007812_0_0_1; -.
DR InParanoid; A0EGN4; -.
DR Proteomes; UP000000600; Partially assembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IBA:GO_Central.
DR GO; GO:0004534; F:5'-3' RNA exonuclease activity; IBA:GO_Central.
DR GO; GO:0003723; F:RNA binding; IBA:GO_Central.
DR GO; GO:0000956; P:nuclear-transcribed mRNA catabolic process; IBA:GO_Central.
DR Gene3D; 1.25.40.1050; -; 1.
DR Gene3D; 2.30.30.750; -; 1.
DR InterPro; IPR027073; 5_3_exoribonuclease.
DR InterPro; IPR041385; SH3_12.
DR InterPro; IPR041412; Xrn1_helical.
DR InterPro; IPR047008; XRN1_SH3_sf.
DR PANTHER; PTHR12341:SF41; 5'-3' EXORIBONUCLEASE 1; 1.
DR PANTHER; PTHR12341; 5'->3' EXORIBONUCLEASE; 1.
DR Pfam; PF18129; SH3_12; 1.
DR Pfam; PF17846; XRN_M; 1.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Reference proteome {ECO:0000313|Proteomes:UP000000600}.
FT DOMAIN 25..222
FT /note="Xrn1 helical"
FT /evidence="ECO:0000259|Pfam:PF17846"
FT DOMAIN 670..734
FT /note="5'-3' exoribonuclease 1 SH3-like"
FT /evidence="ECO:0000259|Pfam:PF18129"
FT REGION 859..920
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 272..299
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 868..885
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 886..907
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 920 AA; 107788 MW; 4BBA94D807D6EBFF CRC64;
MITKEKELID NILKYFSLQE QTEGRKLYYK AKCQITQGEE LDLQKLEDMC INYMEGIYFV
LQYYFKGVPS WEWYYHYYYA PLCGDIVGVV SQIVQQLSDQ EQPIIFTKDK PYPPFKQLLP
SILTPENAML LPEPYGKLLT DKENSILRTP IDYYPESFET DSYGTMYEHQ HITKIPFLDC
NLVEQAYNSI PDTHDSRNEQ ALSIFYQYDS TQQQIYESTL PKLFSNFKSC CKKQFIDIED
KEQFQIAYGQ QQILEQQEDS FQIPSLKVVS IRSSKLINIQ QHEDDIKKFE NQFLLIELQQ
TKFDFDQFIQ DVIKNKNFAH CGFPIQQQCE VMAILRPDHL HLLDGVKLQS IQMLKDYSKK
DHNKIYKEIR NKTLNQYEKC MKLDCSTFLI VKQYKHYLRD NNNQLSYPQE QYRELVYPFE
FVFPDQKVKF NVPNFEDFQI DRSVVIFHEK KNGATGIIKQ IDKKLTVQIT AYPTLLGYDY
IHSDIYYSLQ IVSEKLQCSV KTLLNLLGSV VVNMEDKESK IADQLDIGLN IINRTNNQLV
PELVRLPQAD QYATGSSNSL KNASLITMQI QKSMLKSLNQ RNCPINAKDL FPNSTDPNID
LLKIYIWILQ LPESQYLLQG SSSKVAQIKI HKKPILNNNI NTTKKVDPGF AVQQTTQTFL
PPFFIKHPTI HKIGDRVVNL NYPFGIYGTV VGLLEQKEIM VQVLWDQKHI GFTNLGGRYD
LLSCSTCKFT EIFNLSNEDW RMNLAKRGCH QGEYWDLWTK VYKPDFKSRA IEFNDQLKEQ
NPFKQLVELP VEVKPKVMVK EKEAPQGFEL IQKLAQEQPN LITNQPLQGI KEELIEQNED
EAQKEIKKLF QLYQQKDLQS QGEQQKQSQK QNDEDGSKVQ PQQVEENDDK QQPQSQIKQQ
VQPQQIKKVL TKKKDQQLQQ
//