ID A0D7R2_PARTE Unreviewed; 984 AA.
AC A0D7R2;
DT 28-NOV-2006, integrated into UniProtKB/TrEMBL.
DT 28-NOV-2006, sequence version 1.
DT 24-JAN-2024, entry version 76.
DE SubName: Full=Chromosome undetermined scaffold_40, whole genome shotgun sequence {ECO:0000313|EMBL:CAK79079.1};
GN ORFNames=GSPATT00014046001 {ECO:0000313|EMBL:CAK79079.1};
OS Paramecium tetraurelia.
OC Eukaryota; Sar; Alveolata; Ciliophora; Intramacronucleata;
OC Oligohymenophorea; Peniculida; Parameciidae; Paramecium.
OX NCBI_TaxID=5888 {ECO:0000313|EMBL:CAK79079.1, ECO:0000313|Proteomes:UP000000600};
RN [1] {ECO:0000313|EMBL:CAK79079.1, ECO:0000313|Proteomes:UP000000600}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Stock d4-2 {ECO:0000313|EMBL:CAK79079.1,
RC ECO:0000313|Proteomes:UP000000600};
RX PubMed=17086204; DOI=10.1038/nature05230;
RG Genoscope;
RA Aury J.-M., Jaillon O., Duret L., Noel B., Jubin C., Porcel B.M.,
RA Segurens B., Daubin V., Anthouard V., Aiach N., Arnaiz O., Billaut A.,
RA Beisson J., Blanc I., Bouhouche K., Camara F., Duharcourt S., Guigo R.,
RA Gogendeau D., Katinka M., Keller A.-M., Kissmehl R., Klotz C., Koll F.,
RA Le Moue A., Lepere C., Malinsky S., Nowacki M., Nowak J.K., Plattner H.,
RA Poulain J., Ruiz F., Serrano V., Zagulski M., Dessen P., Betermier M.,
RA Weissenbach J., Scarpelli C., Schachter V., Sperling L., Meyer E.,
RA Cohen J., Wincker P.;
RT "Global trends of whole-genome duplications revealed by the ciliate
RT Paramecium tetraurelia.";
RL Nature 444:171-178(2006).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CT868319; CAK79079.1; -; Genomic_DNA.
DR RefSeq; XP_001446476.1; XM_001446439.1.
DR AlphaFoldDB; A0D7R2; -.
DR STRING; 5888.A0D7R2; -.
DR EnsemblProtists; CAK79079; CAK79079; GSPATT00014046001.
DR GeneID; 5032261; -.
DR KEGG; ptm:GSPATT00014046001; -.
DR eggNOG; KOG0151; Eukaryota.
DR eggNOG; KOG1614; Eukaryota.
DR HOGENOM; CLU_302945_0_0_1; -.
DR InParanoid; A0D7R2; -.
DR Proteomes; UP000000600; Partially assembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IBA:GO_Central.
DR GO; GO:0003723; F:RNA binding; IBA:GO_Central.
DR GO; GO:0006396; P:RNA processing; IEA:InterPro.
DR CDD; cd00590; RRM_SF; 1.
DR Gene3D; 1.25.40.90; -; 1.
DR Gene3D; 3.30.70.330; -; 1.
DR Gene3D; 3.30.230.70; GHMP Kinase, N-terminal domain; 1.
DR Gene3D; 1.10.10.790; Surp module; 1.
DR InterPro; IPR006569; CID_dom.
DR InterPro; IPR008942; ENTH_VHS.
DR InterPro; IPR001247; ExoRNase_PH_dom1.
DR InterPro; IPR012677; Nucleotide-bd_a/b_plait_sf.
DR InterPro; IPR027408; PNPase/RNase_PH_dom_sf.
DR InterPro; IPR035979; RBD_domain_sf.
DR InterPro; IPR020568; Ribosomal_Su5_D2-typ_SF.
DR InterPro; IPR000504; RRM_dom.
DR InterPro; IPR000061; Surp.
DR InterPro; IPR035967; SWAP/Surp_sf.
DR PANTHER; PTHR23140; RNA PROCESSING PROTEIN LD23810P; 1.
DR PANTHER; PTHR23140:SF0; U2 SNRNP-ASSOCIATED SURP MOTIF-CONTAINING PROTEIN; 1.
DR Pfam; PF04818; CID; 1.
DR Pfam; PF01138; RNase_PH; 1.
DR Pfam; PF00076; RRM_1; 1.
DR Pfam; PF01805; Surp; 1.
DR SMART; SM00582; RPR; 1.
DR SMART; SM00648; SWAP; 1.
DR SUPFAM; SSF48464; ENTH/VHS domain; 1.
DR SUPFAM; SSF54211; Ribosomal protein S5 domain 2-like; 1.
DR SUPFAM; SSF54928; RNA-binding domain, RBD; 1.
DR SUPFAM; SSF109905; Surp module (SWAP domain); 1.
DR PROSITE; PS51391; CID; 1.
DR PROSITE; PS50128; SURP; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000000600};
KW RNA-binding {ECO:0000256|ARBA:ARBA00022884}.
FT DOMAIN 156..198
FT /note="SURP motif"
FT /evidence="ECO:0000259|PROSITE:PS50128"
FT DOMAIN 272..400
FT /note="CID"
FT /evidence="ECO:0000259|PROSITE:PS51391"
FT REGION 651..748
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 651..699
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 713..748
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 984 AA; 116350 MW; 83531FBEE2E2CEF7 CRC64;
MQANCILKPL QLQPQVRSLI VIQKPLKPLA GMQQNLNEHL NNIKNIIPYQ FVDKSSPHIT
ICGLTPKITE QSLRLVCAEY GNVVSISLRA YYKDVQAQIV ANVTYENASS AQHAYMELQK
KVENGFHFQL YYGGPCYQSK SKIVKIKLPN PQIRGIIDKL ARQVVKEGAQ FEQMIKQREI
NNSKYAFLYL QSEENEYYKW RVYSFQNGDD EKQWKQEPYY FNLNERIYIP PAIEVEEAPS
FAKKELEKAQ SKCSSIIIIV TTKNKKAQYY VLEDQDRLTL SQMIRELNTQ KHTIGKAMVF
CIDHQNCPAD LMLILEDSLL NDSIWSMKLA RLYLISDILN NCNQNFKSYI QWCLPKIFSN
LDQLLPYKEK ILKLLQCWRE QNLFDQKYLK GLELSFLMKE QSIQIESISS QLYKEKLAHV
DDETLDRICR IKGLCSQGPR DTLIQRLVQH KFYNRANHDV SLEQVSKFVQ VYRYIVEKVF
VIYTIIKSKD QQTIISSQKQ SFTTQIQMMQ EFLKLLQKRY KNIISRDGEE IDAVDERIYE
FNKHIEIQRA ERNLYLNIDG KDLTEEDIRT IDDKKVQVVT NYELTYLQPK PPLPPPPTDP
IEIEVQQYRD ILFQSGLYDP FKIEEFSKSK RAQLVKADQL KKERERQLLL KQQQEQRERE
KERERERERE REKEREKERE KEREREKERE REKERERQRH HNRNNKRSSS SSGSDKRNQK
KRNMIRNIIR EDPDQEAIQE KRKLKTKRDE QNIIKHKKYN YKIYLFIKPL KNFEFVLEQL
TLGKRIDGRD PLQLRMIQSH FGPQITGAVE LSLGETSVSR MSPNPIRPSE GFLKFHLDLQ
VLRDTGYMHN PIKLDMEIEK YIEKVIKGSK ALDTESLCIL SGKNVWSIDV NVALINNDGN
LLDAMYLCCI FSQQHFRRPQ VSVSLQGVKV EVEKRLVPLS IHHIPLSLTQ AILELNEQTI
LLQDSCLEEE DSEWKNHLWC KHLQ
//