ID A0A182M0M1_9DIPT Unreviewed; 1505 AA.
AC A0A182M0M1;
DT 07-SEP-2016, integrated into UniProtKB/TrEMBL.
DT 07-SEP-2016, sequence version 1.
DT 24-JAN-2024, entry version 32.
DE RecName: Full=S1 motif domain-containing protein {ECO:0000259|PROSITE:PS50126};
OS Anopheles culicifacies.
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles; culicifacies species complex.
OX NCBI_TaxID=139723 {ECO:0000313|EnsemblMetazoa:ACUA006557-PA, ECO:0000313|Proteomes:UP000075883};
RN [1] {ECO:0000313|Proteomes:UP000075883}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=A-37 {ECO:0000313|Proteomes:UP000075883};
RG The Broad Institute Genomics Platform;
RA Neafsey D.E., Besansky N., Howell P., Walton C., Young S.K., Zeng Q.,
RA Gargeya S., Fitzgerald M., Haas B., Abouelleil A., Allen A.W., Alvarado L.,
RA Arachchi H.M., Berlin A.M., Chapman S.B., Gainer-Dewar J., Goldberg J.,
RA Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A., Ireland A.,
RA Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W., Priest M.,
RA Roberts A., Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C.,
RA Birren B.;
RT "The Genome Sequence of Anopheles culicifacies species A.";
RL Submitted (SEP-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EnsemblMetazoa:ACUA006557-PA}
RP IDENTIFICATION.
RC STRAIN=A-37 {ECO:0000313|EnsemblMetazoa:ACUA006557-PA};
RG EnsemblMetazoa;
RL Submitted (MAY-2020) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus, nucleolus
CC {ECO:0000256|ARBA:ARBA00004604}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AXCM01001019; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR STRING; 139723.A0A182M0M1; -.
DR EnsemblMetazoa; ACUA006557-RA; ACUA006557-PA; ACUA006557.
DR VEuPathDB; VectorBase:ACUA006557; -.
DR OrthoDB; 167902at2759; -.
DR Proteomes; UP000075883; Unassembled WGS sequence.
DR GO; GO:0005730; C:nucleolus; IEA:UniProtKB-SubCell.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0006364; P:rRNA processing; IEA:UniProtKB-KW.
DR CDD; cd05693; S1_Rrp5_repeat_hs1_sc1; 1.
DR Gene3D; 2.40.50.140; Nucleic acid-binding proteins; 2.
DR Gene3D; 1.25.40.10; Tetratricopeptide repeat domain; 2.
DR InterPro; IPR003107; HAT.
DR InterPro; IPR012340; NA-bd_OB-fold.
DR InterPro; IPR045209; Rrp5.
DR InterPro; IPR048059; Rrp5_S1_rpt_hs1_sc1.
DR InterPro; IPR003029; S1_domain.
DR InterPro; IPR008847; Suf.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR PANTHER; PTHR23270; PROGRAMMED CELL DEATH PROTEIN 11 PRE-RRNA PROCESSING PROTEIN RRP5; 1.
DR PANTHER; PTHR23270:SF10; PROTEIN RRP5 HOMOLOG; 1.
DR Pfam; PF05843; Suf; 1.
DR SMART; SM00386; HAT; 5.
DR SMART; SM00316; S1; 6.
DR SUPFAM; SSF50249; Nucleic acid-binding proteins; 2.
DR SUPFAM; SSF48452; TPR-like; 2.
DR PROSITE; PS50126; S1; 1.
PE 4: Predicted;
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW rRNA processing {ECO:0000256|ARBA:ARBA00022552}.
FT DOMAIN 83..164
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT REGION 970..1048
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1142..1168
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 993..1007
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1009..1041
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1505 AA; 170290 MW; 994A4748AA9ADF6E CRC64;
MVGVEPAFPR GNKVQKRGLT NRVTKRKFSP KNYGATVLPE EKALRLRPKE RREVMLSEKE
AEITMAQQAF DATNLSVATM REGMLVLACV KRIAKAHIEL LLPGCISGIV PISMISDAYS
LRLKDMINTG STNCPTLNDL YRVGDLVYVT LLSKESSRLT FSLKPNDLHK SYVTSQLVPG
LVLSATIVRK EDHGYSMDIG VRNVRGFLSD ESLGQNRDDV GRNLVCSIES VSRLDSNPTV
LLKAFDPEAP WVLNMEDADM ETIVPGCRMM FTVGEQVDHG LRGMLFEDMV PAYVNENMLT
KPTSRPDEYS MFKKIPATLL YVMPVTKQVF VSLKPYSKNR VEMNVNNGPG TIIEKAYVKA
VKDAGVWFQF GNKCRALLRW KTIMGETAGN VDKSVVMEKF QVGMTCKLAI MHYNPLEDTY
IVSNKSTFFD EEIRSSEDIV IGNLYTAHVL KTIPGGAFVS VGYARGSLVK TYYDHEHPVK
VHDKVLVRAV MRELDSPFVK FTNHPALVDE KASILHDWDQ LDATRNDQSF HGVIFKQKTD
TVFVRFFNDI VGVIQKPRAI ANQNITLMAR LNVNTVHKFT VLGFDKATKQ LELQPVSTAE
KIQSASRLVK ARISCVHAAG VDILTNDGQN GTIPSECLSE FGEHNSLYMR LLREGQSVMA
IQTNPDTYSM RLISYFRTHP RQIESVQRGA LLKGSCTNVN GVLYITPLLT NFSKQIEVKT
KENRGTVQDG SIMMMRVLNV KKSPNSRYDL DVSTALLDVC ENGTKDVFNF TAEYLKDVKK
LIKRYQVEKF GFANYNLGDL VNCVVESIVP SSNQVTVEVH SKRKKSKNVT KGIATAILPS
HPATSYTVGQ KVPGRVVWID VERKLLHVCL DQPLVKSILP NSSLTGRKYT PDAQSCWVLY
ANNYVQVCCL QANPPNPLVI VPVKYHYNDL MEKVNNTSKS VTVRLVRNLD DMIFGINSKE
MKLYENMKDD RDGEGQLEGM TNQNAEEDAN SSYEIMDYND DDNDNSNEEP LNGWRVRTPV
TMSTSNGKKK KNGSTSDLSK KGQGEPKLKR MIAVEAKESV SQMQLPLKKK APLKEDPETA
TAAKRTKKTI LKKDKKSIVE LHSNGANAKK SNLKKKKMPL VLEQLDGACD FYLHQLDGTE
DITPSSNNKM GARKRKHTTE GNGLPGAVNF WDSTPVYKRT VSDSSDDETH SSDDEQCETV
GKMRITAKER FEAMKKEEER LRKIEDELAN PSADPHTPDQ FDRLVLAQPN NSMLWIRYMA
FHMESAELDK ARAVGRKALK AIHFREETDR LNVWMALLNL EIRYETVDSF KEVLQEAVQY
NDAFKVYSRA VDILIDCQKH AEVQEMLELL LKKFRKQNDM WFLVADAWYR IGQGSKVKPL
LSQALKSLPA REHIGMIVKF AFLHNRNENR DEAHLLFEQI LTSYPKRTDI WSQYIDMLVK
DNLVGNARQI LERAIMQRLP MKNMKTLYTK YVNFEEKHGD RDSVRRVKQM ATDYVQAQLN
NAGIN
//