ID H2SPC0_TAKRU Unreviewed; 2791 AA.
AC H2SPC0;
DT 21-MAR-2012, integrated into UniProtKB/TrEMBL.
DT 17-JUN-2020, sequence version 3.
DT 27-MAR-2024, entry version 59.
DE SubName: Full=UTP20 small subunit processome component {ECO:0000313|Ensembl:ENSTRUP00000014257.3};
GN Name=UTP20 {ECO:0000313|Ensembl:ENSTRUP00000014257.3};
OS Takifugu rubripes (Japanese pufferfish) (Fugu rubripes).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Eupercaria; Tetraodontiformes; Tetradontoidea; Tetraodontidae; Takifugu.
OX NCBI_TaxID=31033 {ECO:0000313|Ensembl:ENSTRUP00000014257.3, ECO:0000313|Proteomes:UP000005226};
RN [1] {ECO:0000313|Ensembl:ENSTRUP00000014257.3, ECO:0000313|Proteomes:UP000005226}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21551351;
RA Kai W., Kikuchi K., Tohari S., Chew A.K., Tay A., Fujiwara A., Hosoya S.,
RA Suetake H., Naruse K., Brenner S., Suzuki Y., Venkatesh B.;
RT "Integration of the genetic map and genome assembly of fugu facilitates
RT insights into distinct features of genome evolution in teleosts and
RT mammals.";
RL Genome Biol. Evol. 3:424-442(2011).
RN [2] {ECO:0000313|Ensembl:ENSTRUP00000014257.3}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 31033.ENSTRUP00000014257; -.
DR Ensembl; ENSTRUT00000014322.3; ENSTRUP00000014257.3; ENSTRUG00000005849.3.
DR eggNOG; KOG1823; Eukaryota.
DR GeneTree; ENSGT00390000016813; -.
DR HOGENOM; CLU_000327_0_1_1; -.
DR InParanoid; H2SPC0; -.
DR OMA; EGLMAMF; -.
DR OrthoDB; 5482709at2759; -.
DR TreeFam; TF105652; -.
DR Proteomes; UP000005226; Chromosome 18.
DR Gene3D; 1.25.10.10; Leucine-rich Repeat Variant; 2.
DR InterPro; IPR011989; ARM-like.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR046523; UTP20_C.
DR InterPro; IPR011430; UTP20_N.
DR PANTHER; PTHR17695:SF11; SMALL SUBUNIT PROCESSOME COMPONENT 20 HOMOLOG; 1.
DR PANTHER; PTHR17695; UNCHARACTERIZED; 1.
DR Pfam; PF20416; UTP20_C; 1.
DR Pfam; PF07539; UTP20_N; 1.
DR SUPFAM; SSF48371; ARM repeat; 3.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000005226}.
FT DOMAIN 918..1544
FT /note="U3 small nucleolar RNA-associated protein 20 N-
FT terminal"
FT /evidence="ECO:0000259|Pfam:PF07539"
FT DOMAIN 1831..2050
FT /note="U3 small nucleolar RNA-associated protein 20 C-
FT terminal"
FT /evidence="ECO:0000259|Pfam:PF20416"
FT REGION 778..799
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 870..914
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1713..1785
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2060..2095
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2597..2623
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 885..902
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1747..1761
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2070..2084
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2791 AA; 314452 MW; D1EECF833E2BECC0 CRC64;
MNKSKSSHHK SENTFRFLTF AERLANVNID VIHRIDRTGS YAEEVETYFS EGLTKWRDLN
LTVHFTTFLK EVSSKSQSFN MLVFHQKAIV ESLKTHLSVK DSLAYQPLLD LVVQLARDLQ
TDFYPHFPDF FILITSLLDT KDTELLEWAF TCLSYLYKYL WRLMVKDMSN IYSLYSTLLA
HKKEHIRLFA AESFSFLMRK VPDIDALLSH MFTDLQQHPE KAEGAGQLLF EMCKGVRHMF
HSCAAKTFPV ALRKLGPSTS PGPSLPWDTV RDALDHMAQT AANHIDKEHF LVLWEALELS
VLEVSGIVEA KGQKAEEAAE QLERLLFILH TLVSYRDGAK ITKPESVCQT VLQLTQSSTL
PASCSRLLLQ ITSSLLLGEN ITLPTSLIKE MIQKVFSSTL GKDLILEFTK EMFTMKEFEQ
LFLPTMLRFV AGLFRSGDSL SRNSGLNVLV SLILAKAPPP TDGSMAFETY PLLFTGQTTG
SFSQKESSSI TNQPRVPEMV LSLIPSPGEG GKFTDLSLLW SSLVLLPHLR PLDAIAVVPA
VTALLNQLLC EIEAGKLAKA GLYVARQALS CLLTFDHSAE SLSLITVDQI NSVLRSFPTD
LSALLLGDLY YTRLSLSGVS EHLSHRALLD LHQILHANLS SNVSKVRLLT LRILSQFEAE
LPPQSEGEEN VEAQPVFALC LQAELVPATV QDYREKLLHL RKLRHDLVQR SLPQGPHTTF
QQVPLRYLIA MLYVNFRPLW EAVIELLVSH ARGMDNKEFW GVFHEHLEKV AGLAEKQLQD
DEEDHDESTG GTRAEPGCDV IESGDVGVLF LEQLKLATEP NERTDFPNFR NLLWQSMVQF
PDRVEPRSRE LSPLLLRFIR NEFYPADPLV APTQDLRKQD DAPEESGVDE EEEEGDDDEQ
EAEESGRQQK KSVPRGIAAK QLITHLKVFS KFLNPRSLYL ESSLSELYNQ LLCHQDQNIQ
RVALECVLTY RDPNIVPYKE NLERLLQDKH FKEEIVHFNI SEETGVVNAS HRATLIPLLM
RILFGRLRSK AGSKFQGKAS AASRSSIILR FLAGCQTEEL GIFIDLLLEP ISHYSQGSCL
AAVDKAVAET DAGSILPLGR QHSLLNIINV VIHKLGHLIH NYLPKVLQIL LCVTASVSTL
LDRRDQLRPG CISPLKNLRR LGILRIQDFF DNFDWYDFSP DECDAIFQAV VWPQVCRLPT
ESTYSPTPLL KLIHLWCKNA RYFPLLAKQR PNQPECDVLR NVFALLSAKN TSLGTITMVM
DIADSLATTD YSVGTEIEKE LTVNDCVFPQ PEEGALISAD TLGQGSRLLL PHISHLLGYL
SGVVRNTDRL KRKKFRVQVA KELNILSKIS RFVSEKEQSS VLIGLLLPYL QRANNPQETE
IDILATVQNL LRQSLKPSAF LQPLSKLFSI IQNKLPRQAL TNVFQTLSDL DPSLSYITDM
ASKLNAFDSR HLDEIYFDVR LTAFQDATRR VKEMSTLDLN YISTLIHNCF HTYEIGDMSL
GDNATFCLSA VITQLVAVEA GEQIYKDIVQ HTILDAVIKG LRSKTESVQH EYTSVLACLV
KTFPSKKEFR DLVQLTNYSD PESDFFEHMK HIQIHRRGRA LRKLAKQLDD GHVMMTPRSL
QNYIMPYAMT ALLDEKMLKH ENMISASVEV VGAVCRRLTW SKYLYYLKHF IHILQTAQAE
QKLAVSLLVT VLEAFHFDHE TLIREIEAAK SREIGSSRVG PDEEERLESD ASDGEEPMEV
DGKPTQSDVP METNSAVVES VSKDAASRGA ERAAASKPAA VHSGLPHSKD ELEALIKAIH
NTVNNSVLPR LHKCLTAKVQ RDEEHKAVKS KDVKEEEVGR IPIAFAMVKL MQTLPPHIME
ANLPGILIKV CVLLRNRFQE VRDVVRGTLV KITETLGIRY LQYLLKEMQS VLMKGYQVHV
LTFTVYQLLS VLKPALKSGD LDSCMNMLIS IFNNELFGAV AEEKEVKGIV SKLMEARHSK
SMDSYELLAQ FCSKESITSL ILPLKEILET SSSLKVCNRV AAVLRRIILG LLVNDGMTSK
DILLLCHGLI SESLPLLTKR DRDKASAKPP PDPRLPPPSC LLLPPTPKRG GRRAPVSSRT
NMHILVDAGL KLLHLSLKKS KVTSSETSAL EMLDPFVLLL LDCLNSMHVK VITEALVAFT
WLLKFPLPAV EQNANQLTKQ LFVLLKDYSK AGAARGENYL LVQNCFKAIT ILVKNVKSNK
ISETQLQVLL GYAEEDIYDQ SRQATAFGLL KAILSRKLIV PEMEEVLNKV AKLSVTGGNT
MIRVHCRQIY LKYLLDYPLG KKLVNHLDFV VSQLHYEHEA GRESVLEMLA YIFQTFPQNL
LQQHSGLFFA PLSLVVINDS SARCKKMAAL AIKALLAQLN VDHQNSLFTL VNTWLNADKV
SLQRLGAQIC GLFVEVEEEK FARRLDDLLP LLEREINPNN YEDIEEEQDE KGADRLLFSL
LTLISKLNKH CGLLELSKPH DTLCNIWAHI EAHLRYPHCW VWLTASQLFG QLFAAHQPEH
LVAVSGGEGG DSSSQPSATT FICTGLDKKI RELALSFCHQ LQSKFLESVS GEQVIKNLLF
VGKVIYLISP ESDVVSSLGE VEREQGENEQ DEEEGDEKDD RPPSLMWLMR KLSLMAKREA
ANTPKVPLKR TCVFKFMGAI AMDLGKDNVG PYLTTIITPL YRELDSTYAE QDPTLKNLAQ
ELIELLKKQV GLEKFSLAFS AVQKEFSQRR AARKRHRAIQ AVANPDIAAK KKLKKHRNKI
EAKKRKIEFL RPGYKAKKQR SHTLKDLAMV Q
//