ID C7GM52_YEAS2 Unreviewed; 2493 AA.
AC C7GM52;
DT 13-OCT-2009, integrated into UniProtKB/TrEMBL.
DT 13-OCT-2009, sequence version 1.
DT 24-JAN-2024, entry version 38.
DE SubName: Full=Utp20p {ECO:0000313|EMBL:EEU08137.1};
GN Name=UTP20 {ECO:0000313|EMBL:EEU08137.1};
GN ORFNames=C1Q_01318 {ECO:0000313|EMBL:EEU08137.1};
OS Saccharomyces cerevisiae (strain JAY291) (Baker's yeast).
OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; Saccharomycetes;
OC Saccharomycetales; Saccharomycetaceae; Saccharomyces.
OX NCBI_TaxID=574961 {ECO:0000313|EMBL:EEU08137.1, ECO:0000313|Proteomes:UP000008073};
RN [1] {ECO:0000313|EMBL:EEU08137.1, ECO:0000313|Proteomes:UP000008073}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=JAY291 {ECO:0000313|EMBL:EEU08137.1,
RC ECO:0000313|Proteomes:UP000008073};
RX PubMed=19812109; DOI=10.1101/gr.091777.109;
RA Argueso J.L., Carazzolle M.F., Mieczkowski P.A., Duarte F.M., Netto O.V.,
RA Missawa S.K., Galzerani F., Costa G.G., Vidal R.O., Noronha M.F.,
RA Dominska M., Andrietta M.G., Andrietta S.R., Cunha A.F., Gomes L.H.,
RA Tavares F.C., Alcarde A.R., Dietrich F.S., McCusker J.H., Petes T.D.,
RA Pereira G.A.;
RT "Genome structure of a Saccharomyces cerevisiae strain widely used in
RT bioethanol production.";
RL Genome Res. 19:2258-2270(2009).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EEU08137.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ACFL01000048; EEU08137.1; -; Genomic_DNA.
DR Proteomes; UP000008073; Unassembled WGS sequence.
DR Gene3D; 1.25.10.10; Leucine-rich Repeat Variant; 3.
DR InterPro; IPR011989; ARM-like.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR046523; UTP20_C.
DR InterPro; IPR011430; UTP20_N.
DR PANTHER; PTHR17695:SF11; SMALL SUBUNIT PROCESSOME COMPONENT 20 HOMOLOG; 1.
DR PANTHER; PTHR17695; UNCHARACTERIZED; 1.
DR Pfam; PF20416; UTP20_C; 1.
DR Pfam; PF07539; UTP20_N; 1.
DR SUPFAM; SSF48371; ARM repeat; 2.
PE 4: Predicted;
FT DOMAIN 822..1408
FT /note="U3 small nucleolar RNA-associated protein 20 N-
FT terminal"
FT /evidence="ECO:0000259|Pfam:PF07539"
FT DOMAIN 1602..1818
FT /note="U3 small nucleolar RNA-associated protein 20 C-
FT terminal"
FT /evidence="ECO:0000259|Pfam:PF20416"
FT REGION 2458..2493
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2493 AA; 287561 MW; C7765D25412069BA CRC64;
MAKQRQTTKS SKRYRYSSFK ARIDDLKIEP ARNLEKRVHD YVESSHFLAS FDQWKEINLS
AKFTEFAAEI EHDVQTLPQI LYHDKKIFNS LVSFINFHDE FSLQPLLDLL AQFCHDLGPD
FLKFYEEAIK TLINLLDAAI EFESSNVFEW GFNCLAYIFK YLSKFLVKKL VLTCDLLIPL
LSHSKEYLSR FSAEALSFLV RKCPVPNLRE FVRSVFEKLE GDDEQTNLYE GLLILFTESM
TSTQETLHSK AKAIMSVLLH EALTKSSPER SVSLLSDIWM NISKYASIES LLPVYEVMYQ
DFNDSFDATN IDRILEVLTT IVFSESGRKI PDWNKIIILI ERIMSQSENC ASLSQDKVAF
LFALFIRNSD VKTLTLFHQK LFNYALTNIS DCFLEFFQFA LRLSYERVFS FNGLKFLQLF
LKKNWQSQGK KIALFFLEVD DKPELQKVRE VNFPEEFILS IRDFFVTAEI NDSNDLFEIY
WRAIIFKYSK LQNTEIIIPL LERIFSTFAS PDNFTKDMVG TLLKIYRKED DASGNNLLKT
ILDNYENYKE SLNFLRGWNK LVSNLHPSES LKGLMSHYPS LLLSLTDNFM LPDGKIRYET
LELMKTLMIL QGMQVPDLLS SCMVIEEIPL TLQNARDLTI RIKNVGAEFG KTKTDKLVSS
FFLKYLFGLL TVRFSPVWTG VFDTLPNVYT KDEALVWKLV LSFIKLPDEN QNLDYYQPLL
EDGANKVLWD SSVVRLRDTI DTFSHIWSKY STQNTSIIST TIERRGNTTY PILIRNQALK
VMLSIPQVAE NHFVDIAPFV FNDFKTYKDE EDMENERVIT GSWTEVDRNV FLKTLSKFKN
IKNVYSATEL HDHLMVLLGS RNTDVQKLAL DALFAYKNPT LNKYRDNLKN LLDDTLFKDE
ITTFLTENGS QSIKAEDEKV VMPYVLRIFF GRAQVPPTSG QKRSRKIAVI SVLPNFKKPY
INDFLSLASE RLDYNYFFGN GHQINSSKAT LKTIRRMTGF VNIVNSTLSV LRTNFPLHTN
SVLQPLIYSI AMAYYVLDTE STEEVHLRKM ASNLRQQGLK CLSSVFEFVG NAFDWSTSME
DIYAVVVKPR ISHFSDENLQ QPSSLLRLFL YWAHNPSLYQ FLYYDEFATA TALMDTISNQ
HVKEAVIGPI IEAADSIIRN PVNDDHYVDL VTLICTSCLK ILPSLYVKLS DSNSISTFLN
LLVSITEMGF IQDDHVRSRL ISSLISILKG KLKKLQENDT QKILKILKLI VFNYNCSWSD
IEELYTTISS LFKTFDERNL RVSLTELFIE LGRKVPELES ISKLVADLNS YSSSRMHEYD
FPRILSAFKG LIEDGYKSYS ELEWLPLLFT FLHFINDKEE LALRTNASHA IMKFIDFINE
KPNLNEASKS ISMLKDILLP SIRIGLRDSL EEVQSEYVSV LSYMVKNTKY FTDFEDMAIL
LYNGDEEADF FTNVNHIQLH RRQRAIKRLG EHAHQLKDNS ISHYLIPMIE HYVFSDDERY
RNIGNETQIA IGGLAQHMSW NQYKALLRRY ISMLKTKPNQ MKQAVQLIVQ LSVPLRETLR
IVRDGAESKL TLSKFPSNLD EPSNFIKQEL YPTLSKILGT RDDETIIERM PIAEALVNIV
LGLTNDDITN FLPSILTNIC QVLRSKSEEL RDAVRVTLGK ISIILGAEYL VFVIKELMAT
LKRGSQIHVL SYTVHYILKS MHGVLKHSDL DTSSSMIVKI IMENIFGFAG EEKDSENYHT
KVKEIKSNKS YDAGEILASN ISLTEFGTLL SPVKALLMVR INLRNQNKLS ELLRRYLLGL
NHNSDSESES ILKFCHQLFQ ESEMSNSPQI PKKKVKDQVD EKEDFFLVNL ESKSYTINSN
SLLLNSTLQK FALDLLRNVI TRHRSFLTVS HLEGFIPFLR DSLLSENEGV VISTLRILIT
LIRLDFSDES SEIFKNCARK VLNIIKVSPS TSSELCQMGL KFLSAFIRHT DSTLKDTALS
YVLGRVLPDL NEPSRQGLAF NFLKALVSKH IMLPELYDIA DTTREIMVTN HSKEIRDVSR
SVYYQFLMEY DQSKGRLEKQ FKFMVDNLQY PTESGRQSVM ELINLIITKA NPALLSKLSS
SFFLALVNVS FNDDAPRCRE MASVLISTML PKLENKDLEI VEKYIAAWLK QVDNASFLNL
GLRTYKVYLK SIGFEHTIEL DELAIKRIRY ILSDTSVGSE HQWDLVYSAL NTFSSYMEAT
ESVYKHGFKD IWDGIITCLL YPHSWVRQSA ANLVHQLIAN KDKLEISLTN LEIQTIATRI
LHQLGAPSIP ENLANVSIKT LVNISILWKE QRTPFIMDVS KQTGEELKYT TAIDYMVTRI
GGIIRSDEHR MDSFMSKKAC IQLLALLVQV LDEDEVIAEG EKILLPLYGY LETYYSRAVD
EEQEELRTLS NECLKILEDK LQVSDFTKIY TAVKQTVLER RKERRSKRAI LAVNAPQISA
DKKLRKHARS REKRKHEKDE NGYYQRRNKR KRV
//