ID A0A1E4SZJ0_9ASCO Unreviewed; 668 AA.
AC A0A1E4SZJ0;
DT 18-JAN-2017, integrated into UniProtKB/TrEMBL.
DT 18-JAN-2017, sequence version 1.
DT 22-FEB-2023, entry version 22.
DE RecName: Full=NUC153 domain-containing protein {ECO:0000259|Pfam:PF08159};
GN ORFNames=CANARDRAFT_28669 {ECO:0000313|EMBL:ODV84935.1};
OS [Candida] arabinofermentans NRRL YB-2248.
OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; Saccharomycetes;
OC Saccharomycetales; Pichiaceae; Ogataea; Ogataea/Candida clade.
OX NCBI_TaxID=983967 {ECO:0000313|EMBL:ODV84935.1, ECO:0000313|Proteomes:UP000094801};
RN [1] {ECO:0000313|Proteomes:UP000094801}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=NRRL YB-2248 {ECO:0000313|Proteomes:UP000094801};
RG DOE Joint Genome Institute;
RA Riley R., Haridas S., Wolfe K.H., Lopes M.R., Hittinger C.T., Goker M.,
RA Salamov A., Wisecaver J., Long T.M., Aerts A.L., Barry K., Choi C.,
RA Clum A., Coughlan A.Y., Deshpande S., Douglass A.P., Hanson S.J.,
RA Klenk H.-P., Labutti K., Lapidus A., Lindquist E., Lipzen A.,
RA Meier-Kolthoff J.P., Ohm R.A., Otillar R.P., Pangilinan J., Peng Y.,
RA Rokas A., Rosa C.A., Scheuner C., Sibirny A.A., Slot J.C., Stielow J.B.,
RA Sun H., Kurtzman C.P., Blackwell M., Grigoriev I.V., Jeffries T.W.;
RT "Comparative genomics of biotechnologically important yeasts.";
RL Submitted (APR-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus, nucleolus
CC {ECO:0000256|ARBA:ARBA00004604}.
CC -!- SIMILARITY: Belongs to the ESF1 family.
CC {ECO:0000256|ARBA:ARBA00009087}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KV453854; ODV84935.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A1E4SZJ0; -.
DR STRING; 983967.A0A1E4SZJ0; -.
DR OrthoDB; 131620at2759; -.
DR Proteomes; UP000094801; Unassembled WGS sequence.
DR GO; GO:0005730; C:nucleolus; IEA:UniProtKB-SubCell.
DR GO; GO:0006364; P:rRNA processing; IEA:InterPro.
DR InterPro; IPR039754; Esf1.
DR InterPro; IPR012580; NUC153.
DR PANTHER; PTHR12202:SF0; ESF1 HOMOLOG; 1.
DR PANTHER; PTHR12202; UNCHARACTERIZED; 1.
DR Pfam; PF08159; NUC153; 1.
PE 3: Inferred from homology;
KW Coiled coil {ECO:0000256|ARBA:ARBA00023054};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000094801}.
FT DOMAIN 592..617
FT /note="NUC153"
FT /evidence="ECO:0000259|Pfam:PF08159"
FT REGION 1..36
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 50..180
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 226..260
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 402..535
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 50..87
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 109..126
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 143..177
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 226..247
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 459..482
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 492..535
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 668 AA; 76353 MW; 360F10D800C9D20F CRC64;
MAKGDKKGKS GQQPVTLDPR FAQVYSDPRF KSQKRKDLRV KVDDRFSKEE LGISKSTAKV
DKYGRKLTEQ KSNASDLFDK YYKTEEDDKD ESDESDDEEK LEKTKDQSSS EEEDDEEEEE
EEGEEAALSA LDRARGIGAD YSDLDTSDED DDDSSDEEVV KESELEIEEE EDIEEADPTS
TFACVNMDWD YITSTDLYAT FSSFVPTGGK ILSVSLYPSE YGKARMQQEE IEGPPRDLFK
SSKPETNDSD DDSDDSDEEV DIEKATKELY QEDDGADFNS KELRSYQLQR LRYYYAIVKC
DSVATSQKIY EACNGTEYES TANLFDLRYV PEDMTFDDKP RDQCSKVSAG YKPKEFITDA
LQHSKVKLTW DETPAERLNL ATKAFSQKEI EDMDFKAYLA SDSEDSADED GEEMRNKYKS
LASGSGKVGS FDIFGENDDG EGDIDMEITF TPGLSNSGKK AEEEVKDKDE STISTFQNKE
KERRKKRKEK IKELKKEQME NKKLEKQELR KERSGKSAKK GGDKNSSKED LKEKADLELL
MMDETNKETQ HFAMKDVLKA EKLKNKKKTS KKDKLKAADL NIDTDIKIDE GDDRFNEIFE
DHAFAIDPTN TEFKKTEVMD KLLKSGLKKH SGKKSAKIDK KKRKVEEVEE KETVSDLVKK
IKRKSGKK
//