ID A0A1E4T775_9ASCO Unreviewed; 619 AA.
AC A0A1E4T775;
DT 18-JAN-2017, integrated into UniProtKB/TrEMBL.
DT 18-JAN-2017, sequence version 1.
DT 27-MAR-2024, entry version 21.
DE RecName: Full=Protein SQS1 {ECO:0000256|ARBA:ARBA00018964};
DE Flags: Fragment;
GN ORFNames=CANARDRAFT_186122 {ECO:0000313|EMBL:ODV87606.1};
OS [Candida] arabinofermentans NRRL YB-2248.
OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; Saccharomycetes;
OC Saccharomycetales; Pichiaceae; Ogataea; Ogataea/Candida clade.
OX NCBI_TaxID=983967 {ECO:0000313|EMBL:ODV87606.1, ECO:0000313|Proteomes:UP000094801};
RN [1] {ECO:0000313|Proteomes:UP000094801}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=NRRL YB-2248 {ECO:0000313|Proteomes:UP000094801};
RG DOE Joint Genome Institute;
RA Riley R., Haridas S., Wolfe K.H., Lopes M.R., Hittinger C.T., Goker M.,
RA Salamov A., Wisecaver J., Long T.M., Aerts A.L., Barry K., Choi C.,
RA Clum A., Coughlan A.Y., Deshpande S., Douglass A.P., Hanson S.J.,
RA Klenk H.-P., Labutti K., Lapidus A., Lindquist E., Lipzen A.,
RA Meier-Kolthoff J.P., Ohm R.A., Otillar R.P., Pangilinan J., Peng Y.,
RA Rokas A., Rosa C.A., Scheuner C., Sibirny A.A., Slot J.C., Stielow J.B.,
RA Sun H., Kurtzman C.P., Blackwell M., Grigoriev I.V., Jeffries T.W.;
RT "Comparative genomics of biotechnologically important yeasts.";
RL Submitted (APR-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the SQS1 family.
CC {ECO:0000256|ARBA:ARBA00010306}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KV453848; ODV87606.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A1E4T775; -.
DR STRING; 983967.A0A1E4T775; -.
DR OrthoDB; 1333919at2759; -.
DR Proteomes; UP000094801; Unassembled WGS sequence.
DR GO; GO:0005737; C:cytoplasm; IEA:UniProtKB-SubCell.
DR GO; GO:0003676; F:nucleic acid binding; IEA:UniProtKB-UniRule.
DR Gene3D; 3.30.1370.50; R3H-like domain; 1.
DR InterPro; IPR000467; G_patch_dom.
DR InterPro; IPR001374; R3H_dom.
DR InterPro; IPR036867; R3H_dom_sf.
DR PANTHER; PTHR14195; G PATCH DOMAIN CONTAINING PROTEIN 2; 1.
DR PANTHER; PTHR14195:SF2; GH10944P; 1.
DR Pfam; PF01585; G-patch; 1.
DR SMART; SM00443; G_patch; 1.
DR PROSITE; PS50174; G_PATCH; 1.
DR PROSITE; PS51061; R3H; 1.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000094801}.
FT DOMAIN 441..504
FT /note="R3H"
FT /evidence="ECO:0000259|PROSITE:PS51061"
FT DOMAIN 576..619
FT /note="G-patch"
FT /evidence="ECO:0000259|PROSITE:PS50174"
FT REGION 48..201
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 216..244
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 320..354
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 126..140
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 144..178
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 181..201
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 334..351
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:ODV87606.1"
FT NON_TER 619
FT /evidence="ECO:0000313|EMBL:ODV87606.1"
SQ SEQUENCE 619 AA; 71003 MW; 3E83BA66356D47B7 CRC64;
IDYTGKHRGE MLKRPLRKMP IEFIKAKEIY DPSADLYKKI AKFKIADPVE KVSPSPLPNI
EQKQLEEEFE GSSSDASEES DQENNSFSHH HDEQIFDDDD DDDNVNGFLN DLLRQEASQV
SVPIKKIEIQ DDKEDKDETM GVSVVTVPKN NHDSSVDSTK TLPDVSSIEN QSLHLQSSDE
QQSDEEDDDD DDDNDDEEYD EQADFEAAAW IANHSEDSDI EEPDPEKSSS LAPTIVEDSN
EDPKFGFLPE DYEAFDVSQI QVTNIRLSAD DASYYVKAPI LFNTDDYNWW SKESFVEELI
DNGLAHYRVD AFLEYLTSHL TQPDPQKPEE SDYDLGEESS EEESDDDDDG MADLINFSRN
QPRILDDPID IGTATLKTKG KGKKKQLRPD QIEDLEMRQE LMNQFMFRQS SKKNKHDAKQ
LARKADKLMG ENFMLEKYPF ELHVRDMKFE YDEFYDDAQR TAMRFPPLDP HGLKTLKKLA
DYYNLKSRKF GKGPKSYVVA IKSKRTYMYR PDKPQINRIL KQRPIFKRSD VRTTKEEALQ
LKNKRKKSAK ELAEKGDKYK YKEGELVAAF APEIGAGNIG RKLLEKMGWA TGEALGAEGN
KGIIEPIQAK MKMTKIGIK
//