ID G0VKM4_NAUCC Unreviewed; 740 AA.
AC G0VKM4;
DT 19-OCT-2011, integrated into UniProtKB/TrEMBL.
DT 19-OCT-2011, sequence version 1.
DT 13-SEP-2023, entry version 49.
DE RecName: Full=WH2 domain-containing protein {ECO:0000259|PROSITE:PS51082};
GN Name=NCAS0J00820 {ECO:0000313|EMBL:CCC72061.1};
GN OrderedLocusNames=NCAS_0J00820 {ECO:0000313|EMBL:CCC72061.1};
OS Naumovozyma castellii (strain ATCC 76901 / BCRC 22586 / CBS 4309 / NBRC
OS 1992 / NRRL Y-12630) (Yeast) (Saccharomyces castellii).
OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; Saccharomycetes;
OC Saccharomycetales; Saccharomycetaceae; Naumovozyma.
OX NCBI_TaxID=1064592 {ECO:0000313|EMBL:CCC72061.1, ECO:0000313|Proteomes:UP000001640};
RN [1] {ECO:0000313|EMBL:CCC72061.1, ECO:0000313|Proteomes:UP000001640}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 76901 / BCRC 22586 / CBS 4309 / NBRC 1992 / NRRL Y-12630
RC {ECO:0000313|Proteomes:UP000001640};
RX PubMed=22123960; DOI=10.1073/pnas.1112808108;
RA Gordon J.L., Armisen D., Proux-Wera E., OhEigeartaigh S.S., Byrne K.P.,
RA Wolfe K.H.;
RT "Evolutionary erosion of yeast sex chromosomes by mating-type switching
RT accidents.";
RL Proc. Natl. Acad. Sci. U.S.A. 108:20024-20029(2011).
RN [2]
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=Type strain:CBS 4309;
RA Gordon J.L., Armisen D., Proux-Wera E., OhEigeartaigh S.S., Byrne K.P.,
RA Wolfe K.H.;
RT "Genome sequence of Naumovozyma castellii.";
RL Submitted (AUG-2011) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; HE576761; CCC72061.1; -; Genomic_DNA.
DR RefSeq; XP_003678400.1; XM_003678352.1.
DR AlphaFoldDB; G0VKM4; -.
DR STRING; 1064592.G0VKM4; -.
DR GeneID; 11525804; -.
DR KEGG; ncs:NCAS_0J00820; -.
DR eggNOG; KOG4462; Eukaryota.
DR HOGENOM; CLU_386454_0_0_1; -.
DR InParanoid; G0VKM4; -.
DR OMA; PMRQSSN; -.
DR OrthoDB; 1442810at2759; -.
DR Proteomes; UP000001640; Chromosome 10.
DR GO; GO:0003779; F:actin binding; IEA:InterPro.
DR CDD; cd22064; WH2_WAS_WASL; 1.
DR InterPro; IPR003124; WH2_dom.
DR Pfam; PF02205; WH2; 1.
DR SMART; SM00246; WH2; 2.
DR PROSITE; PS51082; WH2; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000001640}.
FT DOMAIN 33..50
FT /note="WH2"
FT /evidence="ECO:0000259|PROSITE:PS51082"
FT REGION 1..20
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 47..452
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 481..652
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..19
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 120..147
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 169..213
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 223..278
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 326..433
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 481..505
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 506..521
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 549..570
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 581..604
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 617..632
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 633..647
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 740 AA; 74255 MW; 574D547E6C0C85A8 CRC64;
MAGPPAPPPP PPPPGVFAAA APAAKPAASV MQGRDALLGD IRKGMRLKKA ETNDRSAPEV
GGGVVSASSA PPSAPKMSAP PIPGSAPAST MGGPPQLGDI FAGGIPKLKH TNADQKNIST
PPPSAPPIPQ ANAPSTPQMP TNRPPHMPSG RPMKRSHQKK SSVSSVAPSV PSAPPPPPMP
SQAAPIPVSA PPIPSMSAPP APPTAPPPPA IPITTAKKDS NAPPVAPPAP PMPTISSPAA
PPTPPVPSMN APPAPPPPSI PVMKKESPTT PSSPPPSSGL PFLAEINARR TNKGVVDDEV
VKKVQTTKVA APASTKKKTK TPSAPKIAAP PLPTSAPPLP SSAPPLPTAA PPLPSSNAPP
PPPVPSFDSP PTPSIPADIP SAPPLPPAAP PAPALPKLQA PAAPAAPAAP APPAPPAPAL
SIPSAPPPPP AVFESISVSS SAPPAAPISS GGALPFLAEI QKKRDDRFVV AEDSNYTTKD
HLATTATPVS KTSTPMVTAP STIPPSVNET PTPPPAAAPP LPSGNGMSFL GEIESKLKIH
HEPSTPESSA PAIPAPPQQG APPIPMAPPP VSIETSNAAP SLPSAPPPPP IISMAPPSAP
PPPTTTDEYA YDDLTSTHSP PPPPPMSAPA PVPPETSSST GSSTPGVKHR LFGNGESTLQ
HTVNQHTNAP DIDVGTYTIS GSNDTTGVKL NSGKITIDDS RFKWSNLDQI PKPRPFQGKI
KLYPSGKGSS VPLDLSLFTT
//