ID A0A3P8YHJ0_ESOLU Unreviewed; 1060 AA.
AC A0A3P8YHJ0;
DT 13-FEB-2019, integrated into UniProtKB/TrEMBL.
DT 02-DEC-2020, sequence version 2.
DT 24-JAN-2024, entry version 22.
DE SubName: Full=WD repeat and HMG-box DNA binding protein 1 {ECO:0000313|Ensembl:ENSELUP00000015581.2};
OS Esox lucius (Northern pike).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Protacanthopterygii; Esociformes;
OC Esocidae; Esox.
OX NCBI_TaxID=8010 {ECO:0000313|Ensembl:ENSELUP00000015581.2, ECO:0000313|Proteomes:UP000265140};
RN [1] {ECO:0000313|Ensembl:ENSELUP00000015581.2, ECO:0000313|Proteomes:UP000265140}
RP NUCLEOTIDE SEQUENCE.
RX PubMed=25069045;
RA Rondeau E.B., Minkley D.R., Leong J.S., Messmer A.M., Jantzen J.R.,
RA von Schalburg K.R., Lemon C., Bird N.H., Koop B.F.;
RT "The genome and linkage map of the northern pike (Esox lucius): conserved
RT synteny revealed between the salmonid sister group and the Neoteleostei.";
RL PLoS ONE 9:e102089-e102089(2014).
RN [2] {ECO:0000313|Ensembl:ENSELUP00000015581.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; A0A3P8YHJ0; -.
DR STRING; 8010.ENSELUP00000015581; -.
DR Ensembl; ENSELUT00000024694.2; ENSELUP00000015581.2; ENSELUG00000015575.2.
DR GeneTree; ENSGT00390000002030; -.
DR Proteomes; UP000265140; LG15.
DR Bgee; ENSELUG00000015575; Expressed in ovary and 10 other cell types or tissues.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-UniRule.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR CDD; cd21993; HMG-box_WDHD1; 1.
DR Gene3D; 1.10.30.10; High mobility group box domain; 1.
DR Gene3D; 2.130.10.10; YVTN repeat-like/Quinoprotein amine dehydrogenase; 2.
DR InterPro; IPR048591; Ctf4-like_C.
DR InterPro; IPR009071; HMG_box_dom.
DR InterPro; IPR036910; HMG_box_dom_sf.
DR InterPro; IPR022100; Mcl1_mid.
DR InterPro; IPR013979; TIF_beta_prop-like.
DR InterPro; IPR015943; WD40/YVTN_repeat-like_dom_sf.
DR InterPro; IPR036322; WD40_repeat_dom_sf.
DR InterPro; IPR001680; WD40_rpt.
DR PANTHER; PTHR19932; WD REPEAT AND HMG-BOX DNA BINDING PROTEIN; 1.
DR PANTHER; PTHR19932:SF10; WD REPEAT AND HMG-BOX DNA-BINDING PROTEIN 1; 1.
DR Pfam; PF20946; Ctf4_C; 1.
DR Pfam; PF08662; eIF2A; 1.
DR Pfam; PF00505; HMG_box; 1.
DR Pfam; PF12341; Mcl1_mid; 1.
DR Pfam; PF00400; WD40; 2.
DR SMART; SM00398; HMG; 1.
DR SMART; SM00320; WD40; 5.
DR SUPFAM; SSF47095; HMG-box; 1.
DR SUPFAM; SSF50978; WD40 repeat-like; 1.
DR PROSITE; PS50118; HMG_BOX_2; 1.
DR PROSITE; PS50082; WD_REPEATS_2; 2.
DR PROSITE; PS50294; WD_REPEATS_REGION; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|PROSITE-ProRule:PRU00267};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00267}; Reference proteome {ECO:0000313|Proteomes:UP000265140};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW WD repeat {ECO:0000256|ARBA:ARBA00022574, ECO:0000256|PROSITE-
KW ProRule:PRU00221}.
FT REPEAT 9..41
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT REPEAT 216..243
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT DOMAIN 956..1024
FT /note="HMG box"
FT /evidence="ECO:0000259|PROSITE:PS50118"
FT DNA_BIND 956..1024
FT /note="HMG box"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00267"
FT REGION 790..811
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 831..880
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 896..960
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 992..1060
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 857..880
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 907..921
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 925..939
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1005..1048
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1060 AA; 115763 MW; 6224C52B1CE19E1C CRC64;
MPCERKPMRY GHSEGQTDVC FDDTGNFIVT CGSDGDVRIW ESLDDDDPKS ISVGEKAYSC
ALKNGKLVTA AANNTVQMHT FPEGEPDGIL TRFTTNATHV AYNSSGSRVA AGSSDFMVKV
VEVSDSSQQK TLRGHNAPVL SVTFDPKDEF LTQVISWPLL QKSNDVTNSK SLCRLAWQPR
VAKLLAVPVE TTVQLYQRDS WALVSTLSDD LGTQPINVVA WSPCGKFLAT GSIGGLLTVW
DVEGKLCVDR QKHEKGYTVC GLAWHPSGTQ IAYTDTEGCL GLLDGLSSSS SSSTMAIKST
KVTGSSNLLR EIFILESGSP AKNLVALDDE DDDDLVPATG RARNRSAFLD DENSLDPGSL
KLGLDKLGNY EDDAGSALVL PAASAAPLHP VYEGPMPTPH QKAFQPGSTP AHLMHRFMMW
NSVGIVRGYN DERDNAVDVE FHDTAIHHAM HLTNSLGHTL ADLSHEAVLL ACPSTDELAS
KLQCLHFSSW DTNKEWMVDL PKGEDVRAVC LGQGWAAAAT SALMLRLFSI GGVQREIFSL
PGPVVCMAGY GEQLLIVYHR GTGFDGDQAL GVQLLQFGRK RQQVIQGEPL PISRRSHLSW
LGFTAEGTPC FIDSEGVVRL LNRSLGNTWT PVCNTRDHCK GKSDHYWVVG VHENPQQLRC
IPCKGSRYPP TLPRPAVAVL PFKLPLCQTT TEKGQMEEQY WRSVLLQNHF GFLSSSGYEI
DEEAQSRAQK EQQELLMKMF ALSCKLEREF RCVELAELMT QNVVTLAIRY ASRSRRMALA
QRLSELALEK ANQLQGPEEE EEPLQYTRRN TGSSVFNKDC VDLSIGLNPF AKGSPASPDK
PSPKPGEFHP AVSGGTGRSS AGGQPRATNV LDSMTSSSRK PTVLLGSSVS AGKLSKTPVL
KPLAPRPKSK TQSTLLNMSA SKAANKKPAE EREPAVDRLK PTEALPPASP ADNVENTKPK
TGFQLWLEEN RKSILADNPD LEETDVIKEA MGRFRNLSAG DRLSWTERAK GDEGDLKKRK
RAEESQGDEN GQNHSDENSA KKKKPLDTSA KLSAFAFSKN
//