GenomeNet

Database: UniProt
Entry: A0A3P8YHJ0_ESOLU
LinkDB: A0A3P8YHJ0_ESOLU
Original site: A0A3P8YHJ0_ESOLU 
ID   A0A3P8YHJ0_ESOLU        Unreviewed;      1060 AA.
AC   A0A3P8YHJ0;
DT   13-FEB-2019, integrated into UniProtKB/TrEMBL.
DT   02-DEC-2020, sequence version 2.
DT   24-JAN-2024, entry version 22.
DE   SubName: Full=WD repeat and HMG-box DNA binding protein 1 {ECO:0000313|Ensembl:ENSELUP00000015581.2};
OS   Esox lucius (Northern pike).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Neopterygii; Teleostei; Protacanthopterygii; Esociformes;
OC   Esocidae; Esox.
OX   NCBI_TaxID=8010 {ECO:0000313|Ensembl:ENSELUP00000015581.2, ECO:0000313|Proteomes:UP000265140};
RN   [1] {ECO:0000313|Ensembl:ENSELUP00000015581.2, ECO:0000313|Proteomes:UP000265140}
RP   NUCLEOTIDE SEQUENCE.
RX   PubMed=25069045;
RA   Rondeau E.B., Minkley D.R., Leong J.S., Messmer A.M., Jantzen J.R.,
RA   von Schalburg K.R., Lemon C., Bird N.H., Koop B.F.;
RT   "The genome and linkage map of the northern pike (Esox lucius): conserved
RT   synteny revealed between the salmonid sister group and the Neoteleostei.";
RL   PLoS ONE 9:e102089-e102089(2014).
RN   [2] {ECO:0000313|Ensembl:ENSELUP00000015581.2}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (SEP-2023) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   AlphaFoldDB; A0A3P8YHJ0; -.
DR   STRING; 8010.ENSELUP00000015581; -.
DR   Ensembl; ENSELUT00000024694.2; ENSELUP00000015581.2; ENSELUG00000015575.2.
DR   GeneTree; ENSGT00390000002030; -.
DR   Proteomes; UP000265140; LG15.
DR   Bgee; ENSELUG00000015575; Expressed in ovary and 10 other cell types or tissues.
DR   GO; GO:0005634; C:nucleus; IEA:UniProtKB-UniRule.
DR   GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR   CDD; cd21993; HMG-box_WDHD1; 1.
DR   Gene3D; 1.10.30.10; High mobility group box domain; 1.
DR   Gene3D; 2.130.10.10; YVTN repeat-like/Quinoprotein amine dehydrogenase; 2.
DR   InterPro; IPR048591; Ctf4-like_C.
DR   InterPro; IPR009071; HMG_box_dom.
DR   InterPro; IPR036910; HMG_box_dom_sf.
DR   InterPro; IPR022100; Mcl1_mid.
DR   InterPro; IPR013979; TIF_beta_prop-like.
DR   InterPro; IPR015943; WD40/YVTN_repeat-like_dom_sf.
DR   InterPro; IPR036322; WD40_repeat_dom_sf.
DR   InterPro; IPR001680; WD40_rpt.
DR   PANTHER; PTHR19932; WD REPEAT AND HMG-BOX DNA BINDING PROTEIN; 1.
DR   PANTHER; PTHR19932:SF10; WD REPEAT AND HMG-BOX DNA-BINDING PROTEIN 1; 1.
DR   Pfam; PF20946; Ctf4_C; 1.
DR   Pfam; PF08662; eIF2A; 1.
DR   Pfam; PF00505; HMG_box; 1.
DR   Pfam; PF12341; Mcl1_mid; 1.
DR   Pfam; PF00400; WD40; 2.
DR   SMART; SM00398; HMG; 1.
DR   SMART; SM00320; WD40; 5.
DR   SUPFAM; SSF47095; HMG-box; 1.
DR   SUPFAM; SSF50978; WD40 repeat-like; 1.
DR   PROSITE; PS50118; HMG_BOX_2; 1.
DR   PROSITE; PS50082; WD_REPEATS_2; 2.
DR   PROSITE; PS50294; WD_REPEATS_REGION; 1.
PE   4: Predicted;
KW   DNA-binding {ECO:0000256|PROSITE-ProRule:PRU00267};
KW   Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW   ProRule:PRU00267}; Reference proteome {ECO:0000313|Proteomes:UP000265140};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   WD repeat {ECO:0000256|ARBA:ARBA00022574, ECO:0000256|PROSITE-
KW   ProRule:PRU00221}.
FT   REPEAT          9..41
FT                   /note="WD"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT   REPEAT          216..243
FT                   /note="WD"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT   DOMAIN          956..1024
FT                   /note="HMG box"
FT                   /evidence="ECO:0000259|PROSITE:PS50118"
FT   DNA_BIND        956..1024
FT                   /note="HMG box"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00267"
FT   REGION          790..811
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          831..880
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          896..960
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          992..1060
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        857..880
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        907..921
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        925..939
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1005..1048
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1060 AA;  115763 MW;  6224C52B1CE19E1C CRC64;
     MPCERKPMRY GHSEGQTDVC FDDTGNFIVT CGSDGDVRIW ESLDDDDPKS ISVGEKAYSC
     ALKNGKLVTA AANNTVQMHT FPEGEPDGIL TRFTTNATHV AYNSSGSRVA AGSSDFMVKV
     VEVSDSSQQK TLRGHNAPVL SVTFDPKDEF LTQVISWPLL QKSNDVTNSK SLCRLAWQPR
     VAKLLAVPVE TTVQLYQRDS WALVSTLSDD LGTQPINVVA WSPCGKFLAT GSIGGLLTVW
     DVEGKLCVDR QKHEKGYTVC GLAWHPSGTQ IAYTDTEGCL GLLDGLSSSS SSSTMAIKST
     KVTGSSNLLR EIFILESGSP AKNLVALDDE DDDDLVPATG RARNRSAFLD DENSLDPGSL
     KLGLDKLGNY EDDAGSALVL PAASAAPLHP VYEGPMPTPH QKAFQPGSTP AHLMHRFMMW
     NSVGIVRGYN DERDNAVDVE FHDTAIHHAM HLTNSLGHTL ADLSHEAVLL ACPSTDELAS
     KLQCLHFSSW DTNKEWMVDL PKGEDVRAVC LGQGWAAAAT SALMLRLFSI GGVQREIFSL
     PGPVVCMAGY GEQLLIVYHR GTGFDGDQAL GVQLLQFGRK RQQVIQGEPL PISRRSHLSW
     LGFTAEGTPC FIDSEGVVRL LNRSLGNTWT PVCNTRDHCK GKSDHYWVVG VHENPQQLRC
     IPCKGSRYPP TLPRPAVAVL PFKLPLCQTT TEKGQMEEQY WRSVLLQNHF GFLSSSGYEI
     DEEAQSRAQK EQQELLMKMF ALSCKLEREF RCVELAELMT QNVVTLAIRY ASRSRRMALA
     QRLSELALEK ANQLQGPEEE EEPLQYTRRN TGSSVFNKDC VDLSIGLNPF AKGSPASPDK
     PSPKPGEFHP AVSGGTGRSS AGGQPRATNV LDSMTSSSRK PTVLLGSSVS AGKLSKTPVL
     KPLAPRPKSK TQSTLLNMSA SKAANKKPAE EREPAVDRLK PTEALPPASP ADNVENTKPK
     TGFQLWLEEN RKSILADNPD LEETDVIKEA MGRFRNLSAG DRLSWTERAK GDEGDLKKRK
     RAEESQGDEN GQNHSDENSA KKKKPLDTSA KLSAFAFSKN
//
DBGET integrated database retrieval system