ID I3L908_PIG Unreviewed; 1046 AA.
AC I3L908;
DT 11-JUL-2012, integrated into UniProtKB/TrEMBL.
DT 22-NOV-2017, sequence version 2.
DT 27-MAR-2024, entry version 70.
DE SubName: Full=WD repeat and HMG-box DNA binding protein 1 {ECO:0000313|Ensembl:ENSSSCP00000020519.2};
GN Name=WDHD1 {ECO:0000313|Ensembl:ENSSSCP00000020519.2,
GN ECO:0000313|VGNC:VGNC:94904};
OS Sus scrofa (Pig).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Artiodactyla; Suina; Suidae; Sus.
OX NCBI_TaxID=9823 {ECO:0000313|Ensembl:ENSSSCP00000020519.2, ECO:0000313|Proteomes:UP000008227};
RN [1] {ECO:0000313|Ensembl:ENSSSCP00000020519.2, ECO:0000313|Proteomes:UP000008227}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Duroc {ECO:0000313|Ensembl:ENSSSCP00000020519.2,
RC ECO:0000313|Proteomes:UP000008227};
RG Porcine genome sequencing project;
RL Submitted (NOV-2009) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSSSCP00000020519.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; I3L908; -.
DR Ensembl; ENSSSCT00000027382.2; ENSSSCP00000020519.2; ENSSSCG00000005052.5.
DR VGNC; VGNC:94904; WDHD1.
DR GeneTree; ENSGT00390000002030; -.
DR HOGENOM; CLU_004219_0_0_1; -.
DR TreeFam; TF105988; -.
DR ChiTaRS; WDHD1; pig.
DR Proteomes; UP000008227; Chromosome 1.
DR Bgee; ENSSSCG00000005052; Expressed in hindlimb bud and 46 other cell types or tissues.
DR ExpressionAtlas; I3L908; baseline and differential.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-UniRule.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR CDD; cd21993; HMG-box_WDHD1; 1.
DR Gene3D; 1.10.30.10; High mobility group box domain; 1.
DR Gene3D; 2.130.10.10; YVTN repeat-like/Quinoprotein amine dehydrogenase; 2.
DR InterPro; IPR048591; Ctf4-like_C.
DR InterPro; IPR009071; HMG_box_dom.
DR InterPro; IPR036910; HMG_box_dom_sf.
DR InterPro; IPR022100; Mcl1_mid.
DR InterPro; IPR015943; WD40/YVTN_repeat-like_dom_sf.
DR InterPro; IPR036322; WD40_repeat_dom_sf.
DR InterPro; IPR001680; WD40_rpt.
DR PANTHER; PTHR19932; WD REPEAT AND HMG-BOX DNA BINDING PROTEIN; 1.
DR PANTHER; PTHR19932:SF10; WD REPEAT AND HMG-BOX DNA-BINDING PROTEIN 1; 1.
DR Pfam; PF20946; Ctf4_C; 1.
DR Pfam; PF12341; Mcl1_mid; 1.
DR Pfam; PF00400; WD40; 2.
DR SMART; SM00398; HMG; 1.
DR SMART; SM00320; WD40; 4.
DR SUPFAM; SSF47095; HMG-box; 1.
DR SUPFAM; SSF50978; WD40 repeat-like; 1.
DR PROSITE; PS50118; HMG_BOX_2; 1.
DR PROSITE; PS50082; WD_REPEATS_2; 2.
DR PROSITE; PS50294; WD_REPEATS_REGION; 2.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|PROSITE-ProRule:PRU00267};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00267}; Reference proteome {ECO:0000313|Proteomes:UP000008227};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW WD repeat {ECO:0000256|ARBA:ARBA00022574, ECO:0000256|PROSITE-
KW ProRule:PRU00221}.
FT REPEAT 9..41
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT REPEAT 132..173
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT DOMAIN 932..987
FT /note="HMG box"
FT /evidence="ECO:0000259|PROSITE:PS50118"
FT DNA_BIND 932..987
FT /note="HMG box"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00267"
FT REGION 247..290
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 740..938
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 984..1046
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 793..807
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 818..875
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 902..916
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 917..938
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 991..1023
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1024..1046
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1046 AA; 117213 MW; 8EABC8AF93A40252 CRC64;
MPATQKPMRY GHTEGHTDVC FDDSGSCIVT CGSDGDVRIW EDLDDDDPKS INVGEKAYSC
ALKNGKLVTA VSNNTIQVHT FPEGVPDGIL TRFTTNANHV IFNEDGTKIA AGSSDFLVKV
VDVMDCSQQK TFRGHDAPVL SLSFDPKDIF LASSSCDGSV KVWQISDQTC AISWPLLQKC
NDVINAKSIC RLAWQPKSGK LLAVPVEKSV KLYRRETWSN QFDLSDNFIS QVSSRVERDY
NDLFDGDDTG NVGDFPNDNA VETHSFSKET RNDEEDDDDL MLASGRPRQR SHILEDDENS
VDVTMLKTGS SRLKEEEEDD QAGNIHSLPL ITSQKPFYDG PMPTPRQKPF QSGSTPLHLT
HRFMVWNSIG IIRCYNDEQD NAIDVEFHDT SIHHATHLSN TLNYTVADLS HEAILLACES
TDELASKLHC LHFSSWDSSK EWIVDMPPNE DIEAVCLGQG WAAAATSSLL LRLFTIGGVQ
KEVFSLPGPV VSMAGHGEQL IIVYHRGTGF DGDQCLGIQL LQLGKKKKQI LHGDPLPLTR
KSYLVWLGFS AEGTPCYVDS EGIVRMLNRG LGYTWTPICN TREHCKGKSD HYWVVGIHEN
PQHLRCIPCK GSRFPPTLPR PAIAVLSFKL PYCQTGTEKG QMEEQYWRSV LFHNHLDYLA
KSGYEYEEST KNQAIKEQQE LLMKMLALSC KLEREFRCVE LADLMTQNAV NLAIKYASRS
RRLILAQKLS ELAVEKAAEL AAPQAEEEEE EEEDFRKKLN AGYSNTPTEW SQPRLRNRVE
EDTEDTEEVD DIEEKPEIHK HRQNPFFKSA DSTDVPAIKS GAVTSSSQGR INPFKVSGSS
KEPANSFSSV RSTNILDNMN KSKKSALNRA TNNEKSPVIK PLVPKPKSKQ ASAASFFQKR
TSQTDKTEEI KEENTKNSSS ETPAVCPQNT ENQRPKTGFQ MWLEENRSNI LSDNPDFSDE
ADIIKEGMVR FRVLSAEERK AWAAKAKGGI ASDGAEEKKR KRVVDESHET ENQEEKTKEN
LNLPKKQKPL NLSANQKLSA FAFKQE
//