ID A0A2B4S6F7_STYPI Unreviewed; 1015 AA.
AC A0A2B4S6F7;
DT 20-DEC-2017, integrated into UniProtKB/TrEMBL.
DT 20-DEC-2017, sequence version 1.
DT 24-JAN-2024, entry version 20.
DE SubName: Full=WD repeat and HMG-box DNA-binding protein 1 {ECO:0000313|EMBL:PFX24112.1};
GN Name=wdhd1 {ECO:0000313|EMBL:PFX24112.1};
GN ORFNames=AWC38_SpisGene11313 {ECO:0000313|EMBL:PFX24112.1};
OS Stylophora pistillata (Smooth cauliflower coral).
OC Eukaryota; Metazoa; Cnidaria; Anthozoa; Hexacorallia; Scleractinia;
OC Astrocoeniina; Pocilloporidae; Stylophora.
OX NCBI_TaxID=50429 {ECO:0000313|EMBL:PFX24112.1, ECO:0000313|Proteomes:UP000225706};
RN [1] {ECO:0000313|Proteomes:UP000225706}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Voolstra C.R., Li Y., Liew Y.J., Baumgarten S., Zoccola D., Flot J.-F.,
RA Tambutte S., Allemand D., Aranda M.;
RT "Comparative analysis of the genomes of Stylophora pistillata and Acropora
RT digitifera provides evidence for extensive differences between species of
RT corals.";
RL bioRxiv 0:0-0(2017).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PFX24112.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LSMT01000186; PFX24112.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A2B4S6F7; -.
DR STRING; 50429.A0A2B4S6F7; -.
DR Proteomes; UP000225706; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-UniRule.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR CDD; cd21993; HMG-box_WDHD1; 1.
DR Gene3D; 1.10.30.10; High mobility group box domain; 1.
DR Gene3D; 2.130.10.10; YVTN repeat-like/Quinoprotein amine dehydrogenase; 2.
DR InterPro; IPR024977; Apc4-like_WD40_dom.
DR InterPro; IPR009071; HMG_box_dom.
DR InterPro; IPR036910; HMG_box_dom_sf.
DR InterPro; IPR022100; Mcl1_mid.
DR InterPro; IPR015943; WD40/YVTN_repeat-like_dom_sf.
DR InterPro; IPR019775; WD40_repeat_CS.
DR InterPro; IPR036322; WD40_repeat_dom_sf.
DR InterPro; IPR001680; WD40_rpt.
DR PANTHER; PTHR19932; WD REPEAT AND HMG-BOX DNA BINDING PROTEIN; 1.
DR PANTHER; PTHR19932:SF10; WD REPEAT AND HMG-BOX DNA-BINDING PROTEIN 1; 1.
DR Pfam; PF12894; ANAPC4_WD40; 1.
DR Pfam; PF00505; HMG_box; 1.
DR Pfam; PF12341; Mcl1_mid; 1.
DR Pfam; PF00400; WD40; 3.
DR SMART; SM00398; HMG; 1.
DR SMART; SM00320; WD40; 5.
DR SUPFAM; SSF47095; HMG-box; 1.
DR SUPFAM; SSF50978; WD40 repeat-like; 1.
DR PROSITE; PS50118; HMG_BOX_2; 1.
DR PROSITE; PS00678; WD_REPEATS_1; 1.
DR PROSITE; PS50082; WD_REPEATS_2; 2.
DR PROSITE; PS50294; WD_REPEATS_REGION; 2.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|PROSITE-ProRule:PRU00267,
KW ECO:0000313|EMBL:PFX24112.1};
KW Nucleus {ECO:0000256|PROSITE-ProRule:PRU00267};
KW Reference proteome {ECO:0000313|Proteomes:UP000225706};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW WD repeat {ECO:0000256|ARBA:ARBA00022574, ECO:0000256|PROSITE-
KW ProRule:PRU00221}.
FT REPEAT 130..163
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT REPEAT 237..278
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT DOMAIN 906..977
FT /note="HMG box"
FT /evidence="ECO:0000259|PROSITE:PS50118"
FT DNA_BIND 906..977
FT /note="HMG box"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00267"
FT REGION 706..779
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 797..915
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 948..987
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 711..726
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 734..766
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 799..834
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 847..874
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 881..898
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1015 AA; 111644 MW; 388FFD8A7BC3D22F CRC64;
MPVDRKPMRF AHSEGQTDVC YDETGGYLLT CGSDGDVRIY KDFDDSDPES VRAGENVTVL
LVKNKKIVTA SDNSVQVYTF PEGSPDGILT RFTASVNHIC FNQSGSVLAA GASDFNIKVV
TVADGSQKVL RGHEAPVLSV TMDPKDQYAA SSSCDGTVKI WNLGDQDLKD VMPTCLAARA
WQVSLSVSHS VSQSVSQSVS QSVSQSVSQS VSQSAIHRAS EWVYERDTWK NTFNLEKDGH
SELVSLVSWS PCGNYIASAS INGEMFIWKV ATQAVIERIT HESGHTICGL AWNPKGNKEI
AYTDNQGQFG VCENVIPNDE VKSSNHPVTA NDFMDDSLVT AAMDADSDDE QLIIGKKKQR
SKSTAWIDEE ADDDDDDDEF KDIRKLKAAL AAPLEFGDGG NDGDTGSEVS APQAPKKEVI
HTPFIPVLQP PFQPGSTPVH LMHRFMVWNS VGIVRCHTED EISSIEVEFH DTSTHHPLHL
TNHLNHTMAA LSSSAVLLAC KAREDSPSKL VCIHFGSWDN SKEWTVVLPE GESIQAVATG
SSFAAVATDK RFLRIFTIGG VQRDILSLPG AVVCMSGYKN QLMVVFHINN PLPGEQALAL
KLLDLKHNQE IIVEERVLLS EKSTLGWLGF SEEGSPVTVD SAGVVRLLSR SFGTSWSPVC
VTKSNVSLAI KYAARSRRMN LANRLDELAR EKAKLEAEEE FEDDFQQIDW PVRGNKTSTS
RMGGHRPSNT LREVEQEEEE EEMEETGGDD GMDDDDYDDF DDDDDEQTSR KEGGVSSFAF
FSACNKQRKG LARLNSLDLG SKPISTANSN PPSSPLPSSN PSQGRSNPFR ISSPGKKASS
VCGKSFFDSV EEEKKASLQR KSKISPDTKP VQKRKKGKQT TLLKTPQEKE KETQNKKTSP
ERNGPPSKKV NGFTLWFEEN KEGLSNENPE LTGTHLVKSA MGQWKALDED EKMEWNNKAK
GTAHEKEQGE KKRKREKSDE ENEDTLNTIN LAKKTKESGV TVANSKLAGF AYNKN
//