ID A0A091R531_MERNU Unreviewed; 1119 AA.
AC A0A091R531;
DT 26-NOV-2014, integrated into UniProtKB/TrEMBL.
DT 26-NOV-2014, sequence version 1.
DT 24-JAN-2024, entry version 33.
DE SubName: Full=WD repeat and HMG-box DNA-binding protein 1 {ECO:0000313|EMBL:KFQ34621.1};
GN ORFNames=N331_11266 {ECO:0000313|EMBL:KFQ34621.1};
OS Merops nubicus (Northern carmine bee-eater).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Coraciiformes; Meropidae; Merops.
OX NCBI_TaxID=57421 {ECO:0000313|EMBL:KFQ34621.1, ECO:0000313|Proteomes:UP000052967};
RN [1] {ECO:0000313|EMBL:KFQ34621.1, ECO:0000313|Proteomes:UP000052967}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=BGI_N331 {ECO:0000313|EMBL:KFQ34621.1};
RA Zhang G., Li C.;
RT "Genome evolution of avian class.";
RL Submitted (APR-2014) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KK713157; KFQ34621.1; -; Genomic_DNA.
DR RefSeq; XP_008943766.1; XM_008945518.1.
DR AlphaFoldDB; A0A091R531; -.
DR GeneID; 103777952; -.
DR KEGG; mnb:103777952; -.
DR CTD; 11169; -.
DR OrthoDB; 3686044at2759; -.
DR Proteomes; UP000052967; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-UniRule.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR CDD; cd21993; HMG-box_WDHD1; 1.
DR CDD; cd00200; WD40; 1.
DR Gene3D; 1.10.30.10; High mobility group box domain; 1.
DR Gene3D; 2.130.10.10; YVTN repeat-like/Quinoprotein amine dehydrogenase; 2.
DR InterPro; IPR048591; Ctf4-like_C.
DR InterPro; IPR009071; HMG_box_dom.
DR InterPro; IPR036910; HMG_box_dom_sf.
DR InterPro; IPR022100; Mcl1_mid.
DR InterPro; IPR013979; TIF_beta_prop-like.
DR InterPro; IPR015943; WD40/YVTN_repeat-like_dom_sf.
DR InterPro; IPR019775; WD40_repeat_CS.
DR InterPro; IPR036322; WD40_repeat_dom_sf.
DR InterPro; IPR001680; WD40_rpt.
DR PANTHER; PTHR19932; WD REPEAT AND HMG-BOX DNA BINDING PROTEIN; 1.
DR PANTHER; PTHR19932:SF10; WD REPEAT AND HMG-BOX DNA-BINDING PROTEIN 1; 1.
DR Pfam; PF20946; Ctf4_C; 1.
DR Pfam; PF08662; eIF2A; 1.
DR Pfam; PF00505; HMG_box; 1.
DR Pfam; PF12341; Mcl1_mid; 1.
DR Pfam; PF00400; WD40; 2.
DR SMART; SM00398; HMG; 1.
DR SMART; SM00320; WD40; 6.
DR SUPFAM; SSF47095; HMG-box; 1.
DR SUPFAM; SSF50978; WD40 repeat-like; 1.
DR PROSITE; PS50118; HMG_BOX_2; 1.
DR PROSITE; PS00678; WD_REPEATS_1; 1.
DR PROSITE; PS50082; WD_REPEATS_2; 3.
DR PROSITE; PS50294; WD_REPEATS_REGION; 3.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|PROSITE-ProRule:PRU00267,
KW ECO:0000313|EMBL:KFQ34621.1};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00267}; Reference proteome {ECO:0000313|Proteomes:UP000052967};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW WD repeat {ECO:0000256|ARBA:ARBA00022574, ECO:0000256|PROSITE-
KW ProRule:PRU00221}.
FT REPEAT 9..41
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT REPEAT 132..173
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT REPEAT 236..267
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT DOMAIN 1006..1060
FT /note="HMG box"
FT /evidence="ECO:0000259|PROSITE:PS50118"
FT DNA_BIND 1006..1060
FT /note="HMG box"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00267"
FT REGION 328..359
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 387..413
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 848..1015
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1056..1119
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 880..948
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 961..977
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 978..1005
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1056..1099
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1100..1119
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1119 AA; 124644 MW; 7313705282176130 CRC64;
MPSAQKPMRY GHTEGHTDVC FDDSGSCIVT CGSDGDVRIW ENLDDDDPKS ITVGEKAYSC
ALKNGRLITA VSNNTVQIHT FPEGAPDGIL TRFTMNANHV IFNRDGTKVA AGSSDFMIKV
VEVADSSKQK TFRGHDAPVL SLSFDPRDVY LASASCDGSV RVWKMADQTC TTTWPLLQNC
NDVINAKSIC RLGWQPGSGK FLAVPVDKVV KLYRRETWDS QCSLSDTFIT QPLNVVAWSP
SGEYLAAGSV DGSIVVWNVE KQECIERMKH EKSYSVCGLA WHPKYSQIAY TDTEGNLGLL
ENIGDGKKAN DKVTSTVTKD YNDLFDGEDD DYLNGDTIEP QSSPTAGADE DVDDLMPTSG
PLRRAIIDDD DNSLDIGLIK ANSNLLEKDD EDDDQSGGFP ALPTSTQKHF YDGPMPTPRQ
KPFQSGSTPA HLMHRFMVWN SVGIIRCYND EQDNAIDVEF HDTSVHHATH LPNSLNHTMA
DLSTEAILLA CESSEELASK LHCIHFSSWD ANKEWTVDMP KDEDIEAICL GQGWAACSTT
ALLVRVFTVG GVQKEIFSLP GPVVSMAGHG DQLMVVYHRG TGFDGDQCLG VQLMELGKRK
KQILHGEPLS LTRKSYLIWI GFSAEGTPCY VDSEGIVRML NRGLGNTWIP VCNTREHCKG
KSDHYWVVGI HENPQQLRCI PCKGARFPPT LPRPAVAILP FKLPYCQVTT EKGQMEEQYW
RSVVFHNHAD YLSKNGYEVD ENAKSQVVKE QQELLMKLFA LSCRLERESR CLELAELMTQ
HVVNLAVKYA SRSRRLNLAQ RLSEMAVEKA TELATALEDE EEEEDFRKHL TAGYSNSATE
WSRLPVRNVQ QDQDVGDTEE TDGYEEAEET PEVHKQRPNP FSKGLTSAEV TTPKSAVITP
SSQGRVNPFK VSSNKKDSVV PSANVLDTMS RYSKKTSLSG SRAVNKQNSP LIKPLIPKPK
SKQTSAASFF QPRTPNTTEK AVEEREEKAG NESQEVKDTA QENTENRRPK TGFQMWLEEN
RANILADNPD LNEAEVIKES MSRFRMLTAE ERIVWTEKAK GGTVHDVAED KKRKRPTDDE
DEAKKNPEQK SEDSNLSKKT KPLDQPTNVR LSAFAFKQS
//