ID A0A9Q1I9K2_CONCO Unreviewed; 1363 AA.
AC A0A9Q1I9K2;
DT 13-SEP-2023, integrated into UniProtKB/TrEMBL.
DT 13-SEP-2023, sequence version 1.
DT 28-JAN-2026, entry version 10.
DE RecName: Full=WD repeat and HMG-box DNA-binding protein 1 {ECO:0000256|ARBA:ARBA00069769};
DE AltName: Full=Acidic nucleoplasmic DNA-binding protein 1 {ECO:0000256|ARBA:ARBA00080131};
GN ORFNames=COCON_G00017090 {ECO:0000313|EMBL:KAJ8289050.1};
OS Conger conger (Conger eel) (Muraena conger).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Anguilliformes; Congridae; Conger.
OX NCBI_TaxID=82655 {ECO:0000313|EMBL:KAJ8289050.1, ECO:0000313|Proteomes:UP001152803};
RN [1] {ECO:0000313|EMBL:KAJ8289050.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=Concon-B {ECO:0000313|EMBL:KAJ8289050.1};
RX PubMed=36758078;
RA Parey E., Louis A., Montfort J., Bouchez O., Roques C., Iampietro C.,
RA Lluch J., Castinel A., Donnadieu C., Desvignes T., Floi Bucao C.,
RA Jouanno E., Wen M., Mejri S., Dirks R., Jansen H., Henkel C., Chen W.J.,
RA Zahm M., Cabau C., Klopp C., Thompson A.W., Robinson-Rechavi M.,
RA Braasch I., Lecointre G., Bobe J., Postlethwait J.H., Berthelot C.,
RA Roest Crollius H., Guiguen Y.;
RT "Genome structures resolve the early diversification of teleost fishes.";
RL Science 379:572-575(2023).
CC -!- FUNCTION: Core replisome component that acts as a replication
CC initiation factor. Binds directly to the CMG complex and functions as a
CC hub to recruit additional proteins to the replication fork.
CC {ECO:0000256|ARBA:ARBA00056293}.
CC -!- SUBCELLULAR LOCATION: Nucleus, nucleoplasm
CC {ECO:0000256|ARBA:ARBA00004642}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KAJ8289050.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JAFJMO010000001; KAJ8289050.1; -; Genomic_DNA.
DR OrthoDB; 427368at2759; -.
DR Proteomes; UP001152803; Unassembled WGS sequence.
DR GO; GO:0043596; C:nuclear replication fork; IEA:TreeGrafter.
DR GO; GO:0005654; C:nucleoplasm; IEA:UniProtKB-SubCell.
DR GO; GO:0003682; F:chromatin binding; IEA:TreeGrafter.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0006281; P:DNA repair; IEA:TreeGrafter.
DR GO; GO:0006261; P:DNA-templated DNA replication; IEA:InterPro.
DR GO; GO:0000278; P:mitotic cell cycle; IEA:TreeGrafter.
DR CDD; cd21993; HMG-box_WDHD1; 1.
DR CDD; cd00200; WD40; 1.
DR FunFam; 1.10.30.10:FF:000028; WD repeat and HMG-box DNA-binding protein 1; 1.
DR FunFam; 2.130.10.10:FF:001715; WD repeat and HMG-box DNA-binding protein 1; 1.
DR Gene3D; 1.10.30.10; High mobility group box domain; 1.
DR Gene3D; 2.130.10.10; YVTN repeat-like/Quinoprotein amine dehydrogenase; 2.
DR InterPro; IPR055339; HMG-box_WDHD1.
DR InterPro; IPR009071; HMG_box_dom.
DR InterPro; IPR036910; HMG_box_dom_sf.
DR InterPro; IPR022252; SOCS4/SOCS5_dom.
DR InterPro; IPR015943; WD40/YVTN_repeat-like_dom_sf.
DR InterPro; IPR036322; WD40_repeat_dom_sf.
DR InterPro; IPR001680; WD40_rpt.
DR InterPro; IPR057646; WD40_WDHD1_1st.
DR InterPro; IPR022100; WDHD1/CFT4_beta-prop_2nd.
DR InterPro; IPR048591; WDHD1/CFT4_hel.
DR PANTHER; PTHR19932; WD REPEAT AND HMG-BOX DNA BINDING PROTEIN; 1.
DR PANTHER; PTHR19932:SF10; WD REPEAT AND HMG-BOX DNA-BINDING PROTEIN 1; 1.
DR Pfam; PF20946; Ctf4_C; 1.
DR Pfam; PF24815; HMG_WDHD1; 1.
DR Pfam; PF12341; Mcl1_mid; 1.
DR Pfam; PF12610; SOCS; 1.
DR Pfam; PF24817; WD40_WDHD1_1st; 1.
DR SMART; SM00398; HMG; 1.
DR SMART; SM00320; WD40; 6.
DR SUPFAM; SSF47095; HMG-box; 1.
DR SUPFAM; SSF50978; WD40 repeat-like; 1.
DR PROSITE; PS50118; HMG_BOX_2; 1.
DR PROSITE; PS50082; WD_REPEATS_2; 2.
DR PROSITE; PS50294; WD_REPEATS_REGION; 2.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00267};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00267}; Reference proteome {ECO:0000313|Proteomes:UP001152803};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW WD repeat {ECO:0000256|ARBA:ARBA00022574, ECO:0000256|PROSITE-
KW ProRule:PRU00221}.
FT REPEAT 242..274
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT REPEAT 365..406
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT DOMAIN 1255..1332
FT /note="HMG box"
FT /evidence="ECO:0000259|PROSITE:PS50118"
FT DNA_BIND 1255..1332
FT /note="HMG box"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00267"
FT REGION 23..58
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 540..669
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1052..1264
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1296..1363
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 540..553
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 562..576
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1099..1118
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1156..1169
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1170..1184
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1229..1239
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1296..1309
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1324..1343
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1363 AA; 149683 MW; 55433E7F3C097AC8 CRC64;
MSEKKSRNSD TRPKCIRSWS ADSYVWSSKK RSRSAQNGPG ASVIEGEVLE DQAGRSASCP
RRRRERKCSC AGPGEGDPEL PCRKALTRRS LRQKFQDAVG QCFPLRTHHH HHHHHHHQAG
ASRPFSVLLW SKRKIHVSEL MQDKCPFSSK SELAHCWHLI KKHVSQPGGS IAIAADQKGG
AFPASSFSSP PPLLSWEEIA GRGPPEEAAA WTTGTPPRGR IYRAFKSSIH WKEMPAERKP
MRYGHSEGHT DVCFDDTGKY IVTCGSDGDV RIWESLDDDD PKSVNVGEKV YSLALKNGKL
VTAVSNNTVQ IHTFPEGDPD GILTRFTSNA NHVAFNCSGS RVAAGSSDFM VKVVEVADSS
QQKTLRGHEA PVLSVNFDPK DEFLGTSSCD GSVAVWKIED QVQVSSWKIL QKSNDVSNAK
SLCRLSWQPG SGKLLAVPVD TTVQLYERGT WSHVSTLSDD LIAQPVNVVA WSPCGKFLAA
GAVGGFLTVW NVETKLCVER EKHEKGYTVC GLAWHPSGKQ IAYTDTEGCL GLLDSVLASS
SSSSSSSAKS TSKVAPNKEA GDYDDLFDGD GDDGPLEENL SGTRSPAKNA VAEDDDDDDL
MPATGRPRNR SSFLDDDENS LDTGSVKLGS DKYGDDDGAS NILPSVAPVA PRPVYDGPMP
TPPQKPFQPG STPGHLMHRF MMWNSVGIVR GYSDELDNAI DVEFHDSSVH HAMHLSNSLG
HTLADLSQEA VLLACEGTEE LASKLQCLHF SSWDTNKEWM VDLPKGEDVR AVCLGQGWVA
AATSALLVRV FSVGGVQREV FSLPGPVVCM TGHGEQILIV YHRGTGFDGD QALGVQLLNL
GQRRRQVIHG EPLPLSRKSH LSWMGFSAEG TPCFVDSEGV VRLLNRSLGN TWTPVCNTRE
HCKGKSDHYW VVGVHENPQQ LRCIPCKGAR FPPTLPRPAV AILPFQLPLC QTSTEKGQME
EQYWRSVLFH NHFDFLSASG YEVSEEGRTR AQNEQQELLM KMFALSCKLE REFRCVELAE
LMTQRVVTLA VRYASRSRRM ALAQRLGELA MEKAAAQREE EQEEEEPDHT SGRQAPGYSR
AAVECAESPQ WGRRPEPARE EEECEEDDGQ QQMEEEEAAE SRKPPLNPFN KGATSPEKPS
PKPASKEGRV NPFKVSGSGK SPGPSWGQSR VTNVLDTMTP STRKPSLVAG SGAKQNKGPV
LKPLAPRPKS KTQPTLLQMS VPRPASRGQQ KEPAVEKRSV AGGQSPASPA DNSENRKPKT
GFQLWLEENR KGILADNPDL DETDVVKEAM GKFRALSSED RLSWTERAKG PSSEVAELKK
RKRREEENEE VKNDEGNERE SSAKKKKPLD ASAKLSAFAF NKD
//