GenomeNet

Database: UniProt
Entry: A0A9Q1I9K2_CONCO
LinkDB: A0A9Q1I9K2_CONCO
Original site: A0A9Q1I9K2_CONCO 
ID   A0A9Q1I9K2_CONCO        Unreviewed;      1363 AA.
AC   A0A9Q1I9K2;
DT   13-SEP-2023, integrated into UniProtKB/TrEMBL.
DT   13-SEP-2023, sequence version 1.
DT   28-JAN-2026, entry version 10.
DE   RecName: Full=WD repeat and HMG-box DNA-binding protein 1 {ECO:0000256|ARBA:ARBA00069769};
DE   AltName: Full=Acidic nucleoplasmic DNA-binding protein 1 {ECO:0000256|ARBA:ARBA00080131};
GN   ORFNames=COCON_G00017090 {ECO:0000313|EMBL:KAJ8289050.1};
OS   Conger conger (Conger eel) (Muraena conger).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Neopterygii; Teleostei; Anguilliformes; Congridae; Conger.
OX   NCBI_TaxID=82655 {ECO:0000313|EMBL:KAJ8289050.1, ECO:0000313|Proteomes:UP001152803};
RN   [1] {ECO:0000313|EMBL:KAJ8289050.1}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=Concon-B {ECO:0000313|EMBL:KAJ8289050.1};
RX   PubMed=36758078;
RA   Parey E., Louis A., Montfort J., Bouchez O., Roques C., Iampietro C.,
RA   Lluch J., Castinel A., Donnadieu C., Desvignes T., Floi Bucao C.,
RA   Jouanno E., Wen M., Mejri S., Dirks R., Jansen H., Henkel C., Chen W.J.,
RA   Zahm M., Cabau C., Klopp C., Thompson A.W., Robinson-Rechavi M.,
RA   Braasch I., Lecointre G., Bobe J., Postlethwait J.H., Berthelot C.,
RA   Roest Crollius H., Guiguen Y.;
RT   "Genome structures resolve the early diversification of teleost fishes.";
RL   Science 379:572-575(2023).
CC   -!- FUNCTION: Core replisome component that acts as a replication
CC       initiation factor. Binds directly to the CMG complex and functions as a
CC       hub to recruit additional proteins to the replication fork.
CC       {ECO:0000256|ARBA:ARBA00056293}.
CC   -!- SUBCELLULAR LOCATION: Nucleus, nucleoplasm
CC       {ECO:0000256|ARBA:ARBA00004642}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:KAJ8289050.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; JAFJMO010000001; KAJ8289050.1; -; Genomic_DNA.
DR   OrthoDB; 427368at2759; -.
DR   Proteomes; UP001152803; Unassembled WGS sequence.
DR   GO; GO:0043596; C:nuclear replication fork; IEA:TreeGrafter.
DR   GO; GO:0005654; C:nucleoplasm; IEA:UniProtKB-SubCell.
DR   GO; GO:0003682; F:chromatin binding; IEA:TreeGrafter.
DR   GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR   GO; GO:0006281; P:DNA repair; IEA:TreeGrafter.
DR   GO; GO:0006261; P:DNA-templated DNA replication; IEA:InterPro.
DR   GO; GO:0000278; P:mitotic cell cycle; IEA:TreeGrafter.
DR   CDD; cd21993; HMG-box_WDHD1; 1.
DR   CDD; cd00200; WD40; 1.
DR   FunFam; 1.10.30.10:FF:000028; WD repeat and HMG-box DNA-binding protein 1; 1.
DR   FunFam; 2.130.10.10:FF:001715; WD repeat and HMG-box DNA-binding protein 1; 1.
DR   Gene3D; 1.10.30.10; High mobility group box domain; 1.
DR   Gene3D; 2.130.10.10; YVTN repeat-like/Quinoprotein amine dehydrogenase; 2.
DR   InterPro; IPR055339; HMG-box_WDHD1.
DR   InterPro; IPR009071; HMG_box_dom.
DR   InterPro; IPR036910; HMG_box_dom_sf.
DR   InterPro; IPR022252; SOCS4/SOCS5_dom.
DR   InterPro; IPR015943; WD40/YVTN_repeat-like_dom_sf.
DR   InterPro; IPR036322; WD40_repeat_dom_sf.
DR   InterPro; IPR001680; WD40_rpt.
DR   InterPro; IPR057646; WD40_WDHD1_1st.
DR   InterPro; IPR022100; WDHD1/CFT4_beta-prop_2nd.
DR   InterPro; IPR048591; WDHD1/CFT4_hel.
DR   PANTHER; PTHR19932; WD REPEAT AND HMG-BOX DNA BINDING PROTEIN; 1.
DR   PANTHER; PTHR19932:SF10; WD REPEAT AND HMG-BOX DNA-BINDING PROTEIN 1; 1.
DR   Pfam; PF20946; Ctf4_C; 1.
DR   Pfam; PF24815; HMG_WDHD1; 1.
DR   Pfam; PF12341; Mcl1_mid; 1.
DR   Pfam; PF12610; SOCS; 1.
DR   Pfam; PF24817; WD40_WDHD1_1st; 1.
DR   SMART; SM00398; HMG; 1.
DR   SMART; SM00320; WD40; 6.
DR   SUPFAM; SSF47095; HMG-box; 1.
DR   SUPFAM; SSF50978; WD40 repeat-like; 1.
DR   PROSITE; PS50118; HMG_BOX_2; 1.
DR   PROSITE; PS50082; WD_REPEATS_2; 2.
DR   PROSITE; PS50294; WD_REPEATS_REGION; 2.
PE   4: Predicted;
KW   DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW   ProRule:PRU00267};
KW   Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW   ProRule:PRU00267}; Reference proteome {ECO:0000313|Proteomes:UP001152803};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   WD repeat {ECO:0000256|ARBA:ARBA00022574, ECO:0000256|PROSITE-
KW   ProRule:PRU00221}.
FT   REPEAT          242..274
FT                   /note="WD"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT   REPEAT          365..406
FT                   /note="WD"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT   DOMAIN          1255..1332
FT                   /note="HMG box"
FT                   /evidence="ECO:0000259|PROSITE:PS50118"
FT   DNA_BIND        1255..1332
FT                   /note="HMG box"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00267"
FT   REGION          23..58
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          540..669
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1052..1264
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1296..1363
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        540..553
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        562..576
FT                   /note="Acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1099..1118
FT                   /note="Acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1156..1169
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1170..1184
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1229..1239
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1296..1309
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1324..1343
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1363 AA;  149683 MW;  55433E7F3C097AC8 CRC64;
     MSEKKSRNSD TRPKCIRSWS ADSYVWSSKK RSRSAQNGPG ASVIEGEVLE DQAGRSASCP
     RRRRERKCSC AGPGEGDPEL PCRKALTRRS LRQKFQDAVG QCFPLRTHHH HHHHHHHQAG
     ASRPFSVLLW SKRKIHVSEL MQDKCPFSSK SELAHCWHLI KKHVSQPGGS IAIAADQKGG
     AFPASSFSSP PPLLSWEEIA GRGPPEEAAA WTTGTPPRGR IYRAFKSSIH WKEMPAERKP
     MRYGHSEGHT DVCFDDTGKY IVTCGSDGDV RIWESLDDDD PKSVNVGEKV YSLALKNGKL
     VTAVSNNTVQ IHTFPEGDPD GILTRFTSNA NHVAFNCSGS RVAAGSSDFM VKVVEVADSS
     QQKTLRGHEA PVLSVNFDPK DEFLGTSSCD GSVAVWKIED QVQVSSWKIL QKSNDVSNAK
     SLCRLSWQPG SGKLLAVPVD TTVQLYERGT WSHVSTLSDD LIAQPVNVVA WSPCGKFLAA
     GAVGGFLTVW NVETKLCVER EKHEKGYTVC GLAWHPSGKQ IAYTDTEGCL GLLDSVLASS
     SSSSSSSAKS TSKVAPNKEA GDYDDLFDGD GDDGPLEENL SGTRSPAKNA VAEDDDDDDL
     MPATGRPRNR SSFLDDDENS LDTGSVKLGS DKYGDDDGAS NILPSVAPVA PRPVYDGPMP
     TPPQKPFQPG STPGHLMHRF MMWNSVGIVR GYSDELDNAI DVEFHDSSVH HAMHLSNSLG
     HTLADLSQEA VLLACEGTEE LASKLQCLHF SSWDTNKEWM VDLPKGEDVR AVCLGQGWVA
     AATSALLVRV FSVGGVQREV FSLPGPVVCM TGHGEQILIV YHRGTGFDGD QALGVQLLNL
     GQRRRQVIHG EPLPLSRKSH LSWMGFSAEG TPCFVDSEGV VRLLNRSLGN TWTPVCNTRE
     HCKGKSDHYW VVGVHENPQQ LRCIPCKGAR FPPTLPRPAV AILPFQLPLC QTSTEKGQME
     EQYWRSVLFH NHFDFLSASG YEVSEEGRTR AQNEQQELLM KMFALSCKLE REFRCVELAE
     LMTQRVVTLA VRYASRSRRM ALAQRLGELA MEKAAAQREE EQEEEEPDHT SGRQAPGYSR
     AAVECAESPQ WGRRPEPARE EEECEEDDGQ QQMEEEEAAE SRKPPLNPFN KGATSPEKPS
     PKPASKEGRV NPFKVSGSGK SPGPSWGQSR VTNVLDTMTP STRKPSLVAG SGAKQNKGPV
     LKPLAPRPKS KTQPTLLQMS VPRPASRGQQ KEPAVEKRSV AGGQSPASPA DNSENRKPKT
     GFQLWLEENR KGILADNPDL DETDVVKEAM GKFRALSSED RLSWTERAKG PSSEVAELKK
     RKRREEENEE VKNDEGNERE SSAKKKKPLD ASAKLSAFAF NKD
//
DBGET integrated database retrieval system