ID F6QKZ0_HORSE Unreviewed; 771 AA.
AC F6QKZ0;
DT 27-JUL-2011, integrated into UniProtKB/TrEMBL.
DT 13-SEP-2023, sequence version 3.
DT 27-MAR-2024, entry version 75.
DE SubName: Full=Cell division cycle 5 like {ECO:0000313|Ensembl:ENSECAP00000008340.3};
GN Name=CDC5L {ECO:0000313|VGNC:VGNC:16304};
OS Equus caballus (Horse).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Perissodactyla; Equidae; Equus.
OX NCBI_TaxID=9796 {ECO:0000313|Ensembl:ENSECAP00000008340.3, ECO:0000313|Proteomes:UP000002281};
RN [1] {ECO:0000313|Ensembl:ENSECAP00000008340.3, ECO:0000313|Proteomes:UP000002281}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000008340.3,
RC ECO:0000313|Proteomes:UP000002281};
RX PubMed=19892987; DOI=10.1126/science.1178158;
RG Broad Institute Genome Sequencing Platform;
RG Broad Institute Whole Genome Assembly Team;
RA Wade C.M., Giulotto E., Sigurdsson S., Zoli M., Gnerre S., Imsland F.,
RA Lear T.L., Adelson D.L., Bailey E., Bellone R.R., Bloecker H., Distl O.,
RA Edgar R.C., Garber M., Leeb T., Mauceli E., MacLeod J.N., Penedo M.C.T.,
RA Raison J.M., Sharpe T., Vogel J., Andersson L., Antczak D.F., Biagi T.,
RA Binns M.M., Chowdhary B.P., Coleman S.J., Della Valle G., Fryc S.,
RA Guerin G., Hasegawa T., Hill E.W., Jurka J., Kiialainen A., Lindgren G.,
RA Liu J., Magnani E., Mickelson J.R., Murray J., Nergadze S.G., Onofrio R.,
RA Pedroni S., Piras M.F., Raudsepp T., Rocchi M., Roeed K.H., Ryder O.A.,
RA Searle S., Skow L., Swinburne J.E., Syvaenen A.C., Tozaki T., Valberg S.J.,
RA Vaudin M., White J.R., Zody M.C., Lander E.S., Lindblad-Toh K.;
RT "Genome sequence, comparative analysis, and population genetics of the
RT domestic horse.";
RL Science 326:865-867(2009).
RN [2] {ECO:0000313|Ensembl:ENSECAP00000008340.3}
RP IDENTIFICATION.
RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000008340.3};
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SIMILARITY: Belongs to the CEF1 family.
CC {ECO:0000256|ARBA:ARBA00010506}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; F6QKZ0; -.
DR STRING; 9796.ENSECAP00000008340; -.
DR PaxDb; 9796-ENSECAP00000008340; -.
DR Ensembl; ENSECAT00000010764.4; ENSECAP00000008340.3; ENSECAG00000010081.4.
DR VGNC; VGNC:16304; CDC5L.
DR GeneTree; ENSGT00550000074922; -.
DR HOGENOM; CLU_009082_0_0_1; -.
DR InParanoid; F6QKZ0; -.
DR OMA; KMGMAGE; -.
DR TreeFam; TF101061; -.
DR Proteomes; UP000002281; Chromosome 20.
DR Bgee; ENSECAG00000010081; Expressed in brainstem and 23 other cell types or tissues.
DR ExpressionAtlas; F6QKZ0; baseline.
DR GO; GO:0000974; C:Prp19 complex; IBA:GO_Central.
DR GO; GO:0005681; C:spliceosomal complex; IBA:GO_Central.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IBA:GO_Central.
DR GO; GO:0000977; F:RNA polymerase II transcription regulatory region sequence-specific DNA binding; IBA:GO_Central.
DR GO; GO:0000398; P:mRNA splicing, via spliceosome; IBA:GO_Central.
DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IBA:GO_Central.
DR CDD; cd00167; SANT; 1.
DR CDD; cd11659; SANT_CDC5_II; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 2.
DR InterPro; IPR047242; CDC5L/Cef1.
DR InterPro; IPR021786; Cdc5p/Cef1_C.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017930; Myb_dom.
DR InterPro; IPR001005; SANT/Myb.
DR InterPro; IPR047240; SANT_CDC5L_II.
DR PANTHER; PTHR45885; CELL DIVISION CYCLE 5-LIKE PROTEIN; 1.
DR PANTHER; PTHR45885:SF1; CELL DIVISION CYCLE 5-LIKE PROTEIN; 1.
DR Pfam; PF11831; Myb_Cef; 1.
DR Pfam; PF13921; Myb_DNA-bind_6; 1.
DR SMART; SM00717; SANT; 2.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS51294; HTH_MYB; 2.
DR PROSITE; PS50090; MYB_LIKE; 2.
PE 3: Inferred from homology;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW mRNA processing {ECO:0000256|ARBA:ARBA00022664};
KW mRNA splicing {ECO:0000256|ARBA:ARBA00023187};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000002281};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Spliceosome {ECO:0000256|ARBA:ARBA00022728}.
FT DOMAIN 1..58
FT /note="HTH myb-type"
FT /evidence="ECO:0000259|PROSITE:PS51294"
FT DOMAIN 3..54
FT /note="Myb-like"
FT /evidence="ECO:0000259|PROSITE:PS50090"
FT DOMAIN 55..104
FT /note="Myb-like"
FT /evidence="ECO:0000259|PROSITE:PS50090"
FT DOMAIN 59..108
FT /note="HTH myb-type"
FT /evidence="ECO:0000259|PROSITE:PS51294"
FT REGION 108..143
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 246..278
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 409..459
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 108..142
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 246..261
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 409..437
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 771 AA; 88388 MW; 2CEEE6C916B219C8 CRC64;
MPRIMIKGGV WRNTEDEILK AAVMKYGKNQ WSRIASLLHR KSAKQCKARW YEWLDPSIKK
TEWSREEEEK LLHLAKLMPT QWRTIAPIIG RTAAQCLEHY EFLLDKAAQR DNEEETTDDP
RKLKPGEIDP NPETKPARPD PIDMDEDELE MLSEARARLA NTQGKKAKRK AREKQLEEAR
RLAALQKRRE LRAAGIEIQK KRKKKRGVDY NAEIPFEKKP ALGFYDTSEE NYQALDADFR
KLRQQDLDGE LRSEKEGRDR KKDKQHLKRK KESDLPSAIL QTSGVSEFTK KRSKLVLPAP
QISDAELQEV VKVGQASEIA RQTAEESGIT NSASSTLLSE YNVTNNSIAL RTPRTPASQD
RILQEAQNLM ALTNVDTPLK GGLNTPLHES DFSGVTPQRQ VVQTPNTVLS TPFRTPSHGS
EGLTPRSGTT PKPVINSTPG RTPLRDKLNI NPEDGMADYS DPSYVKQMER ESREHLRLGL
LGLPAPKNDF EIVLPENAEK ELEDREIDDT YIEDAADVDA RKQAIREAER VKEMKRMHKA
VQKDLPRPSE VNETILRPLN VEPPLTDLQK SEELIKKEMI TMLHYDLLHH PYEPSGNKKG
KTVGFGTNNS EHIAYLEHNP YEKFSKEELK KAQDILVQEM EVVKQGMSHG ELSSEAYNQV
WEECYSQVLY LPGQSRYTRA NLASKKDRIE SLEKRLEINR GHMTTEAKRA AKMEKKMKIL
LGGYQSRAMG LMKQLNDLWD QIEQAYLELR TFEELKKHED SAIPRRLEAV F
//