ID A0A3Q2I9E5_HORSE Unreviewed; 1367 AA.
AC A0A3Q2I9E5;
DT 10-APR-2019, integrated into UniProtKB/TrEMBL.
DT 13-SEP-2023, sequence version 3.
DT 27-MAR-2024, entry version 25.
DE RecName: Full=Homeobox protein cut-like {ECO:0000256|RuleBase:RU361129};
GN Name=CUX1 {ECO:0000313|Ensembl:ENSECAP00000044650.3,
GN ECO:0000313|VGNC:VGNC:112271};
OS Equus caballus (Horse).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Perissodactyla; Equidae; Equus.
OX NCBI_TaxID=9796 {ECO:0000313|Ensembl:ENSECAP00000044650.3, ECO:0000313|Proteomes:UP000002281};
RN [1] {ECO:0000313|Ensembl:ENSECAP00000044650.3, ECO:0000313|Proteomes:UP000002281}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000044650.3,
RC ECO:0000313|Proteomes:UP000002281};
RX PubMed=19892987; DOI=10.1126/science.1178158;
RG Broad Institute Genome Sequencing Platform;
RG Broad Institute Whole Genome Assembly Team;
RA Wade C.M., Giulotto E., Sigurdsson S., Zoli M., Gnerre S., Imsland F.,
RA Lear T.L., Adelson D.L., Bailey E., Bellone R.R., Bloecker H., Distl O.,
RA Edgar R.C., Garber M., Leeb T., Mauceli E., MacLeod J.N., Penedo M.C.T.,
RA Raison J.M., Sharpe T., Vogel J., Andersson L., Antczak D.F., Biagi T.,
RA Binns M.M., Chowdhary B.P., Coleman S.J., Della Valle G., Fryc S.,
RA Guerin G., Hasegawa T., Hill E.W., Jurka J., Kiialainen A., Lindgren G.,
RA Liu J., Magnani E., Mickelson J.R., Murray J., Nergadze S.G., Onofrio R.,
RA Pedroni S., Piras M.F., Raudsepp T., Rocchi M., Roeed K.H., Ryder O.A.,
RA Searle S., Skow L., Swinburne J.E., Syvaenen A.C., Tozaki T., Valberg S.J.,
RA Vaudin M., White J.R., Zody M.C., Lander E.S., Lindblad-Toh K.;
RT "Genome sequence, comparative analysis, and population genetics of the
RT domestic horse.";
RL Science 326:865-867(2009).
RN [2] {ECO:0000313|Ensembl:ENSECAP00000044650.3}
RP IDENTIFICATION.
RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000044650.3};
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|PROSITE-ProRule:PRU00108,
CC ECO:0000256|RuleBase:RU000682}.
CC -!- SIMILARITY: Belongs to the CUT homeobox family.
CC {ECO:0000256|ARBA:ARBA00008190, ECO:0000256|RuleBase:RU361129}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 9796.ENSECAP00000044650; -.
DR PaxDb; 9796-ENSECAP00000044650; -.
DR Ensembl; ENSECAT00000046517.3; ENSECAP00000044650.3; ENSECAG00000022645.4.
DR VGNC; VGNC:112271; CUX1.
DR GeneTree; ENSGT00940000159751; -.
DR InParanoid; A0A3Q2I9E5; -.
DR OMA; LESKPYH; -.
DR Proteomes; UP000002281; Chromosome 13.
DR Bgee; ENSECAG00000022645; Expressed in articular cartilage of joint and 23 other cell types or tissues.
DR ExpressionAtlas; A0A3Q2I9E5; baseline.
DR GO; GO:0005634; C:nucleus; IBA:GO_Central.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IBA:GO_Central.
DR GO; GO:0000977; F:RNA polymerase II transcription regulatory region sequence-specific DNA binding; IBA:GO_Central.
DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IBA:GO_Central.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR Gene3D; 1.10.260.40; lambda repressor-like DNA-binding domains; 3.
DR InterPro; IPR003350; CUT_dom.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR010982; Lambda_DNA-bd_dom_sf.
DR PANTHER; PTHR14043; CCAAT DISPLACEMENT PROTEIN-RELATED; 1.
DR PANTHER; PTHR14043:SF4; HOMEOBOX PROTEIN CUT-LIKE 1; 1.
DR Pfam; PF02376; CUT; 3.
DR Pfam; PF00046; Homeodomain; 1.
DR SMART; SM01109; CUT; 3.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR SUPFAM; SSF47413; lambda repressor-like DNA-binding domains; 3.
DR PROSITE; PS51042; CUT; 3.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 3: Inferred from homology;
KW Coiled coil {ECO:0000256|ARBA:ARBA00023054, ECO:0000256|SAM:Coils};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000002281};
KW Transcription {ECO:0000256|RuleBase:RU361129};
KW Transcription regulation {ECO:0000256|RuleBase:RU361129}.
FT DOMAIN 553..640
FT /note="CUT"
FT /evidence="ECO:0000259|PROSITE:PS51042"
FT DOMAIN 916..1003
FT /note="CUT"
FT /evidence="ECO:0000259|PROSITE:PS51042"
FT DOMAIN 1099..1186
FT /note="CUT"
FT /evidence="ECO:0000259|PROSITE:PS51042"
FT DOMAIN 1224..1284
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 1226..1285
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 407..466
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 522..561
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 671..693
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 752..910
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1018..1092
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1194..1229
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1293..1367
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 112..355
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 446..466
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 522..557
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 677..693
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 822..836
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 846..892
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 893..910
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1018..1060
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1065..1084
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1367 AA; 150083 MW; 6C69947161AB91BD CRC64;
MAANVGSMFQ YWKRFDLQQL QRELDATATV LANRQDESEQ SRKRLIEQSR EFKKNTPEDL
RKQVAPLLKS FQGEIDALSK RSKEAEAAFL NVYKRLIDVP DPVPALELGQ QLQVKVQRLH
DIETENQKLR ETLEEYNKEF AEVKNQEVTI KALKEKIREY EQTLKNQAET IALEKEQKLQ
NDFAEKERKL QETQMSTTSK LEEAEHKVQT LQTALEKTRT ELFDLKTKYD EEITAKADEM
EMIMTDLERA NQRAEVAQRE AETLREQLSS ANHSLQLASQ IQKAPDVEQA IEVLTRSSLE
VELAAKEREI AQLVEDVQRL QASLSKLREN SASQISQLEQ QLSAKNSTLK QLEEKLKGQA
DYEEVKKELN ILKSMEFAPS EGAGTQDASK PLEVLLLEKN RSLQSENAAL RISNSDLSGS
ARRKGKDQPE SRRPGPLPAS PPPQLPRNTG EQASNTNGTH QFSPAGLTQD FFSSSLASPS
LPLASTGKFA LNSLLQRQLM QSFYSKAVQE AGSTSMIFPT GPYSTNSISS QSPLQQSPDV
NGMAPSPSQS ESAGSVSEGE EIDTAEIARQ VKEQLIKHNI GQRIFGHYVL GLSQGSVSEI
LARPKPWNKL TVRGKEPFHK MKQFLSDEQN ILALRSIQGR QRGNITTRVR ASETGSDEAI
KSILEQAKRE LQVQKAAEPA QPSSSSSSGS SDDAIRSILQ QARREMEAQQ AALDPALKQT
PLSQTDIAIL TPKLISTSPI SSGYSPLAIS LKKPPAAPDS SASALPNPPA LKKEAQDTPG
LDLQGAADPA QGVLRHVKNE LGRSGVWKDH WWSTVQPERK SAVPPEEPKG EEASGGKEKG
GGSQTRAERG QLQGPSSSEY WKEWPSAESP YSQSSELSLT GASRSETPQN SPLPSSPIVP
LSKPAKPSVP PLTPEQYEIY MYQEVDTIEL TRQVKEKLAK NGICQRIFGE KVLGLSQGSV
SDMLSRPKPW SKLTQKGREP FIRMQLWLNG ELGQGVLPVQ GQQQGPVLHS VTSLQDPLQQ
GCVSSESTPK TSASCSPAPE SPMSSSESVK SLTELVQQPC PPIETSKDGK PPEPSDPPAS
DSQPTTPLPL SGHSALSIQE LVAMSPELDT YGITKRVKEV LTDNNLGQRL FGETILGLTQ
GSVSDLLARP KPWHKLSLKG REPFVRMQLW LNDPNNVEKL MDMKRMEKKA YMKRRHSSVS
DSQPCEPPSV GIDYSQGASP QPQHQLKKPR VVLAPEEKEA LKRAYQQKPY PSPKTIEELA
TQLNLKTSTV INWFHNYRSR IRRELFIEEI QAGSQGQAGA SDSPSARSGR AAPGSEGDSC
DGVEAAEGPG AADAEESGGP AAAAKSQGGP AEAAAAPEEQ EEAPRPA
//