ID F6TYZ0_HORSE Unreviewed; 530 AA.
AC F6TYZ0;
DT 27-JUL-2011, integrated into UniProtKB/TrEMBL.
DT 13-SEP-2023, sequence version 2.
DT 27-MAR-2024, entry version 67.
DE SubName: Full=Transcription factor CP2 {ECO:0000313|Ensembl:ENSECAP00000018420.2};
GN Name=TFCP2 {ECO:0000313|Ensembl:ENSECAP00000018420.2,
GN ECO:0000313|VGNC:VGNC:24035};
OS Equus caballus (Horse).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Perissodactyla; Equidae; Equus.
OX NCBI_TaxID=9796 {ECO:0000313|Ensembl:ENSECAP00000018420.2, ECO:0000313|Proteomes:UP000002281};
RN [1] {ECO:0000313|Ensembl:ENSECAP00000018420.2, ECO:0000313|Proteomes:UP000002281}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000018420.2,
RC ECO:0000313|Proteomes:UP000002281};
RX PubMed=19892987; DOI=10.1126/science.1178158;
RG Broad Institute Genome Sequencing Platform;
RG Broad Institute Whole Genome Assembly Team;
RA Wade C.M., Giulotto E., Sigurdsson S., Zoli M., Gnerre S., Imsland F.,
RA Lear T.L., Adelson D.L., Bailey E., Bellone R.R., Bloecker H., Distl O.,
RA Edgar R.C., Garber M., Leeb T., Mauceli E., MacLeod J.N., Penedo M.C.T.,
RA Raison J.M., Sharpe T., Vogel J., Andersson L., Antczak D.F., Biagi T.,
RA Binns M.M., Chowdhary B.P., Coleman S.J., Della Valle G., Fryc S.,
RA Guerin G., Hasegawa T., Hill E.W., Jurka J., Kiialainen A., Lindgren G.,
RA Liu J., Magnani E., Mickelson J.R., Murray J., Nergadze S.G., Onofrio R.,
RA Pedroni S., Piras M.F., Raudsepp T., Rocchi M., Roeed K.H., Ryder O.A.,
RA Searle S., Skow L., Swinburne J.E., Syvaenen A.C., Tozaki T., Valberg S.J.,
RA Vaudin M., White J.R., Zody M.C., Lander E.S., Lindblad-Toh K.;
RT "Genome sequence, comparative analysis, and population genetics of the
RT domestic horse.";
RL Science 326:865-867(2009).
RN [2] {ECO:0000313|Ensembl:ENSECAP00000018420.2}
RP IDENTIFICATION.
RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000018420.2};
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU01313}.
CC -!- SIMILARITY: Belongs to the grh/CP2 family. CP2 subfamily.
CC {ECO:0000256|ARBA:ARBA00010852}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; F6TYZ0; -.
DR STRING; 9796.ENSECAP00000018420; -.
DR PaxDb; 9796-ENSECAP00000018420; -.
DR Ensembl; ENSECAT00000022302.3; ENSECAP00000018420.2; ENSECAG00000020650.4.
DR CTD; 7024; -.
DR VGNC; VGNC:24035; TFCP2.
DR GeneTree; ENSGT00940000157629; -.
DR HOGENOM; CLU_015127_2_0_1; -.
DR InParanoid; F6TYZ0; -.
DR OMA; GFNSAHS; -.
DR OrthoDB; 1363858at2759; -.
DR TreeFam; TF314132; -.
DR Proteomes; UP000002281; Chromosome 6.
DR Bgee; ENSECAG00000020650; Expressed in retina and 23 other cell types or tissues.
DR GO; GO:0005634; C:nucleus; IBA:GO_Central.
DR GO; GO:0001228; F:DNA-binding transcription activator activity, RNA polymerase II-specific; IBA:GO_Central.
DR GO; GO:0000978; F:RNA polymerase II cis-regulatory region sequence-specific DNA binding; IBA:GO_Central.
DR GO; GO:0045944; P:positive regulation of transcription by RNA polymerase II; IBA:GO_Central.
DR CDD; cd09589; SAM_TFCP2; 1.
DR Gene3D; 1.10.150.50; Transcription Factor, Ets-1; 1.
DR InterPro; IPR007604; CP2.
DR InterPro; IPR013761; SAM/pointed_sf.
DR InterPro; IPR041418; SAM_3.
DR InterPro; IPR040167; TF_CP2-like.
DR InterPro; IPR037599; TFCP2_SAM.
DR PANTHER; PTHR11037:SF11; ALPHA-GLOBIN TRANSCRIPTION FACTOR CP2; 1.
DR PANTHER; PTHR11037; TRANSCRIPTION FACTOR CP2; 1.
DR Pfam; PF04516; CP2; 1.
DR Pfam; PF18016; SAM_3; 1.
DR SUPFAM; SSF47769; SAM/Pointed domain; 1.
DR PROSITE; PS51968; GRH_CP2_DB; 1.
PE 3: Inferred from homology;
KW DNA-binding {ECO:0000256|PROSITE-ProRule:PRU01313};
KW Nucleus {ECO:0000256|PROSITE-ProRule:PRU01313};
KW Reference proteome {ECO:0000313|Proteomes:UP000002281}.
FT DOMAIN 89..328
FT /note="Grh/CP2 DB"
FT /evidence="ECO:0000259|PROSITE:PS51968"
FT REGION 266..296
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 322..353
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 267..295
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 322..337
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 530 AA; 60433 MW; 1A469B6C3FBEF101 CRC64;
MIGWRRHGGW WRSSASLVGV SWGNEGARMA WALKLPLADE VIESGLVQDF DASLSGIGQE
LGAGAYSMSD VLALPIFKQE ESSLPPDNEN KILPFQYVLC AATSPAVKLH DETLTYLNQG
QSYEIRMLDN RKLGELPEIN GKLVKSIFRV VFHDRRLQYT EHQQLEGWRW NRPGDRILDI
DIPMSVGIID PRANPTQLNT VEFLWDPAKR TSVFIQVHCI STEFTMRKHG GEKGVPFRVQ
IDTFKENENG EYTEHLHSAS CQIKVFKPKG ADRKQKTDRE KMEKRTPHEK EKYQPSYETT
ILTECSPWPE ITYVNNSPSP GFNSSHSSFS LGEGNGSPNH QPEPPPPVTD NLLPTTTPQE
AQQWLHRNRF STFTRLFTNF SGADLLKLTR DDVIQICGPA DGIRLFNALK GRMVRPRLTI
YVCQESLQLR EQQQQQQQQQ QKHEDGDSNG SFFVYHAIYL EELTAVELTE KIAQLFSISP
RQISQIYKQG PTGIHVLISD EMIQNFQEEA CFILDTMKAE TNDSYHIILK
//