ID W5NYK3_SHEEP Unreviewed; 725 AA.
AC W5NYK3;
DT 16-APR-2014, integrated into UniProtKB/TrEMBL.
DT 16-APR-2014, sequence version 1.
DT 27-MAR-2024, entry version 42.
DE RecName: Full=C2H2-type domain-containing protein {ECO:0000259|PROSITE:PS50157};
GN Name=CTCF {ECO:0000313|Ensembl:ENSOARP00000003251.1};
OS Ovis aries (Sheep).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Artiodactyla; Ruminantia; Pecora; Bovidae;
OC Caprinae; Ovis.
OX NCBI_TaxID=9940 {ECO:0000313|Ensembl:ENSOARP00000003251.1, ECO:0000313|Proteomes:UP000002356};
RN [1] {ECO:0000313|Ensembl:ENSOARP00000003251.1, ECO:0000313|Proteomes:UP000002356}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Texel {ECO:0000313|Ensembl:ENSOARP00000003251.1,
RC ECO:0000313|Proteomes:UP000002356};
RX PubMed=20809919; DOI=10.1111/j.1365-2052.2010.02100.x;
RA Archibald A.L., Cockett N.E., Dalrymple B.P., Faraut T., Kijas J.W.,
RA Maddox J.F., McEwan J.C., Hutton Oddy V., Raadsma H.W., Wade C., Wang J.,
RA Wang W., Xun X.;
RT "The sheep genome reference sequence: a work in progress.";
RL Anim. Genet. 41:449-453(2010).
RN [2] {ECO:0000313|Ensembl:ENSOARP00000003251.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AMGL01028295; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR AlphaFoldDB; W5NYK3; -.
DR Ensembl; ENSOART00000003312.1; ENSOARP00000003251.1; ENSOARG00000003053.1.
DR HOGENOM; CLU_002678_77_1_1; -.
DR Proteomes; UP000002356; Chromosome 14.
DR Bgee; ENSOARG00000003053; Expressed in saliva-secreting gland and 53 other cell types or tissues.
DR ExpressionAtlas; W5NYK3; baseline and differential.
DR Gene3D; 3.30.160.60; Classic Zinc Finger; 7.
DR InterPro; IPR036236; Znf_C2H2_sf.
DR InterPro; IPR013087; Znf_C2H2_type.
DR PANTHER; PTHR24408; ZINC FINGER PROTEIN; 1.
DR PANTHER; PTHR24408:SF34; ZINC FINGER PROTEIN 672; 1.
DR Pfam; PF00096; zf-C2H2; 7.
DR SMART; SM00355; ZnF_C2H2; 10.
DR SUPFAM; SSF57667; beta-beta-alpha zinc fingers; 7.
DR PROSITE; PS00028; ZINC_FINGER_C2H2_1; 6.
DR PROSITE; PS50157; ZINC_FINGER_C2H2_2; 9.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000002356};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771, ECO:0000256|PROSITE-
KW ProRule:PRU00042}.
FT DOMAIN 266..293
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 294..321
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 322..350
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 351..378
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 379..407
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 408..436
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 438..466
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 468..495
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 552..570
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT REGION 180..211
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 570..715
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 610..631
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 632..648
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 656..677
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 725 AA; 82183 MW; CBD15268FFB5DBE0 CRC64;
MEGEAVEAIV EESETFIKGK ERKTYQRRRE GGQEEDACHL PQNQTDGGEV VQDVNSSVQM
VMMEQLDPTL LQMKTEVMEG AVAPEAEAAV DDTQIITLQV VNMEEQPINI GELQLVQVPV
PVTVPVATTS VEELQGAYEN EVSKEGLAES EPVICHTLPL PEGFQVVKVG ANGEVETLEQ
GELPPQEDPS WQKDPDYQPP AKKTKKTKKS KLRYTEEGKD VDVSVYDFEE EQQEGLLSEV
NAEKVVGNMK PPKPTKIKKK GVKKTFQCEL CSYTCPRRSN LDRHMKSHTD ERPHKCHLCG
RAFRTVTLLR NHLNTHTGTR PHKCPDCDMA FVTSGELVRH RRYKHTHEKP FKCSMCDYAS
VEVSKLKRHI RSHTGERPFQ CSLCSYASRD TYKLKRHMRT HSEGEKPYEC YICHARFTQS
GTMKMHILQK HTENVAKFHC PHCDTVIARK SDLGVHLRKQ HSYIEQGKKC RYCDAVFHER
YALIQHQKSH KNEKRFKCDQ CDYACRQVGT FIVRKRLMSH QCWVLSKGFG EAALYPISFK
RYHDPNFVPA AFVCSKCGKT FTRRNTMARH ADNCAGPDGV EGENGGETKK SKRGRKRKMR
SKKEDSSDSE ENAEPDLDDN EDEEEPAVEI EPEPEPQPVT PAPPPAKKRR GRPPGRTNQP
KQTQPTAIIQ VEDQNTGAIE NIIVEVKKEP DAEPAEGEEE EAQPAATDAP NGDLTPEMIL
SMMDR
//