GenomeNet

Database: UniProt
Entry: W5NYK3_SHEEP
LinkDB: W5NYK3_SHEEP
Original site: W5NYK3_SHEEP 
ID   W5NYK3_SHEEP            Unreviewed;       725 AA.
AC   W5NYK3;
DT   16-APR-2014, integrated into UniProtKB/TrEMBL.
DT   16-APR-2014, sequence version 1.
DT   27-MAR-2024, entry version 42.
DE   RecName: Full=C2H2-type domain-containing protein {ECO:0000259|PROSITE:PS50157};
GN   Name=CTCF {ECO:0000313|Ensembl:ENSOARP00000003251.1};
OS   Ovis aries (Sheep).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Laurasiatheria; Artiodactyla; Ruminantia; Pecora; Bovidae;
OC   Caprinae; Ovis.
OX   NCBI_TaxID=9940 {ECO:0000313|Ensembl:ENSOARP00000003251.1, ECO:0000313|Proteomes:UP000002356};
RN   [1] {ECO:0000313|Ensembl:ENSOARP00000003251.1, ECO:0000313|Proteomes:UP000002356}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Texel {ECO:0000313|Ensembl:ENSOARP00000003251.1,
RC   ECO:0000313|Proteomes:UP000002356};
RX   PubMed=20809919; DOI=10.1111/j.1365-2052.2010.02100.x;
RA   Archibald A.L., Cockett N.E., Dalrymple B.P., Faraut T., Kijas J.W.,
RA   Maddox J.F., McEwan J.C., Hutton Oddy V., Raadsma H.W., Wade C., Wang J.,
RA   Wang W., Xun X.;
RT   "The sheep genome reference sequence: a work in progress.";
RL   Anim. Genet. 41:449-453(2010).
RN   [2] {ECO:0000313|Ensembl:ENSOARP00000003251.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AMGL01028295; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   AlphaFoldDB; W5NYK3; -.
DR   Ensembl; ENSOART00000003312.1; ENSOARP00000003251.1; ENSOARG00000003053.1.
DR   HOGENOM; CLU_002678_77_1_1; -.
DR   Proteomes; UP000002356; Chromosome 14.
DR   Bgee; ENSOARG00000003053; Expressed in saliva-secreting gland and 53 other cell types or tissues.
DR   ExpressionAtlas; W5NYK3; baseline and differential.
DR   Gene3D; 3.30.160.60; Classic Zinc Finger; 7.
DR   InterPro; IPR036236; Znf_C2H2_sf.
DR   InterPro; IPR013087; Znf_C2H2_type.
DR   PANTHER; PTHR24408; ZINC FINGER PROTEIN; 1.
DR   PANTHER; PTHR24408:SF34; ZINC FINGER PROTEIN 672; 1.
DR   Pfam; PF00096; zf-C2H2; 7.
DR   SMART; SM00355; ZnF_C2H2; 10.
DR   SUPFAM; SSF57667; beta-beta-alpha zinc fingers; 7.
DR   PROSITE; PS00028; ZINC_FINGER_C2H2_1; 6.
DR   PROSITE; PS50157; ZINC_FINGER_C2H2_2; 9.
PE   4: Predicted;
KW   DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW   Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW   Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW   Reference proteome {ECO:0000313|Proteomes:UP000002356};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Zinc {ECO:0000256|ARBA:ARBA00022833};
KW   Zinc-finger {ECO:0000256|ARBA:ARBA00022771, ECO:0000256|PROSITE-
KW   ProRule:PRU00042}.
FT   DOMAIN          266..293
FT                   /note="C2H2-type"
FT                   /evidence="ECO:0000259|PROSITE:PS50157"
FT   DOMAIN          294..321
FT                   /note="C2H2-type"
FT                   /evidence="ECO:0000259|PROSITE:PS50157"
FT   DOMAIN          322..350
FT                   /note="C2H2-type"
FT                   /evidence="ECO:0000259|PROSITE:PS50157"
FT   DOMAIN          351..378
FT                   /note="C2H2-type"
FT                   /evidence="ECO:0000259|PROSITE:PS50157"
FT   DOMAIN          379..407
FT                   /note="C2H2-type"
FT                   /evidence="ECO:0000259|PROSITE:PS50157"
FT   DOMAIN          408..436
FT                   /note="C2H2-type"
FT                   /evidence="ECO:0000259|PROSITE:PS50157"
FT   DOMAIN          438..466
FT                   /note="C2H2-type"
FT                   /evidence="ECO:0000259|PROSITE:PS50157"
FT   DOMAIN          468..495
FT                   /note="C2H2-type"
FT                   /evidence="ECO:0000259|PROSITE:PS50157"
FT   DOMAIN          552..570
FT                   /note="C2H2-type"
FT                   /evidence="ECO:0000259|PROSITE:PS50157"
FT   REGION          180..211
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          570..715
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        610..631
FT                   /note="Acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        632..648
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        656..677
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   725 AA;  82183 MW;  CBD15268FFB5DBE0 CRC64;
     MEGEAVEAIV EESETFIKGK ERKTYQRRRE GGQEEDACHL PQNQTDGGEV VQDVNSSVQM
     VMMEQLDPTL LQMKTEVMEG AVAPEAEAAV DDTQIITLQV VNMEEQPINI GELQLVQVPV
     PVTVPVATTS VEELQGAYEN EVSKEGLAES EPVICHTLPL PEGFQVVKVG ANGEVETLEQ
     GELPPQEDPS WQKDPDYQPP AKKTKKTKKS KLRYTEEGKD VDVSVYDFEE EQQEGLLSEV
     NAEKVVGNMK PPKPTKIKKK GVKKTFQCEL CSYTCPRRSN LDRHMKSHTD ERPHKCHLCG
     RAFRTVTLLR NHLNTHTGTR PHKCPDCDMA FVTSGELVRH RRYKHTHEKP FKCSMCDYAS
     VEVSKLKRHI RSHTGERPFQ CSLCSYASRD TYKLKRHMRT HSEGEKPYEC YICHARFTQS
     GTMKMHILQK HTENVAKFHC PHCDTVIARK SDLGVHLRKQ HSYIEQGKKC RYCDAVFHER
     YALIQHQKSH KNEKRFKCDQ CDYACRQVGT FIVRKRLMSH QCWVLSKGFG EAALYPISFK
     RYHDPNFVPA AFVCSKCGKT FTRRNTMARH ADNCAGPDGV EGENGGETKK SKRGRKRKMR
     SKKEDSSDSE ENAEPDLDDN EDEEEPAVEI EPEPEPQPVT PAPPPAKKRR GRPPGRTNQP
     KQTQPTAIIQ VEDQNTGAIE NIIVEVKKEP DAEPAEGEEE EAQPAATDAP NGDLTPEMIL
     SMMDR
//
DBGET integrated database retrieval system