ID W5P1H8_SHEEP Unreviewed; 634 AA.
AC W5P1H8;
DT 16-APR-2014, integrated into UniProtKB/TrEMBL.
DT 16-APR-2014, sequence version 1.
DT 27-MAR-2024, entry version 42.
DE RecName: Full=C2H2-type domain-containing protein {ECO:0000259|PROSITE:PS50157};
OS Ovis aries (Sheep).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Artiodactyla; Ruminantia; Pecora; Bovidae;
OC Caprinae; Ovis.
OX NCBI_TaxID=9940 {ECO:0000313|Ensembl:ENSOARP00000004277.1, ECO:0000313|Proteomes:UP000002356};
RN [1] {ECO:0000313|Ensembl:ENSOARP00000004277.1, ECO:0000313|Proteomes:UP000002356}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Texel {ECO:0000313|Ensembl:ENSOARP00000004277.1,
RC ECO:0000313|Proteomes:UP000002356};
RX PubMed=20809919; DOI=10.1111/j.1365-2052.2010.02100.x;
RA Archibald A.L., Cockett N.E., Dalrymple B.P., Faraut T., Kijas J.W.,
RA Maddox J.F., McEwan J.C., Hutton Oddy V., Raadsma H.W., Wade C., Wang J.,
RA Wang W., Xun X.;
RT "The sheep genome reference sequence: a work in progress.";
RL Anim. Genet. 41:449-453(2010).
RN [2] {ECO:0000313|Ensembl:ENSOARP00000004277.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AMGL01033362; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AMGL01033363; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR STRING; 9940.ENSOARP00000004277; -.
DR PaxDb; 9940-ENSOARP00000004277; -.
DR Ensembl; ENSOART00000004352.1; ENSOARP00000004277.1; ENSOARG00000004005.1.
DR eggNOG; KOG1721; Eukaryota.
DR HOGENOM; CLU_002678_51_0_1; -.
DR OMA; HRPEAPC; -.
DR Proteomes; UP000002356; Chromosome 15.
DR Bgee; ENSOARG00000004005; Expressed in gastric lymph node and 55 other cell types or tissues.
DR Gene3D; 3.30.160.60; Classic Zinc Finger; 6.
DR Gene3D; 2.170.270.10; SET domain; 1.
DR InterPro; IPR046341; SET_dom_sf.
DR InterPro; IPR036236; Znf_C2H2_sf.
DR InterPro; IPR013087; Znf_C2H2_type.
DR PANTHER; PTHR24384; FINGER PUTATIVE TRANSCRIPTION FACTOR FAMILY-RELATED; 1.
DR PANTHER; PTHR24384:SF226; ZINC FINGER PROTEIN 408; 1.
DR Pfam; PF00096; zf-C2H2; 4.
DR SMART; SM00355; ZnF_C2H2; 6.
DR SUPFAM; SSF57667; beta-beta-alpha zinc fingers; 3.
DR PROSITE; PS00028; ZINC_FINGER_C2H2_1; 6.
DR PROSITE; PS50157; ZINC_FINGER_C2H2_2; 7.
PE 4: Predicted;
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Reference proteome {ECO:0000313|Proteomes:UP000002356};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771, ECO:0000256|PROSITE-
KW ProRule:PRU00042}.
FT DOMAIN 359..386
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 387..414
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 415..442
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 443..465
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 477..498
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 499..526
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 527..554
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT REGION 1..44
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 93..115
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 182..356
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 98..115
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 230..253
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 276..305
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 634 AA; 67790 MW; 32EC5460403EA6E8 CRC64;
GMEATEELLL GRGRLQLAPD PRLGPDSGWS PPSEGCAPGL KDVPPGPTRA ILALKSLPRG
LALGPSLTAE QRLGVWCVGE PLPPGLLWGP LEEESVSEQK ANGVEPKRKE DESLGSWGDV
SLGSWGDVCA CEQSAGWTSL VQRGRLEGEG NVAPVRIGER LHLQVSRLVL PGFELWLWPQ
LLSEGPSPTQ PRPEEAASAA AEVESAVGQE AASPGEDTAE AYPDPGHQSP PSIQAESVVS
PDLTPQTQAL VPEESQPLGP LPADGSMDEE DLLQTQMPPE PQSSSTPQQG PARGEASSSS
SDRAPQLCAH LVKKLRSPRA KTSEPGAQVT GEPQRPGFPA RLRSPPGPAG GSPKQGRRYR
CGECGKAFLQ LCHLKKHAFV HTGHKPFLCT ECGKSYSSEE SFKAHMLGHR GVRPFPCPQC
DKAYGTRRDL REHQVVHSGA RPFSCEQCGK AFARRPSLRL HRKTHQVAAA PAPRPXGKAL
RDPHTLRAHE RLHSGERPFP CPQCGRAYTL ATKLRRHLKS HLADKPHRCP TCGMGYALLQ
SLRRHQLSHQ ARAPASPPCV PPAAPEPTVV LLQDAGSEGD SAPAGDIFEV TISQGQEKCF
VAPGEPGPPP GLVLIHKDLA ISAWAEVVEV ETGS
//