ID A0A3Q2KRW8_HORSE Unreviewed; 1251 AA.
AC A0A3Q2KRW8;
DT 10-APR-2019, integrated into UniProtKB/TrEMBL.
DT 10-APR-2019, sequence version 1.
DT 27-MAR-2024, entry version 19.
DE SubName: Full=Zinc finger protein 521 {ECO:0000313|Ensembl:ENSECAP00000026315.1};
GN Name=ZNF521 {ECO:0000313|Ensembl:ENSECAP00000026315.1,
GN ECO:0000313|VGNC:VGNC:25337};
OS Equus caballus (Horse).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Perissodactyla; Equidae; Equus.
OX NCBI_TaxID=9796 {ECO:0000313|Ensembl:ENSECAP00000026315.1, ECO:0000313|Proteomes:UP000002281};
RN [1] {ECO:0000313|Ensembl:ENSECAP00000026315.1, ECO:0000313|Proteomes:UP000002281}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000026315.1,
RC ECO:0000313|Proteomes:UP000002281};
RX PubMed=19892987; DOI=10.1126/science.1178158;
RG Broad Institute Genome Sequencing Platform;
RG Broad Institute Whole Genome Assembly Team;
RA Wade C.M., Giulotto E., Sigurdsson S., Zoli M., Gnerre S., Imsland F.,
RA Lear T.L., Adelson D.L., Bailey E., Bellone R.R., Bloecker H., Distl O.,
RA Edgar R.C., Garber M., Leeb T., Mauceli E., MacLeod J.N., Penedo M.C.T.,
RA Raison J.M., Sharpe T., Vogel J., Andersson L., Antczak D.F., Biagi T.,
RA Binns M.M., Chowdhary B.P., Coleman S.J., Della Valle G., Fryc S.,
RA Guerin G., Hasegawa T., Hill E.W., Jurka J., Kiialainen A., Lindgren G.,
RA Liu J., Magnani E., Mickelson J.R., Murray J., Nergadze S.G., Onofrio R.,
RA Pedroni S., Piras M.F., Raudsepp T., Rocchi M., Roeed K.H., Ryder O.A.,
RA Searle S., Skow L., Swinburne J.E., Syvaenen A.C., Tozaki T., Valberg S.J.,
RA Vaudin M., White J.R., Zody M.C., Lander E.S., Lindblad-Toh K.;
RT "Genome sequence, comparative analysis, and population genetics of the
RT domestic horse.";
RL Science 326:865-867(2009).
RN [2] {ECO:0000313|Ensembl:ENSECAP00000026315.1}
RP IDENTIFICATION.
RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000026315.1};
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; A0A3Q2KRW8; -.
DR Ensembl; ENSECAT00000035607.2; ENSECAP00000026315.1; ENSECAG00000006546.3.
DR VGNC; VGNC:25337; ZNF521.
DR GeneTree; ENSGT00940000159287; -.
DR Proteomes; UP000002281; Chromosome 8.
DR Bgee; ENSECAG00000006546; Expressed in cerebellum and 21 other cell types or tissues.
DR ExpressionAtlas; A0A3Q2KRW8; baseline.
DR Gene3D; 3.30.160.60; Classic Zinc Finger; 14.
DR InterPro; IPR036236; Znf_C2H2_sf.
DR InterPro; IPR013087; Znf_C2H2_type.
DR PANTHER; PTHR24406; TRANSCRIPTIONAL REPRESSOR CTCFL-RELATED; 1.
DR PANTHER; PTHR24406:SF30; ZINC FINGER PROTEIN 423; 1.
DR Pfam; PF00096; zf-C2H2; 5.
DR Pfam; PF13912; zf-C2H2_6; 5.
DR SMART; SM00355; ZnF_C2H2; 29.
DR SUPFAM; SSF57667; beta-beta-alpha zinc fingers; 10.
DR PROSITE; PS00028; ZINC_FINGER_C2H2_1; 26.
DR PROSITE; PS50157; ZINC_FINGER_C2H2_2; 24.
PE 4: Predicted;
KW Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00042};
KW Reference proteome {ECO:0000313|Proteomes:UP000002281};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00042};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00042}.
FT DOMAIN 58..85
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 86..113
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 114..141
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 142..169
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 186..214
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 221..249
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 250..277
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 377..405
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 417..445
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 453..476
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 574..596
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 604..626
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 634..658
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 662..690
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 692..720
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 723..750
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 749..772
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 826..853
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 870..897
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 899..926
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 1078..1106
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 1165..1192
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 1196..1224
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 1226..1251
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT REGION 1..48
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 297..335
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 803..822
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1047..1077
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1108..1128
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 25..40
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1047..1061
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1251 AA; 141044 MW; 4D469293548FF607 CRC64;
MSRRKQAKPR SLKDGVDVED DPTCSWPASS PSSKDQTSPS HGEGCDFGEE EGGPGLPYPC
QFCDKSFSRL SYLKHHEQSH SDKLPFKCTY CSRLFKHKRS RDRHIKLHTG DKKYHCSECD
AAFSRSDHLK IHLKTHTSNK PYKCAICRRG FLSSSSLHGH MQVHERNKDG SQSGSRMEDW
KMKDTQKCSQ CEEGFDFPED LQKHIAECHP ECSPNEDRAA LQCVYCHELF VEETSLMNHM
EQMHGGEKKN SCSICSESFH TVEELYSHMD SHQQPESCNH SNSPSLVTVG YTSVSSTTPD
SNLSVDSSTM VEAAPPIPKS RGRKRAAQQT PDMTVPSSKQ AKVTYSCIYC NKQLFSSLAV
LQIHLKTMHL DKPEQAHICQ YCLEVLPSLY NLNEHLKQVH EAQDPGLIVS AMPAIVYQCN
FCSEVVNDLN TLQEHIRCSH GFANPAAKDS NAFFCPHCYM GFLTDSSLEE HIRQVHCDLS
GSRFGSPVLG TPKEPVVEVY SCSYCTNSPI FNSVLKLNKH IKENHKNIPL ALNYIHNGKK
SRALSPLSPV AIEQTSLKMM QAVGGAPARP AGEYICNQCG AKYTSLDSFQ THLKTHLDTV
LPKLTCPQCN KEFPNQESLL KHVTIHFMIT STYYICESCD KQFTSVDDLQ KHLLDMHTFV
FFRCTLCQEV FDSKVSIQLH LAVKHSNEKK VYRCTSCNWD FRNETDLQLH VKHNHLENQG
KVHKCIFCGE SFGTEVELQC HITTHSKKYN CKFCSKAFHA IILLEKHLRE KHCVFETKTP
NCGANGASEQ VQKEEVELQT LLTNSQESHN SHDGSEEDVD TSEPMYGCDI CGAAYTMETL
LQNHQLRDHN IRPGESAIVK KKAELIKGNY KCNVCSRTFF SENGLREHMQ THLGPVKHYM
CPICGERFPS LLTLTEHKVT HSKSLDTGNC RICKMPLQSE EEFLEHCQMH PDLRNSLTGF
RCVVCMQTVT STLELKIHGT FHMQKTGNGS AVQTTGRGQH VQKLYKCASC LKEFRSKQDL
VKLDINGLPY GLCAGCVNLS KSGSPGINIP PGTNRPGLGQ NENLSALEGK GKAGGLKTRC
SSCNVKFESE SELQNHIQTV HRELVPDSNS TQLKTPQVSP MPRISPSQSD EKKTYQCIKC
QMVFYNEWDI QVHVANHMID EGLNHECKLC SQTFDSPAKL QCHLIEHSFE GMGGTFKCPV
CFTVFVQANK LQQHIFSAHG QEDKIYDCTQ CPQKFFFQTE LQNHTMTQHS S
//