ID A0A3Q2E0J1_CYPVA Unreviewed; 974 AA.
AC A0A3Q2E0J1;
DT 10-APR-2019, integrated into UniProtKB/TrEMBL.
DT 10-APR-2019, sequence version 1.
DT 27-MAR-2024, entry version 24.
DE RecName: Full=Zinc fingers and homeoboxes protein 1 {ECO:0000256|ARBA:ARBA00040117};
GN Name=ZHX1 {ECO:0000313|Ensembl:ENSCVAP00000025622.1};
OS Cyprinodon variegatus (Sheepshead minnow).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Ovalentaria; Atherinomorphae; Cyprinodontiformes; Cyprinodontidae;
OC Cyprinodon.
OX NCBI_TaxID=28743 {ECO:0000313|Ensembl:ENSCVAP00000025622.1, ECO:0000313|Proteomes:UP000265020};
RN [1] {ECO:0000313|Ensembl:ENSCVAP00000025622.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC -!- SIMILARITY: Belongs to the ZHX family. {ECO:0000256|ARBA:ARBA00007440}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_015256438.1; XM_015400952.1.
DR AlphaFoldDB; A0A3Q2E0J1; -.
DR STRING; 28743.ENSCVAP00000025622; -.
DR Ensembl; ENSCVAT00000001206.1; ENSCVAP00000025622.1; ENSCVAG00000010524.1.
DR GeneID; 107101874; -.
DR KEGG; cvg:107101874; -.
DR CTD; 11244; -.
DR GeneTree; ENSGT00950000182893; -.
DR OMA; IMRIRSR; -.
DR OrthoDB; 5350395at2759; -.
DR Proteomes; UP000265020; Unplaced.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR CDD; cd00086; homeodomain; 5.
DR Gene3D; 3.30.160.60; Classic Zinc Finger; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 5.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR024578; Homez_homeobox_dom.
DR InterPro; IPR041057; ZHX_Znf_C2H2.
DR InterPro; IPR036236; Znf_C2H2_sf.
DR InterPro; IPR013087; Znf_C2H2_type.
DR PANTHER; PTHR15467:SF4; ZINC FINGERS AND HOMEOBOXES PROTEIN 1; 1.
DR PANTHER; PTHR15467; ZINC-FINGERS AND HOMEOBOXES RELATED; 1.
DR Pfam; PF00046; Homeodomain; 4.
DR Pfam; PF11569; Homez; 1.
DR Pfam; PF18387; zf_C2H2_ZHX; 1.
DR SMART; SM00389; HOX; 5.
DR SMART; SM00355; ZnF_C2H2; 2.
DR SUPFAM; SSF57667; beta-beta-alpha zinc fingers; 2.
DR SUPFAM; SSF46689; Homeodomain-like; 5.
DR PROSITE; PS50071; HOMEOBOX_2; 4.
DR PROSITE; PS50157; ZINC_FINGER_C2H2_2; 1.
PE 3: Inferred from homology;
KW Developmental protein {ECO:0000256|ARBA:ARBA00022473};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00042};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000265020};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00042};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00042}.
FT DOMAIN 111..139
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 321..364
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DOMAIN 483..533
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DOMAIN 643..693
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DOMAIN 723..783
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 323..365
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT DNA_BIND 485..534
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT DNA_BIND 645..694
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT DNA_BIND 725..784
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 1..62
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 447..484
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 599..624
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 699..727
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 800..835
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 887..974
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..16
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 20..34
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 458..476
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 599..622
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 709..723
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 806..826
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 894..918
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 919..942
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 943..957
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 974 AA; 107612 MW; 09217A0411A26D41 CRC64;
MSSRRKSTTP CMVPTQEALD SDQEVEDVMD ITDPEDSNEG AALYSEVPSL SEDPAEDADV
RPDAVPDPYL DLNTAEGGYE CKYCSFQTSE LNLFTMHVDT EHPDVVLNAS YVCVECDYHT
KSYDTLQAHN ARLHPGEDNF TRTMVKRNNE TIFQQTVNDL TFDGSFVKVE EDEAEVTSRK
AIPLSKTPIM RIKSRPEPKK FAAAQKLPMD VIKVESDDEY DENQEPPTLS PAPTALAAST
PRLIPVSTPL QVQAVPQGIM VNSSNMLQIK GGSSGGSVLP PGTLAQVLSA LQTQQSPQTQ
LLIPISSIPT YNSAMDNNVL LVSAYNRFPY PSVSEIIGLS AQTKFSEEQI KVWFSAQRLK
HGVSWTPEEV EEARRKKFNG TVQTVPQTIT VIPANIGAST NGLQSIFQTC QIVGQPGLVL
TQVAGSGSTV PVASPITLTV AGVPGNQPKA AEASAPESNA DVSASSASTS LSLDAGATKP
KKSKEQLAEL KASYSRRQFA TEEEISRLME VTKLSKRAIK KWFSDTRYNQ RNSRDHHSLL
LSETSSSRAA LSGAGRAARS SSNSFTENIN SETLSDLSTT TVGTPTTATM TTTTIVIDSS
DDASDSSPTT ANAPSSSVSA SDPRVKFRRA FPDFTPQRFK EKTTEQLQIL EASFQKSDMP
SDEELSRLRS ETKLTRREVD AWFTERRKTP SAALLRDSDT EMEEKVKPTT TPAVSQEGQT
TTPPAGRKIL KKTPEQLHVL KKAFVRTQWP TSEEYDQMAE ETGLPRTYIV NWFGDTRYAC
KNSNLKWYYL YQSGKVDEAL NGAGKGQKKS RKRSRGWSRR TRRPSPCKRS PQGGAGLIKV
KSGKTFLKDY YLKHRALSEK DLDDLVTKSK MSYEQVRDWF SDTARRVEEG KEPFSDEDGE
EEDEEEEEEE EEEVATTECR DSEGEMEAKE QGEASSKDEE VKDETEDPSE EEEKGVQEDP
EGVGQSQPQT EEQT
//