ID I3KAT9_ORENI Unreviewed; 3192 AA.
AC I3KAT9;
DT 11-JUL-2012, integrated into UniProtKB/TrEMBL.
DT 17-JUN-2020, sequence version 2.
DT 27-MAR-2024, entry version 68.
DE SubName: Full=Zinc finger homeobox 4 {ECO:0000313|Ensembl:ENSONIP00000018234.2};
GN Name=ZFHX4 {ECO:0000313|Ensembl:ENSONIP00000018234.2};
OS Oreochromis niloticus (Nile tilapia) (Tilapia nilotica).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Ovalentaria; Cichlomorphae; Cichliformes; Cichlidae; African cichlids;
OC Pseudocrenilabrinae; Oreochromini; Oreochromis.
OX NCBI_TaxID=8128 {ECO:0000313|Ensembl:ENSONIP00000018234.2, ECO:0000313|Proteomes:UP000005207};
RN [1] {ECO:0000313|Proteomes:UP000005207}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RG Broad Institute Genome Assembly Team;
RG Broad Institute Sequencing Platform;
RA Di Palma F., Johnson J., Lander E.S., Lindblad-Toh K.;
RT "The Genome Sequence of Oreochromis niloticus (Nile Tilapia).";
RL Submitted (JAN-2012) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSONIP00000018234.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 8128.ENSONIP00000018234; -.
DR Ensembl; ENSONIT00000018251.2; ENSONIP00000018234.2; ENSONIG00000014495.2.
DR eggNOG; KOG1146; Eukaryota.
DR GeneTree; ENSGT00940000159542; -.
DR HOGENOM; CLU_000245_0_0_1; -.
DR InParanoid; I3KAT9; -.
DR OMA; DESKTGM; -.
DR TreeFam; TF323288; -.
DR Proteomes; UP000005207; Linkage group LG9.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR CDD; cd00086; homeodomain; 4.
DR Gene3D; 3.30.160.60; Classic Zinc Finger; 4.
DR Gene3D; 1.10.10.60; Homeodomain-like; 4.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR003604; Matrin/U1-like-C_Znf_C2H2.
DR InterPro; IPR036236; Znf_C2H2_sf.
DR InterPro; IPR013087; Znf_C2H2_type.
DR PANTHER; PTHR45891; ZINC FINGER HOMEOBOX PROTEIN; 1.
DR PANTHER; PTHR45891:SF2; ZINC FINGER HOMEOBOX PROTEIN 4; 1.
DR Pfam; PF00046; Homeodomain; 4.
DR Pfam; PF00096; zf-C2H2; 1.
DR Pfam; PF12874; zf-met; 2.
DR SMART; SM00389; HOX; 4.
DR SMART; SM00355; ZnF_C2H2; 21.
DR SMART; SM00451; ZnF_U1; 7.
DR SUPFAM; SSF57667; beta-beta-alpha zinc fingers; 7.
DR SUPFAM; SSF46689; Homeodomain-like; 4.
DR PROSITE; PS00027; HOMEOBOX_1; 2.
DR PROSITE; PS50071; HOMEOBOX_2; 4.
DR PROSITE; PS00028; ZINC_FINGER_C2H2_1; 11.
DR PROSITE; PS50157; ZINC_FINGER_C2H2_2; 6.
PE 4: Predicted;
KW Developmental protein {ECO:0000256|ARBA:ARBA00022473};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00042};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000005207};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00042};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00042}.
FT DOMAIN 1073..1096
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 1217..1248
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 1269..1298
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 1630..1658
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 1783..1843
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DOMAIN 1880..1940
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DOMAIN 1968..1997
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 2246..2306
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DOMAIN 2318..2346
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 2562..2622
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 1785..1844
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT DNA_BIND 1882..1941
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT DNA_BIND 2248..2307
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT DNA_BIND 2564..2623
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 126..163
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 315..370
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 385..468
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1009..1047
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1161..1199
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1295..1381
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1507..1583
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1677..1727
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1999..2018
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2025..2118
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2208..2252
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2346..2429
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2453..2494
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2517..2565
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2706..2728
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2756..2828
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2927..2996
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3063..3088
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3109..3162
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 353..370
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 385..399
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1161..1175
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1507..1531
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1532..1571
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1681..1725
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2025..2076
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2087..2101
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2102..2118
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2222..2252
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2353..2367
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2385..2429
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2531..2554
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2711..2728
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2772..2820
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2927..2968
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2969..2996
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3065..3088
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3128..3152
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 3192 AA; 353338 MW; 5775D1C56312BA53 CRC64;
PHGTDTLLVL LSKQTLGSDC SAPATSPVAP AKEIPCNECS NSFSSLQKYM EHHCPNARLP
TTGGHEEGEE IEGMIGDESE DEGDREVGAA EVGEMNSEME MEEDSDVENL CGEIIYQPDG
SAFILEDSKD GQSGLSPGGE RKSLAADHPS FPNTSAGGLA GADGAAKNSC VSKDVPNNVD
LSKFEGCVAD GRRKPVLMCF LCKLSFGYSR SFVTHAVHDH RMTLNEQEQK LLSNKHISAI
IQGIGKDKEP LISFLEPKKT PNSVLPHFPS PANFLGPDTG LRGLWNAFHS SGENADSLQA
GFAFLKDLSQ YPPIKREPGM VGRESPEQDE DAYSSGGGAD MEAEEEEEQV MTMTGQRADS
TSSKDFPLLN QSISPLSNSV LKLNSDSKGP ASASLSSLPV SEKLEMEKRR DGLSPSSNPL
DMMTLRRDDE SPGPLHQHTG NPSTPGTPGT PGTPGPGEGS PGSGVECPKC DTVLGSSRSL
GGHMTMMHSR NSCKTLKCPK CNWHYKYQQT LDAHMKEKHP ESGGSCVYCR TGQAHPRLAR
GESYTCGYKP FRCEVCNYST TTKGNLSIHM QSDKHLNNVQ TLQNGGSEAQ YNHNHANPVP
SASLGGGCGA PSPSKPKQKP TWRCEVCDYE TNVARNLRIH MTSEKHMHNM MLLQQNMKQI
QHSLHLGLAP AEAELYQYYL AQNMGLADPS LSSLSPPIND PSLRLYQCAV CNCYSTDSLE
ALNAHVNAER SLAEEEWRCV VGDVYQCKLC SYNTQLKANF QLHCKTDKHM QKHQLVAHIK
EGGKANQWRL KCVAIGNPVH LKCNACDYYS NSVDKLRLHA TNQRHESAIR LYKVKTLFTH
AVSCKNQFSC VYYCTLCDYS TKARLNLVQH ARSARHQQNE GLRKLQLHQQ GLGGDEDGLS
LHELFHVKEC PSSQGLCFFN LLCFKDANRM QLHVMSQHSM QPVIRCPLCQ DVLSNKIHLQ
LHLTHLHSVA PDCVEKLILT KILTDKNLTL YFLRVGNISK DELANQDKND LDLQREELKP
PKEASEAPDW KRASGLRHDS KSPDNLQEHL SELQRLQQQQ QQLSVSDRHV YKYRCNHCSL
AFKTMQKLQI HSQYHAIRAA TMCSLCQRSF RTFLALRKHL ENGHPELSEA EVQQLIGNLP
LNGDITESEA RALEEAQAFE HDLDKDDEMD QEEKPSPTGS DSSSLLDDMG AEPKRTLPFR
KGPNFTMEKF LDPSRPYKCT VCKESFTQKN ILLVHYNSVS HLHKLKKVLQ EASSPVPQEP
SNSIDNKPFK CNICNVAYSQ SSTLEIHMRS VLHQTKARTA KNDMSSSSSS SSISGTSGPV
SSKSPGPSTQ GNASNLDAAR SGTPSSNKEN TVEPKEPNSS NNSKQAATTD HVSAQASSQQ
SAQSSAQLQL QLQHELQQQA AFFQPQFLNP AFLPHFPMTP EALLQFQQPQ FLFPFYIPGA
EFNLSPELAL HSAAAFGMPG LTGSFLEDLK QQMQQQHQLQ QAQVQQQVQQ QQQQQQQASQ
SQSQILQQKN QQLQNQKPKM EASTVSASEI QMSRDAEDHL EKQENRAKME NGGDALNDVG
KDNKDPKKSK FPEPLIPPPR IVSGARGNAA KALLENFGFE LVIQYNENRQ KNQRKSKDGV
EQVEMNTDKL ECGMCGKLFS NMLILKSHQE HVHSHFFPYV ELEKFAQQYR EAYDKLYPIN
PSSPETPPPP PPPPPPPPPP PPAPTPPPPP PPPPTAPPAP PQVQLPVSLD LPLFPPLMMQ
SVQHPGLPPQ LALQLPTMDS LSTDLTQLCQ QQLGLDPNFL RHSQFKRPRT RITDEQLKIL
RANFDINNSP NEEQIQEMSE KSGLPQKVIK HWFRNTLFKE RQRSKDSPYN FSIPPITTLE
DIRLEPQLSA QEYHRADSMM NKRSSRTRFT DYQLRVLQDF FDTNAYPKDD EIEQLSTVLN
LPTRVIVVWF QNARQKARKS YENQADSKDS EKKELTNERY IRTSNMQYQC KKCNIVFPRI
FDLITHQKKL CYKDEDEEGN EDNQCEEYSD SPEQGPFKIT QASLDMPKQS QTGTPSSSGS
SSPVMSSPRT TMGKTSPKPD LTLETEVKQT ETAPSPVIKS LPEPRPSKAC TPQPPPQKAP
QAQMSRPHSQ PQAAAVPSSP LSLALSSLTN SLPHQMLQYQ CDQCKIAFPT LELWQEHQHM
HFLAAQNQFL HSQFLERPID MPYMIFDPNN PLMASQLLSG GLSQIPSQGS SGLASAAGSG
AMKRKLDDKE ESANDKDGGN SSEEQHRDKR LRTTITPEQL EILYDKYLLD SNPTRKMLDH
IAREVGLKKR VVQVWFQNTR ARERKGQFRA IGPSQSHKKC PFCRALFKAK SALDSHIRSR
HWHEAKQAGF SLPPSPMMNQ DNERGESPNK YNFFDYPQLP TKTEPNEYEL PTASSTPVKP
SEAQVKNFLS PSSLKAENCD ETEGPNINSA EVSSYDLSKM DFDETSSINT AISDATTGDE
YNNNEVESLT ANGGDKLSDN KSGLMPNSDS GNERFQFSMV SPALSFSGKD CDSYFSSRDD
EFDENNDRSE SSSLADPSSP SPFGAGNPFS KSGKGSSAGD RPGHKRFRTQ MSNLQLKVLK
ACFSDYRTPT MQECEMLGNE IGLPKRVVQV WFQNARAKEK KFKINIGKPF MISQGSPEGP
RPECTLCGVK YTARMSVRDH IFSKQHITKV QETMGNQVDR EKDYLAPTTV RQLMAQQELD
RMKKAGEGLG LPGQQQQTSV DNNNALHGLS LPSGYPGLSG LPPVLLPGVN GPSSLPVFPP
NTPVSASNEL KQQGKDSERD NSKKPGDKPP QMKVKDRENE SSSRPETPSM TKKREKPCPA
PGKPGNETSL DAAQLQALQN ALAAGDPSSF FGGQFLPYFL PGFPNCFSPQ LPRGVQTGGY
FPPLCGMESL FPYGPAAVPQ AAMAAGLSPT ALLQQYQQYQ QSLADSLQKQ QQQQKQPEQQ
QKQQQQQKPT PVKTPSSVQS TTTNSFKPKE AIDAKDDNSK GSSTESTKEE PKTDTKITMD
FPDAVIVPST VLRKAVSNYT CVACNVVVNG NEALGQHLKS SLHKEKTIKQ AMRNAKEHTR
LLPHSVCSPT PNTTSTSQSA ASSNNTFPHL SRLSMKSWPN VLFQATTARK AASSSPSASP
PPPLSSPSTV TSTSCSTSGV PTSLPTESCS DESDNELSQK LDDLDNALEV KAKPASGLDA
SFSSIRMDMF SV
//