ID A0A493TQY8_ANAPP Unreviewed; 3652 AA.
AC A0A493TQY8;
DT 05-JUN-2019, integrated into UniProtKB/TrEMBL.
DT 05-JUN-2019, sequence version 1.
DT 27-MAR-2024, entry version 19.
DE SubName: Full=Zinc finger homeobox 3 {ECO:0000313|Ensembl:ENSAPLP00000028293.1};
GN Name=ZFHX3 {ECO:0000313|Ensembl:ENSAPLP00000028293.1};
OS Anas platyrhynchos platyrhynchos (Northern mallard).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Galloanserae; Anseriformes; Anatidae;
OC Anatinae; Anas.
OX NCBI_TaxID=8840 {ECO:0000313|Ensembl:ENSAPLP00000028293.1, ECO:0000313|Proteomes:UP000016666};
RN [1] {ECO:0000313|Ensembl:ENSAPLP00000028293.1, ECO:0000313|Proteomes:UP000016666}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Hou Z.-C., Zhou Z.-K., Zhu F., Hou S.-S.;
RT "A new Pekin duck reference genome.";
RL Submitted (OCT-2017) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSAPLP00000028293.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR Ensembl; ENSAPLT00000039335.1; ENSAPLP00000028293.1; ENSAPLG00000002327.2.
DR GeneTree; ENSGT00940000156149; -.
DR Proteomes; UP000016666; Chromosome 12.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR CDD; cd00086; homeodomain; 4.
DR Gene3D; 3.30.160.60; Classic Zinc Finger; 5.
DR Gene3D; 1.10.10.60; Homeodomain-like; 4.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR003604; Matrin/U1-like-C_Znf_C2H2.
DR InterPro; IPR036236; Znf_C2H2_sf.
DR InterPro; IPR013087; Znf_C2H2_type.
DR PANTHER; PTHR45891; ZINC FINGER HOMEOBOX PROTEIN; 1.
DR PANTHER; PTHR45891:SF4; ZINC FINGER HOMEOBOX PROTEIN 3; 1.
DR Pfam; PF00046; Homeodomain; 4.
DR Pfam; PF00096; zf-C2H2; 2.
DR SMART; SM00389; HOX; 4.
DR SMART; SM00355; ZnF_C2H2; 22.
DR SMART; SM00451; ZnF_U1; 7.
DR SUPFAM; SSF57667; beta-beta-alpha zinc fingers; 6.
DR SUPFAM; SSF46689; Homeodomain-like; 4.
DR PROSITE; PS00027; HOMEOBOX_1; 2.
DR PROSITE; PS50071; HOMEOBOX_2; 4.
DR PROSITE; PS00028; ZINC_FINGER_C2H2_1; 13.
DR PROSITE; PS50157; ZINC_FINGER_C2H2_2; 9.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00042};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000016666};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00042};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00042}.
FT DOMAIN 722..751
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 1330..1360
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 1371..1393
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 1515..1546
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 1566..1595
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 1960..1988
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 2120..2180
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DOMAIN 2217..2277
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DOMAIN 2305..2334
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 2507..2529
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 2616..2676
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DOMAIN 2688..2718
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 2921..2981
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 2122..2181
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT DNA_BIND 2219..2278
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT DNA_BIND 2618..2677
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT DNA_BIND 2923..2982
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 1..78
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 98..126
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 412..533
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 590..616
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1147..1180
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1272..1323
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1460..1505
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1599..1655
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1845..1914
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2007..2058
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2181..2219
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2353..2374
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2388..2484
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2596..2624
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2748..2773
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2818..2850
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2889..2924
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3114..3224
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3340..3408
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3507..3652
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 1689..1730
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 18..36
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 412..426
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 429..450
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 451..485
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 507..533
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1163..1180
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1606..1655
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1853..1878
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2010..2053
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2408..2455
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2456..2471
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2818..2849
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2889..2917
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3114..3171
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3172..3186
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3188..3216
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3340..3362
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3369..3383
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3521..3535
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3536..3585
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3586..3608
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 3652 AA; 399636 MW; 811D05492F6CFE60 CRC64;
MEGCDSPVIP GKDNGCGIPQ HQQWTDLNST HLPDTAGSME QPAAESRGPL DSLRVPFPER
VPESSAAAGP GTEPAGKEVS CGQCAASFAG LQSYMEHRCA GARPPPPLRG ESGSESSEDG
DEESDVENLA GEIVYQPDGS AYIVESLSQL VQSGGACGSG SGALPSLLPN SLSKQGEPSA
AAPVYPQIIN TFHIASSFGK WFEGSDQAFP NTSALAGISP VLHSFRVFDV RHKSNKDYLN
SDGSAKSSCV SKDVPNNVDL SKFDGFVLYG KRKPILMCFL CKLSFGYVRS FVTHAVHDHR
MTLSEEERKI LSNKNISAII QGIGKDKEPL VSFLEPKNKT FQHPLVSTAN LIGPGHSFYG
KFSGIRMEGE QALQAGPAGG AEPPPSGVLA PGTLLNLGGL TSSALKTPIT SVPLGPLASS
PTKSSEGKDP GVAEGEKQEG GDQDSLSEKA EPVEEVEEEE EEEVEEEEEE EEEEEEEEDE
DDEGCKGLFP SELEDELEDR PQEDSGAVAG GSSSSSKKDL ALSNQSISNS PLMPNVLQTL
SRGTASTSSN SASSFVFDGA NRRNHLSFNN EGGGANVAEG SRRLDFIDES ANKDNATAPE
PNESTEGEDG SYISHHQHAG PLCELGGGEC PSGSGVECPK CDTVLGSSRS LGGHMTMMHS
RNSCKTLKCP KCNWHYKYQQ TLEAHMKEKH PEPGGSCVYC KSGQPHPRLA RGESYTCGYK
PFRCEVCNYS TTTKGNLSIH MQSDKHLNNM QNLQNGGGEQ VFSHTAGAAA AAAAAAAAAA
ANIGSTCGAP SPTKPKTKPT WRCEVCDYET NVARNLRIHM TSEKHMHNMM LLQQNMSQIQ
HNRHLGLGSL PSPAEAELYQ YYLAQNMNLP NLKMDSTSSD AQFMMGGFQL DPTNPMATMT
PSLVGGEIPL DMRLGGGQLV SEELMNLGES FTQTNDPSLK LFQCAVCNKF TTDNLDMLGL
HMNVERSLPE DEWKAVMGDS YQCKLCRYNT QLKANFQLHC KTDKHVQKYQ LVAHIKEGGK
ANEWRLKCVA IGNPVHLKCN ACDYYTNSLE KLRLHTVNSR HEASLKLYKH LQHHESGVEG
ESCYYHCVLC NYSTKAKLNL IQHVRSMKHQ RSESLRKLQR LQKGLPEEEE DLGQIFTIRK
CPAADAAPAS QAEKELTEPP ASSKRISFPS SSESPLSFKW SKTSEETKSE QMYQCPYCKY
SNTDVNRLRV HAMTQHSVQP MLRCPLCQDM LNNKIHLQLH LTHLHSVSPD CVEKLIMTVT
TPEVMMPSSM FLPAATPEKD GNSTTEELGK QPEISEDSGK SVLPSEGAEH SSDPKPVAAD
QSSARDDSGF LCWKKGCNQV FKSSAALQTH FNEVHAKRPQ LPVSDRHVYK YRCNQCSLAF
KTIEKLQLHS QYHVIRAATM CCLCQRSFRT FQALKKHLET SHLELSEADI QQLYGGLLVN
GDLLAMGDPS LAEDHTIIVE EDKEEESDLE DKQSPTGSDS GSVQEDSGSE PKRALPFRKG
PNFTMEKFLD PSRPYKCTVC KESFTQKNIL LVHYNSVSHL HKLKRALQES ATGQPEPTSS
PDNKPFKCNT CNVAYSQSST LEIHMRSVLH QTKARAAKLE AAGGSGSSNG AGNSSSSSLS
LGSSTPSPVS TNNNNTFTTT NTSNSGTTPI PSLLNQVSSD SLGIAPIGNP VSTSISSPSE
PKEVNRKKLA DMIASRQQQQ QQQQQQQQQQ QQQQQQQQQQ AQTLAQAQAQ VQAHLQQELQ
QQAALLQSQL FNPALLPHFP MTTETLLQLQ QQQHLLFPFY IPSAEFQLNP EVSLPVTSGA
LTLTGTGPSL LEDLKAQVQL PQQSHPQLLQ QQQGQLSLSQ PHVLIQQSQH PEKKNKSVVK
EKEKETPRER ESAERGDNNA ASKESLPDNL KPKDKKDFVP GSNSEPSLLP PRIASDARGN
ATKALLENFG FELVIQYNEN KQKVQKKNGK TEQGENLEKL ECDTCGKFFS NILILKSHQE
HVHQHYFPFK QLERFAKQYR EHYDKLYPLR PQTPEPPPPP PPPPPPLPPA PPQPASTPTI
PTSAPPITSP TIAPAQPSVP LTQLSMPMEL PIFSPLMMQT MPLQTLPAQL PPQLGPVDPL
PADLAQLYQH QLNPSLLQQQ QNKRPRTRIT DDQLRVLRQY FDINNSPSEE QIKEMADKSG
LPQKVIKHWF RNTLFKERQR NKDSPYNFSN PPITSLEELK IDSRPPSPEP QKQEYWGSKR
SSRTRFTDYQ LRVLQDFFDA NAYPKDDEFE QLSNLLNLPT RVIVVWFQNA RQKARKNYEN
QGEGKDGERR ELTNDRYIRT SNLNYQCKKC SLVFQRIFDL IKHQKKLCYK DEDEDGQDDS
QNEDSMDAME LLTPTSSSCS TPMPSQAYST PTSSANATSS AFLQLTAEAD DSSTYSSKVE
ATDEKPKQSE PPSTQQNQTQ EKQVQPKQES QQQQDQGEQK TNSAHQKISQ LSSPSSLQQP
PPPPQPPQCS LSQSSPSPSQ LSHLSLKPIH TSTPQQLANL PPQLIPYQCD QCKLAFPSFE
HWQEHQQLHF LSAQNQFIHP PFLDRTLDMP FMLFDPSNPL LASQLLSGTL PQIPASSATS
PSTPTSTMNT LKRKLEEKAS ASPGENDSGT GGEEPQRDKR LRTTITPEQL EILYQKYLLD
SNPTRKMLDH IAHEVGLKKR VVQVWFQNTR ARERKGQFRA VGPAQAHRRC PFCRALFKAK
TALEAHIRSR HWHEAKRAGY NLTLSAMLLD CDGGLQMKGD IFDGASFSHM PPTSSDGQSV
PLSPVNKSME LSPRTLLSPS SIKVEGIEDF ESPSMSSVNL SFDQTKLDND DCSSVNTAIT
DTTTGDEGNA DNDSATGIAT ETKSASGPSE GLTKAAMIAM SEYEDRLSSG LVSPAPSFYS
KEYDNEGTVD YSETSSLADP CSPSPGASGS ASKSGESGDR PGQKRFRTQM TNLQLKVLKS
CFNDYRTPTM LECEVLGNDI GLPKRVVQVW FQNARAKEKK SKLSMAKHFG INQTSYEGPK
TECTLCGIKY SARLSVRDHI FSQQHISKVK ETIGSQLDKE KEYFDPATVR QLMAQQELDR
IKKANEVLGL AAQQQGMFDN TPLQALNLPA AYPALQGIPP VLLPGLNSPS LPGFTPSNTA
LTSPKPNLMG LPSTSVPSPG LPTSGLPNKQ PPAALSSPTP AQATAAVAPQ QPPATTPQPQ
QQQPPPPQRK EKESEKAKEK EKAPKGKGDP LPGPKKEKGE APAGALSAPL PAMEYAVEPA
QLQALQAALN SDPTTLLTSQ FLPYFVPGFS PYYAPQIPGA LQSGYLQPMY GMEGLFPYSP
ALSQALLGLS PGSLLQQYQQ YQQSLQEALQ QQQRQAQQQQ QQQQQQQKAQ QPKGSQTPVP
TGAATPDKDP AKEPPKQEEQ KNAPREVSPL LPKPPEEPEA EGKGADSLCD PFIVPKVQYK
LVCRKCQAGF GDQEAASDHL KSLCFFGQSM ANLQEMVLHV PTGSGQGSGS YQCVACESTV
CGDEALSQHL ESAPHKHRTI TRAARNAKEH PSLLPHSACP PDPSTASTSQ SAAHSNDSPP
PPPPPPPPPP RPPPASPHAS RKPWPPAAPR APPGKPPPPP FPPLSSSSTV TSSSCSTSGV
QPSMPTDDYS EESDTDLSQK SDGPASPAEG PRDPGCPKDS GLASVGMDTF RL
//