ID A0A2I0M2E4_COLLI Unreviewed; 1830 AA.
AC A0A2I0M2E4;
DT 28-FEB-2018, integrated into UniProtKB/TrEMBL.
DT 28-FEB-2018, sequence version 1.
DT 27-MAR-2024, entry version 19.
DE SubName: Full=Zinc finger homeobox 3 {ECO:0000313|EMBL:PKK23842.1};
GN Name=ZFHX3 {ECO:0000313|EMBL:PKK23842.1};
GN ORFNames=A306_00010154 {ECO:0000313|EMBL:PKK23842.1};
OS Columba livia (Rock dove).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Columbiformes; Columbidae; Columba.
OX NCBI_TaxID=8932 {ECO:0000313|EMBL:PKK23842.1, ECO:0000313|Proteomes:UP000053872};
RN [1] {ECO:0000313|EMBL:PKK23842.1, ECO:0000313|Proteomes:UP000053872}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC TISSUE=Blood {ECO:0000313|EMBL:PKK23842.1};
RX PubMed=23371554; DOI=10.1126/science.1230422;
RA Shapiro M.D., Kronenberg Z., Li C., Domyan E.T., Pan H., Campbell M.,
RA Tan H., Huff C.D., Hu H., Vickrey A.I., Nielsen S.C., Stringham S.A.,
RA Hu H., Willerslev E., Gilbert M.T., Yandell M., Zhang G., Wang J.;
RT "Genomic diversity and evolution of the head crest in the rock pigeon.";
RL Science 339:1063-1067(2013).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PKK23842.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AKCR02000046; PKK23842.1; -; Genomic_DNA.
DR STRING; 8932.A0A2I0M2E4; -.
DR InParanoid; A0A2I0M2E4; -.
DR Proteomes; UP000053872; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR CDD; cd00086; homeodomain; 2.
DR Gene3D; 3.30.160.60; Classic Zinc Finger; 5.
DR Gene3D; 1.10.10.60; Homeodomain-like; 2.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR003604; Matrin/U1-like-C_Znf_C2H2.
DR InterPro; IPR036236; Znf_C2H2_sf.
DR InterPro; IPR013087; Znf_C2H2_type.
DR PANTHER; PTHR45891; ZINC FINGER HOMEOBOX PROTEIN; 1.
DR PANTHER; PTHR45891:SF4; ZINC FINGER HOMEOBOX PROTEIN 3; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR Pfam; PF00096; zf-C2H2; 2.
DR SMART; SM00389; HOX; 1.
DR SMART; SM00355; ZnF_C2H2; 15.
DR SMART; SM00451; ZnF_U1; 5.
DR SUPFAM; SSF57667; beta-beta-alpha zinc fingers; 4.
DR SUPFAM; SSF46689; Homeodomain-like; 2.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
DR PROSITE; PS00028; ZINC_FINGER_C2H2_1; 9.
DR PROSITE; PS50157; ZINC_FINGER_C2H2_2; 6.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00042};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000053872};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00042};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00042}.
FT DOMAIN 359..388
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 997..1027
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 1038..1060
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 1182..1213
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 1233..1262
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 1593..1621
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 1701..1761
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 1703..1762
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 44..250
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 784..844
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 945..987
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1121..1166
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1269..1339
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1454..1548
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1781..1800
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 68..88
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 89..124
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 155..208
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 799..814
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1273..1318
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1454..1485
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1486..1511
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1830 AA; 202820 MW; 7841824DCC546B7F CRC64;
MVNSVAFAWK ASRPSRPWQP XXXXXXXXXX XLLNLGGLTS SALKTPITSV PLGPLASSPT
ASAEGKDPGA AEGEKQEGGD QDSLSEKAEP VEEVEEEEEE DVEEEEEEEE EEEEEEEDED
DEGCKGLFPS ELEDELEERP QEDAGAVAGG GSSKKDLALS NQSISNSPLM PNVLQTLSRG
TASTSSNSAS SFVFDGANRR NHLSFNNEGG GASVAEGSRR LDFIDESANK DNATAPEPNE
SAEGEDGSYI SHHQHAGPLC ELGGGECPSG SGVECPKCDT VLGSSRSLGG HMTMMHSRNS
CKTLKCPKCN WHYKYQQTLE AHMKEKHPEP GGSCVYCKSG QPHPRLARGE SYTCGYKPFR
CEVCNYSTTT KGNLSIHMQS DKHLNNMQNL QNGGGEQVFS HTAGAAAAGG AGGGARAAAN
IGSTCGAPSP TKPKTKPTWR CEVCDYETNV ARNLRIHMTS EKHMHNMMLL QQNMSQIQHN
RHLGLGSLPS PAEAELYQYY LAQNMNLPNL KMDSTSSDAQ FMMGGFQLDP TNPMATMAPS
LVGGEIPLDM RLGGGQLVSE ELMNLGESFT QTNDPSLKLF QCAVCNKFTT DNLDMLGLHM
NVERSLPEDE WKAVMGDSYQ CKLCRYNTQL KANFQLHCKT DKHVQKYQLV AHIKEGGKAN
EWRLKCVAIG NPVHLKCNAC DYYTNSLEKL RLHTVNSRHE ASLKLYKHLQ HHESGVEGES
CYYHCVLCNY STKAKLNLIQ HVRSMKHQRS ESLRKLQRLQ KGLPEEEEDL GQIFTIRKCP
AAEAEESVED VEGPNETAGD AEEPAKEQES GGDKDQSKWT APASQAEKEL TEPPASSKRI
SFPSSSESPL SFKWSKTSEE TKSEQMYQCP YCKYSNTDVN RLRVHAMTQH SVQPMLRCPL
CQDMLNNKIH LQLHLTHLHS VSPDCVEKLI MTVTTPEVMM PSSMFLPAAA PEKDGNSAAE
ELGKQPEISE XXXSAEHSGD PKPVPADAGC AREDAGFLCW KKGCSQAFKS SAALQTHFNE
VHAKRPQLPV SDRHVYKYRC NQCSLAFKTI EKLQLHSQYH VIRAATMCCL CQRSFRTFQA
LKKHLETSHL ELSEADIQQL YGGLLVNGDL LTMGDPSLAE DHTIIVEEDK EEESDLEDKQ
SPTGSDSGSV QEDSGSEPKR ALPFRKGPNF TMEKFLDPSR PYKCTVCKES FTQKNILLVH
YNSVSHLHKL KRALQESATG QPEPTSSPDN KPFKCNTCNV AYSQSSTLEI HMRSVLHQTK
ARAAKLEAAG GSSNGVGSGS SSSGLALGSS TPSPVSASNS NTFTLGNPVS TNISSPSEPK
EANRKKLADM IASRQQQQQQ QQQQQQQQQQ QQAQTLAQAQ AQVQAHLQQE LQQQAALLQS
QLFNPALLPH FPMTTETLLQ LQQQQHLLFP FYIPSAEFQL NPEVSLPVTS GALTLTGTGP
SLLEDLKAQV QLPQQSHPQL LQQQQGQLSL SQPHSVLIQQ SQHPEKKNKS IVKEKEKETP
REREGAERGE NNVASKESLP DNLKPKEKKD FVAGNSSEPS LLPPRIASDA RGNATKALLE
NFGFELVIQY NENKQKVQKK NGKTEQGENL EKLECDTCGK FFSNILILKS HQEHVHQHYF
PFNSAPPITS PTMAPAQPSV PLTQLSMPME LPIFSPLMMQ TMPLQTLPAQ LPPQLGPVDP
LPADLAQLYQ HQLNPSLLQQ QQNKRPRTRI TDDQLRVLRQ YFDINNSPSE EQIKEMADKS
GLPQKVIKHW FRNTLFKERQ RNKDSPYNFS NPPITSLEEL KIDSRPPSPE PQKQEYWGSK
RSSRTRFTDY QLRVLQDFFD ANAYPKDDEL
//