ID A0A3Q3W8Z5_MOLML Unreviewed; 663 AA.
AC A0A3Q3W8Z5;
DT 10-APR-2019, integrated into UniProtKB/TrEMBL.
DT 10-APR-2019, sequence version 1.
DT 27-MAR-2024, entry version 22.
DE RecName: Full=Zinc fingers and homeoboxes protein 1 {ECO:0000256|ARBA:ARBA00040117};
OS Mola mola (Ocean sunfish) (Tetraodon mola).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Eupercaria; Tetraodontiformes; Molidae; Mola.
OX NCBI_TaxID=94237 {ECO:0000313|Ensembl:ENSMMOP00000008372.1, ECO:0000313|Proteomes:UP000261620};
RN [1] {ECO:0000313|Ensembl:ENSMMOP00000008372.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- FUNCTION: Sequence-specific transcription factor which is part of a
CC developmental regulatory system that provides cells with specific
CC positional identities on the anterior-posterior axis.
CC {ECO:0000256|ARBA:ARBA00003263}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; A0A3Q3W8Z5; -.
DR Ensembl; ENSMMOT00000008525.1; ENSMMOP00000008372.1; ENSMMOG00000003370.1.
DR Proteomes; UP000261620; Unplaced.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR CDD; cd00086; homeodomain; 4.
DR Gene3D; 1.10.10.60; Homeodomain-like; 5.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR001356; Homeobox_dom.
DR PANTHER; PTHR15467:SF4; ZINC FINGERS AND HOMEOBOXES PROTEIN 1; 1.
DR PANTHER; PTHR15467; ZINC-FINGERS AND HOMEOBOXES RELATED; 1.
DR Pfam; PF00046; Homeodomain; 4.
DR SMART; SM00389; HOX; 4.
DR SUPFAM; SSF46689; Homeodomain-like; 4.
DR PROSITE; PS50071; HOMEOBOX_2; 4.
PE 4: Predicted;
KW Developmental protein {ECO:0000256|ARBA:ARBA00022473};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|PROSITE-ProRule:PRU00108,
KW ECO:0000256|RuleBase:RU000682};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000261620}.
FT DOMAIN 142..185
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DOMAIN 311..371
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DOMAIN 396..446
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DOMAIN 481..541
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 144..186
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT DNA_BIND 313..372
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT DNA_BIND 398..447
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT DNA_BIND 483..542
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 1..55
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 439..491
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 605..663
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 20..49
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 444..478
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 625..645
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 646..663
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 663 AA; 74000 MW; 00DFE2A27CC98173 CRC64;
MSSRRKSTTP CMVLPAHVVE QNEPEEKTER KKEEEVEEEK GKEGAEEGPT AGELDQAVVV
VPTPPDTGET PHSALYRRLW TFKTSLTRSV CLIVITDEPN VSAVLSAFQA QHTAAAAAQS
QLLIPLSSIP SYSAAMDTNP LLGSTYKKFP YPSMAEINSL AAQTQFTEEQ IKVWFSAQRL
KHGVSWTPEE VEEARRKQFN GTVHTVPQTI TVIPAHQLSA AANGLQSILQ TCQIVGQPGL
VFTQVGPGGN LPVTSPITLT VAGMPSQSQS SNRVSCQPTL TNSDLKRATT VQPPSLSPQC
LCVCLCNTLF ADTFSLRPKK SKEQLAELKA SYLKNHFVTD AEIARLMNLT NLTKGEIKKW
FSDTRYNQRN SKNSHTRPKT WNPFPDFTLQ KFKEKTPEQL VVLEESFEKG STPSDEELSR
LRTETKLTRR EIDAWFTERR KMPSVSGGTG TSSSSLSASL SRRGSQTPPG GRSKQLPSTS
NKDLKDKNKK TPEQLHILKS AFVRTQWPTP EEYDQLAEES GLPRSYIVSW FGDSRYSWKN
SNLKWFFQYQ SWGRSRTRKQ PRRSASSGRE ILKEYYLKYR FLNEQDLDEL VTKTNMSYEQ
VIKSGDKAKK SVGASKASEE QAAAGMGADE DDEGDEEDGD DTDDSEVWEP SRSVRKSLSV
SED
//