ID H3BAX7_LATCH Unreviewed; 267 AA.
AC H3BAX7;
DT 18-APR-2012, integrated into UniProtKB/TrEMBL.
DT 18-APR-2012, sequence version 1.
DT 27-MAR-2024, entry version 68.
DE SubName: Full=Homeobox C12 {ECO:0000313|Ensembl:ENSLACP00000019048.1};
GN Name=HOXC12 {ECO:0000313|Ensembl:ENSLACP00000019048.1};
OS Latimeria chalumnae (Coelacanth).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Coelacanthiformes; Coelacanthidae; Latimeria.
OX NCBI_TaxID=7897 {ECO:0000313|Ensembl:ENSLACP00000019048.1, ECO:0000313|Proteomes:UP000008672};
RN [1] {ECO:0000313|Proteomes:UP000008672}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Wild caught {ECO:0000313|Proteomes:UP000008672};
RA Di Palma F., Alfoldi J., Johnson J., Berlin A., Gnerre S., Jaffe D.,
RA MacCallum I., Young S., Walker B.J., Lander E., Lindblad-Toh K.;
RT "The draft genome of Latimeria chalumnae.";
RL Submitted (AUG-2011) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSLACP00000019048.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- FUNCTION: Sequence-specific transcription factor which is part of a
CC developmental regulatory system that provides cells with specific
CC positional identities on the anterior-posterior axis.
CC {ECO:0000256|ARBA:ARBA00003263}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AFYH01067053; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR RefSeq; XP_005995686.1; XM_005995624.2.
DR AlphaFoldDB; H3BAX7; -.
DR STRING; 7897.ENSLACP00000019048; -.
DR Ensembl; ENSLACT00000019181.1; ENSLACP00000019048.1; ENSLACG00000016761.1.
DR GeneID; 102367130; -.
DR KEGG; lcm:102367130; -.
DR CTD; 562600; -.
DR eggNOG; KOG0487; Eukaryota.
DR GeneTree; ENSGT00940000161307; -.
DR HOGENOM; CLU_087968_1_0_1; -.
DR InParanoid; H3BAX7; -.
DR OMA; YPMHSRT; -.
DR OrthoDB; 5317093at2759; -.
DR TreeFam; TF351604; -.
DR Proteomes; UP000008672; Unassembled WGS sequence.
DR Bgee; ENSLACG00000016761; Expressed in post-anal tail muscle and 1 other cell type or tissue.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR020479; Homeobox_metazoa.
DR PANTHER; PTHR46440:SF2; HOMEOBOX PROTEIN HOX-C12; 1.
DR PANTHER; PTHR46440; HOMEOBOX PROTEIN HOX-D12-RELATED; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR PRINTS; PR00024; HOMEOBOX.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000008672}.
FT DOMAIN 197..257
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 199..258
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 104..205
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 130..190
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 267 AA; 30183 MW; BC755FA01E4093F6 CRC64;
MGEHNLLNPG FVGPLVNIHT GDAFYFPNFR TSGGQLAGLP SLSYPRRDNV CSLPWTSSEP
CNGYPQPYLS NPVSINPSFN RACDIARAEE NKCYYRDACS ENSSLKREER ARDSSLVPHE
PGIPNGMNAS FSKYDYSNGE TMTQDPSSCQ SLESDSNSSL LNEGSKNSSN QPSTMSSPIS
NGNSLSTAGA PWYPMHTRSR KKRKPYSKLQ LAELEGEFMV NEFITRQRRR ELSDRLNLSD
QQVKIWFQNR RMKKKRLLLR EQALSFF
//