GenomeNet

Database: UniProt
Entry: A0A452FQ65_CAPHI
LinkDB: A0A452FQ65_CAPHI
Original site: A0A452FQ65_CAPHI 
ID   A0A452FQ65_CAPHI        Unreviewed;       738 AA.
AC   A0A452FQ65;
DT   08-MAY-2019, integrated into UniProtKB/TrEMBL.
DT   08-MAY-2019, sequence version 1.
DT   24-JAN-2024, entry version 24.
DE   SubName: Full=SIX homeobox 5 {ECO:0000313|Ensembl:ENSCHIP00000026476.1};
GN   Name=SIX5 {ECO:0000313|Ensembl:ENSCHIP00000026476.1};
OS   Capra hircus (Goat).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Laurasiatheria; Artiodactyla; Ruminantia; Pecora; Bovidae;
OC   Caprinae; Capra.
OX   NCBI_TaxID=9925 {ECO:0000313|Ensembl:ENSCHIP00000026476.1, ECO:0000313|Proteomes:UP000291000};
RN   [1] {ECO:0000313|Ensembl:ENSCHIP00000026476.1, ECO:0000313|Proteomes:UP000291000}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA   Bickhart D.M., Koren S., Rosen B., Hastie A., Liachko I., Sullivan S.T.,
RA   Burton J., Sayre B.L., Huson H.J., Lee J., Lam E., Kelley C.M.,
RA   Hutchison J.L., Zhou Y., Sun J., Crisa A., Schwartz J.C., Hammond J.A.,
RA   Schroeder S.G., Liu G.E., Dunham M., Shendure J., Sonstegard T.S.,
RA   Phillippy A.M., Van Tassell C.P., Smith T.P.;
RT   "Polished mammalian reference genomes with single-molecule sequencing and
RT   chromosome conformation capture applied to the Capra hircus genome.";
RL   Submitted (APR-2016) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|Ensembl:ENSCHIP00000026476.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (SEP-2023) to UniProtKB.
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|PROSITE-ProRule:PRU00108,
CC       ECO:0000256|RuleBase:RU000682}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; LWLT01000020; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   RefSeq; XP_017918113.1; XM_018062624.1.
DR   AlphaFoldDB; A0A452FQ65; -.
DR   STRING; 9925.ENSCHIP00000026476; -.
DR   Ensembl; ENSCHIT00000034339.1; ENSCHIP00000026476.1; ENSCHIG00000022800.1.
DR   GeneID; 102190256; -.
DR   KEGG; chx:102190256; -.
DR   CTD; 147912; -.
DR   GeneTree; ENSGT00940000162237; -.
DR   OrthoDB; 5265771at2759; -.
DR   Proteomes; UP000291000; Chromosome 18.
DR   Bgee; ENSCHIG00000022800; Expressed in testis and 17 other cell types or tissues.
DR   GO; GO:0005829; C:cytosol; IEA:Ensembl.
DR   GO; GO:0005794; C:Golgi apparatus; IEA:Ensembl.
DR   GO; GO:0005654; C:nucleoplasm; IEA:Ensembl.
DR   GO; GO:0001228; F:DNA-binding transcription activator activity, RNA polymerase II-specific; IEA:Ensembl.
DR   GO; GO:0000978; F:RNA polymerase II cis-regulatory region sequence-specific DNA binding; IEA:Ensembl.
DR   GO; GO:0002088; P:lens development in camera-type eye; IEA:Ensembl.
DR   GO; GO:0160024; P:Leydig cell proliferation; IEA:Ensembl.
DR   GO; GO:0045892; P:negative regulation of DNA-templated transcription; IEA:Ensembl.
DR   GO; GO:1902723; P:negative regulation of skeletal muscle satellite cell proliferation; IEA:Ensembl.
DR   GO; GO:0007286; P:spermatid development; IEA:Ensembl.
DR   CDD; cd00086; homeodomain; 1.
DR   Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR   InterPro; IPR009057; Homeobox-like_sf.
DR   InterPro; IPR017970; Homeobox_CS.
DR   InterPro; IPR001356; Homeobox_dom.
DR   InterPro; IPR031701; SIX1_SD.
DR   PANTHER; PTHR10390; HOMEOBOX PROTEIN SIX; 1.
DR   PANTHER; PTHR10390:SF40; HOMEOBOX PROTEIN SIX5; 1.
DR   Pfam; PF00046; Homeodomain; 1.
DR   Pfam; PF16878; SIX1_SD; 1.
DR   SMART; SM00389; HOX; 1.
DR   SUPFAM; SSF46689; Homeodomain-like; 1.
DR   PROSITE; PS00027; HOMEOBOX_1; 1.
DR   PROSITE; PS50071; HOMEOBOX_2; 1.
PE   4: Predicted;
KW   DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW   ProRule:PRU00108};
KW   Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW   ProRule:PRU00108};
KW   Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW   ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000291000}.
FT   DOMAIN          208..259
FT                   /note="Homeobox"
FT                   /evidence="ECO:0000259|PROSITE:PS50071"
FT   DNA_BIND        210..260
FT                   /note="Homeobox"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT   REGION          1..81
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          248..305
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          361..440
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          614..646
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        272..286
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        416..440
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        616..633
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   738 AA;  75110 MW;  377376E0C38F8D8A CRC64;
     MATLPAEPSA RPAAGGEAVV AAAATEEEEE EARQLLQTLQ AAEGEAAAAA GAGAGETAVK
     VEGPGSPGVP GSPPEAAAEP PMGLRFSPEQ VACVCEALLQ AGHAGRLSRF LGALPPAERL
     RGSDPVLRAR ALVAFQRGEY AELYRLLESR PFPAAHHAFL QDLYLRARYH EAERARGRAL
     GAVDKYRLRK KFPLPKTIWD GEETVYCFKE RSRAALKACY RGNRYPTPDE KRRLATLTGL
     SLTQVSNWFK NRRQRDRTGG GGGAPCKSES DGNPTTEDES SRSPEDLERG AAPAAAEGPA
     PGSIFLAGAS PPAPCPASSS ILVNGSFLAA GSSPAVLLNG SPVIINSLAL GEASGLGPLL
     LTGGAPAPQP SPQGPSEGKT SLVLDPQTGE VRLEEAQPEA PETKGAQVTA SGPPGEEVPT
     PLPQVVPGPP TAATFPLPPG PVTSMAAPQV VPLSPPPGYP AGLGPTSPLL NLPQVVPTSQ
     VVTLPQAVGP LQLLAAGPGS PVKVAGASGP ANVHLINSGV GVTALQLPSA TTPGNFLLAN
     SVSGSPIVTG VAVQQGKIIL TATFPTSMLV SQVLPPAPSL ALPLKPDTAI SVPEGALPVA
     TSPALPEAHA LGALPVQQPP QPPPTPATPT PSLPFSPDSS GLLPGFPAPP TEGLLLSPAA
     VPIWPAGLEL SASTEGLLEE EKGLGTQAPH TVLRLPDPDP EGLLLGATAG GEVDEGLEAE
     TKVLTQLQSV PVEEPLEL
//
DBGET integrated database retrieval system