ID H0VIH1_CAVPO Unreviewed; 420 AA.
AC H0VIH1;
DT 22-FEB-2012, integrated into UniProtKB/TrEMBL.
DT 22-FEB-2012, sequence version 1.
DT 27-MAR-2024, entry version 78.
DE SubName: Full=Homeobox containing 1 {ECO:0000313|Ensembl:ENSCPOP00000010003.2};
GN Name=HMBOX1 {ECO:0000313|Ensembl:ENSCPOP00000010003.2};
OS Cavia porcellus (Guinea pig).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Hystricomorpha; Caviidae;
OC Cavia.
OX NCBI_TaxID=10141 {ECO:0000313|Ensembl:ENSCPOP00000010003.2, ECO:0000313|Proteomes:UP000005447};
RN [1] {ECO:0000313|Proteomes:UP000005447}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=2N {ECO:0000313|Proteomes:UP000005447};
RX PubMed=21993624; DOI=10.1038/nature10530;
RA Lindblad-Toh K., Garber M., Zuk O., Lin M.F., Parker B.J., Washietl S.,
RA Kheradpour P., Ernst J., Jordan G., Mauceli E., Ward L.D., Lowe C.B.,
RA Holloway A.K., Clamp M., Gnerre S., Alfoldi J., Beal K., Chang J.,
RA Clawson H., Cuff J., Di Palma F., Fitzgerald S., Flicek P., Guttman M.,
RA Hubisz M.J., Jaffe D.B., Jungreis I., Kent W.J., Kostka D., Lara M.,
RA Martins A.L., Massingham T., Moltke I., Raney B.J., Rasmussen M.D.,
RA Robinson J., Stark A., Vilella A.J., Wen J., Xie X., Zody M.C., Baldwin J.,
RA Bloom T., Chin C.W., Heiman D., Nicol R., Nusbaum C., Young S.,
RA Wilkinson J., Worley K.C., Kovar C.L., Muzny D.M., Gibbs R.A., Cree A.,
RA Dihn H.H., Fowler G., Jhangiani S., Joshi V., Lee S., Lewis L.R.,
RA Nazareth L.V., Okwuonu G., Santibanez J., Warren W.C., Mardis E.R.,
RA Weinstock G.M., Wilson R.K., Delehaunty K., Dooling D., Fronik C.,
RA Fulton L., Fulton B., Graves T., Minx P., Sodergren E., Birney E.,
RA Margulies E.H., Herrero J., Green E.D., Haussler D., Siepel A., Goldman N.,
RA Pollard K.S., Pedersen J.S., Lander E.S., Kellis M.;
RT "A high-resolution map of human evolutionary constraint using 29 mammals.";
RL Nature 478:476-482(2011).
RN [2] {ECO:0000313|Ensembl:ENSCPOP00000010003.2}
RP IDENTIFICATION.
RC STRAIN=2N {ECO:0000313|Ensembl:ENSCPOP00000010003.2};
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AAKN02003346; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AAKN02003347; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AAKN02003348; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AAKN02003349; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AAKN02003350; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AAKN02003351; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AAKN02003352; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AAKN02003353; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AAKN02003354; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AAKN02003355; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR RefSeq; XP_003479437.1; XM_003479389.3.
DR STRING; 10141.ENSCPOP00000031260; -.
DR Ensembl; ENSCPOT00000011228.3; ENSCPOP00000010003.2; ENSCPOG00000011124.4.
DR Ensembl; ENSCPOT00000033516.1; ENSCPOP00000021452.1; ENSCPOG00000011124.4.
DR Ensembl; ENSCPOT00000036403.1; ENSCPOP00000031482.1; ENSCPOG00000011124.4.
DR Ensembl; ENSCPOT00000039887.1; ENSCPOP00000031260.1; ENSCPOG00000011124.4.
DR GeneID; 100715846; -.
DR KEGG; cpoc:100715846; -.
DR CTD; 79618; -.
DR VEuPathDB; HostDB:ENSCPOG00000011124; -.
DR eggNOG; ENOG502QQSR; Eukaryota.
DR GeneTree; ENSGT00940000154928; -.
DR HOGENOM; CLU_052355_1_0_1; -.
DR OMA; CLAVMEX; -.
DR OrthoDB; 5399075at2759; -.
DR TreeFam; TF320327; -.
DR Proteomes; UP000005447; Unassembled WGS sequence.
DR Bgee; ENSCPOG00000011124; Expressed in testis and 12 other cell types or tissues.
DR GO; GO:0005813; C:centrosome; IEA:Ensembl.
DR GO; GO:0000781; C:chromosome, telomeric region; IEA:Ensembl.
DR GO; GO:0005829; C:cytosol; IEA:Ensembl.
DR GO; GO:0016604; C:nuclear body; IEA:Ensembl.
DR GO; GO:0003691; F:double-stranded telomeric DNA binding; IEA:Ensembl.
DR GO; GO:0042802; F:identical protein binding; IEA:Ensembl.
DR GO; GO:0044877; F:protein-containing complex binding; IEA:Ensembl.
DR GO; GO:1990837; F:sequence-specific double-stranded DNA binding; IEA:Ensembl.
DR GO; GO:0000122; P:negative regulation of transcription by RNA polymerase II; IEA:Ensembl.
DR GO; GO:0045893; P:positive regulation of DNA-templated transcription; IEA:InterPro.
DR GO; GO:0032212; P:positive regulation of telomere maintenance via telomerase; IEA:Ensembl.
DR CDD; cd00086; homeodomain; 1.
DR CDD; cd00093; HTH_XRE; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR Gene3D; 1.10.260.40; lambda repressor-like DNA-binding domains; 1.
DR InterPro; IPR001387; Cro/C1-type_HTH.
DR InterPro; IPR040363; HMBOX1.
DR InterPro; IPR006899; HNF-1_N.
DR InterPro; IPR044869; HNF-1_POU.
DR InterPro; IPR044866; HNF_P1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR010982; Lambda_DNA-bd_dom_sf.
DR PANTHER; PTHR14618:SF0; HOMEOBOX-CONTAINING PROTEIN 1; 1.
DR PANTHER; PTHR14618; HOMEODOX-CONTAINING PROTEIN 1 HMBOX1; 1.
DR Pfam; PF04814; HNF-1_N; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR SUPFAM; SSF47413; lambda repressor-like DNA-binding domains; 1.
DR PROSITE; PS51937; HNF_P1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
DR PROSITE; PS51936; POU_4; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000005447};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT DOMAIN 18..49
FT /note="HNF-p1"
FT /evidence="ECO:0000259|PROSITE:PS51937"
FT DOMAIN 145..241
FT /note="POU-specific atypical"
FT /evidence="ECO:0000259|PROSITE:PS51936"
FT DOMAIN 265..340
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 267..341
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 56..136
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 353..385
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 64..136
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 420 AA; 47194 MW; D141FEFFAC30508C CRC64;
MLSSFPVALL ETMSHYTDEP RFTIEQIDLL QRLRRTGMTK HEILHALETL DRLDQEHSDK
FGRRSSYGGS SYGNNTHNVP ASSSTATAST QTQHSGMSPS PSNSYDTSPQ PCTTNQNGRE
NSERLSTSNG KMSPTRYHAN SMGQRTYSFE ASEEDLDVDD KVEELMRRDS SVIKEEIKAF
LANRRISQAV VAQVTGISQS RISHWLLQLG SDLSEQKKRA FYRWYQLEKT NPGATLSMRP
APIPIEDLEW RQTPPPVSAT PGTLRLRRGS RFTWRKECLA VMESYFNGNQ YPDEAKREEI
ANACNAVIQK PGKKLSDLER VTSLKVYNWF ANRRKEIKRR ANIEAAILES HGIDVQSPGG
HSNSDDVDGN DYSEQDDSTS HSDHQDPISL AVEMAAVNHT ILALAQQGTN EIKTEALDDD
//