GenomeNet

Database: UniProt
Entry: H0VIH1_CAVPO
LinkDB: H0VIH1_CAVPO
Original site: H0VIH1_CAVPO 
ID   H0VIH1_CAVPO            Unreviewed;       420 AA.
AC   H0VIH1;
DT   22-FEB-2012, integrated into UniProtKB/TrEMBL.
DT   22-FEB-2012, sequence version 1.
DT   27-MAR-2024, entry version 78.
DE   SubName: Full=Homeobox containing 1 {ECO:0000313|Ensembl:ENSCPOP00000010003.2};
GN   Name=HMBOX1 {ECO:0000313|Ensembl:ENSCPOP00000010003.2};
OS   Cavia porcellus (Guinea pig).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Hystricomorpha; Caviidae;
OC   Cavia.
OX   NCBI_TaxID=10141 {ECO:0000313|Ensembl:ENSCPOP00000010003.2, ECO:0000313|Proteomes:UP000005447};
RN   [1] {ECO:0000313|Proteomes:UP000005447}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=2N {ECO:0000313|Proteomes:UP000005447};
RX   PubMed=21993624; DOI=10.1038/nature10530;
RA   Lindblad-Toh K., Garber M., Zuk O., Lin M.F., Parker B.J., Washietl S.,
RA   Kheradpour P., Ernst J., Jordan G., Mauceli E., Ward L.D., Lowe C.B.,
RA   Holloway A.K., Clamp M., Gnerre S., Alfoldi J., Beal K., Chang J.,
RA   Clawson H., Cuff J., Di Palma F., Fitzgerald S., Flicek P., Guttman M.,
RA   Hubisz M.J., Jaffe D.B., Jungreis I., Kent W.J., Kostka D., Lara M.,
RA   Martins A.L., Massingham T., Moltke I., Raney B.J., Rasmussen M.D.,
RA   Robinson J., Stark A., Vilella A.J., Wen J., Xie X., Zody M.C., Baldwin J.,
RA   Bloom T., Chin C.W., Heiman D., Nicol R., Nusbaum C., Young S.,
RA   Wilkinson J., Worley K.C., Kovar C.L., Muzny D.M., Gibbs R.A., Cree A.,
RA   Dihn H.H., Fowler G., Jhangiani S., Joshi V., Lee S., Lewis L.R.,
RA   Nazareth L.V., Okwuonu G., Santibanez J., Warren W.C., Mardis E.R.,
RA   Weinstock G.M., Wilson R.K., Delehaunty K., Dooling D., Fronik C.,
RA   Fulton L., Fulton B., Graves T., Minx P., Sodergren E., Birney E.,
RA   Margulies E.H., Herrero J., Green E.D., Haussler D., Siepel A., Goldman N.,
RA   Pollard K.S., Pedersen J.S., Lander E.S., Kellis M.;
RT   "A high-resolution map of human evolutionary constraint using 29 mammals.";
RL   Nature 478:476-482(2011).
RN   [2] {ECO:0000313|Ensembl:ENSCPOP00000010003.2}
RP   IDENTIFICATION.
RC   STRAIN=2N {ECO:0000313|Ensembl:ENSCPOP00000010003.2};
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC       ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AAKN02003346; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AAKN02003347; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AAKN02003348; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AAKN02003349; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AAKN02003350; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AAKN02003351; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AAKN02003352; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AAKN02003353; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AAKN02003354; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AAKN02003355; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   RefSeq; XP_003479437.1; XM_003479389.3.
DR   STRING; 10141.ENSCPOP00000031260; -.
DR   Ensembl; ENSCPOT00000011228.3; ENSCPOP00000010003.2; ENSCPOG00000011124.4.
DR   Ensembl; ENSCPOT00000033516.1; ENSCPOP00000021452.1; ENSCPOG00000011124.4.
DR   Ensembl; ENSCPOT00000036403.1; ENSCPOP00000031482.1; ENSCPOG00000011124.4.
DR   Ensembl; ENSCPOT00000039887.1; ENSCPOP00000031260.1; ENSCPOG00000011124.4.
DR   GeneID; 100715846; -.
DR   KEGG; cpoc:100715846; -.
DR   CTD; 79618; -.
DR   VEuPathDB; HostDB:ENSCPOG00000011124; -.
DR   eggNOG; ENOG502QQSR; Eukaryota.
DR   GeneTree; ENSGT00940000154928; -.
DR   HOGENOM; CLU_052355_1_0_1; -.
DR   OMA; CLAVMEX; -.
DR   OrthoDB; 5399075at2759; -.
DR   TreeFam; TF320327; -.
DR   Proteomes; UP000005447; Unassembled WGS sequence.
DR   Bgee; ENSCPOG00000011124; Expressed in testis and 12 other cell types or tissues.
DR   GO; GO:0005813; C:centrosome; IEA:Ensembl.
DR   GO; GO:0000781; C:chromosome, telomeric region; IEA:Ensembl.
DR   GO; GO:0005829; C:cytosol; IEA:Ensembl.
DR   GO; GO:0016604; C:nuclear body; IEA:Ensembl.
DR   GO; GO:0003691; F:double-stranded telomeric DNA binding; IEA:Ensembl.
DR   GO; GO:0042802; F:identical protein binding; IEA:Ensembl.
DR   GO; GO:0044877; F:protein-containing complex binding; IEA:Ensembl.
DR   GO; GO:1990837; F:sequence-specific double-stranded DNA binding; IEA:Ensembl.
DR   GO; GO:0000122; P:negative regulation of transcription by RNA polymerase II; IEA:Ensembl.
DR   GO; GO:0045893; P:positive regulation of DNA-templated transcription; IEA:InterPro.
DR   GO; GO:0032212; P:positive regulation of telomere maintenance via telomerase; IEA:Ensembl.
DR   CDD; cd00086; homeodomain; 1.
DR   CDD; cd00093; HTH_XRE; 1.
DR   Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR   Gene3D; 1.10.260.40; lambda repressor-like DNA-binding domains; 1.
DR   InterPro; IPR001387; Cro/C1-type_HTH.
DR   InterPro; IPR040363; HMBOX1.
DR   InterPro; IPR006899; HNF-1_N.
DR   InterPro; IPR044869; HNF-1_POU.
DR   InterPro; IPR044866; HNF_P1.
DR   InterPro; IPR009057; Homeobox-like_sf.
DR   InterPro; IPR001356; Homeobox_dom.
DR   InterPro; IPR010982; Lambda_DNA-bd_dom_sf.
DR   PANTHER; PTHR14618:SF0; HOMEOBOX-CONTAINING PROTEIN 1; 1.
DR   PANTHER; PTHR14618; HOMEODOX-CONTAINING PROTEIN 1 HMBOX1; 1.
DR   Pfam; PF04814; HNF-1_N; 1.
DR   Pfam; PF00046; Homeodomain; 1.
DR   SMART; SM00389; HOX; 1.
DR   SUPFAM; SSF46689; Homeodomain-like; 1.
DR   SUPFAM; SSF47413; lambda repressor-like DNA-binding domains; 1.
DR   PROSITE; PS51937; HNF_P1; 1.
DR   PROSITE; PS50071; HOMEOBOX_2; 1.
DR   PROSITE; PS51936; POU_4; 1.
PE   4: Predicted;
KW   DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW   ProRule:PRU00108};
KW   Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW   ProRule:PRU00108};
KW   Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW   ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000005447};
KW   Transcription {ECO:0000256|ARBA:ARBA00023163};
KW   Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT   DOMAIN          18..49
FT                   /note="HNF-p1"
FT                   /evidence="ECO:0000259|PROSITE:PS51937"
FT   DOMAIN          145..241
FT                   /note="POU-specific atypical"
FT                   /evidence="ECO:0000259|PROSITE:PS51936"
FT   DOMAIN          265..340
FT                   /note="Homeobox"
FT                   /evidence="ECO:0000259|PROSITE:PS50071"
FT   DNA_BIND        267..341
FT                   /note="Homeobox"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT   REGION          56..136
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          353..385
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        64..136
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   420 AA;  47194 MW;  D141FEFFAC30508C CRC64;
     MLSSFPVALL ETMSHYTDEP RFTIEQIDLL QRLRRTGMTK HEILHALETL DRLDQEHSDK
     FGRRSSYGGS SYGNNTHNVP ASSSTATAST QTQHSGMSPS PSNSYDTSPQ PCTTNQNGRE
     NSERLSTSNG KMSPTRYHAN SMGQRTYSFE ASEEDLDVDD KVEELMRRDS SVIKEEIKAF
     LANRRISQAV VAQVTGISQS RISHWLLQLG SDLSEQKKRA FYRWYQLEKT NPGATLSMRP
     APIPIEDLEW RQTPPPVSAT PGTLRLRRGS RFTWRKECLA VMESYFNGNQ YPDEAKREEI
     ANACNAVIQK PGKKLSDLER VTSLKVYNWF ANRRKEIKRR ANIEAAILES HGIDVQSPGG
     HSNSDDVDGN DYSEQDDSTS HSDHQDPISL AVEMAAVNHT ILALAQQGTN EIKTEALDDD
//
DBGET integrated database retrieval system