ID W5NPL4_SHEEP Unreviewed; 458 AA.
AC W5NPL4;
DT 16-APR-2014, integrated into UniProtKB/TrEMBL.
DT 16-APR-2014, sequence version 1.
DT 27-MAR-2024, entry version 42.
DE RecName: Full=Homeobox domain-containing protein {ECO:0000259|PROSITE:PS50071};
OS Ovis aries (Sheep).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Artiodactyla; Ruminantia; Pecora; Bovidae;
OC Caprinae; Ovis.
OX NCBI_TaxID=9940 {ECO:0000313|Ensembl:ENSOARP00000000104.1, ECO:0000313|Proteomes:UP000002356};
RN [1] {ECO:0000313|Ensembl:ENSOARP00000000104.1, ECO:0000313|Proteomes:UP000002356}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Texel {ECO:0000313|Ensembl:ENSOARP00000000104.1,
RC ECO:0000313|Proteomes:UP000002356};
RX PubMed=20809919; DOI=10.1111/j.1365-2052.2010.02100.x;
RA Archibald A.L., Cockett N.E., Dalrymple B.P., Faraut T., Kijas J.W.,
RA Maddox J.F., McEwan J.C., Hutton Oddy V., Raadsma H.W., Wade C., Wang J.,
RA Wang W., Xun X.;
RT "The sheep genome reference sequence: a work in progress.";
RL Anim. Genet. 41:449-453(2010).
RN [2] {ECO:0000313|Ensembl:ENSOARP00000000104.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC -!- SIMILARITY: Belongs to the even-skipped homeobox family.
CC {ECO:0000256|ARBA:ARBA00038449}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AMGL01052547; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AMGL01052548; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR STRING; 9940.ENSOARP00000000104; -.
DR PaxDb; 9940-ENSOARP00000000104; -.
DR Ensembl; ENSOART00000000129.1; ENSOARP00000000104.1; ENSOARG00000000124.1.
DR eggNOG; KOG0844; Eukaryota.
DR HOGENOM; CLU_045075_0_0_1; -.
DR Proteomes; UP000002356; Chromosome 2.
DR Bgee; ENSOARG00000000124; Expressed in rectum and 3 other cell types or tissues.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR001356; Homeobox_dom.
DR PANTHER; PTHR46294:SF1; HOMEOBOX EVEN-SKIPPED HOMOLOG PROTEIN 2; 1.
DR PANTHER; PTHR46294; SEGMENTATION PROTEIN EVEN-SKIPPED; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 3: Inferred from homology;
KW DNA-binding {ECO:0000256|PROSITE-ProRule:PRU00108,
KW ECO:0000256|RuleBase:RU000682};
KW Homeobox {ECO:0000256|PROSITE-ProRule:PRU00108,
KW ECO:0000256|RuleBase:RU000682};
KW Nucleus {ECO:0000256|PROSITE-ProRule:PRU00108,
KW ECO:0000256|RuleBase:RU000682};
KW Reference proteome {ECO:0000313|Proteomes:UP000002356}.
FT DOMAIN 188..236
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 190..237
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 82..112
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 133..185
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 393..431
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 439..458
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 146..162
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 397..413
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 458 AA; 47484 MW; 8603F983E55DA200 CRC64;
MMERIRKEMI LMERGLHSPT TGKRFSNLSD SAGNAVLEAL ENSQHPARLS PRLPSAPLHS
ALGDLPAKGK FEIDTLFNLQ HPSSESTVSS EIASAAEGRK KPGHYSEAAA EADMSSDVEV
GCSALRSPGG LGAVPLKENN GKGFAESGSA AGTTTSATGS GLGSLHGGGG GGGSGGGAAL
GGSGSGADQV RRYRTAFTRE QIARLEKEFY RENYVSRPRR CELAAALNLP ETTIKVLIPA
RACSSFPRVF AVSEGLAGLA DLGRSPNGAA HAANARLRPA RAQVPLTLLH GSNFPSAAKP
AGLCCSEPDP CPSGQFLTAQ RTQDLTRRLF TKAAAAASLQ RSPRILAQIL GLRDGCSARP
GNRWVARKRE APGAAAVGHR SVPTALYTAA HTVSAGREDS ARSEGRRPGH SKVWEGRGGP
GRGLGAPGXS GFLPYSAAVL SKTAVSPPDQ RDEAPLTR
//