ID W5QAH7_SHEEP Unreviewed; 612 AA.
AC W5QAH7;
DT 16-APR-2014, integrated into UniProtKB/TrEMBL.
DT 16-APR-2014, sequence version 1.
DT 27-MAR-2024, entry version 48.
DE RecName: Full=HMG box domain-containing protein {ECO:0000259|PROSITE:PS50118};
GN Name=HMGXB4 {ECO:0000313|Ensembl:ENSOARP00000019722.1};
OS Ovis aries (Sheep).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Artiodactyla; Ruminantia; Pecora; Bovidae;
OC Caprinae; Ovis.
OX NCBI_TaxID=9940 {ECO:0000313|Ensembl:ENSOARP00000019722.1, ECO:0000313|Proteomes:UP000002356};
RN [1] {ECO:0000313|Ensembl:ENSOARP00000019722.1, ECO:0000313|Proteomes:UP000002356}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Texel {ECO:0000313|Ensembl:ENSOARP00000019722.1,
RC ECO:0000313|Proteomes:UP000002356};
RX PubMed=20809919; DOI=10.1111/j.1365-2052.2010.02100.x;
RA Archibald A.L., Cockett N.E., Dalrymple B.P., Faraut T., Kijas J.W.,
RA Maddox J.F., McEwan J.C., Hutton Oddy V., Raadsma H.W., Wade C., Wang J.,
RA Wang W., Xun X.;
RT "The sheep genome reference sequence: a work in progress.";
RL Anim. Genet. 41:449-453(2010).
RN [2] {ECO:0000313|Ensembl:ENSOARP00000019722.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AMGL01082199; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR AlphaFoldDB; W5QAH7; -.
DR STRING; 9940.ENSOARP00000019722; -.
DR PaxDb; 9940-ENSOARP00000019722; -.
DR Ensembl; ENSOART00000019995.1; ENSOARP00000019722.1; ENSOARG00000018373.1.
DR eggNOG; ENOG502QSH9; Eukaryota.
DR HOGENOM; CLU_032486_0_0_1; -.
DR OMA; MKPLFVN; -.
DR Proteomes; UP000002356; Chromosome 3.
DR Bgee; ENSOARG00000018373; Expressed in epididymis and 55 other cell types or tissues.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-UniRule.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR CDD; cd21982; HMG-box_HMGXB4; 1.
DR Gene3D; 1.10.30.10; High mobility group box domain; 1.
DR InterPro; IPR025228; DUF4171.
DR InterPro; IPR009071; HMG_box_dom.
DR InterPro; IPR036910; HMG_box_dom_sf.
DR InterPro; IPR042477; HMGXB4.
DR InterPro; IPR048016; HMGXB4_HMG-box.
DR PANTHER; PTHR46584; HMG DOMAIN-CONTAINING PROTEIN 4; 1.
DR PANTHER; PTHR46584:SF1; HMG DOMAIN-CONTAINING PROTEIN 4; 1.
DR Pfam; PF13775; DUF4171; 1.
DR Pfam; PF00505; HMG_box; 1.
DR SMART; SM00398; HMG; 1.
DR SUPFAM; SSF47095; HMG-box; 1.
DR PROSITE; PS50118; HMG_BOX_2; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|PROSITE-ProRule:PRU00267};
KW Nucleus {ECO:0000256|PROSITE-ProRule:PRU00267};
KW Reference proteome {ECO:0000313|Proteomes:UP000002356}.
FT DOMAIN 418..486
FT /note="HMG box"
FT /evidence="ECO:0000259|PROSITE:PS50118"
FT DNA_BIND 418..486
FT /note="HMG box"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00267"
FT REGION 61..421
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 479..524
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 61..93
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 148..166
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 217..232
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 254..268
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 328..354
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 364..392
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 394..420
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 479..498
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 612 AA; 67206 MW; DFDD118944FE61A4 CRC64;
PHSQRPCRTT MAYDDSVKKE DCFDGDHSFE DIGLAAGRSQ REKKRSYKDF LREEEEIAAQ
VRNSSKKKLK DSELYFLGTD THRKKRKHSS DDYYYGDISS LEPSQKKKKK SSPQSADTAM
DLLKAITSPL ATGAKPSKKI GEKSSSSSSH LESKKEHHRK KVSGSSGELS LEDGSSHKSK
KMKPLYVNTE TLTLREPDGL KMKLILSPKE KGSSSVDEES FQYPSQQATV KKSSKKSARD
EQGALLLGHE LQSFLKTARK KHKSSSDPRS SPGPEGCGSE ASQFPESHSS NLDLSGLEPI
LVESDSSSGG ELEAGELVID DSYREIKKKK KSKKSKKKKD KEKHKEKRHS KSKRSSGLPP
AAVAGEVPVP PVPAPSLPYT GAATPPPPLP SLHTDGHSEK KKKREEKDRE RDRGEKPKKK
NMSAYQVFCK EYRVTIVADH PGIDFGELSK KLAEVWKQLP EKDKLVWKQK AQYLQHKQNK
AEATTVKRKA SSSEGSMKVK AASMGVLSPQ KKSPPTAMLL PASPAKAPET EPIDVAAHLQ
LLGESLSLIG HRLQETEGMV AVSGSLSVLL DSIICALGPL ACLTTQLPEL NGCPKQVLSN
TLDNIAYIMP GL
//