ID H0VTX8_CAVPO Unreviewed; 763 AA.
AC H0VTX8;
DT 22-FEB-2012, integrated into UniProtKB/TrEMBL.
DT 22-FEB-2012, sequence version 1.
DT 27-MAR-2024, entry version 73.
DE SubName: Full=SRY-box transcription factor 5 {ECO:0000313|Ensembl:ENSCPOP00000014145.2};
GN Name=SOX5 {ECO:0000313|Ensembl:ENSCPOP00000014145.2};
OS Cavia porcellus (Guinea pig).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Hystricomorpha; Caviidae;
OC Cavia.
OX NCBI_TaxID=10141 {ECO:0000313|Ensembl:ENSCPOP00000014145.2, ECO:0000313|Proteomes:UP000005447};
RN [1] {ECO:0000313|Proteomes:UP000005447}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=2N {ECO:0000313|Proteomes:UP000005447};
RX PubMed=21993624; DOI=10.1038/nature10530;
RA Lindblad-Toh K., Garber M., Zuk O., Lin M.F., Parker B.J., Washietl S.,
RA Kheradpour P., Ernst J., Jordan G., Mauceli E., Ward L.D., Lowe C.B.,
RA Holloway A.K., Clamp M., Gnerre S., Alfoldi J., Beal K., Chang J.,
RA Clawson H., Cuff J., Di Palma F., Fitzgerald S., Flicek P., Guttman M.,
RA Hubisz M.J., Jaffe D.B., Jungreis I., Kent W.J., Kostka D., Lara M.,
RA Martins A.L., Massingham T., Moltke I., Raney B.J., Rasmussen M.D.,
RA Robinson J., Stark A., Vilella A.J., Wen J., Xie X., Zody M.C., Baldwin J.,
RA Bloom T., Chin C.W., Heiman D., Nicol R., Nusbaum C., Young S.,
RA Wilkinson J., Worley K.C., Kovar C.L., Muzny D.M., Gibbs R.A., Cree A.,
RA Dihn H.H., Fowler G., Jhangiani S., Joshi V., Lee S., Lewis L.R.,
RA Nazareth L.V., Okwuonu G., Santibanez J., Warren W.C., Mardis E.R.,
RA Weinstock G.M., Wilson R.K., Delehaunty K., Dooling D., Fronik C.,
RA Fulton L., Fulton B., Graves T., Minx P., Sodergren E., Birney E.,
RA Margulies E.H., Herrero J., Green E.D., Haussler D., Siepel A., Goldman N.,
RA Pollard K.S., Pedersen J.S., Lander E.S., Kellis M.;
RT "A high-resolution map of human evolutionary constraint using 29 mammals.";
RL Nature 478:476-482(2011).
RN [2] {ECO:0000313|Ensembl:ENSCPOP00000014145.2}
RP IDENTIFICATION.
RC STRAIN=2N {ECO:0000313|Ensembl:ENSCPOP00000014145.2};
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AAKN02031144; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AAKN02031145; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AAKN02031146; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AAKN02031147; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AAKN02031148; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AAKN02031149; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AAKN02031150; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AAKN02031151; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AAKN02031152; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AAKN02031153; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR RefSeq; XP_013008654.1; XM_013153200.1.
DR AlphaFoldDB; H0VTX8; -.
DR STRING; 10141.ENSCPOP00000014145; -.
DR Ensembl; ENSCPOT00000015841.3; ENSCPOP00000014145.2; ENSCPOG00000015686.4.
DR VEuPathDB; HostDB:ENSCPOG00000015686; -.
DR eggNOG; KOG0528; Eukaryota.
DR GeneTree; ENSGT00940000156122; -.
DR HOGENOM; CLU_018522_0_0_1; -.
DR InParanoid; H0VTX8; -.
DR OMA; PPPKSKX; -.
DR TreeFam; TF320471; -.
DR Proteomes; UP000005447; Unassembled WGS sequence.
DR Bgee; ENSCPOG00000015686; Expressed in testis and 10 other cell types or tissues.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0000976; F:transcription cis-regulatory region binding; IEA:Ensembl.
DR GO; GO:0055059; P:asymmetric neuroblast division; IEA:Ensembl.
DR GO; GO:0071560; P:cellular response to transforming growth factor beta stimulus; IEA:Ensembl.
DR GO; GO:0032332; P:positive regulation of chondrocyte differentiation; IEA:Ensembl.
DR GO; GO:2000741; P:positive regulation of mesenchymal stem cell differentiation; IEA:Ensembl.
DR CDD; cd22042; HMG-box_EGL13-like; 1.
DR Gene3D; 1.10.30.10; High mobility group box domain; 1.
DR InterPro; IPR009071; HMG_box_dom.
DR InterPro; IPR036910; HMG_box_dom_sf.
DR PANTHER; PTHR45789; FI18025P1; 1.
DR PANTHER; PTHR45789:SF3; TRANSCRIPTION FACTOR SOX-5; 1.
DR Pfam; PF00505; HMG_box; 1.
DR SMART; SM00398; HMG; 1.
DR SUPFAM; SSF47095; HMG-box; 1.
DR PROSITE; PS50118; HMG_BOX_2; 1.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00267};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00267}; Reference proteome {ECO:0000313|Proteomes:UP000005447}.
FT DOMAIN 556..624
FT /note="HMG box"
FT /evidence="ECO:0000259|PROSITE:PS50118"
FT DNA_BIND 556..624
FT /note="HMG box"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00267"
FT REGION 1..29
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 77..141
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 359..433
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 714..763
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 195..268
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 453..514
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 77..132
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 366..387
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 397..418
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 731..750
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 763 AA; 84090 MW; 9F8149F84B167EFA CRC64;
MLTDPDLPQE FERMSSKRPA SPYGEADGEV AMVTSRQKVE EEESDGLPAF HLPLHVSFPN
KPHSEEFQPV SLLTQETCGH RTPTSQHNTM EVDGNKVMSS FAPHNSSTSP QKAEEGGRQS
GESLSSTALG TPERRKGSLA DVVDTLKQRK MEELIKNEPE ETPSIEKLLS KDWKDKLLAM
GSGNFGEIKG TPESLAEKER QLMGMINQLT SLREQLLAAH DEQKKLAASQ IEKQRQQMEL
AKQQQEQIAR QQQQLLQQQH KINLLQQQIQ VQGQLPPLMI PVFPPDQRTL AAAAQQGFLL
PPGFSYKAGC SDPYPVQLIP TTMAAAAAAT PGLGPLQLQQ LYAAQLAAMQ VSPGGKLPGI
PQGNLGAAVS PTSIHTDKST NSPPPKSKDE VAQPLNLSAK PKTSDGKSPT SPTSPHMPAL
RINSGAGPLK ASVPATLASP STRVSTIGYL NDHDAVSKAI QEARQMKEQL RREQQVLDGK
VAVVNSLGLN NCRTEKEKTT LESLTQQLAV KQNEEGKFSH AMMDFNMSGD SDGSAGVSES
RIYRESRGRG SNEPHIKRPM NAFMVWAKDE RRKILQAFPD MHNSNISKIL GSRWKAMTNL
EKQPYYEEQA RLSKQHLEKY PDYKYKPRPK RTCLVDGKKL RIGEYKAIMR NRRQEMRQYF
NVGQQAQIPI ATAGVVYPGA IAMAGMPSPH LPSEHSSVSS SPEPGMPVIQ STYGVKGEEP
HIKEEIQAED INGEIYDEYD EEEDDPDVDY GSDSENHIAG QAN
//