ID A0A226NK20_CALSU Unreviewed; 418 AA.
AC A0A226NK20;
DT 25-OCT-2017, integrated into UniProtKB/TrEMBL.
DT 25-OCT-2017, sequence version 1.
DT 24-JAN-2024, entry version 20.
DE RecName: Full=HMG box domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=ASZ78_005536 {ECO:0000313|EMBL:OXB67854.1};
OS Callipepla squamata (Scaled quail).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Galloanserae; Galliformes; Odontophoridae;
OC Callipepla.
OX NCBI_TaxID=9009 {ECO:0000313|EMBL:OXB67854.1, ECO:0000313|Proteomes:UP000198323};
RN [1] {ECO:0000313|EMBL:OXB67854.1, ECO:0000313|Proteomes:UP000198323}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Texas {ECO:0000313|EMBL:OXB67854.1,
RC ECO:0000313|Proteomes:UP000198323};
RC TISSUE=Leg muscle {ECO:0000313|EMBL:OXB67854.1};
RA Oldeschulte D.L., Halley Y.A., Bhattarai E.K., Brashear W.A., Hill J.,
RA Metz R.P., Johnson C.D., Rollins D., Peterson M.J., Bickhart D.M.,
RA Decker J.E., Seabury C.M.;
RT "Disparate Historic Effective Population Sizes Predicted by Modern Levels
RT of Genome Diversity for the Scaled Quail (Callipepla squamata) and the
RT Northern Bobwhite (Colinus virginianus): Inferences from First and Second
RT Generation Draft Genome Assemblies for Sympatric New World Quail.";
RL Submitted (JUL-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OXB67854.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; MCFN01000030; OXB67854.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A226NK20; -.
DR STRING; 9009.A0A226NK20; -.
DR Proteomes; UP000198323; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-UniRule.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR CDD; cd22048; HMG-box_SoxF_SOX18; 1.
DR Gene3D; 1.10.30.10; High mobility group box domain; 1.
DR InterPro; IPR009071; HMG_box_dom.
DR InterPro; IPR036910; HMG_box_dom_sf.
DR InterPro; IPR033392; Sox7/17/18_central.
DR InterPro; IPR021934; Sox_C.
DR PANTHER; PTHR10270; SOX TRANSCRIPTION FACTOR; 1.
DR PANTHER; PTHR10270:SF204; TRANSCRIPTION FACTOR SOX-18; 1.
DR Pfam; PF00505; HMG_box; 1.
DR Pfam; PF12067; Sox17_18_mid; 1.
DR SMART; SM00398; HMG; 1.
DR SUPFAM; SSF47095; HMG-box; 1.
DR PROSITE; PS50118; HMG_BOX_2; 1.
DR PROSITE; PS51516; SOX_C; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00267}; Nucleus {ECO:0000256|PROSITE-ProRule:PRU00267};
KW Reference proteome {ECO:0000313|Proteomes:UP000198323}.
FT DOMAIN 100..168
FT /note="HMG box"
FT /evidence="ECO:0000259|PROSITE:PS50118"
FT DOMAIN 298..417
FT /note="Sox C-terminal"
FT /evidence="ECO:0000259|PROSITE:PS51516"
FT DNA_BIND 100..168
FT /note="HMG box"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00267"
FT REGION 30..100
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 198..219
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 33..47
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 418 AA; 46157 MW; C9FBDDE23869252D CRC64;
MNISESNYCR EEISQPRGDC SWVTGAVPAA EPGLAFPRPP GAASPSSRTP SPEPGFAFGP
AAPGAAPGAA PSRTPSPEPG YGYSPPAGRA EGKAGEDSRI RRPMNAFMVW AKDERKRLAQ
QNPDLHNAVL SKMLGQSWKA LSASDKRPFV EEAERLRIQH LQDHPNYKYR PRRKKQAKKI
KRMEPNILLH NLSQPCSDNF SMSHHGGSQP GHPQPPPLNH FRELHSMGSD IENYGLPTPE
MSPLDVLEQT EPAFFPPHMQ EDCGMMPFRG YHHHHPQMEF PQEKCLGRDV AVPYAQPPAH
LADAMRTPHP SGLYYNQMCS GSQSGLSAHL GQLSPPPEAH HMESVDHLNQ TDLWTDVDRN
EFDQYLNMSR TRPEASGLPY HVSLSKVTPR SISCEESSLI SALSDASSAV YYSPCITG
//