GenomeNet

Database: UniProt
Entry: A0A226NK20_CALSU
LinkDB: A0A226NK20_CALSU
Original site: A0A226NK20_CALSU 
ID   A0A226NK20_CALSU        Unreviewed;       418 AA.
AC   A0A226NK20;
DT   25-OCT-2017, integrated into UniProtKB/TrEMBL.
DT   25-OCT-2017, sequence version 1.
DT   24-JAN-2024, entry version 20.
DE   RecName: Full=HMG box domain-containing protein {ECO:0008006|Google:ProtNLM};
GN   ORFNames=ASZ78_005536 {ECO:0000313|EMBL:OXB67854.1};
OS   Callipepla squamata (Scaled quail).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC   Coelurosauria; Aves; Neognathae; Galloanserae; Galliformes; Odontophoridae;
OC   Callipepla.
OX   NCBI_TaxID=9009 {ECO:0000313|EMBL:OXB67854.1, ECO:0000313|Proteomes:UP000198323};
RN   [1] {ECO:0000313|EMBL:OXB67854.1, ECO:0000313|Proteomes:UP000198323}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Texas {ECO:0000313|EMBL:OXB67854.1,
RC   ECO:0000313|Proteomes:UP000198323};
RC   TISSUE=Leg muscle {ECO:0000313|EMBL:OXB67854.1};
RA   Oldeschulte D.L., Halley Y.A., Bhattarai E.K., Brashear W.A., Hill J.,
RA   Metz R.P., Johnson C.D., Rollins D., Peterson M.J., Bickhart D.M.,
RA   Decker J.E., Seabury C.M.;
RT   "Disparate Historic Effective Population Sizes Predicted by Modern Levels
RT   of Genome Diversity for the Scaled Quail (Callipepla squamata) and the
RT   Northern Bobwhite (Colinus virginianus): Inferences from First and Second
RT   Generation Draft Genome Assemblies for Sympatric New World Quail.";
RL   Submitted (JUL-2016) to the EMBL/GenBank/DDBJ databases.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:OXB67854.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; MCFN01000030; OXB67854.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A226NK20; -.
DR   STRING; 9009.A0A226NK20; -.
DR   Proteomes; UP000198323; Unassembled WGS sequence.
DR   GO; GO:0005634; C:nucleus; IEA:UniProtKB-UniRule.
DR   GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR   CDD; cd22048; HMG-box_SoxF_SOX18; 1.
DR   Gene3D; 1.10.30.10; High mobility group box domain; 1.
DR   InterPro; IPR009071; HMG_box_dom.
DR   InterPro; IPR036910; HMG_box_dom_sf.
DR   InterPro; IPR033392; Sox7/17/18_central.
DR   InterPro; IPR021934; Sox_C.
DR   PANTHER; PTHR10270; SOX TRANSCRIPTION FACTOR; 1.
DR   PANTHER; PTHR10270:SF204; TRANSCRIPTION FACTOR SOX-18; 1.
DR   Pfam; PF00505; HMG_box; 1.
DR   Pfam; PF12067; Sox17_18_mid; 1.
DR   SMART; SM00398; HMG; 1.
DR   SUPFAM; SSF47095; HMG-box; 1.
DR   PROSITE; PS50118; HMG_BOX_2; 1.
DR   PROSITE; PS51516; SOX_C; 1.
PE   4: Predicted;
KW   DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW   ProRule:PRU00267}; Nucleus {ECO:0000256|PROSITE-ProRule:PRU00267};
KW   Reference proteome {ECO:0000313|Proteomes:UP000198323}.
FT   DOMAIN          100..168
FT                   /note="HMG box"
FT                   /evidence="ECO:0000259|PROSITE:PS50118"
FT   DOMAIN          298..417
FT                   /note="Sox C-terminal"
FT                   /evidence="ECO:0000259|PROSITE:PS51516"
FT   DNA_BIND        100..168
FT                   /note="HMG box"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00267"
FT   REGION          30..100
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          198..219
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        33..47
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   418 AA;  46157 MW;  C9FBDDE23869252D CRC64;
     MNISESNYCR EEISQPRGDC SWVTGAVPAA EPGLAFPRPP GAASPSSRTP SPEPGFAFGP
     AAPGAAPGAA PSRTPSPEPG YGYSPPAGRA EGKAGEDSRI RRPMNAFMVW AKDERKRLAQ
     QNPDLHNAVL SKMLGQSWKA LSASDKRPFV EEAERLRIQH LQDHPNYKYR PRRKKQAKKI
     KRMEPNILLH NLSQPCSDNF SMSHHGGSQP GHPQPPPLNH FRELHSMGSD IENYGLPTPE
     MSPLDVLEQT EPAFFPPHMQ EDCGMMPFRG YHHHHPQMEF PQEKCLGRDV AVPYAQPPAH
     LADAMRTPHP SGLYYNQMCS GSQSGLSAHL GQLSPPPEAH HMESVDHLNQ TDLWTDVDRN
     EFDQYLNMSR TRPEASGLPY HVSLSKVTPR SISCEESSLI SALSDASSAV YYSPCITG
//
DBGET integrated database retrieval system