ID A0A3M0JH91_HIRRU Unreviewed; 323 AA.
AC A0A3M0JH91;
DT 13-FEB-2019, integrated into UniProtKB/TrEMBL.
DT 13-FEB-2019, sequence version 1.
DT 27-MAR-2024, entry version 21.
DE RecName: Full=HMG box domain-containing protein {ECO:0000259|PROSITE:PS50118};
GN ORFNames=DUI87_25279 {ECO:0000313|EMBL:RMB98373.1};
OS Hirundo rustica rustica.
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Passeriformes; Sylvioidea; Hirundinidae;
OC Hirundo.
OX NCBI_TaxID=333673 {ECO:0000313|EMBL:RMB98373.1, ECO:0000313|Proteomes:UP000269221};
RN [1] {ECO:0000313|EMBL:RMB98373.1, ECO:0000313|Proteomes:UP000269221}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Chelidonia {ECO:0000313|EMBL:RMB98373.1};
RC TISSUE=Blood {ECO:0000313|EMBL:RMB98373.1};
RA Formenti G., Chiara M., Poveda L., Francoijs K.-J., Bonisoli-Alquati A.,
RA Canova L., Gianfranceschi L., Horner D.S., Saino N.;
RT "A high quality draft genome assembly of the barn swallow (H. rustica
RT rustica).";
RL Submitted (JUL-2018) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Chromosome {ECO:0000256|ARBA:ARBA00004286}.
CC -!- SIMILARITY: Belongs to the HMGB family.
CC {ECO:0000256|ARBA:ARBA00008774}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:RMB98373.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; QRBI01000153; RMB98373.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A3M0JH91; -.
DR STRING; 333673.A0A3M0JH91; -.
DR Proteomes; UP000269221; Unassembled WGS sequence.
DR GO; GO:0005694; C:chromosome; IEA:UniProtKB-SubCell.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR CDD; cd21978; HMG-box_HMGB_rpt1; 1.
DR CDD; cd21979; HMG-box_HMGB_rpt2; 1.
DR Gene3D; 1.10.30.10; High mobility group box domain; 2.
DR InterPro; IPR009071; HMG_box_dom.
DR InterPro; IPR036910; HMG_box_dom_sf.
DR InterPro; IPR017967; HMG_boxA_CS.
DR PANTHER; PTHR48112:SF14; HIGH MOBILITY GROUP PROTEIN B3; 1.
DR PANTHER; PTHR48112; HIGH MOBILITY GROUP PROTEIN DSP1; 1.
DR Pfam; PF00505; HMG_box; 1.
DR Pfam; PF09011; HMG_box_2; 1.
DR PRINTS; PR00886; HIGHMOBLTY12.
DR SMART; SM00398; HMG; 2.
DR SUPFAM; SSF47095; HMG-box; 2.
DR PROSITE; PS00353; HMG_BOX_1; 1.
DR PROSITE; PS50118; HMG_BOX_2; 2.
PE 3: Inferred from homology;
KW Chromosome {ECO:0000256|ARBA:ARBA00022454};
KW DNA-binding {ECO:0000256|PROSITE-ProRule:PRU00267};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00267}; Reference proteome {ECO:0000313|Proteomes:UP000269221}.
FT DOMAIN 130..200
FT /note="HMG box"
FT /evidence="ECO:0000259|PROSITE:PS50118"
FT DOMAIN 214..282
FT /note="HMG box"
FT /evidence="ECO:0000259|PROSITE:PS50118"
FT DNA_BIND 130..200
FT /note="HMG box"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00267"
FT DNA_BIND 214..282
FT /note="HMG box"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00267"
FT REGION 1..25
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 197..218
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 282..323
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 197..214
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 282..303
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 304..323
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 323 AA; 36132 MW; 5103A1032F556068 CRC64;
MLGAGAVPRR EPESRTRSAG QGREGVVQPR WVCDGKDRPE FTLDGGTFWS RNAKTPGHRA
EGWLLGHCGQ WDWKRSIICL LDGLPALVCD STSSEGYGTL AEMVSSCPVF LLLAFGSVGV
KMAKGDPKKP KGKMSAYAFF VQTCREEHKK KNPEVPVNFA EFSKKCSERW KTMSSKEKAK
FDEMAKADKV RYDREMKDYG PAKGGKKKKD PNAPKRPPSG FFLFCSEFRP KIKSTNPGIS
IGDVAKKLGE MWNNLSDGEK QPYNNKAAKL KEKYEKDVAD YKSKGKFDGA KGAATKAARK
KVEEEDEEEE EDEEEEDEDD DDE
//