GenomeNet

Database: UniProt
Entry: A0A643BNS3_BALPH
LinkDB: A0A643BNS3_BALPH
Original site: A0A643BNS3_BALPH 
ID   A0A643BNS3_BALPH        Unreviewed;       817 AA.
AC   A0A643BNS3;
DT   22-APR-2020, integrated into UniProtKB/TrEMBL.
DT   22-APR-2020, sequence version 1.
DT   24-JAN-2024, entry version 12.
DE   RecName: Full=HMG box domain-containing protein {ECO:0000259|PROSITE:PS50118};
GN   ORFNames=E2I00_004040 {ECO:0000313|EMBL:KAB0389622.1};
OS   Balaenoptera physalus (Fin whale) (Balaena physalus).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Laurasiatheria; Artiodactyla; Whippomorpha; Cetacea; Mysticeti;
OC   Balaenopteridae; Balaenoptera.
OX   NCBI_TaxID=9770 {ECO:0000313|EMBL:KAB0389622.1, ECO:0000313|Proteomes:UP000437017};
RN   [1] {ECO:0000313|EMBL:KAB0389622.1, ECO:0000313|Proteomes:UP000437017}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=FinWhale-01 {ECO:0000313|EMBL:KAB0389622.1};
RX   PubMed=31553763;
RA   Westbury M.V., Petersen B., Lorenzen E.D.;
RT   "Genomic analyses reveal an absence of contemporary introgressive admixture
RT   between fin whales and blue whales, despite known hybrids.";
RL   PLoS ONE 14:0-e0222004(2019).
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:KAB0389622.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; SGJD01006867; KAB0389622.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A643BNS3; -.
DR   Proteomes; UP000437017; Unassembled WGS sequence.
DR   GO; GO:0005634; C:nucleus; IEA:UniProtKB-UniRule.
DR   GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR   CDD; cd21981; HMG-box_HMGXB3; 1.
DR   Gene3D; 1.10.30.10; High mobility group box domain; 1.
DR   InterPro; IPR009071; HMG_box_dom.
DR   InterPro; IPR036910; HMG_box_dom_sf.
DR   InterPro; IPR039598; HMGXB3.
DR   PANTHER; PTHR17609; HMG DOMAIN-CONTAINING PROTEIN 3; 1.
DR   PANTHER; PTHR17609:SF2; HMG DOMAIN-CONTAINING PROTEIN 3; 1.
DR   Pfam; PF09011; HMG_box_2; 1.
DR   SMART; SM00398; HMG; 1.
DR   SUPFAM; SSF47095; HMG-box; 1.
DR   PROSITE; PS50118; HMG_BOX_2; 1.
PE   4: Predicted;
KW   DNA-binding {ECO:0000256|PROSITE-ProRule:PRU00267};
KW   Nucleus {ECO:0000256|PROSITE-ProRule:PRU00267};
KW   Reference proteome {ECO:0000313|Proteomes:UP000437017}.
FT   DOMAIN          42..100
FT                   /note="HMG box"
FT                   /evidence="ECO:0000259|PROSITE:PS50118"
FT   DNA_BIND        42..100
FT                   /note="HMG box"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00267"
FT   REGION          362..391
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          451..489
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          557..585
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   817 AA;  88692 MW;  2A429F5FA8874E34 CRC64;
     MDASYDGTEV TVVMEEIEEA YCYTSPGPPK KKKKYKIHGE KAKKPRSAYL LYYYDIYLKV
     QQELPHLPQS EINKKISESW RLLSVAERSY YLEKAKLEKE GLDPNSKLSA LTAVVPDIPG
     FRKILPRSDY IIIPKSSLQE ERSCSQLELC VAQNQMSPKG PSLLSNTALE TVPSHAGMAE
     QCLAVEALAE DVGALAQPGA VQEITTSEIL GPDVLLNEAS LEVGESHQPY QTSLVIEETL
     VNGSPDLPTG SLAVPHPQVG ESMSVVTVMR DSSEISSSAP AAQFIMLPLP AYSVVENPTS
     IKLTTTYTRR GHGTCTSPGC SFTYVTRHKP PKCPTCGNFL GGKWIPKEKP AKVKVELTSG
     VSSKGSVVKR NQQPLTSEQN SAKENSSKLT LENSEAVSQL LSIAPQRDVG EENEWEEVII
     SDAHVLVKET PGNRGTAVIK KPVVKSGVQR EVSLGTAEND TPGLDMPPPA EGTSTSNSLP
     APKKPTGVDL LIPVPRASEL KGRARGKPSL LAAARPMRAI LPAPANVGRG SSMGLPRARQ
     AFPLSDKTPS VRTCGLKPST LKQLGQPIQQ PSSTGEVKLP NGPANRTSQV KVVEVKPDMF
     PPYKYSCTVT LDLGLATSRG RGKCKNPSCS YVYTNRHKPR ICPSCGFNLA KDRTEKTTKV
     LEASSPLSDV LSATEPLSAA QREIQRQSTL QLLRKVLQIP ENESELAEVF ALIHELNSSR
     LVLSNVSEET VTIEQTSWSN YYESPSTQCL LCSSPLFKGG QNSLAGPQEC WLLTASWLQV
     VTAQVKMCLN PHCLALHSFM DIYTEAMFEN SLCGSDL
//
DBGET integrated database retrieval system