ID A0A087XGD3_POEFO Unreviewed; 1234 AA.
AC A0A087XGD3;
DT 29-OCT-2014, integrated into UniProtKB/TrEMBL.
DT 26-NOV-2014, sequence version 2.
DT 24-JAN-2024, entry version 36.
DE RecName: Full=HMG box domain-containing protein {ECO:0000259|PROSITE:PS50118};
OS Poecilia formosa (Amazon molly) (Limia formosa).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Ovalentaria; Atherinomorphae; Cyprinodontiformes; Poeciliidae; Poeciliinae;
OC Poecilia.
OX NCBI_TaxID=48698 {ECO:0000313|Ensembl:ENSPFOP00000004836.2, ECO:0000313|Proteomes:UP000028760};
RN [1] {ECO:0000313|Proteomes:UP000028760}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=female {ECO:0000313|Proteomes:UP000028760};
RA Schartl M., Warren W.;
RL Submitted (OCT-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSPFOP00000004836.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AYCK01002988; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AYCK01002989; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR AlphaFoldDB; A0A087XGD3; -.
DR Ensembl; ENSPFOT00000004845.2; ENSPFOP00000004836.2; ENSPFOG00000004457.2.
DR GeneTree; ENSGT00390000006983; -.
DR Proteomes; UP000028760; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-UniRule.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR Gene3D; 1.10.30.10; High mobility group box domain; 1.
DR InterPro; IPR009071; HMG_box_dom.
DR InterPro; IPR036910; HMG_box_dom_sf.
DR InterPro; IPR039598; HMGXB3.
DR InterPro; IPR040648; HMGXB3_CxC4.
DR PANTHER; PTHR17609; HMG DOMAIN-CONTAINING PROTEIN 3; 1.
DR PANTHER; PTHR17609:SF2; HMG DOMAIN-CONTAINING PROTEIN 3; 1.
DR Pfam; PF18717; CxC4; 1.
DR Pfam; PF09011; HMG_box_2; 1.
DR SMART; SM00398; HMG; 1.
DR SUPFAM; SSF47095; HMG-box; 1.
DR PROSITE; PS50118; HMG_BOX_2; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|PROSITE-ProRule:PRU00267};
KW Nucleus {ECO:0000256|PROSITE-ProRule:PRU00267};
KW Reference proteome {ECO:0000313|Proteomes:UP000028760}.
FT DOMAIN 43..111
FT /note="HMG box"
FT /evidence="ECO:0000259|PROSITE:PS50118"
FT DNA_BIND 43..111
FT /note="HMG box"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00267"
FT REGION 22..44
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 162..200
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 366..432
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 588..616
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 169..200
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 387..432
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 588..605
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1234 AA; 136659 MW; 776D8B5F5D27F7C6 CRC64;
MDNVEVLEVV KVTEEVEDVY SQVEMTSPKK KRGKADENSH ERPKKPRSAY LLYYFDVHQV
MQLGTPNLPQ SEVNKRISES WRRLSVAEKS YYLERAKLEK DGIDTETKPQ TSQSSSKHLP
GFRKILPRAN CFLVPNSASP NYQAVGSQSE VCIESIDFSA EPPTVPSDTA EETVVSSPQR
TSPWSSSKTL TSTDVQGSQG SVDEMLNDVS LNTKASGVAV QMMQREATQM VTIVPSQVRS
LLSVSPNYPS FKSQLEPQPL AGISSLQPVM MISVGAKSDQ IPKPSYKMVK TRVYSKINFI
SSLLQSVKTY TRRGRGRCLN PGCSFVYVTR HKPPTCPECG SHLGGKWIPA VKVQLLLAGD
IAKRTQDKAG ASRLTTDNKS DESRQATLHV SADTSKTSGG GNKEGQANRS NRKQPALSAA
APPEGSSNTP LPQICKQIKV NTKSTHQKPA RSCAVVQKRP VRPILPAVCN PGNTGQTSRG
IYFRCARCSS DVVVLVATVP QEILSSLKPS TLKQLGQTAP TTTTTQVNPV LFMENLFLCG
FFSVCVTDDL GLSTARGRGR CKNPACDYMY KNRHKPAVCP KCGSELTRKN TKPPKVREAQ
HPSETLLDPH QDLSPAQRDV QRQSTLQLIR STLQIPESDT ELQETMSLIQ ELNSVKIVLV
KNGEGSDAET ETLLETGWPQ FYESAATHCG LCSYRLLKGE RGTVAGQEDC WLLTETLMQT
ASLQLKVCPN LRCLALHSFA DLHPGLFNVG NRLLVSLDLF LKIRANIKLG QTPPLAAQSV
FDQIQNHPIH SLTAEESSHV QELFLSGYWA FECLTMRDYN DMICGICGVA PKVEIAQRHR
NDVLELKNVE FTWPECSVSD EVHVDDFWLT MEGEAIEQAA FPADIPITRV DASIIAPFAP
PLMRSPTVIN TEKDKLMPPI QQPAGDPSVL VRLIHDGQLR LDRVEDHSED ELRAILDSCG
VDLTPGSTKN ELLASLVNLY THVHSGLPTA PQPPAHLTAG KLSKMCPHKV VCSSKYLVRG
ETARDHVDLL LSSRYWPPVY VCDCPRQVAL CTDLQYPELA TQMWGRNQGC FSDPFEKPEF
VTCPELQDQP YAADLSLVAE NQLVHPITKS PSCWLAYPPG AARDAPAQEH HRMIRCRDLE
PYINLLTELD LKEPEDDVNT KPMIFNNTAY YYLYNRLVDF LSSRDIVNQQ ISQVVKACQP
GEVVIRDSLY RLGVAQINTD GDEGTRLDAQ TEEE
//