ID W5JJG1_ANODA Unreviewed; 823 AA.
AC W5JJG1;
DT 19-MAR-2014, integrated into UniProtKB/TrEMBL.
DT 19-MAR-2014, sequence version 1.
DT 27-MAR-2024, entry version 43.
DE RecName: Full=HMG box domain-containing protein {ECO:0000259|PROSITE:PS50118};
GN ORFNames=AND_004863 {ECO:0000313|EMBL:ETN63428.1};
OS Anopheles darlingi (Mosquito).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles.
OX NCBI_TaxID=43151 {ECO:0000313|EMBL:ETN63428.1};
RN [1] {ECO:0000313|EMBL:ETN63428.1, ECO:0000313|Proteomes:UP000000673}
RP NUCLEOTIDE SEQUENCE.
RX PubMed=20920257; DOI=10.1186/1471-2164-11-529;
RA Mendes N.D., Freitas A.T., Vasconcelos A.T., Sagot M.F.;
RT "Combination of measures distinguishes pre-miRNAs from other stem-loops in
RT the genome of the newly sequenced Anopheles darlingi.";
RL BMC Genomics 11:529-529(2010).
RN [2] {ECO:0000313|EMBL:ETN63428.1}
RP NUCLEOTIDE SEQUENCE.
RA Almeida L.G., Nicolas M.F., Souza R.C., Vasconcelos A.T.R.;
RL Submitted (MAY-2010) to the EMBL/GenBank/DDBJ databases.
RN [3] {ECO:0000313|EMBL:ETN63428.1}
RP NUCLEOTIDE SEQUENCE.
RX PubMed=23761445;
RA Marinotti O., Cerqueira G.C., de Almeida L.G., Ferro M.I., Loreto E.L.,
RA Zaha A., Teixeira S.M., Wespiser A.R., Almeida E Silva A.,
RA Schlindwein A.D., Pacheco A.C., Silva A.L., Graveley B.R., Walenz B.P.,
RA Lima Bde A., Ribeiro C.A., Nunes-Silva C.G., de Carvalho C.R., Soares C.M.,
RA de Menezes C.B., Matiolli C., Caffrey D., Araujo D.A., de Oliveira D.M.,
RA Golenbock D., Grisard E.C., Fantinatti-Garboggini F., de Carvalho F.M.,
RA Barcellos F.G., Prosdocimi F., May G., Azevedo Junior G.M., Guimaraes G.M.,
RA Goldman G.H., Padilha I.Q., Batista Jda S., Ferro J.A., Ribeiro J.M.,
RA Fietto J.L., Dabbas K.M., Cerdeira L., Agnez-Lima L.F., Brocchi M.,
RA de Carvalho M.O., Teixeira Mde M., Diniz Maia Mde M., Goldman M.H.,
RA Cruz Schneider M.P., Felipe M.S., Hungria M., Nicolas M.F., Pereira M.,
RA Montes M.A., Cantao M.E., Vincentz M., Rafael M.S., Silverman N.,
RA Stoco P.H., Souza R.C., Vicentini R., Gazzinelli R.T., Neves Rde O.,
RA Silva R., Astolfi-Filho S., Maciel T.E., Urmenyi T.P., Tadei W.P.,
RA Camargo E.P., de Vasconcelos A.T.;
RT "The genome of Anopheles darlingi, the main neotropical malaria vector.";
RL Nucleic Acids Res. 41:7387-7400(2013).
RN [4] {ECO:0000313|EnsemblMetazoa:ADAC004863-PA}
RP IDENTIFICATION.
RG EnsemblMetazoa;
RL Submitted (JUN-2015) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ADMH02001255; ETN63428.1; -; Genomic_DNA.
DR AlphaFoldDB; W5JJG1; -.
DR STRING; 43151.W5JJG1; -.
DR EnsemblMetazoa; ADAC004863-RA; ADAC004863-PA; ADAC004863.
DR VEuPathDB; VectorBase:ADAC004863; -.
DR VEuPathDB; VectorBase:ADAR2_002614; -.
DR eggNOG; KOG4715; Eukaryota.
DR HOGENOM; CLU_021772_2_1_1; -.
DR OMA; YTAKHMA; -.
DR OrthoDB; 3062313at2759; -.
DR Proteomes; UP000000673; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-UniRule.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR CDD; cd21983; HMG-box_SMARCE1; 1.
DR Gene3D; 1.10.30.10; High mobility group box domain; 1.
DR InterPro; IPR009071; HMG_box_dom.
DR InterPro; IPR036910; HMG_box_dom_sf.
DR PANTHER; PTHR46232; SMARCE1 REGULATOR OF CHROMATIN; 1.
DR PANTHER; PTHR46232:SF1; SWI_SNF-RELATED MATRIX-ASSOCIATED ACTIN-DEPENDENT REGULATOR OF CHROMATIN SUBFAMILY E MEMBER 1; 1.
DR Pfam; PF00505; HMG_box; 1.
DR SMART; SM00398; HMG; 1.
DR SUPFAM; SSF47095; HMG-box; 1.
DR PROSITE; PS50118; HMG_BOX_2; 1.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW DNA-binding {ECO:0000256|PROSITE-ProRule:PRU00267};
KW Nucleus {ECO:0000256|PROSITE-ProRule:PRU00267};
KW Reference proteome {ECO:0000313|Proteomes:UP000000673}.
FT DOMAIN 70..138
FT /note="HMG box"
FT /evidence="ECO:0000259|PROSITE:PS50118"
FT DNA_BIND 70..138
FT /note="HMG box"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00267"
FT REGION 1..75
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 151..192
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 312..660
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 681..823
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 113..140
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 17..41
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 315..337
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 345..369
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 385..423
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 424..438
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 465..485
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 493..526
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 532..555
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 563..653
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 690..709
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 726..755
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 775..789
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 809..823
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 823 AA; 86797 MW; CD8CE05FCA789E23 CRC64;
MALPVNYKHN IPGPSMTPQR IRPSGSGADR KDNSSPFMHS PHGNPAFTPQ KMGKASAASE
SKMIKPPKPP EKPLMPYMRY SRKVWDSIKA SNSDLKLWEV GKIIGQQWRD LPEAEKEEYI
AEYEQEKTEH EKNMKAYHNS PAYLAYLTAR NKAKPGDGDG HETSSRSSSK AQQQDRRIDI
QPAEDEDDPD DGYSFKHVAY ARFSRNHRLI NEIFSDAVVP DVRSVVTTQR MHVLKRQVQS
LTMHQMKLQH ELQLIEEKFE TRKRKFVESS ELFQDELKKH CKPAVDEETF QKMVERQYEM
MKRERLRALE EAQKPPAPAP APAPAPASQP PTSQAPPAAP SATKAEEQTP EQPAGATAAA
ASTPTNGNEP AAAAAAPGAT AAVAPATQET GSDGSADRSS PMAMATSDES QGSQDSSTTG
PVSEKKELAE ELKPEAKDDA ATPVGAPVSN ASPEEAKAVA APPVSQPSTA PPSHPIGPPP
VTSAAGPPTA ATEAPPSVAP PASIPSQPPA LPPPGHMPPV AAPPAAPHQH PGVVGTQQSE
PKPEIPPTSI VHHTSSHPPP VAAAHGGHPP GGPVMPPVHG GYAPPPVSAA GAPVPPHQVP
SVPTPPPPSH TPESQIPPPN VTVPHQHQPP HPAPPHHMPP HMQPHPGMPT HGSPFPGYPH
AGSPRAPYYL PGYGGHPQPY GQYGHYPYHQ QYGPPPPPGG YPVPGGPPGS RPMGHYGEVH
HAEPPHHGYG PPPPPGATGH GMPPGAVPAP GQPHAAGPPT MVTATATVTP PAVHATPAAA
PPPVAGPEPG EIEPEKNPAA TPNKKRKKGG AASKQDDTDK KDD
//