ID A0A182MYW0_9DIPT Unreviewed; 875 AA.
AC A0A182MYW0;
DT 07-SEP-2016, integrated into UniProtKB/TrEMBL.
DT 07-SEP-2016, sequence version 1.
DT 27-MAR-2024, entry version 35.
DE RecName: Full=HMG box domain-containing protein {ECO:0000259|PROSITE:PS50118};
OS Anopheles dirus.
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles.
OX NCBI_TaxID=7168 {ECO:0000313|EnsemblMetazoa:ADIR000565-PA, ECO:0000313|Proteomes:UP000075884};
RN [1] {ECO:0000313|Proteomes:UP000075884}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=WRAIR2 {ECO:0000313|Proteomes:UP000075884};
RG The Broad Institute Genomics Platform;
RA Neafsey D.E., Walton C., Walker B., Young S.K., Zeng Q., Gargeya S.,
RA Fitzgerald M., Haas B., Abouelleil A., Allen A.W., Alvarado L.,
RA Arachchi H.M., Berlin A.M., Chapman S.B., Gainer-Dewar J., Goldberg J.,
RA Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A., Ireland A.,
RA Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W., Priest M.,
RA Roberts A., Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C.,
RA Birren B.;
RT "The Genome Sequence of Anopheles dirus WRAIR2.";
RL Submitted (MAR-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EnsemblMetazoa:ADIR000565-PA}
RP IDENTIFICATION.
RC STRAIN=WRAIR2 {ECO:0000313|EnsemblMetazoa:ADIR000565-PA};
RG EnsemblMetazoa;
RL Submitted (MAY-2020) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the TCF/LEF family.
CC {ECO:0000256|ARBA:ARBA00006569}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; A0A182MYW0; -.
DR STRING; 7168.A0A182MYW0; -.
DR EnsemblMetazoa; ADIR000565-RA; ADIR000565-PA; ADIR000565.
DR VEuPathDB; VectorBase:ADIR000565; -.
DR OrthoDB; 5351131at2759; -.
DR Proteomes; UP000075884; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-UniRule.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0016055; P:Wnt signaling pathway; IEA:UniProtKB-KW.
DR CDD; cd21996; HMG-box_TCF7-like; 1.
DR Gene3D; 1.10.30.10; High mobility group box domain; 1.
DR Gene3D; 4.10.900.10; TCF3-CBD (Catenin binding domain); 1.
DR InterPro; IPR027397; Catenin-bd_sf.
DR InterPro; IPR009071; HMG_box_dom.
DR InterPro; IPR036910; HMG_box_dom_sf.
DR InterPro; IPR024940; TCF/LEF.
DR PANTHER; PTHR10373:SF38; PROTEIN PANGOLIN, ISOFORM J; 1.
DR PANTHER; PTHR10373; TRANSCRIPTION FACTOR 7 FAMILY MEMBER; 1.
DR Pfam; PF00505; HMG_box; 1.
DR SMART; SM01366; c-clamp; 1.
DR SMART; SM00398; HMG; 1.
DR SUPFAM; SSF47095; HMG-box; 1.
DR PROSITE; PS50118; HMG_BOX_2; 1.
PE 3: Inferred from homology;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00267}; Nucleus {ECO:0000256|PROSITE-ProRule:PRU00267};
KW Wnt signaling pathway {ECO:0000256|ARBA:ARBA00022687}.
FT DOMAIN 386..454
FT /note="HMG box"
FT /evidence="ECO:0000259|PROSITE:PS50118"
FT DNA_BIND 386..454
FT /note="HMG box"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00267"
FT REGION 1..99
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 242..383
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 461..486
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 512..571
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 587..766
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 840..875
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 14..57
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 280..294
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 313..338
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 339..353
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 369..383
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 512..536
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 587..634
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 642..656
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 657..703
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 711..753
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 875 AA; 90963 MW; B75A645C2F51A20B CRC64;
MPHNSSTGDE LGSTDEVKVF KDEGDRDDEK LSENLLEEKS SLIDLTESEE KTVKNGTTRH
EAANTLFGGT GGGSGGGGGG SGGGGGGGSG GVGTGIKLPH GAPHPGFNMG YIVPPYSYPN
GTAGGLSVSM ANKMSLPPFF CHNGDHLSSP PPAHCGILPY QLDPKTMSLT RPPLYSFPTS
QYPYPMLSPD MLPVGPSWHT PSLYSPAAGF RSPYPSSLQI YTSLPSDFYR YSPTLLPPSI
HPSHPVLNAS HPAIITPGPK QDISGGGGSH RGGDRQGGAS IKLESSSSSA VNDHEPSSGH
HHHHHRSTSS SYHHHRSSGG GSLGDQHGTS NNNNSNHHTN HNHHNHHHHN HHNNNNGNGG
GSGGAGHKHQ SLAEREAALE KKRSHVKKPL NAFMLYMKEM RAKVVAECTL KESAAINQIL
GRKWHSLSRE EQSVYYDKAR QERQLHMELY PGWTARDNYG YGAKKKKRKK DRSPADPGGN
SMKKCRARYG LDQQNQWCKP CRRKKKCIRY KEAGGGDRGD GSSDREGGRD RGDGSDDAIG
SCGSMEDDSS KSPGEEDEDR ESINQSLSSP RCLSVLSSLQ SPYSNNFSPY NNHHTMRTNS
ASSSASNGTG GGGGGPGLNN STSSTSNVTS GGGGSLGKSP PGMLLPPTPS TPTLAPPTPS
SSSSSSSSSS SSSSSSLSSS SSSSTPSLQI PPHLQQLNLK TSPPLLPNTP PSSGGSSVSS
TTALAIKEEL PDSSTGNGSI GGSPSSGALH PASSTSPPAG VGGFLLHPAH HHHHLHHQHH
LASLHHSSNA LQLGHTQLHA TSKTNGDLSP TTTAAAAATA AALAAVATGG ALATAPQAAA
VAPTSTNDKS VPVNNGSSSR SSSNSSSSSQ IASDR
//