ID A0A182VUN6_9DIPT Unreviewed; 943 AA.
AC A0A182VUN6;
DT 07-SEP-2016, integrated into UniProtKB/TrEMBL.
DT 07-SEP-2016, sequence version 1.
DT 13-SEP-2023, entry version 33.
DE RecName: Full=ETS domain-containing protein {ECO:0008006|Google:ProtNLM};
OS Anopheles minimus.
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles.
OX NCBI_TaxID=112268 {ECO:0000313|EnsemblMetazoa:AMIN001779-PA, ECO:0000313|Proteomes:UP000075920};
RN [1] {ECO:0000313|Proteomes:UP000075920}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=MINIMUS1 {ECO:0000313|Proteomes:UP000075920};
RG The Broad Institute Genomics Platform;
RA Neafsey D.E., Walton C., Walker B., Young S.K., Zeng Q., Gargeya S.,
RA Fitzgerald M., Haas B., Abouelleil A., Allen A.W., Alvarado L.,
RA Arachchi H.M., Berlin A.M., Chapman S.B., Gainer-Dewar J., Goldberg J.,
RA Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A., Ireland A.,
RA Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W., Priest M.,
RA Roberts A., Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C.,
RA Birren B.;
RT "The Genome Sequence of Anopheles minimus MINIMUS1.";
RL Submitted (MAR-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EnsemblMetazoa:AMIN001779-PA}
RP IDENTIFICATION.
RC STRAIN=MINIMUS1 {ECO:0000313|EnsemblMetazoa:AMIN001779-PA};
RG EnsemblMetazoa;
RL Submitted (MAY-2020) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|RuleBase:RU004019}.
CC -!- SIMILARITY: Belongs to the ETS family. {ECO:0000256|ARBA:ARBA00005562,
CC ECO:0000256|RuleBase:RU004019}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; A0A182VUN6; -.
DR STRING; 112268.A0A182VUN6; -.
DR EnsemblMetazoa; AMIN001779-RA; AMIN001779-PA; AMIN001779.
DR VEuPathDB; VectorBase:AMIN001779; -.
DR OrthoDB; 2950528at2759; -.
DR Proteomes; UP000075920; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro.
DR GO; GO:0043565; F:sequence-specific DNA binding; IEA:InterPro.
DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IEA:InterPro.
DR Gene3D; 1.10.150.50; Transcription Factor, Ets-1; 1.
DR Gene3D; 1.10.10.10; Winged helix-like DNA-binding domain superfamily/Winged helix DNA-binding domain; 1.
DR InterPro; IPR000418; Ets_dom.
DR InterPro; IPR046328; ETS_fam.
DR InterPro; IPR003118; Pointed_dom.
DR InterPro; IPR013761; SAM/pointed_sf.
DR InterPro; IPR036388; WH-like_DNA-bd_sf.
DR InterPro; IPR036390; WH_DNA-bd_sf.
DR PANTHER; PTHR11849; ETS; 1.
DR PANTHER; PTHR11849:SF201; ETS DNA-BINDING PROTEIN POKKURI; 1.
DR Pfam; PF00178; Ets; 1.
DR Pfam; PF02198; SAM_PNT; 1.
DR PRINTS; PR00454; ETSDOMAIN.
DR SMART; SM00413; ETS; 1.
DR SMART; SM00251; SAM_PNT; 1.
DR SUPFAM; SSF47769; SAM/Pointed domain; 1.
DR SUPFAM; SSF46785; Winged helix' DNA-binding domain; 1.
DR PROSITE; PS00345; ETS_DOMAIN_1; 1.
DR PROSITE; PS00346; ETS_DOMAIN_2; 1.
DR PROSITE; PS50061; ETS_DOMAIN_3; 1.
DR PROSITE; PS51433; PNT; 1.
PE 3: Inferred from homology;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|RuleBase:RU004019};
KW Nucleus {ECO:0000256|RuleBase:RU004019}.
FT DOMAIN 79..163
FT /note="PNT"
FT /evidence="ECO:0000259|PROSITE:PS51433"
FT DOMAIN 499..582
FT /note="ETS"
FT /evidence="ECO:0000259|PROSITE:PS50061"
FT REGION 170..195
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 222..384
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 406..437
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 458..488
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 613..634
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 662..753
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 769..794
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 821..943
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 222..354
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 458..477
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 620..634
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 662..701
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 712..729
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 730..753
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 842..856
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 867..885
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 886..917
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 918..932
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 943 AA; 99904 MW; 496889D427872336 CRC64;
MDEVDTIAAI IREQQASSAA VSGQDVLTSA SQSPSTAAAA IFAAAAMKLL PIPLSPLNAP
PPLGLWNSEL LWRYPPAPPS PLADLKTQLP PQLNTDPRIW GRDEVVIFLR FCEREFDLPK
FDLDLFQMNG KALCVLTKND LAERSPGAGD VLHNVLQMLI RDAQILHRHL PSSPVTPTGR
YPLSPHSHPP TPNWSALAPP DSLFFHSSHL QQFMAVSNSV TLSPAPSIDS QAGSPPSQSH
AEQNVFQQQQ QQQQQQSKSS GAGSSSSSTS SSSSTNGSAG SGGSANGNGT GPSTNGASAA
NVSNSLISSS SSSSAASSTS SSVSNQSDSD EDNATVGNGS GATSTTNGSN GNGFHPLPLS
HPDSKLPLIA THPPQQSPPL TPISKETQHL GQLQLLDTKT LQYPVSSLLN GSSTNGGSSS
RDRDSSSSVG GSSGSASSNA AAAAAAMAAA VLGLKSAASN GTGSSGSSGS NSAPSTPGAF
LPVKREFFPD SPEPNTNGRL LWDFLQQLLN DPSQRYSSYI AWKCRDTGVF KIVDPAGLAK
LWGKQKNHLS MNYDKMSRAL RYYYRVNILR KVQGERHCYQ FLRNPTELKS IKNISLLRQT
MAASQAAAAA AAAQSGGGND RNGDRVSASN GSSGANPLMS LLPNGANIHH LAAAAAAAAA
SQNGPLSPSS SSSSSSPSSL HHHHHLQQQQ QQQQALHSHH LAVQNGSGPH HRNIKMETDL
DDLKPTDLST NDRKFSYGST VNGTDCYPSS SSSRLIRNAN GLTTIQLLRQ QQQQQQQSEA
GSPPSSPTPL SINSQSLHTI SSNLHHLHQQ QLQAQAAAAA AIAAAAEARH QQDDQPDHPA
HHHLQHPHHQ SHHHSARNGS RSSSTSGASE HDHREPHDRE RDTPSPANSE SSSHPAAASS
QQQQQQQQQQ QQQQQQHHHM HVKSDDDSMM PTDLSKSESM YYK
//