ID A0A3F2YSC8_ANOAR Unreviewed; 983 AA.
AC A0A3F2YSC8;
DT 16-JAN-2019, integrated into UniProtKB/TrEMBL.
DT 16-JAN-2019, sequence version 1.
DT 24-JAN-2024, entry version 22.
DE RecName: Full=Signal transducer and activator of transcription {ECO:0000256|RuleBase:RU046415};
OS Anopheles arabiensis (Mosquito).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles.
OX NCBI_TaxID=7173 {ECO:0000313|EnsemblMetazoa:AARA016345-PB.2, ECO:0000313|Proteomes:UP000075840};
RN [1] {ECO:0000313|Proteomes:UP000075840}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Dongola {ECO:0000313|Proteomes:UP000075840};
RG The Broad Institute Genomics Platform;
RA Neafsey D.E., Howell P., Walker B., Young S.K., Zeng Q., Gargeya S.,
RA Fitzgerald M., Haas B., Abouelleil A., Allen A.W., Alvarado L.,
RA Arachchi H.M., Berlin A.M., Chapman S.B., Gainer-Dewar J., Goldberg J.,
RA Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A., Ireland A.,
RA Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W., Priest M.,
RA Roberts A., Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C.,
RA Birren B.;
RT "The Genome Sequence of Anopheles arabiensis DONG5_A.";
RL Submitted (MAR-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EnsemblMetazoa:AARA016345-PB.2}
RP IDENTIFICATION.
RC STRAIN=Dongola {ECO:0000313|EnsemblMetazoa:AARA016345-PB.2};
RG EnsemblMetazoa;
RL Submitted (AUG-2022) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Cytoplasm {ECO:0000256|ARBA:ARBA00004496,
CC ECO:0000256|RuleBase:RU046415}. Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|RuleBase:RU046415}.
CC -!- SIMILARITY: Belongs to the transcription factor STAT family.
CC {ECO:0000256|ARBA:ARBA00005586, ECO:0000256|RuleBase:RU046415}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; APCN01003999; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; APCN01004000; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; APCN01004001; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR AlphaFoldDB; A0A3F2YSC8; -.
DR SMR; A0A3F2YSC8; -.
DR EnsemblMetazoa; AARA016345-RB; AARA016345-PB; AARA016345.
DR VEuPathDB; VectorBase:AARA016345; -.
DR VEuPathDB; VectorBase:AARA21_009024; -.
DR OrthoDB; 7823at2759; -.
DR Proteomes; UP000075840; Unassembled WGS sequence.
DR GO; GO:0005737; C:cytoplasm; IEA:UniProtKB-SubCell.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro.
DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IEA:UniProt.
DR GO; GO:0007165; P:signal transduction; IEA:InterPro.
DR CDD; cd09919; SH2_STAT_family; 1.
DR CDD; cd16855; STAT5_CCD; 1.
DR Gene3D; 1.10.238.10; EF-hand; 1.
DR Gene3D; 3.30.505.10; SH2 domain; 1.
DR Gene3D; 1.20.1050.20; STAT transcription factor, all-alpha domain; 1.
DR Gene3D; 2.60.40.630; STAT transcription factor, DNA-binding domain; 1.
DR Gene3D; 1.10.532.10; STAT transcription factor, N-terminal domain; 1.
DR InterPro; IPR008967; p53-like_TF_DNA-bd_sf.
DR InterPro; IPR000980; SH2.
DR InterPro; IPR036860; SH2_dom_sf.
DR InterPro; IPR001217; STAT.
DR InterPro; IPR046994; STAT5_CCD.
DR InterPro; IPR048988; STAT_linker.
DR InterPro; IPR036535; STAT_N_sf.
DR InterPro; IPR013800; STAT_TF_alpha.
DR InterPro; IPR015988; STAT_TF_coiled-coil.
DR InterPro; IPR013801; STAT_TF_DNA-bd.
DR InterPro; IPR012345; STAT_TF_DNA-bd_N.
DR InterPro; IPR013799; STAT_TF_prot_interaction.
DR PANTHER; PTHR11801; SIGNAL TRANSDUCER AND ACTIVATOR OF TRANSCRIPTION; 1.
DR PANTHER; PTHR11801:SF43; SIGNAL TRANSDUCER AND TRANSCRIPTION ACTIVATOR; 1.
DR Pfam; PF00017; SH2; 1.
DR Pfam; PF01017; STAT_alpha; 1.
DR Pfam; PF02864; STAT_bind; 1.
DR Pfam; PF02865; STAT_int; 1.
DR Pfam; PF21354; STAT_linker; 1.
DR SMART; SM00252; SH2; 1.
DR SMART; SM00964; STAT_int; 1.
DR SUPFAM; SSF49417; p53-like transcription factors; 1.
DR SUPFAM; SSF55550; SH2 domain; 1.
DR SUPFAM; SSF47655; STAT; 1.
DR SUPFAM; SSF48092; Transcription factor STAT-4 N-domain; 1.
DR PROSITE; PS50001; SH2; 1.
PE 3: Inferred from homology;
KW Activator {ECO:0000256|ARBA:ARBA00023159, ECO:0000256|RuleBase:RU046415};
KW Cytoplasm {ECO:0000256|ARBA:ARBA00022490, ECO:0000256|RuleBase:RU046415};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|RuleBase:RU046415};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|RuleBase:RU046415};
KW Phosphoprotein {ECO:0000256|ARBA:ARBA00022553,
KW ECO:0000256|RuleBase:RU046415};
KW SH2 domain {ECO:0000256|ARBA:ARBA00022999, ECO:0000256|RuleBase:RU046415};
KW Transcription {ECO:0000256|ARBA:ARBA00023163,
KW ECO:0000256|RuleBase:RU046415};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015,
KW ECO:0000256|RuleBase:RU046415}.
SQ SEQUENCE 983 AA; 110244 MW; 890F47B67DE484F8 CRC64;
MSLWARVNQL PQPILEQIRF IYGSNFPIEV RHYLADWIEE RLLNAPVYTN DQEAVYEQDA
ANFLNQLIME LERTAINLPE SNFTIKIRLN ESARNFRQLF SHNPAQLYQH LMNCLHRERQ
CVAYPDECVN VQDPEVTEVF NAVQQLQIMV RTNENDNRNL MKEYEHLLLE VHELQKNRAQ
LETIENAEMR AHAHNQLAQH QKMVNDRLQL CTGKRLALVD GFRKTILITD EVQNKVLNKY
LSQWKINQGF AGNGASMMSA SNLDTIQAWC ESLAEIIWST KDQIRLAIKN KSKLHVEQED
VPDLLPQAMV DVTNLLKMLI TNTFIIEKQP PQVMKTNTRF AATVRLLVGN TLNIKMVNPQ
VKVSIISEAQ AQQTQQTNKA SEQSCGEIMN NIGNLEYNET TKQLSVSFRN MQLKKIKRAE
KKGTECVMDE KFALLFQSSF AVGHGDLVFS VWTISLPVVV IVHGNQEPQS WATITWDNAF
ADINRIPFQV PDKVIWNQLA EALNMKFRAS TGRSLTAENM HFLCEKAFKT NLPFPVPNDL
TIMWSQFCKE PIPDRSFTFW DWFYAAMKVT REHLRGPWMD GSIIGFIHKS KAEDYLLKCP
RGTFLLRFSD SELGGITIAW VNEGNDGQPQ ILHIQPFTAK DFSTRSLSDR IRDFDDLFYL
YPNKPKHEAF DRYTTPAGPP RNKNYIASEV RAVLMPGPTN NQMNSFPNTP SYNIQSPDAS
RDTPSTGYSV SGRASNASAS CGLQHRYVSS FEPYVQSDGF AQSVLAAGTG TTLSGLVRPP
PALSVLSTSS NCSSSTTTTA GGGHYQQHQG LMEQQQHHHH LHNHITNQHH SPATGSQNHA
QLPQQQFDDA TMSLSDDGTT ELRSLPSMPG AFSFTGRPFA DSGNGSERPN LPVEHQPNGP
NVQLPQLSAG HTNSTVAAAV PAAITTSISS SSSRRVDSGG DGISCASSRT TSLSSASTAM
EQQLSGLEDI SSWLDLNNNT SWS
//