ID A0A499FTG8_ANOAR Unreviewed; 562 AA.
AC A0A499FTG8;
DT 03-JUL-2019, integrated into UniProtKB/TrEMBL.
DT 03-JUL-2019, sequence version 1.
DT 13-SEP-2023, entry version 21.
DE SubName: Full=Uncharacterized protein {ECO:0000313|EnsemblMetazoa:AARA018231-PA.1};
OS Anopheles arabiensis (Mosquito).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles.
OX NCBI_TaxID=7173 {ECO:0000313|EnsemblMetazoa:AARA018231-PA.1, ECO:0000313|Proteomes:UP000075840};
RN [1] {ECO:0000313|Proteomes:UP000075840}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Dongola {ECO:0000313|Proteomes:UP000075840};
RG The Broad Institute Genomics Platform;
RA Neafsey D.E., Howell P., Walker B., Young S.K., Zeng Q., Gargeya S.,
RA Fitzgerald M., Haas B., Abouelleil A., Allen A.W., Alvarado L.,
RA Arachchi H.M., Berlin A.M., Chapman S.B., Gainer-Dewar J., Goldberg J.,
RA Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A., Ireland A.,
RA Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W., Priest M.,
RA Roberts A., Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C.,
RA Birren B.;
RT "The Genome Sequence of Anopheles arabiensis DONG5_A.";
RL Submitted (MAR-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EnsemblMetazoa:AARA018231-PA.1}
RP IDENTIFICATION.
RC STRAIN=Dongola {ECO:0000313|EnsemblMetazoa:AARA018231-PA.1};
RG EnsemblMetazoa;
RL Submitted (AUG-2022) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00201}.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00201}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; APCN01000723; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; APCN01000724; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR AlphaFoldDB; A0A499FTG8; -.
DR EnsemblMetazoa; AARA018231-RA; AARA018231-PA; AARA018231.
DR VEuPathDB; VectorBase:AARA018231; -.
DR VEuPathDB; VectorBase:AARA21_007718; -.
DR OrthoDB; 5323209at2759; -.
DR Proteomes; UP000075840; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro.
DR GO; GO:0000978; F:RNA polymerase II cis-regulatory region sequence-specific DNA binding; IEA:InterPro.
DR GO; GO:0045893; P:positive regulation of DNA-templated transcription; IEA:InterPro.
DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IEA:UniProt.
DR Gene3D; 2.60.40.820; Transcription factor, T-box; 1.
DR InterPro; IPR008967; p53-like_TF_DNA-bd_sf.
DR InterPro; IPR046360; T-box_DNA-bd.
DR InterPro; IPR036960; T-box_sf.
DR InterPro; IPR002070; TF_Brachyury.
DR InterPro; IPR001699; TF_T-box.
DR InterPro; IPR018186; TF_T-box_CS.
DR PANTHER; PTHR11267:SF129; LP04777P-RELATED; 1.
DR PANTHER; PTHR11267; T-BOX PROTEIN-RELATED; 1.
DR Pfam; PF00907; T-box; 1.
DR PRINTS; PR00938; BRACHYURY.
DR PRINTS; PR00937; TBOX.
DR SMART; SM00425; TBOX; 1.
DR SUPFAM; SSF49417; p53-like transcription factors; 1.
DR PROSITE; PS01283; TBOX_1; 1.
DR PROSITE; PS50252; TBOX_3; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00201};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00201}; Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
SQ SEQUENCE 562 AA; 59976 MW; 072D1988FFB1770D CRC64;
MLLDTQGTAG DGPAVGPPGM GSGTVPSTAA AAAAAAAAAA ALSAAKGATD FSIAAIMALR
EGSSRSPLEA VFVSGSQNSS VDRLSEVDDE VDVDVEECSD SEEPKSSDSG RVTHSRTTET
PNSVSVSDEE RLSPEPAQKR PTIVGSCNSD DLRPVQCHLE TKELWDKFNE LGTEMIITKT
GRRMFPTVRV SFSGPLRQVT PADRYVVLLD VVPVDNRRYR YAYHRSAWLV AGKADPPPPA
RLYAHPDTPL GADALRKQVI SFEKVKLTNN EMDKNGQIVL NSMHRYQPRI HLARLGPGQN
IPITPKELAE VDHKTYVFPE TIFTAVTAYQ NQLITKLKID SNPFAKGFRD SSRLTDFDRD
PMDALLFEQH MRSPLRLFPD PLMAQFTSGP SPADFQDASS AALLEKARQH LQMWGRSPYS
ELLLPQMYQR PQALGALNLG VWQNSASWPA SPQLPGGFLP NAAAAAAAVA AAASASGGPR
HTTPPPPPPP PATALGPSGG TILSPSTPLT PTSSSGTPSP DPRAKHFTRF TPYQIPQAQH
RSSPAPPPSS LPPAGSPGSS SN
//