ID W5JIA7_ANODA Unreviewed; 639 AA.
AC W5JIA7;
DT 19-MAR-2014, integrated into UniProtKB/TrEMBL.
DT 19-MAR-2014, sequence version 1.
DT 24-JAN-2024, entry version 54.
DE SubName: Full=T-box transcription factor tbx20 {ECO:0000313|EMBL:ETN63856.1};
GN ORFNames=AND_004417 {ECO:0000313|EMBL:ETN63856.1};
OS Anopheles darlingi (Mosquito).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles.
OX NCBI_TaxID=43151 {ECO:0000313|EMBL:ETN63856.1};
RN [1] {ECO:0000313|EMBL:ETN63856.1, ECO:0000313|Proteomes:UP000000673}
RP NUCLEOTIDE SEQUENCE.
RX PubMed=20920257; DOI=10.1186/1471-2164-11-529;
RA Mendes N.D., Freitas A.T., Vasconcelos A.T., Sagot M.F.;
RT "Combination of measures distinguishes pre-miRNAs from other stem-loops in
RT the genome of the newly sequenced Anopheles darlingi.";
RL BMC Genomics 11:529-529(2010).
RN [2] {ECO:0000313|EMBL:ETN63856.1}
RP NUCLEOTIDE SEQUENCE.
RA Almeida L.G., Nicolas M.F., Souza R.C., Vasconcelos A.T.R.;
RL Submitted (MAY-2010) to the EMBL/GenBank/DDBJ databases.
RN [3] {ECO:0000313|EMBL:ETN63856.1}
RP NUCLEOTIDE SEQUENCE.
RX PubMed=23761445;
RA Marinotti O., Cerqueira G.C., de Almeida L.G., Ferro M.I., Loreto E.L.,
RA Zaha A., Teixeira S.M., Wespiser A.R., Almeida E Silva A.,
RA Schlindwein A.D., Pacheco A.C., Silva A.L., Graveley B.R., Walenz B.P.,
RA Lima Bde A., Ribeiro C.A., Nunes-Silva C.G., de Carvalho C.R., Soares C.M.,
RA de Menezes C.B., Matiolli C., Caffrey D., Araujo D.A., de Oliveira D.M.,
RA Golenbock D., Grisard E.C., Fantinatti-Garboggini F., de Carvalho F.M.,
RA Barcellos F.G., Prosdocimi F., May G., Azevedo Junior G.M., Guimaraes G.M.,
RA Goldman G.H., Padilha I.Q., Batista Jda S., Ferro J.A., Ribeiro J.M.,
RA Fietto J.L., Dabbas K.M., Cerdeira L., Agnez-Lima L.F., Brocchi M.,
RA de Carvalho M.O., Teixeira Mde M., Diniz Maia Mde M., Goldman M.H.,
RA Cruz Schneider M.P., Felipe M.S., Hungria M., Nicolas M.F., Pereira M.,
RA Montes M.A., Cantao M.E., Vincentz M., Rafael M.S., Silverman N.,
RA Stoco P.H., Souza R.C., Vicentini R., Gazzinelli R.T., Neves Rde O.,
RA Silva R., Astolfi-Filho S., Maciel T.E., Urmenyi T.P., Tadei W.P.,
RA Camargo E.P., de Vasconcelos A.T.;
RT "The genome of Anopheles darlingi, the main neotropical malaria vector.";
RL Nucleic Acids Res. 41:7387-7400(2013).
RN [4] {ECO:0000313|EnsemblMetazoa:ADAC004417-PA}
RP IDENTIFICATION.
RG EnsemblMetazoa;
RL Submitted (JUN-2015) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00201}.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00201}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ADMH02001163; ETN63856.1; -; Genomic_DNA.
DR AlphaFoldDB; W5JIA7; -.
DR STRING; 43151.W5JIA7; -.
DR EnsemblMetazoa; ADAC004417-RA; ADAC004417-PA; ADAC004417.
DR VEuPathDB; VectorBase:ADAC004417; -.
DR VEuPathDB; VectorBase:ADAR2_008967; -.
DR eggNOG; KOG3586; Eukaryota.
DR HOGENOM; CLU_014430_9_2_1; -.
DR OMA; MRHQTAA; -.
DR OrthoDB; 5323209at2759; -.
DR Proteomes; UP000000673; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro.
DR GO; GO:0000978; F:RNA polymerase II cis-regulatory region sequence-specific DNA binding; IEA:InterPro.
DR GO; GO:0045893; P:positive regulation of DNA-templated transcription; IEA:InterPro.
DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IEA:UniProt.
DR CDD; cd20193; T-box_TBX20-like; 1.
DR Gene3D; 2.60.40.820; Transcription factor, T-box; 1.
DR InterPro; IPR008967; p53-like_TF_DNA-bd_sf.
DR InterPro; IPR046360; T-box_DNA-bd.
DR InterPro; IPR036960; T-box_sf.
DR InterPro; IPR002070; TF_Brachyury.
DR InterPro; IPR001699; TF_T-box.
DR InterPro; IPR018186; TF_T-box_CS.
DR PANTHER; PTHR11267:SF129; LP04777P-RELATED; 1.
DR PANTHER; PTHR11267; T-BOX PROTEIN-RELATED; 1.
DR Pfam; PF00907; T-box; 1.
DR PRINTS; PR00938; BRACHYURY.
DR PRINTS; PR00937; TBOX.
DR SMART; SM00425; TBOX; 1.
DR SUPFAM; SSF49417; p53-like transcription factors; 1.
DR PROSITE; PS01283; TBOX_1; 1.
DR PROSITE; PS50252; TBOX_3; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00201};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00201}; Reference proteome {ECO:0000313|Proteomes:UP000000673};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT DOMAIN 217..408
FT /note="T-box"
FT /evidence="ECO:0000259|PROSITE:PS50252"
FT REGION 1..44
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 83..104
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 133..191
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 551..639
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 27..44
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 133..151
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 551..583
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 616..639
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 639 AA; 68643 MW; B54E0A740A518A2D CRC64;
MLLEAASHNG NNHNPPHHVP LQPQPPHHQL HHHHHLHHPH HQYPHHQAAM VAAVQQAAAA
LTAVKNATNF SIAAIMAQGT NNNNNNNSSS ISNSPPNGTS NTETSAAAAA AAAAAAAAAA
AAVVANATRI EESNYRPRSR TPERSSEVAE EEINVNVEDC SDSEESTEQV RETHSRTAST
PASVSDEDRL SPEIAQKAPK IVGSCNSDDL RPVQCHLETK ELWDKFNELG TEMIITKTGR
RMFPTVRVSF SGPMRHQTAA DRYAVLLDII PLDNRRYRYA YHRSAWLVAG KADPPPPARL
YSHPDTPLGP DALRRQVISF EKIKLTNNEM DKTGQIVLNS MHRYQPRIHL VRIGPNQSIP
TSPAELQEMD HKTFVFPETI FTAVTAYQNQ LITKLKIDSN PFAKGFRDSS RLNDFDRDPM
ESFLLEQHLR SPLRLFPDQV MAQLGAGAGA GGGMPPGMQS NDPSVLLEKA RQQLHLWGNP
SAAYSELMLQ QLYQRPNFGL NFGPLWQSQW PPTQLPPGFL SNVGPPPPPA HNAATSTMQG
VAAALAAASG AGSNSASASP TAPSASSSSS SSSSASSAGS QPGPVSPEVR PKSFARFTPY
QIPQHPPAAA ATASGPGHSP GQQAAGRSPG SPASRSPSH
//