ID F9WIZ4_TRYCI Unreviewed; 453 AA.
AC F9WIZ4;
DT 19-OCT-2011, integrated into UniProtKB/TrEMBL.
DT 19-OCT-2011, sequence version 1.
DT 22-FEB-2023, entry version 25.
DE SubName: Full=WGS project CAEQ00000000 data, annotated contig 85 {ECO:0000313|EMBL:CCD17295.1};
GN ORFNames=TCIL3000_0_02370 {ECO:0000313|EMBL:CCD17295.1};
OS Trypanosoma congolense (strain IL3000).
OC Eukaryota; Discoba; Euglenozoa; Kinetoplastea; Metakinetoplastina;
OC Trypanosomatida; Trypanosomatidae; Trypanosoma; Nannomonas.
OX NCBI_TaxID=1068625 {ECO:0000313|EMBL:CCD17295.1, ECO:0000313|Proteomes:UP000000702};
RN [1] {ECO:0000313|Proteomes:UP000000702}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=IL3000 {ECO:0000313|Proteomes:UP000000702};
RA Jackson A.P., Berry A., Allison H.C., Burton P., Anderson J., Aslett M.,
RA Brown R., Corton N., Harris D., Hauser H., Gamble J., Gilderthorp R.,
RA McQuillan J., Quail M.A., Sanders M., Van Tonder A., Ginger M.L.,
RA Donelson J.E., Field M.C., Barry J.D., Berriman M., Hertz-Fowler C.;
RT "Divergent evolution of antigenic variation in African trypanosomes.";
RL Submitted (JUL-2011) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:CCD17295.1, ECO:0000313|Proteomes:UP000000702}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=IL3000 {ECO:0000313|EMBL:CCD17295.1,
RC ECO:0000313|Proteomes:UP000000702};
RX PubMed=22331916; DOI=10.1073/pnas.1117313109;
RA Jackson A.P., Berry A., Aslett M., Allison H.C., Burton P.,
RA Vavrova-Anderson J., Brown R., Browne H., Corton N., Hauser H., Gamble J.,
RA Gilderthorp R., Marcello L., McQuillan J., Otto T.D., Quail M.A.,
RA Sanders M.J., van Tonder A., Ginger M.L., Field M.C., Barry J.D.,
RA Hertz-Fowler C., Berriman M.;
RT "Antigenic diversity is generated by distinct evolutionary mechanisms in
RT African trypanosome species.";
RL Proc. Natl. Acad. Sci. U.S.A. 109:3416-3421(2012).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:CCD17295.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CAEQ01002665; CCD17295.1; -; Genomic_DNA.
DR AlphaFoldDB; F9WIZ4; -.
DR VEuPathDB; TriTrypDB:TcIL3000_0_02370; -.
DR OMA; VARDCND; -.
DR OrthoDB; 5622406at2759; -.
DR Proteomes; UP000000702; Unassembled WGS sequence.
DR Gene3D; 1.20.1260.80; -; 1.
DR InterPro; IPR031987; GARP.
DR Pfam; PF16731; GARP; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000000702};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..22
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 23..453
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5003395092"
FT DOMAIN 35..225
FT /note="Trypanosoma glutamic acid/alanine-rich protein"
FT /evidence="ECO:0000259|Pfam:PF16731"
FT REGION 54..137
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 163..196
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 210..428
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 66..125
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 210..274
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 275..302
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 303..323
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 335..355
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 367..418
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 453 AA; 48734 MW; C06F2E76468F1849 CRC64;
MAIRFPYYLA LLFYVTGIPS VADPNDDENE KLTVEAVQSV CELTKIMRGV GESTKELQSQ
AEAHASEASK AKEESESAVG RAEEANEKSP EAADALERAR EALSEAEEAS EEAALASKEA
EKHAENATSL ATGQQEDLEK FLREIAEDIE SDEDIKAVAR DCNDTNSSVT SEGLDSARRN
AKKHFTSEAA NTLENITNAT VEVFSNLEKE VQRAKESQDT AEKARDAATE AADDAEEKAK
KGGGNTSDDD DNSSDEKEES PGKDEEKSPG EDEEKSPGED EEESPGEDEE KSPGEDEEES
PGKDEEKSPG KDEEESPGKD EEKSPGEDEE ESPGKDEEKS PGKDEEESPG KDEEKSPGED
EEESPGKEEE KSPGKDEEES PGKDEEKSPG GDEEKSPGGD EEKSPGKGEG KTPGKDEESS
DEDEYEAVGS ARFRGTGVGL FLSLALLVHA TVL
//