ID Q4E2F7_TRYCC Unreviewed; 215 AA.
AC Q4E2F7;
DT 13-SEP-2005, integrated into UniProtKB/TrEMBL.
DT 13-SEP-2005, sequence version 1.
DT 08-NOV-2023, entry version 42.
DE SubName: Full=Mucin TcMUCII, putative {ECO:0000313|EMBL:EAN98970.1};
GN ORFNames=Tc00.1047053509525.320 {ECO:0000313|EMBL:EAN98970.1};
OS Trypanosoma cruzi (strain CL Brener).
OC Eukaryota; Discoba; Euglenozoa; Kinetoplastea; Metakinetoplastina;
OC Trypanosomatida; Trypanosomatidae; Trypanosoma; Schizotrypanum.
OX NCBI_TaxID=353153 {ECO:0000313|EMBL:EAN98970.1, ECO:0000313|Proteomes:UP000002296};
RN [1] {ECO:0000313|EMBL:EAN98970.1, ECO:0000313|Proteomes:UP000002296}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CL Brener {ECO:0000313|EMBL:EAN98970.1,
RC ECO:0000313|Proteomes:UP000002296};
RX PubMed=16020725; DOI=10.1126/science.1112631;
RA El-Sayed N.M., Myler P.J., Bartholomeu D.C., Nilsson D., Aggarwal G.,
RA Tran A.N., Ghedin E., Worthey E.A., Delcher A.L., Blandin G.,
RA Westenberger S.J., Caler E., Cerqueira G.C., Branche C., Haas B.,
RA Anupama A., Arner E., Aslund L., Attipoe P., Bontempi E., Bringaud F.,
RA Burton P., Cadag E., Campbell D.A., Carrington M., Crabtree J., Darban H.,
RA da Silveira J.F., de Jong P., Edwards K., Englund P.T., Fazelina G.,
RA Feldblyum T., Ferella M., Frasch A.C., Gull K., Horn D., Hou L., Huang Y.,
RA Kindlund E., Klingbeil M., Kluge S., Koo H., Lacerda D., Levin M.J.,
RA Lorenzi H., Louie T., Machado C.R., McCulloch R., McKenna A., Mizuno Y.,
RA Mottram J.C., Nelson S., Ochaya S., Osoegawa K., Pai G., Parsons M.,
RA Pentony M., Pettersson U., Pop M., Ramirez J.L., Rinta J., Robertson L.,
RA Salzberg S.L., Sanchez D.O., Seyler A., Sharma R., Shetty J., Simpson A.J.,
RA Sisk E., Tammi M.T., Tarleton R., Teixeira S., Van Aken S., Vogt C.,
RA Ward P.N., Wickstead B., Wortman J., White O., Fraser C.M., Stuart K.D.,
RA Andersson B.;
RT "The genome sequence of Trypanosoma cruzi, etiologic agent of Chagas
RT disease.";
RL Science 309:409-415(2005).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EAN98970.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AAHK01000034; EAN98970.1; -; Genomic_DNA.
DR RefSeq; XP_820821.1; XM_815728.1.
DR AlphaFoldDB; Q4E2F7; -.
DR PaxDb; 353153-Q4E2F7; -.
DR EnsemblProtists; EAN98970; EAN98970; Tc00.1047053509525.320.
DR GeneID; 3553605; -.
DR KEGG; tcr:509525.320; -.
DR InParanoid; Q4E2F7; -.
DR Proteomes; UP000002296; Unassembled WGS sequence.
DR InterPro; IPR000458; Tryp_mucin.
DR Pfam; PF01456; Mucin; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000002296};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..25
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 26..215
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5004237992"
FT REGION 30..183
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 30..99
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 115..183
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 215 AA; 21921 MW; 90EE03BC48D57CEF CRC64;
MMMTCRLLCA LLVLALCCCP SVCVSEIQPA ASSQTNTTPV PTKPESPEAV NGQSTPATEG
APQGTGQPGM QSNGSAEQPR NNAGYGGSPT VTVPQVNAEP IEVQTDDKDS KTETTRSTST
SSGKIVKEAE DTSGTKPPTT TTTTTKPPTT TKAPTTTTTT TAPEAPSTTT TEAPAVSTTR
TPSRLREIDG SLSSSAWVCA PLVLAASALA YTTLG
//