ID Q4CQF1_TRYCC Unreviewed; 219 AA.
AC Q4CQF1;
DT 13-SEP-2005, integrated into UniProtKB/TrEMBL.
DT 13-SEP-2005, sequence version 1.
DT 24-JAN-2024, entry version 49.
DE SubName: Full=Mucin TcMUCII, putative {ECO:0000313|EMBL:EAN82503.1};
GN ORFNames=Tc00.1047053503889.10 {ECO:0000313|EMBL:EAN82503.1};
OS Trypanosoma cruzi (strain CL Brener).
OC Eukaryota; Discoba; Euglenozoa; Kinetoplastea; Metakinetoplastina;
OC Trypanosomatida; Trypanosomatidae; Trypanosoma; Schizotrypanum.
OX NCBI_TaxID=353153 {ECO:0000313|EMBL:EAN82503.1, ECO:0000313|Proteomes:UP000002296};
RN [1] {ECO:0000313|EMBL:EAN82503.1, ECO:0000313|Proteomes:UP000002296}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CL Brener {ECO:0000313|EMBL:EAN82503.1,
RC ECO:0000313|Proteomes:UP000002296};
RX PubMed=16020725; DOI=10.1126/science.1112631;
RA El-Sayed N.M., Myler P.J., Bartholomeu D.C., Nilsson D., Aggarwal G.,
RA Tran A.N., Ghedin E., Worthey E.A., Delcher A.L., Blandin G.,
RA Westenberger S.J., Caler E., Cerqueira G.C., Branche C., Haas B.,
RA Anupama A., Arner E., Aslund L., Attipoe P., Bontempi E., Bringaud F.,
RA Burton P., Cadag E., Campbell D.A., Carrington M., Crabtree J., Darban H.,
RA da Silveira J.F., de Jong P., Edwards K., Englund P.T., Fazelina G.,
RA Feldblyum T., Ferella M., Frasch A.C., Gull K., Horn D., Hou L., Huang Y.,
RA Kindlund E., Klingbeil M., Kluge S., Koo H., Lacerda D., Levin M.J.,
RA Lorenzi H., Louie T., Machado C.R., McCulloch R., McKenna A., Mizuno Y.,
RA Mottram J.C., Nelson S., Ochaya S., Osoegawa K., Pai G., Parsons M.,
RA Pentony M., Pettersson U., Pop M., Ramirez J.L., Rinta J., Robertson L.,
RA Salzberg S.L., Sanchez D.O., Seyler A., Sharma R., Shetty J., Simpson A.J.,
RA Sisk E., Tammi M.T., Tarleton R., Teixeira S., Van Aken S., Vogt C.,
RA Ward P.N., Wickstead B., Wortman J., White O., Fraser C.M., Stuart K.D.,
RA Andersson B.;
RT "The genome sequence of Trypanosoma cruzi, etiologic agent of Chagas
RT disease.";
RL Science 309:409-415(2005).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EAN82503.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AAHK01002465; EAN82503.1; -; Genomic_DNA.
DR RefSeq; XP_804354.1; XM_799261.1.
DR AlphaFoldDB; Q4CQF1; -.
DR PaxDb; 353153-Q4CQF1; -.
DR EnsemblProtists; EAN82503; EAN82503; Tc00.1047053503889.10.
DR GeneID; 3533765; -.
DR KEGG; tcr:503889.10; -.
DR InParanoid; Q4CQF1; -.
DR OrthoDB; 140785at2759; -.
DR Proteomes; UP000002296; Unassembled WGS sequence.
DR InterPro; IPR000458; Tryp_mucin.
DR Pfam; PF01456; Mucin; 2.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000002296};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..23
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 24..219
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5004235773"
FT REGION 26..179
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 26..86
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 101..179
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 219 AA; 21858 MW; F6A345DD9AC882B5 CRC64;
MMTCRLLCAL LVLTLCCCPS VCATENSGDP NKDSPSVSPA SQPGAGVLGQ ANTSTTPSST
RLEVTLPGTE NQNSRGEASD TRSNTGRGKG PAPPGNSVPT IPAFDTEQGQ GGSGGSGSSG
TTADTVQKNG DNDTTDSTSG NNSSTDQTNT NAGDPAEKST ATTTTTTTTT TTTTTQAPTT
TTIRAPSLLR ESDGSLSSSA WVCAPLLLAV SALAYTTLG
//