ID Q4CVU9_TRYCC Unreviewed; 375 AA.
AC Q4CVU9;
DT 13-SEP-2005, integrated into UniProtKB/TrEMBL.
DT 13-SEP-2005, sequence version 1.
DT 27-MAR-2024, entry version 52.
DE SubName: Full=Mucin TcMUCII, putative {ECO:0000313|EMBL:EAN84403.1};
GN ORFNames=Tc00.1047053508747.50 {ECO:0000313|EMBL:EAN84403.1};
OS Trypanosoma cruzi (strain CL Brener).
OC Eukaryota; Discoba; Euglenozoa; Kinetoplastea; Metakinetoplastina;
OC Trypanosomatida; Trypanosomatidae; Trypanosoma; Schizotrypanum.
OX NCBI_TaxID=353153 {ECO:0000313|EMBL:EAN84403.1, ECO:0000313|Proteomes:UP000002296};
RN [1] {ECO:0000313|EMBL:EAN84403.1, ECO:0000313|Proteomes:UP000002296}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CL Brener {ECO:0000313|EMBL:EAN84403.1,
RC ECO:0000313|Proteomes:UP000002296};
RX PubMed=16020725; DOI=10.1126/science.1112631;
RA El-Sayed N.M., Myler P.J., Bartholomeu D.C., Nilsson D., Aggarwal G.,
RA Tran A.N., Ghedin E., Worthey E.A., Delcher A.L., Blandin G.,
RA Westenberger S.J., Caler E., Cerqueira G.C., Branche C., Haas B.,
RA Anupama A., Arner E., Aslund L., Attipoe P., Bontempi E., Bringaud F.,
RA Burton P., Cadag E., Campbell D.A., Carrington M., Crabtree J., Darban H.,
RA da Silveira J.F., de Jong P., Edwards K., Englund P.T., Fazelina G.,
RA Feldblyum T., Ferella M., Frasch A.C., Gull K., Horn D., Hou L., Huang Y.,
RA Kindlund E., Klingbeil M., Kluge S., Koo H., Lacerda D., Levin M.J.,
RA Lorenzi H., Louie T., Machado C.R., McCulloch R., McKenna A., Mizuno Y.,
RA Mottram J.C., Nelson S., Ochaya S., Osoegawa K., Pai G., Parsons M.,
RA Pentony M., Pettersson U., Pop M., Ramirez J.L., Rinta J., Robertson L.,
RA Salzberg S.L., Sanchez D.O., Seyler A., Sharma R., Shetty J., Simpson A.J.,
RA Sisk E., Tammi M.T., Tarleton R., Teixeira S., Van Aken S., Vogt C.,
RA Ward P.N., Wickstead B., Wortman J., White O., Fraser C.M., Stuart K.D.,
RA Andersson B.;
RT "The genome sequence of Trypanosoma cruzi, etiologic agent of Chagas
RT disease.";
RL Science 309:409-415(2005).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EAN84403.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AAHK01001703; EAN84403.1; -; Genomic_DNA.
DR RefSeq; XP_806254.1; XM_801161.1.
DR AlphaFoldDB; Q4CVU9; -.
DR PaxDb; 353153-Q4CVU9; -.
DR EnsemblProtists; EAN84403; EAN84403; Tc00.1047053508747.50.
DR GeneID; 3536213; -.
DR KEGG; tcr:508747.50; -.
DR VEuPathDB; TriTrypDB:TcCLB.508747.50; -.
DR InParanoid; Q4CVU9; -.
DR OrthoDB; 138398at2759; -.
DR Proteomes; UP000002296; Unassembled WGS sequence.
DR InterPro; IPR000458; Tryp_mucin.
DR Pfam; PF01456; Mucin; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000002296};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..31
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 32..375
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5004236275"
FT REGION 54..344
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 57..71
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 72..89
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 106..198
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 213..240
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 242..256
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 274..344
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 375 AA; 37919 MW; 23AF9B843EEDE79E CRC64;
MMTGRVLCVL LVSSLMCCFL CVCAAVAATE ARVRPAGDGA VTAGHMWAVM AAESEPVVDG
SERKVNVNSE ENDNQEDEEN EENEEDEEGG GSKVGDVPTP TPPETLTPTT RKDQSSTDQL
RSITSNPSDA GDVSDSQTHA SGQESLSGGP AAGTTSPLPN SSQQAAGGVH AGGGSSSQGS
QASGQTVTGQ SLVSETAKAT PQGGGGGGPG AHHEADHTTK DGGDNPLGKN GNEKEEPPEG
SQTAPRPSSG GSANVAPNPA VSIPLPPEPK STGEATGTGE SPHPTDNAQT QSMRNDTTPS
TAQTQNSPNT KAQEPEAEIT TTEAPTTTTT ETPTTTTTTR APSRLREFDG SLSSSAWVCA
PLLLAVYALA CTTVG
//