GenomeNet

Database: UniProt
Entry: Q4CSG4_TRYCC
LinkDB: Q4CSG4_TRYCC
Original site: Q4CSG4_TRYCC 
ID   Q4CSG4_TRYCC            Unreviewed;       217 AA.
AC   Q4CSG4;
DT   13-SEP-2005, integrated into UniProtKB/TrEMBL.
DT   13-SEP-2005, sequence version 1.
DT   08-NOV-2023, entry version 51.
DE   SubName: Full=Mucin TcMUCII, putative {ECO:0000313|EMBL:EAN83216.1};
GN   ORFNames=Tc00.1047053505959.10 {ECO:0000313|EMBL:EAN83216.1};
OS   Trypanosoma cruzi (strain CL Brener).
OC   Eukaryota; Discoba; Euglenozoa; Kinetoplastea; Metakinetoplastina;
OC   Trypanosomatida; Trypanosomatidae; Trypanosoma; Schizotrypanum.
OX   NCBI_TaxID=353153 {ECO:0000313|EMBL:EAN83216.1, ECO:0000313|Proteomes:UP000002296};
RN   [1] {ECO:0000313|EMBL:EAN83216.1, ECO:0000313|Proteomes:UP000002296}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=CL Brener {ECO:0000313|EMBL:EAN83216.1,
RC   ECO:0000313|Proteomes:UP000002296};
RX   PubMed=16020725; DOI=10.1126/science.1112631;
RA   El-Sayed N.M., Myler P.J., Bartholomeu D.C., Nilsson D., Aggarwal G.,
RA   Tran A.N., Ghedin E., Worthey E.A., Delcher A.L., Blandin G.,
RA   Westenberger S.J., Caler E., Cerqueira G.C., Branche C., Haas B.,
RA   Anupama A., Arner E., Aslund L., Attipoe P., Bontempi E., Bringaud F.,
RA   Burton P., Cadag E., Campbell D.A., Carrington M., Crabtree J., Darban H.,
RA   da Silveira J.F., de Jong P., Edwards K., Englund P.T., Fazelina G.,
RA   Feldblyum T., Ferella M., Frasch A.C., Gull K., Horn D., Hou L., Huang Y.,
RA   Kindlund E., Klingbeil M., Kluge S., Koo H., Lacerda D., Levin M.J.,
RA   Lorenzi H., Louie T., Machado C.R., McCulloch R., McKenna A., Mizuno Y.,
RA   Mottram J.C., Nelson S., Ochaya S., Osoegawa K., Pai G., Parsons M.,
RA   Pentony M., Pettersson U., Pop M., Ramirez J.L., Rinta J., Robertson L.,
RA   Salzberg S.L., Sanchez D.O., Seyler A., Sharma R., Shetty J., Simpson A.J.,
RA   Sisk E., Tammi M.T., Tarleton R., Teixeira S., Van Aken S., Vogt C.,
RA   Ward P.N., Wickstead B., Wortman J., White O., Fraser C.M., Stuart K.D.,
RA   Andersson B.;
RT   "The genome sequence of Trypanosoma cruzi, etiologic agent of Chagas
RT   disease.";
RL   Science 309:409-415(2005).
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:EAN83216.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AAHK01002135; EAN83216.1; -; Genomic_DNA.
DR   RefSeq; XP_805067.1; XM_799974.1.
DR   AlphaFoldDB; Q4CSG4; -.
DR   PaxDb; 353153-Q4CSG4; -.
DR   EnsemblProtists; EAN83216; EAN83216; Tc00.1047053505959.10.
DR   GeneID; 3534688; -.
DR   KEGG; tcr:505959.10; -.
DR   VEuPathDB; TriTrypDB:TcCLB.505959.10; -.
DR   InParanoid; Q4CSG4; -.
DR   Proteomes; UP000002296; Unassembled WGS sequence.
DR   InterPro; IPR000458; Tryp_mucin.
DR   Pfam; PF01456; Mucin; 2.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000002296};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..24
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           25..217
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5004235836"
FT   REGION          62..181
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        62..84
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        93..131
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        143..180
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   217 AA;  22004 MW;  920D3214CEBA3C4A CRC64;
     MMTTCRLLCA LLVLALCCCP SVCALVEPEP KFDVQASQTT TTNTTQPPTA KPVEAIVGAP
     SQSTLATGGA SQETGHSDPS TTGPVASPGD DGREGSGSTS TAVLEPKVST EPNKSLTEGT
     NGMTGQQSRT ETSAEEKNKE SEASAQTTTT TTTTQAPTTT TTTAPEAPST TTTEAPAVST
     TCAPSRLREI DGSLSSSAWV CAPLVLAVSA LAYTTLD
//
DBGET integrated database retrieval system