GenomeNet

Database: UniProt
Entry: Q4CVN0_TRYCC
LinkDB: Q4CVN0_TRYCC
Original site: Q4CVN0_TRYCC 
ID   Q4CVN0_TRYCC            Unreviewed;       276 AA.
AC   Q4CVN0;
DT   13-SEP-2005, integrated into UniProtKB/TrEMBL.
DT   13-SEP-2005, sequence version 1.
DT   08-NOV-2023, entry version 49.
DE   SubName: Full=Mucin TcMUCII, putative {ECO:0000313|EMBL:EAN84333.1};
GN   ORFNames=Tc00.1047053509479.20 {ECO:0000313|EMBL:EAN84333.1};
OS   Trypanosoma cruzi (strain CL Brener).
OC   Eukaryota; Discoba; Euglenozoa; Kinetoplastea; Metakinetoplastina;
OC   Trypanosomatida; Trypanosomatidae; Trypanosoma; Schizotrypanum.
OX   NCBI_TaxID=353153 {ECO:0000313|EMBL:EAN84333.1, ECO:0000313|Proteomes:UP000002296};
RN   [1] {ECO:0000313|EMBL:EAN84333.1, ECO:0000313|Proteomes:UP000002296}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=CL Brener {ECO:0000313|EMBL:EAN84333.1,
RC   ECO:0000313|Proteomes:UP000002296};
RX   PubMed=16020725; DOI=10.1126/science.1112631;
RA   El-Sayed N.M., Myler P.J., Bartholomeu D.C., Nilsson D., Aggarwal G.,
RA   Tran A.N., Ghedin E., Worthey E.A., Delcher A.L., Blandin G.,
RA   Westenberger S.J., Caler E., Cerqueira G.C., Branche C., Haas B.,
RA   Anupama A., Arner E., Aslund L., Attipoe P., Bontempi E., Bringaud F.,
RA   Burton P., Cadag E., Campbell D.A., Carrington M., Crabtree J., Darban H.,
RA   da Silveira J.F., de Jong P., Edwards K., Englund P.T., Fazelina G.,
RA   Feldblyum T., Ferella M., Frasch A.C., Gull K., Horn D., Hou L., Huang Y.,
RA   Kindlund E., Klingbeil M., Kluge S., Koo H., Lacerda D., Levin M.J.,
RA   Lorenzi H., Louie T., Machado C.R., McCulloch R., McKenna A., Mizuno Y.,
RA   Mottram J.C., Nelson S., Ochaya S., Osoegawa K., Pai G., Parsons M.,
RA   Pentony M., Pettersson U., Pop M., Ramirez J.L., Rinta J., Robertson L.,
RA   Salzberg S.L., Sanchez D.O., Seyler A., Sharma R., Shetty J., Simpson A.J.,
RA   Sisk E., Tammi M.T., Tarleton R., Teixeira S., Van Aken S., Vogt C.,
RA   Ward P.N., Wickstead B., Wortman J., White O., Fraser C.M., Stuart K.D.,
RA   Andersson B.;
RT   "The genome sequence of Trypanosoma cruzi, etiologic agent of Chagas
RT   disease.";
RL   Science 309:409-415(2005).
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:EAN84333.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AAHK01001727; EAN84333.1; -; Genomic_DNA.
DR   RefSeq; XP_806184.1; XM_801091.1.
DR   AlphaFoldDB; Q4CVN0; -.
DR   PaxDb; 353153-Q4CVN0; -.
DR   EnsemblProtists; EAN84333; EAN84333; Tc00.1047053509479.20.
DR   GeneID; 3536123; -.
DR   KEGG; tcr:509479.20; -.
DR   VEuPathDB; TriTrypDB:TcCLB.509479.20; -.
DR   InParanoid; Q4CVN0; -.
DR   Proteomes; UP000002296; Unassembled WGS sequence.
DR   InterPro; IPR000458; Tryp_mucin.
DR   Pfam; PF01456; Mucin; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000002296};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..22
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           23..276
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5004236077"
FT   REGION          61..243
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        73..88
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        106..123
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        132..243
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   276 AA;  27278 MW;  FEBA71839309CA1C CRC64;
     MTTCRLLCAL LVLALCCCPS VCVTATGKEQ DGVNGSTAAQ PPGAGGLGAK NTFAASISTV
     PADALPSKTP GTGTVEIPGN NAEQPHGTSG LQGAGDAASL DEQEPITESS RSDSAEPTEK
     VQEREPISGT GGTEPLKQSS TGQTQSGDPT SQASSAGTGV PGSPDTPSLE SIPKQEQAPG
     GSDGSSGGSV NPTTSSSSVT ASVPAQSQKE HAPTTTTTTT TKAPTTTTTT TTETPSTTTT
     RAPSRLREID GCLSSSAWVC APLLLAVSAL AYTALG
//
DBGET integrated database retrieval system