ID Q4CVS2_TRYCC Unreviewed; 2510 AA.
AC Q4CVS2;
DT 13-SEP-2005, integrated into UniProtKB/TrEMBL.
DT 13-SEP-2005, sequence version 1.
DT 27-MAR-2024, entry version 64.
DE SubName: Full=Dispersed gene family protein 1 (DGF-1), putative {ECO:0000313|EMBL:EAN84374.1};
GN ORFNames=Tc00.1047053511561.20 {ECO:0000313|EMBL:EAN84374.1};
OS Trypanosoma cruzi (strain CL Brener).
OC Eukaryota; Discoba; Euglenozoa; Kinetoplastea; Metakinetoplastina;
OC Trypanosomatida; Trypanosomatidae; Trypanosoma; Schizotrypanum.
OX NCBI_TaxID=353153 {ECO:0000313|EMBL:EAN84374.1, ECO:0000313|Proteomes:UP000002296};
RN [1] {ECO:0000313|EMBL:EAN84374.1, ECO:0000313|Proteomes:UP000002296}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CL Brener {ECO:0000313|EMBL:EAN84374.1,
RC ECO:0000313|Proteomes:UP000002296};
RX PubMed=16020725; DOI=10.1126/science.1112631;
RA El-Sayed N.M., Myler P.J., Bartholomeu D.C., Nilsson D., Aggarwal G.,
RA Tran A.N., Ghedin E., Worthey E.A., Delcher A.L., Blandin G.,
RA Westenberger S.J., Caler E., Cerqueira G.C., Branche C., Haas B.,
RA Anupama A., Arner E., Aslund L., Attipoe P., Bontempi E., Bringaud F.,
RA Burton P., Cadag E., Campbell D.A., Carrington M., Crabtree J., Darban H.,
RA da Silveira J.F., de Jong P., Edwards K., Englund P.T., Fazelina G.,
RA Feldblyum T., Ferella M., Frasch A.C., Gull K., Horn D., Hou L., Huang Y.,
RA Kindlund E., Klingbeil M., Kluge S., Koo H., Lacerda D., Levin M.J.,
RA Lorenzi H., Louie T., Machado C.R., McCulloch R., McKenna A., Mizuno Y.,
RA Mottram J.C., Nelson S., Ochaya S., Osoegawa K., Pai G., Parsons M.,
RA Pentony M., Pettersson U., Pop M., Ramirez J.L., Rinta J., Robertson L.,
RA Salzberg S.L., Sanchez D.O., Seyler A., Sharma R., Shetty J., Simpson A.J.,
RA Sisk E., Tammi M.T., Tarleton R., Teixeira S., Van Aken S., Vogt C.,
RA Ward P.N., Wickstead B., Wortman J., White O., Fraser C.M., Stuart K.D.,
RA Andersson B.;
RT "The genome sequence of Trypanosoma cruzi, etiologic agent of Chagas
RT disease.";
RL Science 309:409-415(2005).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EAN84374.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AAHK01001713; EAN84374.1; -; Genomic_DNA.
DR RefSeq; XP_806225.1; XM_801132.1.
DR AlphaFoldDB; Q4CVS2; -.
DR STRING; 353153.Q4CVS2; -.
DR PaxDb; 353153-Q4CVS2; -.
DR EnsemblProtists; EAN84374; EAN84374; Tc00.1047053511561.20.
DR GeneID; 3536175; -.
DR KEGG; tcr:511561.20; -.
DR eggNOG; ENOG502SEI3; Eukaryota.
DR InParanoid; Q4CVS2; -.
DR Proteomes; UP000002296; Unassembled WGS sequence.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR Gene3D; 2.160.20.10; Single-stranded right-handed beta-helix, Pectin lyase-like; 1.
DR InterPro; IPR021053; Dispersed_gene_fam_prot1_C.
DR InterPro; IPR021004; Dispersed_gene_fam_prot1_dom4.
DR InterPro; IPR021282; Dispersed_gene_fam_prot1_dom5.
DR InterPro; IPR006626; PbH1.
DR InterPro; IPR012334; Pectin_lyas_fold.
DR InterPro; IPR011050; Pectin_lyase_fold/virulence.
DR Pfam; PF11024; DGF-1_4; 1.
DR Pfam; PF11038; DGF-1_5; 1.
DR Pfam; PF11040; DGF-1_C; 1.
DR SMART; SM00710; PbH1; 8.
DR SUPFAM; SSF51126; Pectin lyase-like; 1.
PE 4: Predicted;
KW Membrane {ECO:0000256|SAM:Phobius};
KW Reference proteome {ECO:0000313|Proteomes:UP000002296};
KW Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT TRANSMEM 2108..2134
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 2154..2173
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 2179..2196
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 2216..2241
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 2247..2268
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 2318..2339
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 2345..2367
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 2379..2397
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 2409..2430
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 1893..1967
FT /note="Dispersed gene family protein 1"
FT /evidence="ECO:0000259|Pfam:PF11024"
FT DOMAIN 1980..2256
FT /note="Dispersed gene family protein 1"
FT /evidence="ECO:0000259|Pfam:PF11038"
FT DOMAIN 2428..2510
FT /note="Dispersed gene family protein 1 C-terminal"
FT /evidence="ECO:0000259|Pfam:PF11040"
FT REGION 1907..1941
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2449..2492
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2449..2464
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2465..2486
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:EAN84374.1"
SQ SEQUENCE 2510 AA; 261210 MW; DF6EB125C5205241 CRC64;
FGVNSQILVV GSTLVTTSSH AIAFLDFVFG ANSTLLLLDN RIEGNRHALN FPFIVVVDGG
GVIVKGNILR TAGNSGAELA ICVHAVDVKN GGYFGVENNL MRAVVGVWFV KVATVSSAGL
LRVADCTFIG KEYLFDPALV YLSNSLILQG GAQWRVEGNN VSAASVLSMP RSQQKIHLSG
SGTTVVLAKN RQVEGSAVFG NLLPPNTIVA SPARFVVGCN LRGGEEVSYD DVFPEGVVVF
GCGTCNDDAA CYMPGTESVD RGPCSCSCKD GWHGASCLPF EVPDTVVPPL PERAVDGDAS
CVVNQTLTNL TLNMWKTRHC YAGVTFSGVG AVLTFSLNSM PLHLPINVTL TGCTFLEGAV
LRFVGGAEAV ESAGVLIRVS QTVIRSSVVV FALALPQHCD IAVTEVDAVQ FSVVDIPDIT
SKTLSVLLLK NVVLSASSLL VSNVKAYGSD YRGFGLHSVG TLTLVGGSSL YARYFSFDGY
KHLFYVYRLS VSDHSVFALL NNTMSSGAGL LVQQHGFSVS DHSVLRVVGN RGSVSYAIYA
NKLWTVEQSS WLDWRDNDVG LGTVFYDSGS PFVDMDSSSG VTLTGCKMGS TGLSVSLLKR
VGAGYRFVAG CLIVAGREVT AVAELELNGI TNVTTVAGCG ECTKDGNCFA PLTTAVIDCK
CQCAAGGHGD VCVPAPVPAG PPPPPPLPAP ATPPPPPVGE CISDMVYPEV AQAVGGGLSW
LCYRNVTFSG GGMSLTVLVG AMTGDVANVM FDGCTWRDGA VLLLLGNAYA AVGSLNIVVT
GNTFRDALLS PEGVFPPRTN ITISGNRFTV TRLVPRSGLD LRRPSCVAMN GLVISNNSAV
VLSGNVFHAV AASSSAIYVV RSSLRVSWHS VFAVMGNTFH MDGGDSNLIC LEGSRHSSSL
SVLNNSAVVI RGNLVTRPVR YFMYFFLALF VESQSAVVFQ GNDMQGSWFV FFPSHSSNIY
YNSWLQLSDN LCRESPSEAF LFINPKVNLR GSTVSVSGNR FMSNTVTPTV LRISSASGDL
TNGAIVAACN TVNGEEEVQY VIPSVYNATI LTCSDPCALA ASCFPAYTTT ASSDGCACNC
AEGGHGDACL PVAVPEPPST DGVDLCVRDV SVDVEVNAGL GTSVVCYVGV TFAADVVVDV
ESMSGSVRNV TLANCAFVDG ASLYVVGWLS DPPAGERADV LISGLESRSG GGVLVANRYP
PGSRVTVVDS VLIAEKRVAY RGAYFLGDAS ACLLVHSMNL TGSVLTIART HVAAVFRDAA
GVLFVGGVAL SSRGALYVDG LSVQTALGLC VSVEGGVAAS GGSVVAFVDS DFLLCRHAVS
VRGAVSVSGS AVALVRSGFV LTEDYAVAFY STVSLDDGSM LLVKGNVHDG VSREMLYAAG
AVTAAGSTLS FVRNRALLPR MLSLSLSLAD GAHLRVACND AGGRVLSTAE EYAAAGFGDA
GSIDVVGCDD CDRETYCYAP GTASASMTDG VCVCVCGSGG YGEACVPVGV PVLPPAAVTA
SSVFFREGVT VRSVFVVPAG AGEVTLRHVV LDGVSPVLYV PWMARDGVRI VVQNVSLLNG
AVLYVMGGGV LRGAVAAGSD ESGPVELSVC DVEALNGALV LTGTFPAGSV LTVTDSLLVA
ARPTPLVYLP GSQSSPYAPV LVLSGLRLVR SVLVVSGVAL VTVMTGGRTV AVDGAVLELV
GGGVALDSAV FGGEYALYAS ARVVASGGAV LRVSGSQVYA AHGLVFGSGV EANASAVVVN
DNVGALTDGA LLVLRGSASF ASGSWLSVRG NSISGRLLSV PSYPRRADLV QSTLTLHGNA
GSGPVVMDGT VALGGAGGRF VVGCLTLNGQ ALQPMDYRSA GIIGEFRPVA CGVCDADVFC
FAAATRAMSA SCRCRCAEGG YGRDCLPVYL PHVDGCNRTL FRPPLSHTAT LTETRSPTPT
PTPSLSTTHY SPTQYSQTET PRVTETVALS PTRTPTASVS STLWWSDVAC PTLAVTTTAA
GGSLTQNDIR GGGSAVPTRL MVALPPPFQW ARDPQLGTHL SFVPVATAQP RGFGGPWGAM
LRNATWVRNA TNPSTVLELA VPVHRGYFIA ADETIVIRCD AVAVFGGCRG VLLGSFTIRS
DTLPAASALS AITGVVAGAA AVAVLVTGGL GSILEMQALG VFARMPCASA QERASTVALP
YFLSVFAALD PLWMVVGNAL LAAVFGCVHC GVTAAFQRWR GVDAATAWAA MRFPSLTYVV
AHAMHLGIFF GSVLALAMPG ARVQHRVVGV VGVLYGAAFP AGVCYLIARH VGASFERYWQ
FSRKPLHERL LYPVGYWHPA AQQRMYGGML TNMRDSRVYW CVFQLSVLCV VCLIAAVHSP
VGGCHVQYFC MAAVLLAGAG VVAFTNMMRS AFLTVMRTAG FALLAALCVI SAANHLAPSD
GGARAYADIV LLLTTVLLAV AVYSVVVWYA EDRYWQELRE PRRGGLEALL RDDEESDEET
QKLHDMTSSS YASGTTTASS YRPPAPPLQS VTGDIRSDAL SLFDGASIAS
//