ID Q4DYW0_TRYCC Unreviewed; 3432 AA.
AC Q4DYW0;
DT 13-SEP-2005, integrated into UniProtKB/TrEMBL.
DT 13-SEP-2005, sequence version 1.
DT 27-MAR-2024, entry version 65.
DE SubName: Full=Dispersed gene family protein 1 (DGF-1), putative {ECO:0000313|EMBL:EAN97716.1};
GN ORFNames=Tc00.1047053507167.169 {ECO:0000313|EMBL:EAN97716.1};
OS Trypanosoma cruzi (strain CL Brener).
OC Eukaryota; Discoba; Euglenozoa; Kinetoplastea; Metakinetoplastina;
OC Trypanosomatida; Trypanosomatidae; Trypanosoma; Schizotrypanum.
OX NCBI_TaxID=353153 {ECO:0000313|EMBL:EAN97716.1, ECO:0000313|Proteomes:UP000002296};
RN [1] {ECO:0000313|EMBL:EAN97716.1, ECO:0000313|Proteomes:UP000002296}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CL Brener {ECO:0000313|EMBL:EAN97716.1,
RC ECO:0000313|Proteomes:UP000002296};
RX PubMed=16020725; DOI=10.1126/science.1112631;
RA El-Sayed N.M., Myler P.J., Bartholomeu D.C., Nilsson D., Aggarwal G.,
RA Tran A.N., Ghedin E., Worthey E.A., Delcher A.L., Blandin G.,
RA Westenberger S.J., Caler E., Cerqueira G.C., Branche C., Haas B.,
RA Anupama A., Arner E., Aslund L., Attipoe P., Bontempi E., Bringaud F.,
RA Burton P., Cadag E., Campbell D.A., Carrington M., Crabtree J., Darban H.,
RA da Silveira J.F., de Jong P., Edwards K., Englund P.T., Fazelina G.,
RA Feldblyum T., Ferella M., Frasch A.C., Gull K., Horn D., Hou L., Huang Y.,
RA Kindlund E., Klingbeil M., Kluge S., Koo H., Lacerda D., Levin M.J.,
RA Lorenzi H., Louie T., Machado C.R., McCulloch R., McKenna A., Mizuno Y.,
RA Mottram J.C., Nelson S., Ochaya S., Osoegawa K., Pai G., Parsons M.,
RA Pentony M., Pettersson U., Pop M., Ramirez J.L., Rinta J., Robertson L.,
RA Salzberg S.L., Sanchez D.O., Seyler A., Sharma R., Shetty J., Simpson A.J.,
RA Sisk E., Tammi M.T., Tarleton R., Teixeira S., Van Aken S., Vogt C.,
RA Ward P.N., Wickstead B., Wortman J., White O., Fraser C.M., Stuart K.D.,
RA Andersson B.;
RT "The genome sequence of Trypanosoma cruzi, etiologic agent of Chagas
RT disease.";
RL Science 309:409-415(2005).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EAN97716.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AAHK01000087; EAN97716.1; -; Genomic_DNA.
DR RefSeq; XP_819567.1; XM_814474.1.
DR STRING; 353153.Q4DYW0; -.
DR PaxDb; 353153-Q4DYW0; -.
DR EnsemblProtists; EAN97716; EAN97716; Tc00.1047053507167.169.
DR GeneID; 3552040; -.
DR KEGG; tcr:507167.169; -.
DR eggNOG; ENOG502SEI3; Eukaryota.
DR InParanoid; Q4DYW0; -.
DR Proteomes; UP000002296; Unassembled WGS sequence.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR Gene3D; 2.160.20.10; Single-stranded right-handed beta-helix, Pectin lyase-like; 1.
DR InterPro; IPR021053; Dispersed_gene_fam_prot1_C.
DR InterPro; IPR021004; Dispersed_gene_fam_prot1_dom4.
DR InterPro; IPR021282; Dispersed_gene_fam_prot1_dom5.
DR InterPro; IPR006626; PbH1.
DR InterPro; IPR012334; Pectin_lyas_fold.
DR InterPro; IPR011050; Pectin_lyase_fold/virulence.
DR PANTHER; PTHR24033; EGF-LIKE DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF11024; DGF-1_4; 1.
DR Pfam; PF11038; DGF-1_5; 1.
DR Pfam; PF11040; DGF-1_C; 1.
DR SMART; SM00710; PbH1; 8.
DR SUPFAM; SSF101447; Formin homology 2 domain (FH2 domain); 1.
DR SUPFAM; SSF51126; Pectin lyase-like; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Membrane {ECO:0000256|SAM:Phobius};
KW Reference proteome {ECO:0000313|Proteomes:UP000002296};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT TRANSMEM 21..40
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 3053..3079
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 3099..3118
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 3124..3141
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 3161..3180
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 3192..3213
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 3263..3285
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 3291..3310
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 3317..3338
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 3350..3373
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 2833..2911
FT /note="Dispersed gene family protein 1"
FT /evidence="ECO:0000259|Pfam:PF11024"
FT DOMAIN 2924..3201
FT /note="Dispersed gene family protein 1"
FT /evidence="ECO:0000259|Pfam:PF11038"
FT DOMAIN 3373..3431
FT /note="Dispersed gene family protein 1 C-terminal"
FT /evidence="ECO:0000259|Pfam:PF11040"
FT REGION 3391..3432
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3391..3410
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 3432 AA; 356164 MW; D885A7CC2582CADE CRC64;
MWCAVGACPH SRHRVRCRAI AAVRVALLVV LSLVAAAAWM PAVHAVVLRL RGGTVDRAIT
VGRAVDTVLM DGVSITNGVA VVFDVPAMLP GTLRIELRHC VCDGGVQIYV RGYSGEPASD
RSLEVSVSGL SGSYCSLVFV HNLPAHTNVT VRDSTIVTAG PMRYSQLSGL TDAVASPLVL
HATSLLQSQL RVSNTVLRSL QAGGSAVYFG GGVDLLSSAV VLDGVLLEAL GGPTASALHV
ASSSRLSLRS HSVFSVTSVS VVSSGGGGIV LGERLAVLDS VLRFVRVEGF AASSMVHCDG
GTIGAGGWLE LHDVWAVGEA SSVASLSGVM LSGGTVSIVR CAATGATLVS GLTITSGVVS
VQCNRAGGRV LQSSGDYRMA GLPSVSVVPC DGCAAALACF DALTASFSDC VCSCRAGGVG
EACLPFDVPP SRAGGGGDTA PGCVSGVTLT ESVTVGGGRA TTCFDSVVFS GPIIVAVDLR
LMDAFADALN VTLRHCVLAG GAQLRIGGLS EITARLMPHA LLNMTNVTSV EGTIVLHGAM
PLHSSVLLAN STLRATVDWS QYVPTTPGFA GSRYGPALVL DGVRLLSTRF VMTRSTLVCG
GESCAAILVE RGLGVHLSSV FYMDNCAVLS RTHVMYALAS NLRMSGSSVF SIHNSLWSAP
SNEYYKGACM FGDVVVDGGS VLQVVSSNFR LGFAMFIANT LTVTGGSWLV HRDNEFHAAY
VVYVANENAV VFRDRSVWSI LDNNFTYGSY SSTTAYVTNN WSPPSDTRPT IYGMCNEAIG
SPVTDYQGEL NIGTPVTVLD CGACTVDAVC FAARTSSISG CECVCAAGGY GDTCLPAAVP
DGLGPLPLPD AKDTEVRCVH GGSIGSVYVP DPGVRGLCLV KVTFTAPMLL DLSYFDAPQQ
TLNITLLQCV LVGLLIRGNG ARVHVSVTSS MLASGALVFE GDFGVSSQIL VVGSTLVTTS
SHAISFLLFI CVNSTLLLLD NYIEGNIHAV YFTNAAVDGG GIIVKGNTLR ATRDDDGVES
SVYVNAIALR NGGYFDVENT KMSAVSGFYF HGDTTVSSAG LLRVADCYFV ESTEEFESAL
VFLSGSLSLE GGAQWRVEGN SVGAASILSI AHAEVKIQLS GSGTTVALAH NRQVDSRSFL
ARFLPSSIVV KLSARFVVGC NLQGDEEVSY DDVFPEDVVV FRCGTCNDDA ACYMPGTESV
DRGSCSCSCK DGWHGASCLP FEVPDTVVPP LPERAVDGDT SCVVNQTLTD LTLNMWKTHH
CYVGVTFSGV GAALTFFLNR MPLHLPINIT LTGCTFREGA ALQFVGGASA AESAGVLIRV
SRTVMRSSVV VFAAALPQHC DIAVTEVDAV QSSEVQLPHI RTNMLSVVLL QIVVLSASSL
LVSNIKAHSL RYGALGLYST GTLTLLDGSS LYVQYCSFAG YTHMFYVNIL SVNDHSVFAL
LNNTMSSGTS LLCQQQELSV SEHSVLRVVG NSGPVSNAIY SLSFFTVHYS SWLDWRYNDV
GVGAMFHESE STFLIIDGSS VVTLTGCTMG STGLSVSLLK RADAGYRFVA GCLTVAGREV
TTAAELALNG ITNVTTAAAC GECTKEGDCF APLTTAVIGC RCRCAAGGHG DVCLPAPVPA
GPPPPPPPPP PPPPPPPVGE CISDMVYPEV AQTVGGGLSC LCYRNVTFSG GGMSLTVLVG
AMTGDVVNVT FDGCTWRDGA VLLLLGNAYA AVGSLNIFVT RNTFSDALLS PEGVFPPHTN
ITISGNRFTL TWLVPRSGLD IRRPSCVAMN GLVISNDSAV VLSGNVFQSV TTSSSAIYVV
GSALRVSWHS LFAVVGNTFH MAGGDGTLIY LEGSSQSLSL SVLNNSAVVI RGNAVTRPVK
CFVLLIWTLD VESHSAVVFQ GNEMQRSSVV FISGDSSNIY YNSWLQLSGN LCRESPSEAF
AFLYPKVNLR DSTVSVSGNQ FVSSTVSPTM LRIPQSSRDL TNGAIVAACN TVNGEEEAKY
AIPSVYNATI MTCSDPCTLA ASCFPAYTTT ASSDGCACAC AEGGYGDACL PVAVPEPPST
DGADLCVRDV RVGVGVNAGF GTSVACYVGV TFATDVVVDV ESMSGSMRNV TLVNCLFVGG
ASLYVVGWRS DPPAGERADV LISGLESCSG GGVVVASRFS PGSRVTVVDS VLIAEARVAY
RGAYGLGDAS ACLVLHNVNL TGSVLTIART HVAAVFRDAV GVLFVGGVAL SSRGALYVDG
LSVQTALGLC VSVEGGVAAS GGSVVAFVDS DFLLCKHAVS VRGAVSVSGS AVALVRSEFL
STEDYAVAFY STVSLDGGSM LLAKGNVHDG ASREMLYAAG AVTAAGSTLS FVRNRALLPR
MVSLSLSLAS GAHLRVACNY AGGRILSTAE EYAAAGFGDA GSIDVAGCDA CDRDTHCYTP
GTASASMRNG VCVCLCGSSG HGEACVPVGA PALPPAVGTA SSVFFREGVT LRSVFVVPAG
ASEVTLRHVV LDGVSPVLYV PWMARDGVRI VVQDVSLLNG AVLYVMGGGA LRGAVAAGSD
ESGPVELSVC EVEALNGALV LSGTFPAGSV LTVTDSLLVA ARPTPLVYLP GSQSSPYAPV
LVLSGLRLMR SVLVLFGVAL VTVMTGGRTV VVDGAVLVLV GGGVTLDAAV FGGDYALYAS
ARVVASEGAV LRVSGSQVYA AHGLVFGSGV EANASAVVVN DNTGALTDGA LLELRGSASF
ASGSWLSVRG NSISGRLLSV PSYPRRAELV KSTLTLHGNA GSGPVVMDGT VALGGAGRKF
VVGCLTLNGQ ALQPMDYRSA GIIGEFRPVA CGVCDADVRC FAAATRTMSG SCRCRCAEGG
YGRDCLPVYL PHVDGCNRTP APPPLSHTAT LTETRSLTPT WTATQTPSLS TAHYSPPWYG
PTETLQGTET VALSPTRTPT ASVSSTPWWS DVACPTLAVT TTSAGGSLTQ NDIRGGGSAV
PTRLMVALPP PFRWARDPQL GTHLSFVPVS TAQPRGFGGP WGAMLSNATW VRNATNPSTV
LELAVPVHRG YFIAADETIV IRCDAAAVSG GCRGVLLGSF TIRSATLPAA ASALSAITGV
VAGAAAVAVV VTGGLGSILE MQALGVFARM SCASAQERAS TVALPCFLSV FAALDPLWMV
VGNALLAAVF GCVHCGVTAA FQRWRGVDAA SAWAAMRFPS LTYVVAHAMH LGIFFGSVLA
LAMPDARGHH RVIGVVGVLY GVAFPAGVCY LIARHAGASF TRYWQFLRKP LHERLLYPVG
YWHPAAQQRM YGGMLTNVRG SRVYWCVFQL SVLCVVCLIA AVQSPVGGCD VQYFCMAAVL
VAGAGVVAFT NMMRSSFLTV MHTASFVLLA ALCVVSAANH LAPSDGGARA YAAIVLLLTT
LLLAVTVYSM VVWHAEDRHW QELREPRRGG LEALLRDDEE SDEETQKPHD RTSFSYAPGT
TVASSYRPPA PP
//