GenomeNet

Database: UniProt
Entry: Q4DYW0_TRYCC
LinkDB: Q4DYW0_TRYCC
Original site: Q4DYW0_TRYCC 
ID   Q4DYW0_TRYCC            Unreviewed;      3432 AA.
AC   Q4DYW0;
DT   13-SEP-2005, integrated into UniProtKB/TrEMBL.
DT   13-SEP-2005, sequence version 1.
DT   27-MAR-2024, entry version 65.
DE   SubName: Full=Dispersed gene family protein 1 (DGF-1), putative {ECO:0000313|EMBL:EAN97716.1};
GN   ORFNames=Tc00.1047053507167.169 {ECO:0000313|EMBL:EAN97716.1};
OS   Trypanosoma cruzi (strain CL Brener).
OC   Eukaryota; Discoba; Euglenozoa; Kinetoplastea; Metakinetoplastina;
OC   Trypanosomatida; Trypanosomatidae; Trypanosoma; Schizotrypanum.
OX   NCBI_TaxID=353153 {ECO:0000313|EMBL:EAN97716.1, ECO:0000313|Proteomes:UP000002296};
RN   [1] {ECO:0000313|EMBL:EAN97716.1, ECO:0000313|Proteomes:UP000002296}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=CL Brener {ECO:0000313|EMBL:EAN97716.1,
RC   ECO:0000313|Proteomes:UP000002296};
RX   PubMed=16020725; DOI=10.1126/science.1112631;
RA   El-Sayed N.M., Myler P.J., Bartholomeu D.C., Nilsson D., Aggarwal G.,
RA   Tran A.N., Ghedin E., Worthey E.A., Delcher A.L., Blandin G.,
RA   Westenberger S.J., Caler E., Cerqueira G.C., Branche C., Haas B.,
RA   Anupama A., Arner E., Aslund L., Attipoe P., Bontempi E., Bringaud F.,
RA   Burton P., Cadag E., Campbell D.A., Carrington M., Crabtree J., Darban H.,
RA   da Silveira J.F., de Jong P., Edwards K., Englund P.T., Fazelina G.,
RA   Feldblyum T., Ferella M., Frasch A.C., Gull K., Horn D., Hou L., Huang Y.,
RA   Kindlund E., Klingbeil M., Kluge S., Koo H., Lacerda D., Levin M.J.,
RA   Lorenzi H., Louie T., Machado C.R., McCulloch R., McKenna A., Mizuno Y.,
RA   Mottram J.C., Nelson S., Ochaya S., Osoegawa K., Pai G., Parsons M.,
RA   Pentony M., Pettersson U., Pop M., Ramirez J.L., Rinta J., Robertson L.,
RA   Salzberg S.L., Sanchez D.O., Seyler A., Sharma R., Shetty J., Simpson A.J.,
RA   Sisk E., Tammi M.T., Tarleton R., Teixeira S., Van Aken S., Vogt C.,
RA   Ward P.N., Wickstead B., Wortman J., White O., Fraser C.M., Stuart K.D.,
RA   Andersson B.;
RT   "The genome sequence of Trypanosoma cruzi, etiologic agent of Chagas
RT   disease.";
RL   Science 309:409-415(2005).
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:EAN97716.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AAHK01000087; EAN97716.1; -; Genomic_DNA.
DR   RefSeq; XP_819567.1; XM_814474.1.
DR   STRING; 353153.Q4DYW0; -.
DR   PaxDb; 353153-Q4DYW0; -.
DR   EnsemblProtists; EAN97716; EAN97716; Tc00.1047053507167.169.
DR   GeneID; 3552040; -.
DR   KEGG; tcr:507167.169; -.
DR   eggNOG; ENOG502SEI3; Eukaryota.
DR   InParanoid; Q4DYW0; -.
DR   Proteomes; UP000002296; Unassembled WGS sequence.
DR   GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR   Gene3D; 2.160.20.10; Single-stranded right-handed beta-helix, Pectin lyase-like; 1.
DR   InterPro; IPR021053; Dispersed_gene_fam_prot1_C.
DR   InterPro; IPR021004; Dispersed_gene_fam_prot1_dom4.
DR   InterPro; IPR021282; Dispersed_gene_fam_prot1_dom5.
DR   InterPro; IPR006626; PbH1.
DR   InterPro; IPR012334; Pectin_lyas_fold.
DR   InterPro; IPR011050; Pectin_lyase_fold/virulence.
DR   PANTHER; PTHR24033; EGF-LIKE DOMAIN-CONTAINING PROTEIN; 1.
DR   Pfam; PF11024; DGF-1_4; 1.
DR   Pfam; PF11038; DGF-1_5; 1.
DR   Pfam; PF11040; DGF-1_C; 1.
DR   SMART; SM00710; PbH1; 8.
DR   SUPFAM; SSF101447; Formin homology 2 domain (FH2 domain); 1.
DR   SUPFAM; SSF51126; Pectin lyase-like; 1.
PE   4: Predicted;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   EGF-like domain {ECO:0000256|ARBA:ARBA00022536};
KW   Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Membrane {ECO:0000256|SAM:Phobius};
KW   Reference proteome {ECO:0000313|Proteomes:UP000002296};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Transmembrane {ECO:0000256|SAM:Phobius};
KW   Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT   TRANSMEM        21..40
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        3053..3079
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        3099..3118
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        3124..3141
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        3161..3180
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        3192..3213
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        3263..3285
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        3291..3310
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        3317..3338
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        3350..3373
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   DOMAIN          2833..2911
FT                   /note="Dispersed gene family protein 1"
FT                   /evidence="ECO:0000259|Pfam:PF11024"
FT   DOMAIN          2924..3201
FT                   /note="Dispersed gene family protein 1"
FT                   /evidence="ECO:0000259|Pfam:PF11038"
FT   DOMAIN          3373..3431
FT                   /note="Dispersed gene family protein 1 C-terminal"
FT                   /evidence="ECO:0000259|Pfam:PF11040"
FT   REGION          3391..3432
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        3391..3410
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   3432 AA;  356164 MW;  D885A7CC2582CADE CRC64;
     MWCAVGACPH SRHRVRCRAI AAVRVALLVV LSLVAAAAWM PAVHAVVLRL RGGTVDRAIT
     VGRAVDTVLM DGVSITNGVA VVFDVPAMLP GTLRIELRHC VCDGGVQIYV RGYSGEPASD
     RSLEVSVSGL SGSYCSLVFV HNLPAHTNVT VRDSTIVTAG PMRYSQLSGL TDAVASPLVL
     HATSLLQSQL RVSNTVLRSL QAGGSAVYFG GGVDLLSSAV VLDGVLLEAL GGPTASALHV
     ASSSRLSLRS HSVFSVTSVS VVSSGGGGIV LGERLAVLDS VLRFVRVEGF AASSMVHCDG
     GTIGAGGWLE LHDVWAVGEA SSVASLSGVM LSGGTVSIVR CAATGATLVS GLTITSGVVS
     VQCNRAGGRV LQSSGDYRMA GLPSVSVVPC DGCAAALACF DALTASFSDC VCSCRAGGVG
     EACLPFDVPP SRAGGGGDTA PGCVSGVTLT ESVTVGGGRA TTCFDSVVFS GPIIVAVDLR
     LMDAFADALN VTLRHCVLAG GAQLRIGGLS EITARLMPHA LLNMTNVTSV EGTIVLHGAM
     PLHSSVLLAN STLRATVDWS QYVPTTPGFA GSRYGPALVL DGVRLLSTRF VMTRSTLVCG
     GESCAAILVE RGLGVHLSSV FYMDNCAVLS RTHVMYALAS NLRMSGSSVF SIHNSLWSAP
     SNEYYKGACM FGDVVVDGGS VLQVVSSNFR LGFAMFIANT LTVTGGSWLV HRDNEFHAAY
     VVYVANENAV VFRDRSVWSI LDNNFTYGSY SSTTAYVTNN WSPPSDTRPT IYGMCNEAIG
     SPVTDYQGEL NIGTPVTVLD CGACTVDAVC FAARTSSISG CECVCAAGGY GDTCLPAAVP
     DGLGPLPLPD AKDTEVRCVH GGSIGSVYVP DPGVRGLCLV KVTFTAPMLL DLSYFDAPQQ
     TLNITLLQCV LVGLLIRGNG ARVHVSVTSS MLASGALVFE GDFGVSSQIL VVGSTLVTTS
     SHAISFLLFI CVNSTLLLLD NYIEGNIHAV YFTNAAVDGG GIIVKGNTLR ATRDDDGVES
     SVYVNAIALR NGGYFDVENT KMSAVSGFYF HGDTTVSSAG LLRVADCYFV ESTEEFESAL
     VFLSGSLSLE GGAQWRVEGN SVGAASILSI AHAEVKIQLS GSGTTVALAH NRQVDSRSFL
     ARFLPSSIVV KLSARFVVGC NLQGDEEVSY DDVFPEDVVV FRCGTCNDDA ACYMPGTESV
     DRGSCSCSCK DGWHGASCLP FEVPDTVVPP LPERAVDGDT SCVVNQTLTD LTLNMWKTHH
     CYVGVTFSGV GAALTFFLNR MPLHLPINIT LTGCTFREGA ALQFVGGASA AESAGVLIRV
     SRTVMRSSVV VFAAALPQHC DIAVTEVDAV QSSEVQLPHI RTNMLSVVLL QIVVLSASSL
     LVSNIKAHSL RYGALGLYST GTLTLLDGSS LYVQYCSFAG YTHMFYVNIL SVNDHSVFAL
     LNNTMSSGTS LLCQQQELSV SEHSVLRVVG NSGPVSNAIY SLSFFTVHYS SWLDWRYNDV
     GVGAMFHESE STFLIIDGSS VVTLTGCTMG STGLSVSLLK RADAGYRFVA GCLTVAGREV
     TTAAELALNG ITNVTTAAAC GECTKEGDCF APLTTAVIGC RCRCAAGGHG DVCLPAPVPA
     GPPPPPPPPP PPPPPPPVGE CISDMVYPEV AQTVGGGLSC LCYRNVTFSG GGMSLTVLVG
     AMTGDVVNVT FDGCTWRDGA VLLLLGNAYA AVGSLNIFVT RNTFSDALLS PEGVFPPHTN
     ITISGNRFTL TWLVPRSGLD IRRPSCVAMN GLVISNDSAV VLSGNVFQSV TTSSSAIYVV
     GSALRVSWHS LFAVVGNTFH MAGGDGTLIY LEGSSQSLSL SVLNNSAVVI RGNAVTRPVK
     CFVLLIWTLD VESHSAVVFQ GNEMQRSSVV FISGDSSNIY YNSWLQLSGN LCRESPSEAF
     AFLYPKVNLR DSTVSVSGNQ FVSSTVSPTM LRIPQSSRDL TNGAIVAACN TVNGEEEAKY
     AIPSVYNATI MTCSDPCTLA ASCFPAYTTT ASSDGCACAC AEGGYGDACL PVAVPEPPST
     DGADLCVRDV RVGVGVNAGF GTSVACYVGV TFATDVVVDV ESMSGSMRNV TLVNCLFVGG
     ASLYVVGWRS DPPAGERADV LISGLESCSG GGVVVASRFS PGSRVTVVDS VLIAEARVAY
     RGAYGLGDAS ACLVLHNVNL TGSVLTIART HVAAVFRDAV GVLFVGGVAL SSRGALYVDG
     LSVQTALGLC VSVEGGVAAS GGSVVAFVDS DFLLCKHAVS VRGAVSVSGS AVALVRSEFL
     STEDYAVAFY STVSLDGGSM LLAKGNVHDG ASREMLYAAG AVTAAGSTLS FVRNRALLPR
     MVSLSLSLAS GAHLRVACNY AGGRILSTAE EYAAAGFGDA GSIDVAGCDA CDRDTHCYTP
     GTASASMRNG VCVCLCGSSG HGEACVPVGA PALPPAVGTA SSVFFREGVT LRSVFVVPAG
     ASEVTLRHVV LDGVSPVLYV PWMARDGVRI VVQDVSLLNG AVLYVMGGGA LRGAVAAGSD
     ESGPVELSVC EVEALNGALV LSGTFPAGSV LTVTDSLLVA ARPTPLVYLP GSQSSPYAPV
     LVLSGLRLMR SVLVLFGVAL VTVMTGGRTV VVDGAVLVLV GGGVTLDAAV FGGDYALYAS
     ARVVASEGAV LRVSGSQVYA AHGLVFGSGV EANASAVVVN DNTGALTDGA LLELRGSASF
     ASGSWLSVRG NSISGRLLSV PSYPRRAELV KSTLTLHGNA GSGPVVMDGT VALGGAGRKF
     VVGCLTLNGQ ALQPMDYRSA GIIGEFRPVA CGVCDADVRC FAAATRTMSG SCRCRCAEGG
     YGRDCLPVYL PHVDGCNRTP APPPLSHTAT LTETRSLTPT WTATQTPSLS TAHYSPPWYG
     PTETLQGTET VALSPTRTPT ASVSSTPWWS DVACPTLAVT TTSAGGSLTQ NDIRGGGSAV
     PTRLMVALPP PFRWARDPQL GTHLSFVPVS TAQPRGFGGP WGAMLSNATW VRNATNPSTV
     LELAVPVHRG YFIAADETIV IRCDAAAVSG GCRGVLLGSF TIRSATLPAA ASALSAITGV
     VAGAAAVAVV VTGGLGSILE MQALGVFARM SCASAQERAS TVALPCFLSV FAALDPLWMV
     VGNALLAAVF GCVHCGVTAA FQRWRGVDAA SAWAAMRFPS LTYVVAHAMH LGIFFGSVLA
     LAMPDARGHH RVIGVVGVLY GVAFPAGVCY LIARHAGASF TRYWQFLRKP LHERLLYPVG
     YWHPAAQQRM YGGMLTNVRG SRVYWCVFQL SVLCVVCLIA AVQSPVGGCD VQYFCMAAVL
     VAGAGVVAFT NMMRSSFLTV MHTASFVLLA ALCVVSAANH LAPSDGGARA YAAIVLLLTT
     LLLAVTVYSM VVWHAEDRHW QELREPRRGG LEALLRDDEE SDEETQKPHD RTSFSYAPGT
     TVASSYRPPA PP
//
DBGET integrated database retrieval system