ID Q4DKL6_TRYCC Unreviewed; 3427 AA.
AC Q4DKL6;
DT 13-SEP-2005, integrated into UniProtKB/TrEMBL.
DT 13-SEP-2005, sequence version 1.
DT 27-MAR-2024, entry version 62.
DE SubName: Full=Dispersed gene family protein 1 (DGF-1), putative {ECO:0000313|EMBL:EAN93067.1};
GN ORFNames=Tc00.1047053508607.30 {ECO:0000313|EMBL:EAN93067.1};
OS Trypanosoma cruzi (strain CL Brener).
OC Eukaryota; Discoba; Euglenozoa; Kinetoplastea; Metakinetoplastina;
OC Trypanosomatida; Trypanosomatidae; Trypanosoma; Schizotrypanum.
OX NCBI_TaxID=353153 {ECO:0000313|EMBL:EAN93067.1, ECO:0000313|Proteomes:UP000002296};
RN [1] {ECO:0000313|EMBL:EAN93067.1, ECO:0000313|Proteomes:UP000002296}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CL Brener {ECO:0000313|EMBL:EAN93067.1,
RC ECO:0000313|Proteomes:UP000002296};
RX PubMed=16020725; DOI=10.1126/science.1112631;
RA El-Sayed N.M., Myler P.J., Bartholomeu D.C., Nilsson D., Aggarwal G.,
RA Tran A.N., Ghedin E., Worthey E.A., Delcher A.L., Blandin G.,
RA Westenberger S.J., Caler E., Cerqueira G.C., Branche C., Haas B.,
RA Anupama A., Arner E., Aslund L., Attipoe P., Bontempi E., Bringaud F.,
RA Burton P., Cadag E., Campbell D.A., Carrington M., Crabtree J., Darban H.,
RA da Silveira J.F., de Jong P., Edwards K., Englund P.T., Fazelina G.,
RA Feldblyum T., Ferella M., Frasch A.C., Gull K., Horn D., Hou L., Huang Y.,
RA Kindlund E., Klingbeil M., Kluge S., Koo H., Lacerda D., Levin M.J.,
RA Lorenzi H., Louie T., Machado C.R., McCulloch R., McKenna A., Mizuno Y.,
RA Mottram J.C., Nelson S., Ochaya S., Osoegawa K., Pai G., Parsons M.,
RA Pentony M., Pettersson U., Pop M., Ramirez J.L., Rinta J., Robertson L.,
RA Salzberg S.L., Sanchez D.O., Seyler A., Sharma R., Shetty J., Simpson A.J.,
RA Sisk E., Tammi M.T., Tarleton R., Teixeira S., Van Aken S., Vogt C.,
RA Ward P.N., Wickstead B., Wortman J., White O., Fraser C.M., Stuart K.D.,
RA Andersson B.;
RT "The genome sequence of Trypanosoma cruzi, etiologic agent of Chagas
RT disease.";
RL Science 309:409-415(2005).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EAN93067.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AAHK01000379; EAN93067.1; -; Genomic_DNA.
DR RefSeq; XP_814918.1; XM_809825.1.
DR STRING; 353153.Q4DKL6; -.
DR PaxDb; 353153-Q4DKL6; -.
DR EnsemblProtists; EAN93067; EAN93067; Tc00.1047053508607.30.
DR GeneID; 3546541; -.
DR KEGG; tcr:508607.30; -.
DR eggNOG; ENOG502SEI3; Eukaryota.
DR InParanoid; Q4DKL6; -.
DR OrthoDB; 176640at2759; -.
DR Proteomes; UP000002296; Unassembled WGS sequence.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR Gene3D; 2.160.20.10; Single-stranded right-handed beta-helix, Pectin lyase-like; 1.
DR InterPro; IPR021053; Dispersed_gene_fam_prot1_C.
DR InterPro; IPR021004; Dispersed_gene_fam_prot1_dom4.
DR InterPro; IPR021282; Dispersed_gene_fam_prot1_dom5.
DR InterPro; IPR006626; PbH1.
DR InterPro; IPR012334; Pectin_lyas_fold.
DR InterPro; IPR011050; Pectin_lyase_fold/virulence.
DR PANTHER; PTHR24033; EGF-LIKE DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR24033:SF151; PROTEIN EYES SHUT; 1.
DR Pfam; PF11024; DGF-1_4; 1.
DR Pfam; PF11038; DGF-1_5; 1.
DR Pfam; PF11040; DGF-1_C; 1.
DR SMART; SM00710; PbH1; 14.
DR SUPFAM; SSF51126; Pectin lyase-like; 2.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Membrane {ECO:0000256|SAM:Phobius};
KW Reference proteome {ECO:0000313|Proteomes:UP000002296};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT TRANSMEM 3048..3074
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 3112..3136
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 3156..3177
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 3183..3208
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 3258..3279
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 3285..3304
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 3311..3333
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 3345..3370
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 2831..2906
FT /note="Dispersed gene family protein 1"
FT /evidence="ECO:0000259|Pfam:PF11024"
FT DOMAIN 2919..3196
FT /note="Dispersed gene family protein 1"
FT /evidence="ECO:0000259|Pfam:PF11038"
FT DOMAIN 3368..3426
FT /note="Dispersed gene family protein 1 C-terminal"
FT /evidence="ECO:0000259|Pfam:PF11040"
FT REGION 2847..2869
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 3427 AA; 355634 MW; DA0B8E7C80EE7A84 CRC64;
MRSAVGACPR SRHRLQRSAI AAVRVALLVA LVLAAAAAWM PAVHAVVLRL RGGTVDRAIT
VGRAVDTVLM DGVSITNGVA VVFDVPAMLP GALRIELRNC VCDGGAQLYV RGYSGEPASD
RSLEVSVSGL SGGYCSLVFV HSLPAHTNVT VRDSTIVTAG PMRYSQLSML TDAVASPLVL
HSTSLLQTQL RVSNTLLRSL HTGGTAVYVG GGMDLLSSAV VFDGVLLEAS GGPTASAMHV
ASSSRLSLRS HSVFSVTNVS AVSSGGGIVL GERLAVVDSV LRFVGVEGSV ASSLVRCDGG
TVGGGGWLDL HDVWAVGEAS SVASLSGVTL SGGTVSIARC TATGATLVSG PTITSGVVSV
QCNRAGGRVL RSSGDYRLAG LPSVSVVPCD GCAAALACFD ALTASFSDCV CSCRAGGVGE
ACLPFDVPPA RAGGGGDTAE GCVSGVTLTE SVTVGGGRAT ACFDSVVFSG PITVAVDLRS
MDAFADALDV TLRHCVLAGG AQLRIGGLSE STARLMPHAL VNMTNVTSLE GTIVLHGAMP
PHSSVLLANS TLRATVGGSQ YMPTTPGFEG FRYGPALVLD GVRLLSTRFV MTRSTLLCGG
GSCSAIVVER GLGVKLSSVF YMDNCAVRSQ THVMYAFGSD LRVVGGSVFS IQNSLWSAPS
INMYEGACVF RDVAVDGGSV LQMVSSNFRL GFAMLMATTL TVTDGSWLVH RDNEFRTAYV
VYVTNYYGVA FHDRSVWSIL HNNFTYGSYS STTAYMTSRW SPPSDTRPII YGMCNEARGS
LVTDYGEDLN IGAPVTVLDC GACTVDAVCF AARTSSISGC ECVCAADGYG DTCLPAAVPD
GLGPLPLPDA RDTEVRCVHG GSISSVDDPD PGVRGLCFVN VTFTAAIVLE LSHFDAPQQT
LNITLLQCVL VGLSIKGNGA RVHVNVTSSM LDAGDLEFRG DFGASSQILV AGSVLFSTSV
YAIAFLDFVL GANSSLLLLY NNVEGNDYVV CFAVAAVVDG GGIILKGNTL RATEVKGMES
SVYVYAVEVK NGGHFDVENN TMRAVNGIYF SDENTVSSAG LLRVADCDFV GSMELFDSAL
LYVSGSVTLE GGAQWRVEGN SVSAASVLSM TYSQSRIQLS GSGTTVVLAH NRQLDDIYPF
ADLSLPQTIV ELPARIVVGC NLQGGEEASY DDVFPEGVVV FRCGTCNDDA ACYMPGTESV
DRSSCSCSCK DGWHGASCLP FEVPDTVVPP VAERAVDGDT SCVVNQTLKI LTLNMWKTHH
CYVGVTFSGV GAALTFFLDS MPLHLPINIT ITGCTFRDGA ALQFVGGAAA AESSGVLIRV
SQTVMRSSVV VFSLALPKHC DITVTEVDAV QSSIVFWPDT VNKRLSVVAL DDVELTASSL
LVSNVRAHAL RYGGYGLYSI GKLMLVAGSS LYVRYCSLDG YTHLFDINIL IVRDHSVFVL
LKNTMASGKT LLYQCLEFSV SDHSVLRVVG NSGPVSYAIF AEDSWTVQES SWLDLRENDV
GVGAMFHESK STFLIIDDSS VVTLTGCKMG STGLSVSLLS QADAGYRFVA GCLTVAGRVV
TTAAELALHG ITNVTAVAAC GECTKYGDCF APLTTAVIDC KCQCAAGGHG DVCVPAPVPA
GPPPPPVPLT PPHPPVGECI SDMVYPEVAQ AVGSGLSWLC YRNVTFSGGG MSLTVLIGAM
AGDVANVTFD GCTWRDGAVL LLLGNVHAAV GSLNIVVTGN RFIDALLIPE GVFPQHTNIT
ISGNRFTVTR LIPRSGLDIR RPSCVAMNGL AISNDSAVVL SGNVFQTVTA SSGVIHVFRS
ALRVSWYSVF AVVGNTFHMA GVNGTLIHLE GSMHSSSLSV LNNSAVVIRG NVVTRPVQCV
IFFRWALSVE SQSSFLFQGN DMQGSSVVFY SNFLSCIYYS SWLQLSGNLC RESPSEAFLF
LNPEVNLRDS TVSVSGNQFM SSTVTPTVLR ISEVSSVLTN GAIVAACNTV NGEEEANYVV
PSVYNATILD CSDPCALAAS CFPAYTTTVS SDGCACTCAE GGHGDACLPV AVPEPPSTDG
ADLCVRDVRV DVEVNAGLGT SVACYVGVTF AADVVVDVES MSGSVRNVTL SNCTFVSGAS
LYVVGWLSDP PAGERADVLI SGLESRFGGG VLVANRFPPG SRVTVVDSVL IAERRVAYRG
AYGLGDASAC LVVHNVNLTG SVLTIARTHV AAVFRDAVGV LVVGGVALSS RGALYVDGLL
VQTALGLCVS VEDGVAASGG SVVAFVDSDF LLCKRAVSVR GAVSVSGSAV ALVRSDFSLT
EDYAVAFYST VRLAGGSMLL AKGNVHDGVS REMLYAAGAV TAAGSTLSFV RNRALLPRML
SLSLSLSAGA HLRVACNDAG GRVLSTAEEY AAAGFGDAGS IDVAGCDACD RDTHCYAPGT
ASASMKNGVC VCACGSGGYG EACVPVGAPA LPPAAGTASS VFFREGVTVQ SVFVVPAGAS
EVTLRHVVLD GVSPVLYVPW MARDGVRIVV QNVSLRNGAV LYVMGGGALR GADAAGSDGS
GPVELSVCDV EALNGALVLT GTFPAGSVLT VTDTLLVAAR PTPLVYLLGS QSSPYAPVLV
LSGLRLVRSV LVVSGVALVT VMTGGRTVVV DGAVLELVGG GVALDAAVFG GEYALYASAR
VVASWGAVLR VSGSQVYAAH GLVFDSGVVA NASAVVVNDN AGAVTDGALL ELRGSASFAS
GSWLSVRGNS ISGRLLSVPP YPRSAELVQS TLTLYGNAGS GLVAMDGTVA LGGAGRKFVV
GCLTLNGQAL QPMDYRSAGI IGEFRPVACG VCDADVHCFA AATRAMSGSC GCRCAEGGYG
RDCLPVYLPH VDGCNRTPAM PPLSRTATLT ETRSPTPSST PSMSTTQYGP TETLQVTETL
QVTEAVALSP TRTPTASVSS TLWWSDVACP TLAVTTTAAG GSLTQNDIRG GGSAVPTRLM
VALPPPFRWA RDPQLGTHLS FVPVSTAQPR GIGGPWGAML SNATWVRNAT NPSTVLELAV
PVHRGYFIAA DETIVIRCDA AAVFGGCKGV LLGSFTISSN TLPAASSALS AITGVVAGAA
AVAVVVTGGL GSILEMQALG VFARMSCTSA QERASTVALP YFLSVFAARD PLWMVVGNAL
LAAVFGCVHC GVTAAFQRWR GVDAASAWAA MRFPSLTYVV AHAMHLGIFF GSVLALAMPG
ARVQHYVIGV VGVLYGVAFP AGVCYLIARH TGASFTKYWQ FSRNPLHERL LYPVGYWHPA
AQQRMYGSML TNMRGSRVYW CVFQLSVLCV VCLIAAAHPP VGGCHVQYFC MAAVLLAGAG
VVAFTNMMRS AFLTVMHTAS FVLLAALCLV SAANHFAPSD GGARAYAAIV LLLTAVLLAV
AVYSVVVWYA EDHHWQELRE PRRGGLEALL RNDEESDEET QKLHEMTLSS YASGATAASS
YRPPAPP
//