ID Q4E0A4_TRYCC Unreviewed; 3463 AA.
AC Q4E0A4;
DT 13-SEP-2005, integrated into UniProtKB/TrEMBL.
DT 13-SEP-2005, sequence version 1.
DT 27-MAR-2024, entry version 65.
DE SubName: Full=Dispersed gene family protein 1 (DGF-1), putative {ECO:0000313|EMBL:EAN98217.1};
GN ORFNames=Tc00.1047053509287.240 {ECO:0000313|EMBL:EAN98217.1};
OS Trypanosoma cruzi (strain CL Brener).
OC Eukaryota; Discoba; Euglenozoa; Kinetoplastea; Metakinetoplastina;
OC Trypanosomatida; Trypanosomatidae; Trypanosoma; Schizotrypanum.
OX NCBI_TaxID=353153 {ECO:0000313|EMBL:EAN98217.1, ECO:0000313|Proteomes:UP000002296};
RN [1] {ECO:0000313|EMBL:EAN98217.1, ECO:0000313|Proteomes:UP000002296}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CL Brener {ECO:0000313|EMBL:EAN98217.1,
RC ECO:0000313|Proteomes:UP000002296};
RX PubMed=16020725; DOI=10.1126/science.1112631;
RA El-Sayed N.M., Myler P.J., Bartholomeu D.C., Nilsson D., Aggarwal G.,
RA Tran A.N., Ghedin E., Worthey E.A., Delcher A.L., Blandin G.,
RA Westenberger S.J., Caler E., Cerqueira G.C., Branche C., Haas B.,
RA Anupama A., Arner E., Aslund L., Attipoe P., Bontempi E., Bringaud F.,
RA Burton P., Cadag E., Campbell D.A., Carrington M., Crabtree J., Darban H.,
RA da Silveira J.F., de Jong P., Edwards K., Englund P.T., Fazelina G.,
RA Feldblyum T., Ferella M., Frasch A.C., Gull K., Horn D., Hou L., Huang Y.,
RA Kindlund E., Klingbeil M., Kluge S., Koo H., Lacerda D., Levin M.J.,
RA Lorenzi H., Louie T., Machado C.R., McCulloch R., McKenna A., Mizuno Y.,
RA Mottram J.C., Nelson S., Ochaya S., Osoegawa K., Pai G., Parsons M.,
RA Pentony M., Pettersson U., Pop M., Ramirez J.L., Rinta J., Robertson L.,
RA Salzberg S.L., Sanchez D.O., Seyler A., Sharma R., Shetty J., Simpson A.J.,
RA Sisk E., Tammi M.T., Tarleton R., Teixeira S., Van Aken S., Vogt C.,
RA Ward P.N., Wickstead B., Wortman J., White O., Fraser C.M., Stuart K.D.,
RA Andersson B.;
RT "The genome sequence of Trypanosoma cruzi, etiologic agent of Chagas
RT disease.";
RL Science 309:409-415(2005).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EAN98217.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AAHK01000066; EAN98217.1; -; Genomic_DNA.
DR RefSeq; XP_820068.1; XM_814975.1.
DR STRING; 353153.Q4E0A4; -.
DR PaxDb; 353153-Q4E0A4; -.
DR EnsemblProtists; EAN98217; EAN98217; Tc00.1047053509287.240.
DR GeneID; 3552625; -.
DR KEGG; tcr:509287.240; -.
DR eggNOG; ENOG502SEI3; Eukaryota.
DR InParanoid; Q4E0A4; -.
DR OrthoDB; 131534at2759; -.
DR Proteomes; UP000002296; Unassembled WGS sequence.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR Gene3D; 2.160.20.10; Single-stranded right-handed beta-helix, Pectin lyase-like; 1.
DR InterPro; IPR021053; Dispersed_gene_fam_prot1_C.
DR InterPro; IPR021004; Dispersed_gene_fam_prot1_dom4.
DR InterPro; IPR021282; Dispersed_gene_fam_prot1_dom5.
DR InterPro; IPR006626; PbH1.
DR InterPro; IPR012334; Pectin_lyas_fold.
DR InterPro; IPR011050; Pectin_lyase_fold/virulence.
DR PANTHER; PTHR24033; EGF-LIKE DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR24033:SF151; PROTEIN EYES SHUT; 1.
DR Pfam; PF11024; DGF-1_4; 1.
DR Pfam; PF11038; DGF-1_5; 1.
DR Pfam; PF11040; DGF-1_C; 1.
DR SMART; SM00710; PbH1; 15.
DR SUPFAM; SSF51126; Pectin lyase-like; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Membrane {ECO:0000256|SAM:Phobius};
KW Reference proteome {ECO:0000313|Proteomes:UP000002296};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT TRANSMEM 3052..3078
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 3123..3140
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 3160..3179
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 3191..3212
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 3262..3283
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 3289..3308
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 3315..3337
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 3349..3374
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 2837..2910
FT /note="Dispersed gene family protein 1"
FT /evidence="ECO:0000259|Pfam:PF11024"
FT DOMAIN 2923..3200
FT /note="Dispersed gene family protein 1"
FT /evidence="ECO:0000259|Pfam:PF11038"
FT DOMAIN 3372..3457
FT /note="Dispersed gene family protein 1 C-terminal"
FT /evidence="ECO:0000259|Pfam:PF11040"
FT REGION 3396..3436
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3407..3424
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 3463 AA; 359559 MW; C8CE7F271671ED55 CRC64;
MWSVVGACPR SRHRVRCRAI AAMRVALLFL LVLSAAAAFM PAVHAVVLRL RGGTVDRAIT
VGRAVDTVLM DGVYITNGVA VVFDVPAMLP GALRIELRNC VCDGGAQIYV RGDSGEPASD
RSLEVSVSGL SGGYCSLVFV HNLPAHTNVT VRDSTIVMPG PMRYSQLSGL ADAVASPLVL
HATSLLQTQL RVSNTVLRSL QAGGSAVYVG GGVDLLSSAV VLDGVSLEAS GGPTASAMRV
ASSSRLSLRS HSVFSVTNVS VVSSGGGIVL GERLAVFDSV LRFVRVEGSV ASSLVRCGGG
TVGSGGWLEL HDVWAVGEAS SVASLSGVTL GGGAVSIARC VATGATLVSG LTITSGVVSV
QCNRAGGRVL QSSGDYRMAG LPSVSVVPCD GCAAALTCFD ALTASFSDCV CGCRAGGVGD
ACLPFDVPPA RAGGGGGGAQ DCVSGVTLTE SVTVGGGRAT ACFDSVVFSG PITVAVDLRS
MDAFADSLNV TLRHCVLAGG AQLRIGGLSE STARLMPHAL VNMTNVTSLE GTIVLHGAMP
LHSSVLLANS TLRATVGGSQ YVPTTPGHAG SRYGPALVLD GVRLLSTRFV MTRSTLFCGG
DLCAAILVER GLDVNLSSVF YMDNCAVRSR EHVMYALASD LRVSGGSVFS IQNSSWIASS
IEIYEGACVF RDVVLDGGSV LQVVSSNFRL GFAMLIAETL TVTGGSWLVH RDNEFRAAYV
LYVIKENGVA FCDRSVWSIL YNKLTYGLYS SFTHMTSNWS PPSDSRPIIY GVCNEARGSP
VTDYGDDLNI WVPVTVLDCG ACTVDAVCFA ARTSSIRGCE CECAAGGHGD TCVPAAVPDG
LGPLPLPDAK DTEVRCVHGG SISSVDYPDP GVRGLCFVNV TFTAAIVLDL WSFNAPQQTL
NITLLQCVLM GLSIGGSGAR VHVNVTSSML DSGALEFEGD FGASSQILVA GSTLVAKSDH
AILFMEFTFG ANMTLLLIDN YIEANRYAIY FFISVVVFDG GGIIVKGNTL STTEEDDGVE
SSVCVNAVGL KNGGYIDVEN NMMRSVNGVN LLGDTTVGSA GLLRVADCTF VGGTEFFDPA
LVYVSGSVTL EGGAQWRVEG NSVSAASVLS IPYSWQNIQL LGSDTTVVLA HNRQVDDSFP
VADLSLPQTI VELSARFLVG CNLQDDEEVS YDDVLPEGMD VLLFGCGTCN DDAACYMPGT
ESVDRGSCSC SCKDGWHGAS CLPFEVPDTV VPPVAERAVD GDTSCVVNQT LTSLTLNMWK
THHCYVGVTF SGVGAVLTFS FNSMPLHLPI NITLTGCTFL DGAVLQFVGD VEVAESAGVL
IRVSRTVMRS SVVAFALALP QHCDIAVTEV DAVQSSAVEL PGTVSRTLSV LFLKNVVLTA
SSLLVSNVNA RASKRDAFGL YSIGTLTLVG GSSLYARYCS FDGYTHLFYL DILSVSDRSV
FALLNNTVFF GTSLLYQYRG FSVSDHSVLR VVGNSGSVSC AIYSLNSWTV QRSSWLDWRD
NDVGVGAMLH DSESVFVGID GSSVVTLTGC KMGSTGLSRP LLSQSDAGYQ FVAGCLTVAG
REVTTAAELE LHCITNVTTV AVCAECTKDG DCFAPLTTAV IDCRCQCAAG GHGDVCVPVP
VPAGPPPLPP PPVPPTPPPP PVGECISDMV YPEVAHAVGD GLSWLCYRNV TFSGGGMSLT
VLVGAMTGDV ANVTFDGCTW RDGAVLLLLG NAHAAVGSLN IVVTGNTFRD ALLSPEGVFP
PRTNITISGN RFTVTRLIPR SGLGLRKPSC VAMNELAISN DSAVVLSGNV FQSVTASSSA
IHVVESSLRV SWHSVFAVVG NTFHMAGGDG TLIHLEGSSE SSSLSVLNNS AVVIRGNLVS
RPVKHFILFF WASRVESLSA VVFQGNDMQG SLVVFYSAFS SHIYYNSWLQ LSGNLCRESP
SEAFAVFNPT VNLRDSTLTV SGNQFMSNNG VAKALWIFSE SADPTNGAIV AACNSVNGGE
GANYAIPSVY NATILTCIDP CTLATSCFPA YTTTVSSDGC ACRCAEGGHG DACLPVAVPE
PPSTDGADLC VRDVRVGVEV NAGFGTSVVC YVGVTFAVDV VVDMDLMTGT VRNVTLANCT
LVGGASLYVL GWRSDPPAGL CAEVLISGLE SRSGGGVVVA NRFPPGSRVT VVDSVLIAEN
RVAYRGAYDL GDASACLVVH NVNLTGSVLT IARTQVKAVF RDAVGVLVVG GVALSSRGAL
YVDGLLVQTA LGLCVSVEGG VAASGGSVVA FVDSVFLLCK HAVSVRGAVS VSGSAVAFLR
SDFSLTEDYA VAFYSTVSLA GGSMLLAKGN VHDGVSREML HAAGAVTAAG STLSFVRNRA
LLPRMLSLSL SLAAGAHLRV ACDDAGGRVL LTAEEYAAAG FGDAGSIDVA GCDACDRNTH
CYAPGTASAS MKDGVCVCAC GSGGYGEACM PVGAPALPPA VGTASRVFVR EGVTVQSVFV
VPAGASEVTL RHVVLDGVSP VLYVPWMARD GVRIVVQNVS LLNGAVLYVM GGGVLRGAGT
AGSDESGPVE LSVCDVEALN GAIVLTGTFP AGSVLTVADS LLVAARPTPL VYLPGSQSSP
YAPVLVLSGL RLVRSVLVVS GVGLVTVMTG GRTVVVDGAV LELVVGGVAL DAAVLGGEYA
LYASARVVAS GGAVLRVSGS QVYAAHGLVF GSGVVANASA VVVNDNAGAL TDGALLELRG
SASFLSGSWL SVRGNSISGR LLSLPSYPRS ADLVQSTLTL YGNAGSGPVV MDGTVVLVGA
GRRFVVGCLT LNGQALQPME YRSAGIIGEF RPVACSVCDA DVRCFAAATM SMSGSCGCRC
AEGGYGRDCL PVHLPHVDGC NRTPAMPLLS RTETLTETCS LTPTWTATPK PNLSRTQYGP
TETLQVTETV APPPTRTPTA SVSSTLWWSD VACPTLAVTT TAAGGGLTQN DIRGGGSAVP
TLLMVALPPP FRWARDPQLG THLSFFPVST AQPRGFGGPW GAMLRNATWV RNATNPSTVL
ELAVPVHRGY FIAADETIVI RCDAVAVFGG CKGVLLGSFT IRSDMLPAAA SVLSAITGVV
AGAAAVAVVV TGGLGSILEM QALGVFARMS CTSAQERAST VALPYFLSVF AALDPLWMVV
GNALLAAVFG CVHCGVTAAF QRWRGVDAAS AWAAMRFPSL TYVVAHAMHL GIFFGSVLTL
AMPDARVQHR VIGVVGVLYG AAFPAGVCYF IARHVGASFT RYWQFSRKPL HERLLYPVGY
WHPTAQQRMY GGMLTNMRGS RVYWCVFQLL VLCVVCLIAA VHPPVGGCHV QYFCMAAVLV
AGAGAVAFTN MMRSAFLTVM HTAGFALLAL LCLVSAANHL APSDGGARAY AAIVLLLTTV
LLAVAVYSVV VWYAEDRHWQ ELREPRRGGL EALLRDDEES GDETQKPHEM TSSSYASGTT
VASSYRPPAP PRPVAGDIRS DALSLFDRAS SASCSINYVT LDR
//