ID Q4D6V4_TRYCC Unreviewed; 3467 AA.
AC Q4D6V4;
DT 13-SEP-2005, integrated into UniProtKB/TrEMBL.
DT 13-SEP-2005, sequence version 1.
DT 27-MAR-2024, entry version 61.
DE SubName: Full=Dispersed gene family protein 1 (DGF-1), putative {ECO:0000313|EMBL:EAN88256.1};
GN ORFNames=Tc00.1047053510367.10 {ECO:0000313|EMBL:EAN88256.1};
OS Trypanosoma cruzi (strain CL Brener).
OC Eukaryota; Discoba; Euglenozoa; Kinetoplastea; Metakinetoplastina;
OC Trypanosomatida; Trypanosomatidae; Trypanosoma; Schizotrypanum.
OX NCBI_TaxID=353153 {ECO:0000313|EMBL:EAN88256.1, ECO:0000313|Proteomes:UP000002296};
RN [1] {ECO:0000313|EMBL:EAN88256.1, ECO:0000313|Proteomes:UP000002296}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CL Brener {ECO:0000313|EMBL:EAN88256.1,
RC ECO:0000313|Proteomes:UP000002296};
RX PubMed=16020725; DOI=10.1126/science.1112631;
RA El-Sayed N.M., Myler P.J., Bartholomeu D.C., Nilsson D., Aggarwal G.,
RA Tran A.N., Ghedin E., Worthey E.A., Delcher A.L., Blandin G.,
RA Westenberger S.J., Caler E., Cerqueira G.C., Branche C., Haas B.,
RA Anupama A., Arner E., Aslund L., Attipoe P., Bontempi E., Bringaud F.,
RA Burton P., Cadag E., Campbell D.A., Carrington M., Crabtree J., Darban H.,
RA da Silveira J.F., de Jong P., Edwards K., Englund P.T., Fazelina G.,
RA Feldblyum T., Ferella M., Frasch A.C., Gull K., Horn D., Hou L., Huang Y.,
RA Kindlund E., Klingbeil M., Kluge S., Koo H., Lacerda D., Levin M.J.,
RA Lorenzi H., Louie T., Machado C.R., McCulloch R., McKenna A., Mizuno Y.,
RA Mottram J.C., Nelson S., Ochaya S., Osoegawa K., Pai G., Parsons M.,
RA Pentony M., Pettersson U., Pop M., Ramirez J.L., Rinta J., Robertson L.,
RA Salzberg S.L., Sanchez D.O., Seyler A., Sharma R., Shetty J., Simpson A.J.,
RA Sisk E., Tammi M.T., Tarleton R., Teixeira S., Van Aken S., Vogt C.,
RA Ward P.N., Wickstead B., Wortman J., White O., Fraser C.M., Stuart K.D.,
RA Andersson B.;
RT "The genome sequence of Trypanosoma cruzi, etiologic agent of Chagas
RT disease.";
RL Science 309:409-415(2005).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EAN88256.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AAHK01000910; EAN88256.1; -; Genomic_DNA.
DR RefSeq; XP_810107.1; XM_805014.1.
DR STRING; 353153.Q4D6V4; -.
DR PaxDb; 353153-Q4D6V4; -.
DR EnsemblProtists; EAN88256; EAN88256; Tc00.1047053510367.10.
DR GeneID; 3540819; -.
DR KEGG; tcr:510367.10; -.
DR eggNOG; ENOG502SEI3; Eukaryota.
DR InParanoid; Q4D6V4; -.
DR OrthoDB; 176640at2759; -.
DR Proteomes; UP000002296; Unassembled WGS sequence.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR InterPro; IPR021053; Dispersed_gene_fam_prot1_C.
DR InterPro; IPR021004; Dispersed_gene_fam_prot1_dom4.
DR InterPro; IPR021282; Dispersed_gene_fam_prot1_dom5.
DR InterPro; IPR006626; PbH1.
DR InterPro; IPR011050; Pectin_lyase_fold/virulence.
DR PANTHER; PTHR24033; EGF-LIKE DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF11024; DGF-1_4; 1.
DR Pfam; PF11038; DGF-1_5; 1.
DR Pfam; PF11040; DGF-1_C; 1.
DR SMART; SM00710; PbH1; 9.
DR SUPFAM; SSF51126; Pectin lyase-like; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Membrane {ECO:0000256|SAM:Phobius};
KW Reference proteome {ECO:0000313|Proteomes:UP000002296};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT TRANSMEM 3055..3081
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 3126..3143
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 3163..3183
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 3195..3215
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 3265..3286
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 3292..3311
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 3318..3340
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 3352..3377
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 2839..2913
FT /note="Dispersed gene family protein 1"
FT /evidence="ECO:0000259|Pfam:PF11024"
FT DOMAIN 2926..3203
FT /note="Dispersed gene family protein 1"
FT /evidence="ECO:0000259|Pfam:PF11038"
FT DOMAIN 3375..3461
FT /note="Dispersed gene family protein 1 C-terminal"
FT /evidence="ECO:0000259|Pfam:PF11040"
FT REGION 1623..1643
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 3467 AA; 360094 MW; 52C6DA9EA35CE7D5 CRC64;
MRSAEGACPR SRHRLGRRAI AAVRVALLVV LALVAAAAWM PAVHAVVLRL RSGTVDRAIT
VGRAVDTVLM DGVSITNGVA VVFDVAAMLP GALRIELRSC VCDGGAQIYV RGYSGEPASD
RSLEVSVTGL SGSYCSLVFV HNLPAHTNVT VRDSTIVTPG PMRYSQLSGL TDAVASPLVL
YATSLLQTQL RVSSTVLRSL HVGGSAVYVG GGVDLLSSAV VLDGVLLEAS GGPTASAMHV
ASSLPFSLRS HSVFSVTNVS VVSSGGGFVL GECLAAIDSV LRFVGVEGSV ASSLVRCDGG
TVGAGGWLEL HDVWAVGEAS SVASLSGVTL SGGAVSIARC AATGATLVSG PTITSGVVSV
QCNRAGGRVL QSSGDYRMAG LPSVSVVPCD GCAAALACFD ALTASFSDCV CSCRAGGVGN
ACLPFDVPPA RSGGGGGGAQ DCVSGVTLTE SVTVGGGQAT ACFDSVVFSG PITVAVDLRS
MDVLADALNV TLRHCVLAGG AQLRIGGLSE STARLVPHAL VNMTNVTSVE GTIVLHGAMP
QHSSVLLANS TLRATVGGSQ YVPTTRGHAG FWHGPALVLD GVRLLSTRFV MTRSTLVCGG
GSCAAILVER SLGVNQFSVF YMDNCAVISR THVMYALASD LRVSGGSVFS IQNSLWSAPS
TEYYKGGCVF GDVAVDGGSV LQIVSSEFRL GFAMLMANRL TVNGGSWLVH RDSEFRTAYV
VYVVHENGVA FHDRSVWSIL HNKFTYGSYS STIAHMTSNW LPPSDSRPII YGVCNELRGS
PVTDYQDDLN IGTPVTLLDC GACTMDAVCF AARTSSISGC ECVCAAGGHG DTCLPAAVPD
GLGPLPLPLP DAEDTEVRCV HGGSISSVDV PDPGVRGLCF VNVTFTAAIV LDLLYFDAPE
QTLNITLLQC ILMGLLIKGS GARVHVNVTS SMLDSGALEF RGDFGTSSQI LVAGSTLVTT
SSHAIAFLDF SFGANSTLLL LDNNIEANIF AVCFPVAVVV EGGGIILKGS TLRTKKKDYR
STSAVYYNGV YVRNGGYIDV ENNTMNAVNG VYFFGDTYVS SAGLLRVADC TFVGSTRVLN
CALAYFDGSV ILQGGAQWRV EGNNVGAASV LTVVYSWQKI RLLGSGTTVV LAHNRQVDRG
MAFASAVSSN TIVALPARFV VGCNLQGDEE VSYDGVFPED VMVFRCGTCN DDAACYIPGT
ESVDRSSCSC SCKDGWHGAS CLPFEVPHTV VPPVAERAVD GDTSCVVNQT LTSLTLNMWK
THHCYAGVTF SGVGAALTFF LDSMPLHLPI NITFTGCTFR EGVALQFVGG IVAVESSGVL
IRVSQTVMRS SVVVFALALP QYCDIAVTEV DAVQSFEVQL PHTRRNKLSV FLLKNTVLSA
SSLLASNVKA YGSGYRGLGL YTTGTLTLVS GSSLYVRYCS FDGYLHLLYV YILSVSDHSV
FALLNNTMSS GESLLYLRHG LSVSEHSVFR VVGNSGIVAC AIYAEDLWTV QRSSWLDWRD
NDVGLGAFFH KPESASFSID SSSVLTLTGC KMGFTGLSVS LLSQADAGYR FVAGCLTVAG
REVTTAAELE LHGITNVTTA AACGECTKDG DCFAPLTTAV IDCKCRCAAG GHGDVCAPAP
VPAGPPSPPP PPPPSSLPPP PPPVGECISD MVYPEVAQAV GSGLSWLCYR NVTFSGGGMS
LTVLIGAMTG DVANVTFDGC TWRDGAVLLL AGNAHAAVGS LSIVVTGSTF DDALLSPEGV
FPPHTNITIS GNRFTVTRLI PRSGLDLRRP ACVAMNELVI TNDSAVVLSG NVFQTVRALS
SAICVLRYAL SVSWHSVFAV VGNTFRMAGA NSTLIHLEGS SQTSSLDVLN NSAVVIRGNI
VTRPVRDFIS LLWVLRVESQ SAVVFQGNDM QGGLAVFYSS FASHIYYNSW LQLSGNLCRV
SPVIAFAVFN SRVNLRDSTV SVSGNQFISS TVSPTLIKIP KRPRDLTNGV VVAACNTVNG
GEGAKYAIPS VYNATILTCS DPCALATSCF PAYTTTASSD GCACTCAEGG HGDACLPVAV
PESSSTDGAD LCVRDVRVDV ELKAGFGTLV ACYVGVTFAA DVVVDVESMT GSVRNVTLAN
CTFVDGASLY VVGWLSDPPA GERADVLISG LESRSGGGVL VANRFPPGSR VTVVDSVLIA
ETRVAYRGAH DLGDASACLV LHKVNLTGSV LTIARTQVVA VSRDAVGVLF VGGVALQSRG
ALYMDGLSVQ TALGLCVSVE GGVMASGGSV VAFVDSEFLL CKHAVSMRGA VSVLGSAVAL
VRSEFSSTED YAVAFYSTVS LAGGSMLLVK GNVHDGVLRE MLYATGAVTA AGSTLSFVRN
RALLPRILSL SLLLSAGAHL RVACNDAGGR VLLTAEEYAA AGFGDAGSID VAGCDACDRD
THCYAPGTAS ASMRNGVCVG ACGGGGYGEA CVPVGAPALP PAVSTASSVF VREGVTVRSV
FVVPAGASEV TLRHVVLDGV SPVLYVPWMA RDGVRIVVQN VSLLNGAVLY VMGVGALRGA
GAAGSDEGGP VELSVCDVEA LNGALVLTGT FPAGSVLTVT DSLLVAARST PLVYLPSSQF
SPYAPLLVLS GLRLVRSVLV VSGVALVTVM TGGRTAVVDG AVLELVGGGV ALDAAVFGGD
YALYTSARVV ASAGAVLRVS GSQVYAAHGL VFDSGVVANA SAVVVNDNAG ALTDGALLVL
RGSASFVSRS WLSVRGNSIS GRLLSVPSYP RSVEFVQSTL TLHGNAGSGP VVMDGTVALV
GAGRKFFVGC LTLNGQPLQP MDYRSAGIFG EFRPVACGVC DADVHCFAAA TRAMSVSCGC
CCAEGGYGRD CLPVYLPHVD GCNRTSGMPL LSHTATLTET RSLTPTPTPS LSAAHYSPAQ
YGPTETLQVT ETVALSPTRT PTASVSSTLW WSDVACPTLA VTTTAAGGSL TQNDIRGGGS
AVPTRLMVAL PPPFRWASDP QLGTHLSFVP VSTAQPSGFG GPWGAMLSNA TWVRSATNPS
TVLELAVPVH RGYFIAVDET IVIRCDAVAV FGGCKGVLLG SFTIRSNTLP AAASAFSAIT
GVVAGAVAVA VVVTGGLGSI LEMQALGVFA RMSCASAQER ASTVALPYFL SVFAALDPLW
MVVGNALLAA VFGCVHCGVT AAFQRWRGVD AASAWVAMRF PSLTYVVAHA MHLGIFFGSV
LALAMPDARV QHRVVGVFGV LYGAAFPAGV CYFIARHTGA SFTRYWQFSR KPLHERLLYP
VGYWHPAAQQ RMYGGMLTNM RGSHVYWCVF QLSVLCVVCL IAAVHPPVGG CHVQYFCMAA
MLLAGAGVVA FTNMMRSAFL TVMRAASFVL LALLCVISAA NHLAPSDGGA RAYVAIVLLL
TAVLLAVTVY SVVVWYAEDR HWQELREPRR GGLEALLRDD EESDEEARKP HDMMSSSYAI
GTTVASSYRP PAPPLQLMAA DTRSDALSLL DRVSSASCSI NYALLDR
//