GenomeNet

Database: UniProt
Entry: Q4D6V4_TRYCC
LinkDB: Q4D6V4_TRYCC
Original site: Q4D6V4_TRYCC 
ID   Q4D6V4_TRYCC            Unreviewed;      3467 AA.
AC   Q4D6V4;
DT   13-SEP-2005, integrated into UniProtKB/TrEMBL.
DT   13-SEP-2005, sequence version 1.
DT   27-MAR-2024, entry version 61.
DE   SubName: Full=Dispersed gene family protein 1 (DGF-1), putative {ECO:0000313|EMBL:EAN88256.1};
GN   ORFNames=Tc00.1047053510367.10 {ECO:0000313|EMBL:EAN88256.1};
OS   Trypanosoma cruzi (strain CL Brener).
OC   Eukaryota; Discoba; Euglenozoa; Kinetoplastea; Metakinetoplastina;
OC   Trypanosomatida; Trypanosomatidae; Trypanosoma; Schizotrypanum.
OX   NCBI_TaxID=353153 {ECO:0000313|EMBL:EAN88256.1, ECO:0000313|Proteomes:UP000002296};
RN   [1] {ECO:0000313|EMBL:EAN88256.1, ECO:0000313|Proteomes:UP000002296}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=CL Brener {ECO:0000313|EMBL:EAN88256.1,
RC   ECO:0000313|Proteomes:UP000002296};
RX   PubMed=16020725; DOI=10.1126/science.1112631;
RA   El-Sayed N.M., Myler P.J., Bartholomeu D.C., Nilsson D., Aggarwal G.,
RA   Tran A.N., Ghedin E., Worthey E.A., Delcher A.L., Blandin G.,
RA   Westenberger S.J., Caler E., Cerqueira G.C., Branche C., Haas B.,
RA   Anupama A., Arner E., Aslund L., Attipoe P., Bontempi E., Bringaud F.,
RA   Burton P., Cadag E., Campbell D.A., Carrington M., Crabtree J., Darban H.,
RA   da Silveira J.F., de Jong P., Edwards K., Englund P.T., Fazelina G.,
RA   Feldblyum T., Ferella M., Frasch A.C., Gull K., Horn D., Hou L., Huang Y.,
RA   Kindlund E., Klingbeil M., Kluge S., Koo H., Lacerda D., Levin M.J.,
RA   Lorenzi H., Louie T., Machado C.R., McCulloch R., McKenna A., Mizuno Y.,
RA   Mottram J.C., Nelson S., Ochaya S., Osoegawa K., Pai G., Parsons M.,
RA   Pentony M., Pettersson U., Pop M., Ramirez J.L., Rinta J., Robertson L.,
RA   Salzberg S.L., Sanchez D.O., Seyler A., Sharma R., Shetty J., Simpson A.J.,
RA   Sisk E., Tammi M.T., Tarleton R., Teixeira S., Van Aken S., Vogt C.,
RA   Ward P.N., Wickstead B., Wortman J., White O., Fraser C.M., Stuart K.D.,
RA   Andersson B.;
RT   "The genome sequence of Trypanosoma cruzi, etiologic agent of Chagas
RT   disease.";
RL   Science 309:409-415(2005).
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:EAN88256.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AAHK01000910; EAN88256.1; -; Genomic_DNA.
DR   RefSeq; XP_810107.1; XM_805014.1.
DR   STRING; 353153.Q4D6V4; -.
DR   PaxDb; 353153-Q4D6V4; -.
DR   EnsemblProtists; EAN88256; EAN88256; Tc00.1047053510367.10.
DR   GeneID; 3540819; -.
DR   KEGG; tcr:510367.10; -.
DR   eggNOG; ENOG502SEI3; Eukaryota.
DR   InParanoid; Q4D6V4; -.
DR   OrthoDB; 176640at2759; -.
DR   Proteomes; UP000002296; Unassembled WGS sequence.
DR   GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR   InterPro; IPR021053; Dispersed_gene_fam_prot1_C.
DR   InterPro; IPR021004; Dispersed_gene_fam_prot1_dom4.
DR   InterPro; IPR021282; Dispersed_gene_fam_prot1_dom5.
DR   InterPro; IPR006626; PbH1.
DR   InterPro; IPR011050; Pectin_lyase_fold/virulence.
DR   PANTHER; PTHR24033; EGF-LIKE DOMAIN-CONTAINING PROTEIN; 1.
DR   Pfam; PF11024; DGF-1_4; 1.
DR   Pfam; PF11038; DGF-1_5; 1.
DR   Pfam; PF11040; DGF-1_C; 1.
DR   SMART; SM00710; PbH1; 9.
DR   SUPFAM; SSF51126; Pectin lyase-like; 1.
PE   4: Predicted;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   EGF-like domain {ECO:0000256|ARBA:ARBA00022536};
KW   Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Membrane {ECO:0000256|SAM:Phobius};
KW   Reference proteome {ECO:0000313|Proteomes:UP000002296};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Transmembrane {ECO:0000256|SAM:Phobius};
KW   Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT   TRANSMEM        3055..3081
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        3126..3143
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        3163..3183
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        3195..3215
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        3265..3286
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        3292..3311
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        3318..3340
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        3352..3377
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   DOMAIN          2839..2913
FT                   /note="Dispersed gene family protein 1"
FT                   /evidence="ECO:0000259|Pfam:PF11024"
FT   DOMAIN          2926..3203
FT                   /note="Dispersed gene family protein 1"
FT                   /evidence="ECO:0000259|Pfam:PF11038"
FT   DOMAIN          3375..3461
FT                   /note="Dispersed gene family protein 1 C-terminal"
FT                   /evidence="ECO:0000259|Pfam:PF11040"
FT   REGION          1623..1643
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   3467 AA;  360094 MW;  52C6DA9EA35CE7D5 CRC64;
     MRSAEGACPR SRHRLGRRAI AAVRVALLVV LALVAAAAWM PAVHAVVLRL RSGTVDRAIT
     VGRAVDTVLM DGVSITNGVA VVFDVAAMLP GALRIELRSC VCDGGAQIYV RGYSGEPASD
     RSLEVSVTGL SGSYCSLVFV HNLPAHTNVT VRDSTIVTPG PMRYSQLSGL TDAVASPLVL
     YATSLLQTQL RVSSTVLRSL HVGGSAVYVG GGVDLLSSAV VLDGVLLEAS GGPTASAMHV
     ASSLPFSLRS HSVFSVTNVS VVSSGGGFVL GECLAAIDSV LRFVGVEGSV ASSLVRCDGG
     TVGAGGWLEL HDVWAVGEAS SVASLSGVTL SGGAVSIARC AATGATLVSG PTITSGVVSV
     QCNRAGGRVL QSSGDYRMAG LPSVSVVPCD GCAAALACFD ALTASFSDCV CSCRAGGVGN
     ACLPFDVPPA RSGGGGGGAQ DCVSGVTLTE SVTVGGGQAT ACFDSVVFSG PITVAVDLRS
     MDVLADALNV TLRHCVLAGG AQLRIGGLSE STARLVPHAL VNMTNVTSVE GTIVLHGAMP
     QHSSVLLANS TLRATVGGSQ YVPTTRGHAG FWHGPALVLD GVRLLSTRFV MTRSTLVCGG
     GSCAAILVER SLGVNQFSVF YMDNCAVISR THVMYALASD LRVSGGSVFS IQNSLWSAPS
     TEYYKGGCVF GDVAVDGGSV LQIVSSEFRL GFAMLMANRL TVNGGSWLVH RDSEFRTAYV
     VYVVHENGVA FHDRSVWSIL HNKFTYGSYS STIAHMTSNW LPPSDSRPII YGVCNELRGS
     PVTDYQDDLN IGTPVTLLDC GACTMDAVCF AARTSSISGC ECVCAAGGHG DTCLPAAVPD
     GLGPLPLPLP DAEDTEVRCV HGGSISSVDV PDPGVRGLCF VNVTFTAAIV LDLLYFDAPE
     QTLNITLLQC ILMGLLIKGS GARVHVNVTS SMLDSGALEF RGDFGTSSQI LVAGSTLVTT
     SSHAIAFLDF SFGANSTLLL LDNNIEANIF AVCFPVAVVV EGGGIILKGS TLRTKKKDYR
     STSAVYYNGV YVRNGGYIDV ENNTMNAVNG VYFFGDTYVS SAGLLRVADC TFVGSTRVLN
     CALAYFDGSV ILQGGAQWRV EGNNVGAASV LTVVYSWQKI RLLGSGTTVV LAHNRQVDRG
     MAFASAVSSN TIVALPARFV VGCNLQGDEE VSYDGVFPED VMVFRCGTCN DDAACYIPGT
     ESVDRSSCSC SCKDGWHGAS CLPFEVPHTV VPPVAERAVD GDTSCVVNQT LTSLTLNMWK
     THHCYAGVTF SGVGAALTFF LDSMPLHLPI NITFTGCTFR EGVALQFVGG IVAVESSGVL
     IRVSQTVMRS SVVVFALALP QYCDIAVTEV DAVQSFEVQL PHTRRNKLSV FLLKNTVLSA
     SSLLASNVKA YGSGYRGLGL YTTGTLTLVS GSSLYVRYCS FDGYLHLLYV YILSVSDHSV
     FALLNNTMSS GESLLYLRHG LSVSEHSVFR VVGNSGIVAC AIYAEDLWTV QRSSWLDWRD
     NDVGLGAFFH KPESASFSID SSSVLTLTGC KMGFTGLSVS LLSQADAGYR FVAGCLTVAG
     REVTTAAELE LHGITNVTTA AACGECTKDG DCFAPLTTAV IDCKCRCAAG GHGDVCAPAP
     VPAGPPSPPP PPPPSSLPPP PPPVGECISD MVYPEVAQAV GSGLSWLCYR NVTFSGGGMS
     LTVLIGAMTG DVANVTFDGC TWRDGAVLLL AGNAHAAVGS LSIVVTGSTF DDALLSPEGV
     FPPHTNITIS GNRFTVTRLI PRSGLDLRRP ACVAMNELVI TNDSAVVLSG NVFQTVRALS
     SAICVLRYAL SVSWHSVFAV VGNTFRMAGA NSTLIHLEGS SQTSSLDVLN NSAVVIRGNI
     VTRPVRDFIS LLWVLRVESQ SAVVFQGNDM QGGLAVFYSS FASHIYYNSW LQLSGNLCRV
     SPVIAFAVFN SRVNLRDSTV SVSGNQFISS TVSPTLIKIP KRPRDLTNGV VVAACNTVNG
     GEGAKYAIPS VYNATILTCS DPCALATSCF PAYTTTASSD GCACTCAEGG HGDACLPVAV
     PESSSTDGAD LCVRDVRVDV ELKAGFGTLV ACYVGVTFAA DVVVDVESMT GSVRNVTLAN
     CTFVDGASLY VVGWLSDPPA GERADVLISG LESRSGGGVL VANRFPPGSR VTVVDSVLIA
     ETRVAYRGAH DLGDASACLV LHKVNLTGSV LTIARTQVVA VSRDAVGVLF VGGVALQSRG
     ALYMDGLSVQ TALGLCVSVE GGVMASGGSV VAFVDSEFLL CKHAVSMRGA VSVLGSAVAL
     VRSEFSSTED YAVAFYSTVS LAGGSMLLVK GNVHDGVLRE MLYATGAVTA AGSTLSFVRN
     RALLPRILSL SLLLSAGAHL RVACNDAGGR VLLTAEEYAA AGFGDAGSID VAGCDACDRD
     THCYAPGTAS ASMRNGVCVG ACGGGGYGEA CVPVGAPALP PAVSTASSVF VREGVTVRSV
     FVVPAGASEV TLRHVVLDGV SPVLYVPWMA RDGVRIVVQN VSLLNGAVLY VMGVGALRGA
     GAAGSDEGGP VELSVCDVEA LNGALVLTGT FPAGSVLTVT DSLLVAARST PLVYLPSSQF
     SPYAPLLVLS GLRLVRSVLV VSGVALVTVM TGGRTAVVDG AVLELVGGGV ALDAAVFGGD
     YALYTSARVV ASAGAVLRVS GSQVYAAHGL VFDSGVVANA SAVVVNDNAG ALTDGALLVL
     RGSASFVSRS WLSVRGNSIS GRLLSVPSYP RSVEFVQSTL TLHGNAGSGP VVMDGTVALV
     GAGRKFFVGC LTLNGQPLQP MDYRSAGIFG EFRPVACGVC DADVHCFAAA TRAMSVSCGC
     CCAEGGYGRD CLPVYLPHVD GCNRTSGMPL LSHTATLTET RSLTPTPTPS LSAAHYSPAQ
     YGPTETLQVT ETVALSPTRT PTASVSSTLW WSDVACPTLA VTTTAAGGSL TQNDIRGGGS
     AVPTRLMVAL PPPFRWASDP QLGTHLSFVP VSTAQPSGFG GPWGAMLSNA TWVRSATNPS
     TVLELAVPVH RGYFIAVDET IVIRCDAVAV FGGCKGVLLG SFTIRSNTLP AAASAFSAIT
     GVVAGAVAVA VVVTGGLGSI LEMQALGVFA RMSCASAQER ASTVALPYFL SVFAALDPLW
     MVVGNALLAA VFGCVHCGVT AAFQRWRGVD AASAWVAMRF PSLTYVVAHA MHLGIFFGSV
     LALAMPDARV QHRVVGVFGV LYGAAFPAGV CYFIARHTGA SFTRYWQFSR KPLHERLLYP
     VGYWHPAAQQ RMYGGMLTNM RGSHVYWCVF QLSVLCVVCL IAAVHPPVGG CHVQYFCMAA
     MLLAGAGVVA FTNMMRSAFL TVMRAASFVL LALLCVISAA NHLAPSDGGA RAYVAIVLLL
     TAVLLAVTVY SVVVWYAEDR HWQELREPRR GGLEALLRDD EESDEEARKP HDMMSSSYAI
     GTTVASSYRP PAPPLQLMAA DTRSDALSLL DRVSSASCSI NYALLDR
//
DBGET integrated database retrieval system