GenomeNet

Database: UniProt
Entry: Q4DUP4_TRYCC
LinkDB: Q4DUP4_TRYCC
Original site: Q4DUP4_TRYCC 
ID   Q4DUP4_TRYCC            Unreviewed;      3466 AA.
AC   Q4DUP4;
DT   13-SEP-2005, integrated into UniProtKB/TrEMBL.
DT   13-SEP-2005, sequence version 1.
DT   27-MAR-2024, entry version 58.
DE   SubName: Full=Dispersed gene family protein 1 (DGF-1), putative {ECO:0000313|EMBL:EAN96239.1};
GN   ORFNames=Tc00.1047053510713.20 {ECO:0000313|EMBL:EAN96239.1};
OS   Trypanosoma cruzi (strain CL Brener).
OC   Eukaryota; Discoba; Euglenozoa; Kinetoplastea; Metakinetoplastina;
OC   Trypanosomatida; Trypanosomatidae; Trypanosoma; Schizotrypanum.
OX   NCBI_TaxID=353153 {ECO:0000313|EMBL:EAN96239.1, ECO:0000313|Proteomes:UP000002296};
RN   [1] {ECO:0000313|EMBL:EAN96239.1, ECO:0000313|Proteomes:UP000002296}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=CL Brener {ECO:0000313|EMBL:EAN96239.1,
RC   ECO:0000313|Proteomes:UP000002296};
RX   PubMed=16020725; DOI=10.1126/science.1112631;
RA   El-Sayed N.M., Myler P.J., Bartholomeu D.C., Nilsson D., Aggarwal G.,
RA   Tran A.N., Ghedin E., Worthey E.A., Delcher A.L., Blandin G.,
RA   Westenberger S.J., Caler E., Cerqueira G.C., Branche C., Haas B.,
RA   Anupama A., Arner E., Aslund L., Attipoe P., Bontempi E., Bringaud F.,
RA   Burton P., Cadag E., Campbell D.A., Carrington M., Crabtree J., Darban H.,
RA   da Silveira J.F., de Jong P., Edwards K., Englund P.T., Fazelina G.,
RA   Feldblyum T., Ferella M., Frasch A.C., Gull K., Horn D., Hou L., Huang Y.,
RA   Kindlund E., Klingbeil M., Kluge S., Koo H., Lacerda D., Levin M.J.,
RA   Lorenzi H., Louie T., Machado C.R., McCulloch R., McKenna A., Mizuno Y.,
RA   Mottram J.C., Nelson S., Ochaya S., Osoegawa K., Pai G., Parsons M.,
RA   Pentony M., Pettersson U., Pop M., Ramirez J.L., Rinta J., Robertson L.,
RA   Salzberg S.L., Sanchez D.O., Seyler A., Sharma R., Shetty J., Simpson A.J.,
RA   Sisk E., Tammi M.T., Tarleton R., Teixeira S., Van Aken S., Vogt C.,
RA   Ward P.N., Wickstead B., Wortman J., White O., Fraser C.M., Stuart K.D.,
RA   Andersson B.;
RT   "The genome sequence of Trypanosoma cruzi, etiologic agent of Chagas
RT   disease.";
RL   Science 309:409-415(2005).
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:EAN96239.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AAHK01000163; EAN96239.1; -; Genomic_DNA.
DR   RefSeq; XP_818090.1; XM_812997.1.
DR   PaxDb; 353153-Q4DUP4; -.
DR   EnsemblProtists; EAN96239; EAN96239; Tc00.1047053510713.20.
DR   GeneID; 3550252; -.
DR   KEGG; tcr:510713.20; -.
DR   eggNOG; ENOG502SEI3; Eukaryota.
DR   InParanoid; Q4DUP4; -.
DR   OrthoDB; 130950at2759; -.
DR   Proteomes; UP000002296; Unassembled WGS sequence.
DR   GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR   InterPro; IPR021053; Dispersed_gene_fam_prot1_C.
DR   InterPro; IPR021004; Dispersed_gene_fam_prot1_dom4.
DR   InterPro; IPR021282; Dispersed_gene_fam_prot1_dom5.
DR   InterPro; IPR006626; PbH1.
DR   InterPro; IPR011050; Pectin_lyase_fold/virulence.
DR   PANTHER; PTHR24033; EGF-LIKE DOMAIN-CONTAINING PROTEIN; 1.
DR   PANTHER; PTHR24033:SF151; PROTEIN EYES SHUT; 1.
DR   Pfam; PF11024; DGF-1_4; 1.
DR   Pfam; PF11038; DGF-1_5; 1.
DR   Pfam; PF11040; DGF-1_C; 1.
DR   SMART; SM00710; PbH1; 13.
DR   SUPFAM; SSF51126; Pectin lyase-like; 1.
PE   4: Predicted;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   EGF-like domain {ECO:0000256|ARBA:ARBA00022536};
KW   Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Membrane {ECO:0000256|SAM:Phobius};
KW   Reference proteome {ECO:0000313|Proteomes:UP000002296};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Transmembrane {ECO:0000256|SAM:Phobius};
KW   Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT   TRANSMEM        3120..3144
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        3164..3189
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        3195..3216
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        3266..3287
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        3293..3312
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        3319..3341
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        3353..3378
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   DOMAIN          2836..2914
FT                   /note="Dispersed gene family protein 1"
FT                   /evidence="ECO:0000259|Pfam:PF11024"
FT   DOMAIN          2927..3204
FT                   /note="Dispersed gene family protein 1"
FT                   /evidence="ECO:0000259|Pfam:PF11038"
FT   DOMAIN          3376..3462
FT                   /note="Dispersed gene family protein 1 C-terminal"
FT                   /evidence="ECO:0000259|Pfam:PF11040"
FT   REGION          3400..3451
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        3413..3430
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   3466 AA;  359064 MW;  8840CD037A23479C CRC64;
     MRSAVGVCPR SRHRVRCRAI LTVRVALLVV LALVAAAAWV PAAHAVVLRL RGGTVDRAIT
     VGRAVDTVLM DGVCITNGVA VVFDVPAMLP GALRIELRNC VCDGGAQIYV RGYGGEPASD
     RSLEVSVSGL SGSYCSLVFV HNLPAHTNVT VRDSTIVTAG PMHYSQLNGL TDAVASPLVL
     HATSLLQTQL RVSNTVLRSL QTGGSAVYVG GGVDLLSSAV ALDGVLLEAS GGPTASAMHV
     ASSSRLSLQS HSVFSMTNVS VVSSGGGIVL GERLAVIDSV LRFVGVEGSV ASSLVRCDGG
     TIGGGGWLDL HDVWAVGEAS SVASLSGVTL SGGAVSIARC AATGATLVSG LTITSGVVSV
     QCNRAGGRVL RSSGDYRMAG LPSVSVVPCD GCVAALACFD ALTASFSDCA CSCRAGGVGD
     ACLPFDVPPA RAGGGGGDTA QDCMSGVTLT ESVTVGGVRA RACFDSVVFS GPITVAVDLR
     LMDAFADALK VTLRHCVLAG GARLRIGGLS ESTARLVPHA LVSMTNVTSL EGTIVLHGAM
     PQHSTVLLAN STLRATVGGS QYVPTTRGHE KFRYGSALVL DGVRLLSTRF VMTRSTMACG
     GGSCAAILVE RGLGANLSSV FYMDSCVVRS QTHVMYAIAS DLRVGGGSVF SIQNSSWSAP
     STEYGEGACV FKDVVVDGGS VLQIVSGIFR LGFAMLIANT LTVADGSWLV HRDNEFRTAY
     VVYVANENGV AFRDRSVWSI LGNNFTYGSY SSTIARMTSK WSPPSDSHPI IYGVCNEARG
     SPVTDYGEEL NIGTPVTVLD CGACTVDAVC FAARTSSISG CECVCAAGGY GDTCLPAAVP
     DGLGPLPLPE ADDTEVRCVH GGSIGSVDYP DPGVRGLCFV NVTFTAAILL DLWSFDAPQQ
     TLNITLLHCV LVGLLIKGSG ARVHVSVVSS MMDSGVLVFQ GDFGVSSQIL VVGSTIVTMS
     GHAILFVKFT LSANMTLLLL DNYIEGSCYA VYFSVALVVD GGGIILKGNT LSATKDDDGV
     ESSVYVYAVD VRNGGFIDVE NNTMSAVNGV ILFGDNTVSS AGLLRVADCT FVGGTETFDP
     ALLHLSGSVT LEGGAQWRVE GNSVGAASVL TILYPQYKIQ LSGSGTTVAL ANNRQVGGNT
     VFAMLLPWST ILTSPARFVV GCNLQDDEEV SYDGVFPEDV VIFRCGTCND DAACYMPGTE
     LVDRSLCSCS CKDGWHGASC LPFEVPETVV PPVAERAVDG DTSCVVNQTL TDLTINMWKT
     HHCYVGVTFS GMGAVLTFFL NSMPLHLPIN ITLTGCTFLD GAALQFVGGA GATESSGVLI
     RVSQTVLRSS AVAFLRALPL HCDIAVTEVD AVQSFEFELS GTVNNMWSVF LLGNVVLRAS
     TLLVSNVKAY ATKRDALGFS SIGTLKLVGG SSLYLQYCSF EGYKYLFYVH SLSVSDNSVF
     ALLNNTMLFG VSLLYQHQGF SVSDHSVLRV VRNSGSARYA ICNDELWNVQ RSSWLDWRDN
     DVEVGAMFYD TESAFVSIDD SSAVTLTGCK MGSTGLSVSL LKRADAGYRF VAGCLTVAGR
     EVTTAAELAL HGITNVTTVA VCGECTRDGD CFAPLTTAVS DCKCQCAAGG HGDACVPAPV
     PAGPPPPPSP PLRPTPPPPP VGECISDMVY PEVAQSVGDG LSWLCYRNVT FSGGGMSLTV
     LVGAMTGDVV KITFDGCTWR DGAVLLLLGN AHATVGSLHI VVTGSTFSDA LLSPEGGFPP
     RTNITISGNH FAVTRLIPRS GLDIWIPSCV AMNGLVIAND SAVVLSGNVF QTVTASSIAI
     YVVRSALRVW WHSVFAVVGN TFHMDGGGST LINIDGFSQS LSLSVLNNSA VVVRGNLVTR
     PVRYFLLLIL ASRVESRSAV VFQGNEMQRS SVVFFSSHSS YIYYNSWLQV SGNLCRESPS
     EAFASLYPKV NLRDSTVSVS GNRFMSSTDR PTVLRITTGS RDITNGAIVA ACNSVNGEEG
     VEYSVPSVYN ATILTCSDPC ILAASCFPAY TTTASSDGCA CNCAEGGHGD ACLPVAVPEP
     PSTDGADLCV RDVRVDVEVS AGLGTSVACY VGVTFAADVV VDVESMSGSV RNVTLANCTF
     VGGASLYVVG WLSDPPAGQC VDVLISGLES RSGGGVLVAN RYPPGSRVTV VDSVLIAEAR
     VAYRGEYDLG DASACLVVHN VNLTGSVLTI ARTHVAAVFC DAVGVLVVGS VALSSRGALY
     VDGLSVQTAL GLCVSVEGGV AASGGSVVAF VDSDFLLCKH AVSVRGAVSV SGSAVALVRG
     EFLSTEDHAV AFYSTVSVAG GSMLLAKDNV HDGVSREMLY AAGAVTAVGS TLSFVRNRAL
     LPRMLSASLS LSAGAHLRVA CNDAGGRVLS TAEEYAAAGF GDAGSIDVVG CDVCDRDTHC
     YAPGTASASM RNGVCVCACG SGGYGEACVP VGAPALPPAV GTASSVFVRE GVAVQSVFVV
     PAGASEVTLR HVVLDGVSPV LYVPWMARDG VRIVVQNVSL LSGAVLYVMG GGALRGAVAA
     GSDEGGPVEL SVCDVEALNG ALVLTGTYPA GSVLTVTDSL LVAARLTPLV YLPGSQSSPY
     APVLVLSGLR LVRSVLVVSG VALVTVMTGG RTVAVDGAVL ELVGGGVALD AAVLGGDFAL
     YASARVVASG GAVLRVSGSQ VYAAHGLVFG SGVEVNASAV VVNDNAGVLT DGALLVLRGS
     ASFLSGSWLS VRGNSISGRL LSVPSYPRSA ELVQSTLTLY ENAGSGSVVM DGTVALGGAG
     RNFVVGCLTL NGQTLRPMEY RSAGIIGKFR PVACGVCDAD VSCFAAATRT MSRSCGCRCA
     EGGYGRDCLP VYLPHVDGCN RTPAMPLLSH TVTLTETRSL TPTWTSTWTP SLSATHDSPT
     WYGPTETLQG TETVALSPTR TPTASVSGTL WWSDVACPTL AVTTTAAGGS LTQNDIRGGG
     SVVPTRLMVA LPPPFRWASG PQLGTHLSFV PVSTAQPSGF GGPWGAMLRN ATWVRNATNP
     STVLELAVPV HRGYFIAADE TIVIRCDAAA VSGGCKGVLL GSFTIRSDTL PAAASALSAI
     TGVAAGAAAA SVLVTGGLGS ILEMQALGVF ARMPCASAQE SESTAALPYF LSVFAALDPL
     WMVVGNALLA AVFGCVHCGV TAAFQRWRGV DAASAWAAMR FPSLTYVVAH AMHLGIFFGS
     VLALAMPGAR VQQRVIGVTG VLYGVAFPAG VCYLIARHTG ASFTKYWQFS RKPLHERLLY
     PVGYWHPAAQ RRMYGGMLTN MKGSHVYWCV FQLSVLCVVG LIAAVHPPVG VCHVLYFCIA
     AVLLAGAGVV AFTNMMRSAF LTVMHTASFV LLAALCLVSA ANHLAPSDGG TRAYAAIVLL
     LTTVLLAVTV YSVVVWYAED RHWQELREPR RGGLEALLRD DEESDEDAQK PHEMTSSSYA
     TGTTGASSYR PPAPPLQPMA GDTHSDALSP LDRASSASCM IDYAAM
//
DBGET integrated database retrieval system