ID Q4DUP4_TRYCC Unreviewed; 3466 AA.
AC Q4DUP4;
DT 13-SEP-2005, integrated into UniProtKB/TrEMBL.
DT 13-SEP-2005, sequence version 1.
DT 27-MAR-2024, entry version 58.
DE SubName: Full=Dispersed gene family protein 1 (DGF-1), putative {ECO:0000313|EMBL:EAN96239.1};
GN ORFNames=Tc00.1047053510713.20 {ECO:0000313|EMBL:EAN96239.1};
OS Trypanosoma cruzi (strain CL Brener).
OC Eukaryota; Discoba; Euglenozoa; Kinetoplastea; Metakinetoplastina;
OC Trypanosomatida; Trypanosomatidae; Trypanosoma; Schizotrypanum.
OX NCBI_TaxID=353153 {ECO:0000313|EMBL:EAN96239.1, ECO:0000313|Proteomes:UP000002296};
RN [1] {ECO:0000313|EMBL:EAN96239.1, ECO:0000313|Proteomes:UP000002296}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CL Brener {ECO:0000313|EMBL:EAN96239.1,
RC ECO:0000313|Proteomes:UP000002296};
RX PubMed=16020725; DOI=10.1126/science.1112631;
RA El-Sayed N.M., Myler P.J., Bartholomeu D.C., Nilsson D., Aggarwal G.,
RA Tran A.N., Ghedin E., Worthey E.A., Delcher A.L., Blandin G.,
RA Westenberger S.J., Caler E., Cerqueira G.C., Branche C., Haas B.,
RA Anupama A., Arner E., Aslund L., Attipoe P., Bontempi E., Bringaud F.,
RA Burton P., Cadag E., Campbell D.A., Carrington M., Crabtree J., Darban H.,
RA da Silveira J.F., de Jong P., Edwards K., Englund P.T., Fazelina G.,
RA Feldblyum T., Ferella M., Frasch A.C., Gull K., Horn D., Hou L., Huang Y.,
RA Kindlund E., Klingbeil M., Kluge S., Koo H., Lacerda D., Levin M.J.,
RA Lorenzi H., Louie T., Machado C.R., McCulloch R., McKenna A., Mizuno Y.,
RA Mottram J.C., Nelson S., Ochaya S., Osoegawa K., Pai G., Parsons M.,
RA Pentony M., Pettersson U., Pop M., Ramirez J.L., Rinta J., Robertson L.,
RA Salzberg S.L., Sanchez D.O., Seyler A., Sharma R., Shetty J., Simpson A.J.,
RA Sisk E., Tammi M.T., Tarleton R., Teixeira S., Van Aken S., Vogt C.,
RA Ward P.N., Wickstead B., Wortman J., White O., Fraser C.M., Stuart K.D.,
RA Andersson B.;
RT "The genome sequence of Trypanosoma cruzi, etiologic agent of Chagas
RT disease.";
RL Science 309:409-415(2005).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EAN96239.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AAHK01000163; EAN96239.1; -; Genomic_DNA.
DR RefSeq; XP_818090.1; XM_812997.1.
DR PaxDb; 353153-Q4DUP4; -.
DR EnsemblProtists; EAN96239; EAN96239; Tc00.1047053510713.20.
DR GeneID; 3550252; -.
DR KEGG; tcr:510713.20; -.
DR eggNOG; ENOG502SEI3; Eukaryota.
DR InParanoid; Q4DUP4; -.
DR OrthoDB; 130950at2759; -.
DR Proteomes; UP000002296; Unassembled WGS sequence.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR InterPro; IPR021053; Dispersed_gene_fam_prot1_C.
DR InterPro; IPR021004; Dispersed_gene_fam_prot1_dom4.
DR InterPro; IPR021282; Dispersed_gene_fam_prot1_dom5.
DR InterPro; IPR006626; PbH1.
DR InterPro; IPR011050; Pectin_lyase_fold/virulence.
DR PANTHER; PTHR24033; EGF-LIKE DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR24033:SF151; PROTEIN EYES SHUT; 1.
DR Pfam; PF11024; DGF-1_4; 1.
DR Pfam; PF11038; DGF-1_5; 1.
DR Pfam; PF11040; DGF-1_C; 1.
DR SMART; SM00710; PbH1; 13.
DR SUPFAM; SSF51126; Pectin lyase-like; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Membrane {ECO:0000256|SAM:Phobius};
KW Reference proteome {ECO:0000313|Proteomes:UP000002296};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT TRANSMEM 3120..3144
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 3164..3189
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 3195..3216
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 3266..3287
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 3293..3312
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 3319..3341
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 3353..3378
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 2836..2914
FT /note="Dispersed gene family protein 1"
FT /evidence="ECO:0000259|Pfam:PF11024"
FT DOMAIN 2927..3204
FT /note="Dispersed gene family protein 1"
FT /evidence="ECO:0000259|Pfam:PF11038"
FT DOMAIN 3376..3462
FT /note="Dispersed gene family protein 1 C-terminal"
FT /evidence="ECO:0000259|Pfam:PF11040"
FT REGION 3400..3451
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3413..3430
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 3466 AA; 359064 MW; 8840CD037A23479C CRC64;
MRSAVGVCPR SRHRVRCRAI LTVRVALLVV LALVAAAAWV PAAHAVVLRL RGGTVDRAIT
VGRAVDTVLM DGVCITNGVA VVFDVPAMLP GALRIELRNC VCDGGAQIYV RGYGGEPASD
RSLEVSVSGL SGSYCSLVFV HNLPAHTNVT VRDSTIVTAG PMHYSQLNGL TDAVASPLVL
HATSLLQTQL RVSNTVLRSL QTGGSAVYVG GGVDLLSSAV ALDGVLLEAS GGPTASAMHV
ASSSRLSLQS HSVFSMTNVS VVSSGGGIVL GERLAVIDSV LRFVGVEGSV ASSLVRCDGG
TIGGGGWLDL HDVWAVGEAS SVASLSGVTL SGGAVSIARC AATGATLVSG LTITSGVVSV
QCNRAGGRVL RSSGDYRMAG LPSVSVVPCD GCVAALACFD ALTASFSDCA CSCRAGGVGD
ACLPFDVPPA RAGGGGGDTA QDCMSGVTLT ESVTVGGVRA RACFDSVVFS GPITVAVDLR
LMDAFADALK VTLRHCVLAG GARLRIGGLS ESTARLVPHA LVSMTNVTSL EGTIVLHGAM
PQHSTVLLAN STLRATVGGS QYVPTTRGHE KFRYGSALVL DGVRLLSTRF VMTRSTMACG
GGSCAAILVE RGLGANLSSV FYMDSCVVRS QTHVMYAIAS DLRVGGGSVF SIQNSSWSAP
STEYGEGACV FKDVVVDGGS VLQIVSGIFR LGFAMLIANT LTVADGSWLV HRDNEFRTAY
VVYVANENGV AFRDRSVWSI LGNNFTYGSY SSTIARMTSK WSPPSDSHPI IYGVCNEARG
SPVTDYGEEL NIGTPVTVLD CGACTVDAVC FAARTSSISG CECVCAAGGY GDTCLPAAVP
DGLGPLPLPE ADDTEVRCVH GGSIGSVDYP DPGVRGLCFV NVTFTAAILL DLWSFDAPQQ
TLNITLLHCV LVGLLIKGSG ARVHVSVVSS MMDSGVLVFQ GDFGVSSQIL VVGSTIVTMS
GHAILFVKFT LSANMTLLLL DNYIEGSCYA VYFSVALVVD GGGIILKGNT LSATKDDDGV
ESSVYVYAVD VRNGGFIDVE NNTMSAVNGV ILFGDNTVSS AGLLRVADCT FVGGTETFDP
ALLHLSGSVT LEGGAQWRVE GNSVGAASVL TILYPQYKIQ LSGSGTTVAL ANNRQVGGNT
VFAMLLPWST ILTSPARFVV GCNLQDDEEV SYDGVFPEDV VIFRCGTCND DAACYMPGTE
LVDRSLCSCS CKDGWHGASC LPFEVPETVV PPVAERAVDG DTSCVVNQTL TDLTINMWKT
HHCYVGVTFS GMGAVLTFFL NSMPLHLPIN ITLTGCTFLD GAALQFVGGA GATESSGVLI
RVSQTVLRSS AVAFLRALPL HCDIAVTEVD AVQSFEFELS GTVNNMWSVF LLGNVVLRAS
TLLVSNVKAY ATKRDALGFS SIGTLKLVGG SSLYLQYCSF EGYKYLFYVH SLSVSDNSVF
ALLNNTMLFG VSLLYQHQGF SVSDHSVLRV VRNSGSARYA ICNDELWNVQ RSSWLDWRDN
DVEVGAMFYD TESAFVSIDD SSAVTLTGCK MGSTGLSVSL LKRADAGYRF VAGCLTVAGR
EVTTAAELAL HGITNVTTVA VCGECTRDGD CFAPLTTAVS DCKCQCAAGG HGDACVPAPV
PAGPPPPPSP PLRPTPPPPP VGECISDMVY PEVAQSVGDG LSWLCYRNVT FSGGGMSLTV
LVGAMTGDVV KITFDGCTWR DGAVLLLLGN AHATVGSLHI VVTGSTFSDA LLSPEGGFPP
RTNITISGNH FAVTRLIPRS GLDIWIPSCV AMNGLVIAND SAVVLSGNVF QTVTASSIAI
YVVRSALRVW WHSVFAVVGN TFHMDGGGST LINIDGFSQS LSLSVLNNSA VVVRGNLVTR
PVRYFLLLIL ASRVESRSAV VFQGNEMQRS SVVFFSSHSS YIYYNSWLQV SGNLCRESPS
EAFASLYPKV NLRDSTVSVS GNRFMSSTDR PTVLRITTGS RDITNGAIVA ACNSVNGEEG
VEYSVPSVYN ATILTCSDPC ILAASCFPAY TTTASSDGCA CNCAEGGHGD ACLPVAVPEP
PSTDGADLCV RDVRVDVEVS AGLGTSVACY VGVTFAADVV VDVESMSGSV RNVTLANCTF
VGGASLYVVG WLSDPPAGQC VDVLISGLES RSGGGVLVAN RYPPGSRVTV VDSVLIAEAR
VAYRGEYDLG DASACLVVHN VNLTGSVLTI ARTHVAAVFC DAVGVLVVGS VALSSRGALY
VDGLSVQTAL GLCVSVEGGV AASGGSVVAF VDSDFLLCKH AVSVRGAVSV SGSAVALVRG
EFLSTEDHAV AFYSTVSVAG GSMLLAKDNV HDGVSREMLY AAGAVTAVGS TLSFVRNRAL
LPRMLSASLS LSAGAHLRVA CNDAGGRVLS TAEEYAAAGF GDAGSIDVVG CDVCDRDTHC
YAPGTASASM RNGVCVCACG SGGYGEACVP VGAPALPPAV GTASSVFVRE GVAVQSVFVV
PAGASEVTLR HVVLDGVSPV LYVPWMARDG VRIVVQNVSL LSGAVLYVMG GGALRGAVAA
GSDEGGPVEL SVCDVEALNG ALVLTGTYPA GSVLTVTDSL LVAARLTPLV YLPGSQSSPY
APVLVLSGLR LVRSVLVVSG VALVTVMTGG RTVAVDGAVL ELVGGGVALD AAVLGGDFAL
YASARVVASG GAVLRVSGSQ VYAAHGLVFG SGVEVNASAV VVNDNAGVLT DGALLVLRGS
ASFLSGSWLS VRGNSISGRL LSVPSYPRSA ELVQSTLTLY ENAGSGSVVM DGTVALGGAG
RNFVVGCLTL NGQTLRPMEY RSAGIIGKFR PVACGVCDAD VSCFAAATRT MSRSCGCRCA
EGGYGRDCLP VYLPHVDGCN RTPAMPLLSH TVTLTETRSL TPTWTSTWTP SLSATHDSPT
WYGPTETLQG TETVALSPTR TPTASVSGTL WWSDVACPTL AVTTTAAGGS LTQNDIRGGG
SVVPTRLMVA LPPPFRWASG PQLGTHLSFV PVSTAQPSGF GGPWGAMLRN ATWVRNATNP
STVLELAVPV HRGYFIAADE TIVIRCDAAA VSGGCKGVLL GSFTIRSDTL PAAASALSAI
TGVAAGAAAA SVLVTGGLGS ILEMQALGVF ARMPCASAQE SESTAALPYF LSVFAALDPL
WMVVGNALLA AVFGCVHCGV TAAFQRWRGV DAASAWAAMR FPSLTYVVAH AMHLGIFFGS
VLALAMPGAR VQQRVIGVTG VLYGVAFPAG VCYLIARHTG ASFTKYWQFS RKPLHERLLY
PVGYWHPAAQ RRMYGGMLTN MKGSHVYWCV FQLSVLCVVG LIAAVHPPVG VCHVLYFCIA
AVLLAGAGVV AFTNMMRSAF LTVMHTASFV LLAALCLVSA ANHLAPSDGG TRAYAAIVLL
LTTVLLAVTV YSVVVWYAED RHWQELREPR RGGLEALLRD DEESDEDAQK PHEMTSSSYA
TGTTGASSYR PPAPPLQPMA GDTHSDALSP LDRASSASCM IDYAAM
//