ID Q4DST8_TRYCC Unreviewed; 3055 AA.
AC Q4DST8;
DT 13-SEP-2005, integrated into UniProtKB/TrEMBL.
DT 13-SEP-2005, sequence version 1.
DT 27-MAR-2024, entry version 85.
DE SubName: Full=Ankyrin repeat protein, putative {ECO:0000313|EMBL:EAN95587.1};
GN ORFNames=Tc00.1047053510769.80 {ECO:0000313|EMBL:EAN95587.1};
OS Trypanosoma cruzi (strain CL Brener).
OC Eukaryota; Discoba; Euglenozoa; Kinetoplastea; Metakinetoplastina;
OC Trypanosomatida; Trypanosomatidae; Trypanosoma; Schizotrypanum.
OX NCBI_TaxID=353153 {ECO:0000313|EMBL:EAN95587.1, ECO:0000313|Proteomes:UP000002296};
RN [1] {ECO:0000313|EMBL:EAN95587.1, ECO:0000313|Proteomes:UP000002296}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CL Brener {ECO:0000313|EMBL:EAN95587.1,
RC ECO:0000313|Proteomes:UP000002296};
RX PubMed=16020725; DOI=10.1126/science.1112631;
RA El-Sayed N.M., Myler P.J., Bartholomeu D.C., Nilsson D., Aggarwal G.,
RA Tran A.N., Ghedin E., Worthey E.A., Delcher A.L., Blandin G.,
RA Westenberger S.J., Caler E., Cerqueira G.C., Branche C., Haas B.,
RA Anupama A., Arner E., Aslund L., Attipoe P., Bontempi E., Bringaud F.,
RA Burton P., Cadag E., Campbell D.A., Carrington M., Crabtree J., Darban H.,
RA da Silveira J.F., de Jong P., Edwards K., Englund P.T., Fazelina G.,
RA Feldblyum T., Ferella M., Frasch A.C., Gull K., Horn D., Hou L., Huang Y.,
RA Kindlund E., Klingbeil M., Kluge S., Koo H., Lacerda D., Levin M.J.,
RA Lorenzi H., Louie T., Machado C.R., McCulloch R., McKenna A., Mizuno Y.,
RA Mottram J.C., Nelson S., Ochaya S., Osoegawa K., Pai G., Parsons M.,
RA Pentony M., Pettersson U., Pop M., Ramirez J.L., Rinta J., Robertson L.,
RA Salzberg S.L., Sanchez D.O., Seyler A., Sharma R., Shetty J., Simpson A.J.,
RA Sisk E., Tammi M.T., Tarleton R., Teixeira S., Van Aken S., Vogt C.,
RA Ward P.N., Wickstead B., Wortman J., White O., Fraser C.M., Stuart K.D.,
RA Andersson B.;
RT "The genome sequence of Trypanosoma cruzi, etiologic agent of Chagas
RT disease.";
RL Science 309:409-415(2005).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EAN95587.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AAHK01000202; EAN95587.1; -; Genomic_DNA.
DR RefSeq; XP_817438.1; XM_812345.1.
DR STRING; 353153.Q4DST8; -.
DR PaxDb; 353153-Q4DST8; -.
DR EnsemblProtists; EAN95587; EAN95587; Tc00.1047053510769.80.
DR GeneID; 3549473; -.
DR KEGG; tcr:510769.80; -.
DR eggNOG; KOG4177; Eukaryota.
DR InParanoid; Q4DST8; -.
DR OMA; EGVSIGW; -.
DR OrthoDB; 123304at2759; -.
DR Proteomes; UP000002296; Unassembled WGS sequence.
DR Gene3D; 1.25.40.20; Ankyrin repeat-containing domain; 7.
DR InterPro; IPR002110; Ankyrin_rpt.
DR InterPro; IPR036770; Ankyrin_rpt-contain_sf.
DR InterPro; IPR017868; Filamin/ABP280_repeat-like.
DR PANTHER; PTHR24123:SF33; ANKYRIN 2, ISOFORM U; 1.
DR PANTHER; PTHR24123; ANKYRIN REPEAT-CONTAINING; 1.
DR Pfam; PF12796; Ank_2; 4.
DR Pfam; PF13637; Ank_4; 1.
DR SMART; SM00248; ANK; 24.
DR SUPFAM; SSF48403; Ankyrin repeat; 5.
DR PROSITE; PS50297; ANK_REP_REGION; 4.
DR PROSITE; PS50088; ANK_REPEAT; 4.
DR PROSITE; PS50194; FILAMIN_REPEAT; 1.
PE 4: Predicted;
KW ANK repeat {ECO:0000256|ARBA:ARBA00023043, ECO:0000256|PROSITE-
KW ProRule:PRU00023}; Reference proteome {ECO:0000313|Proteomes:UP000002296};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT REPEAT 249..281
FT /note="ANK"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
FT REPEAT 645..677
FT /note="ANK"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
FT REPEAT 677..709
FT /note="ANK"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
FT REPEAT 1736..1768
FT /note="ANK"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
FT REPEAT 2803..2829
FT /note="Filamin"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00087"
FT REGION 3008..3037
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3016..3037
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 3055 AA; 335130 MW; 6EACA36503F54486 CRC64;
MRSAEANGTP LGAPILCQVR TGGATGSRLV GTIHIRLNSS LGDVRLLLRS LASRARGDAA
APSGLEVHGA PVNVTQGEWK KAMSYPLHPF DLSLAFSFVK CGGCLPIQQH HEEKLLLVDV
FPRLPRAWGM STVAGWMMEP TPPMVKVLAR PGEYVPCRDA LAVVFIAQGE PENLMPEDVL
VERQMLQQCS YGRPYQLKEL FTLLNQWNGN VKDCFGRTVL HECVYQGHLE AVVSLLSFSF
IRVNEQDIQG KTPLHIAVRV GNEFVVSRLL EAGADILLTD NGGDTALHVA LRLRNDRIVE
LLCKRLRATG IEAKRLCFCK NGVGISPMDM FQLHSPTFIQ LCEEGDVAAI KSLYDHYLFS
RDSIKGCDEL LHQSALHVAA ACGHVNVLKF LLDELKFGRL TDKQMLNSRS QTPMHLAAER
GQISAVRFLH ERYPWFISVR DITGATPLVA ALRRRQHSMA VVDYLISVLP RGSGAINACD
NSGMGALHLL CELGMTLAAN SLIVDHGADV TLSCSCGIVS SRYLKKTRIL PRLSKRAGEA
STRQVVGEQQ GLTPLLCTLR GGRGHVSTIE MLLSHGAGTR GDEVVELLFY LITKGHYEFA
DRVVASGVGK LTSSNDLLSR FCHLNHGVGI RWCIERGCCS LNSPEGGYPL LVSSALGDAE
AVSFLLSRGA DPNVTAGGKT PLLASINGGH HAVVERLVRA GARLVASDGS WTALRAAAER
GAESVVRALL GLHVLSPLVV SQALVHAMES VRGRKSITCE RVCVHLAGFL DLASRDIMHP
TELLHLAASR SLFAVVRVLV DKLLALPASV LRGIVNDAPP PPNRMFVLIE PTVAPLRVKA
SRLVSVKGRR GLYTPPKPFK RLFIQRREAG VLFHRLRLRD VHSYCAEANE GELLERLLLD
VGLKPWAGPD YRGWNAADYA ADKQQANALR MFLVVGLAPC RRHVVCRGTR MGALCRSIKC
ANVNVDEAGL LSSALCDLAA AQETTLVRQI LTDTCRFHNV SDLSGAGAWL DELILCCVRT
RCLDVLGILE GEFQVPLKKR LSLSVQPLLS AVACRDVKLL AYLILHGTPV DLIGSIPAMG
GHELRSLARE EREVSPLWLA ARLGDITAME LLLSVGPLSP AKCTDSTDFR RDALKALVDG
APRRPSKAQD TLIAQGVVML ARAGHVYTLP SIMRVAAGKG LVRTVESLID CYGPRTVTED
IVSSGLCSLH YLVGRPALSR LLRSSLLLVA DVGSVHVEAS QNLLATIQTT KFTVNPVDYA
LRAGCSEGAL LLLGLGLCGS GGRKEKCNPK ISRIVRLCAA RSAAQGEGGY TVLHAAMELQ
YHEIAYKILN ERVLLQHVDR SSMEEKRTDN TPSVASLMNF YLDRAHRHLL LSPGVPAPMK
ISPCDMPYLV FSQSFDDEGY REIVRLSALD GGLEQTLDLL HGGLPMAFVW HLFSRHRHSF
YFCNRFFGTT ELTPMGCAVA AGNLPWVRLL AYTGVSTSNC NSIVAGIKRG TASSLVQVPI
PPVHRKVGVQ AAARRHVCRK EDSYPVNFTD ERHHVSPLLL SMAIIVETSW AGDAEGLLRQ
SQVMQFLLSC DQCLQREELN PLAIALAKLM LWDLLEALVA AATKLIDNPT EGRLFCTIHE
AEVPPIIKRA IGSSRHVMHV AARCAPREIL MLVAKHSQRA DIEEACDAKG KTVLFYAIHH
PCRFALDVLG GLRVPTNVQC CRRTGRTLLM LACQRGQLSL VMALLKKAEM NARDRDGNTA
LLLAAAAGRA EIVEHLLANG ADPSVKNRKG MTAVMAAAFA GHDDIAVPLA ENFSTLDDFF
SPQTTLLHCA AVGGCHRVAS TLVDMVEKIN IWAEDVGGFT AVYLAYAFGN ALVLRTLLSA
ALKYGVEPAP SLHLERQIFI CSSTLPRYGW LRGVLRMGEV LLDESRQRLL CQSATRPGEM
TYGNPTSFRW FRTSLLLWCV CNNNAVGVRV LGEMNSADDC GALHEAAKRG HLGVVELLLK
LEMSDPNVLD GAGRLPFEVA AAHRHVACAS LLLARTRLDM MHLKAPAAGS AQSALHRLAS
SGGAETLFVL VEAFQGMNRP DWSVVASQLL DALDIPDSSG MTAFESAVAM GNPAGVLRVA
QVIRRLALAS GRKKALSVSN SILSHLPCMS PAVRVLLYDV FGILEVAMCV GFEQEKRVGR
LAFADVRLIG ANALRDNAFC ASEVAAAFTL EHEISVASSL LKGLPFRIRY IPRSLETRSA
SQQVQLLQWL ESSLILSNYE NWGAEKPLET IELELVPQRT DEFVEISNMY LRHSVYMDSR
RLLTPNLHAK LRFASRSEAL RLRSVTEERC RLLTLKMRQL PHPLMSRGRV MVEWGDGKDE
VTVEAVTRLM DDGLARVDAF FSGKLKDALI GVSMADVISV TAATDRTMNA VYISFRYTRE
IIRRRHYTSS EGGFVIMFND NAMQDVDRVL HLALYRQVVA DVNLVGVSYV RDAFLADVRR
LAGMQMGDDA TSFKLEVEGG TLDELPLHLL EAMMSDAAEA LASLITESLA NMRNFHSSKI
VSDFLKQSLR SVVVLFSPER RPVAEFSEGK LMLCLNLQLA PTRCEICEGL RRGALAGEIE
RLRGELPRVI ATVGQRLQMR LPTTTFVLDS ASLLEGEDDE YVVSALSLLC YNCGALVLQP
LMDGVSMGWD TRLGSVVRRH VRQLTLVLEL FGGGNCVLYD SGNFVYNCPL FFLERGVYAT
SSCYLLSSQQ IASLLLMQLS LVDPSIFGLV NTSKPLACWS QVHGVCTRIL SAGCQRNAVR
MTTRNIFNLR LGHAVSESSL TFIGCWRSIK VHEATGRVHF TAPTKAGYYE QQILMGGQPI
LKSPLRLRVR PLAVHLPSTK VLSNFNTVVV GRPFDIVLLL RDKYGNRIEA PEKIVVSPVT
AGAARIMSWK RSRVDMIEVR VVVADVCDKC AVPIRLRAAD EGDFELLFPV ESVSEQTYHW
VCLLRGGHTL DWEKRCRASK TKATKTLSMA GKASDRVKYF FRRTVRDAER RRLVRDAKLI
GGLLNRVFPP RRPNDGKNGN SKKGKKVTRD EYPSRRRVCV VMGEESEAKG EEVSK
//