ID Q4DDF1_TRYCC Unreviewed; 748 AA.
AC Q4DDF1;
DT 13-SEP-2005, integrated into UniProtKB/TrEMBL.
DT 13-SEP-2005, sequence version 1.
DT 24-JAN-2024, entry version 60.
DE RecName: Full=SAP domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=Tc00.1047053510517.30 {ECO:0000313|EMBL:EAN90547.1};
OS Trypanosoma cruzi (strain CL Brener).
OC Eukaryota; Discoba; Euglenozoa; Kinetoplastea; Metakinetoplastina;
OC Trypanosomatida; Trypanosomatidae; Trypanosoma; Schizotrypanum.
OX NCBI_TaxID=353153 {ECO:0000313|EMBL:EAN90547.1, ECO:0000313|Proteomes:UP000002296};
RN [1] {ECO:0000313|EMBL:EAN90547.1, ECO:0000313|Proteomes:UP000002296}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CL Brener {ECO:0000313|EMBL:EAN90547.1,
RC ECO:0000313|Proteomes:UP000002296};
RX PubMed=16020725; DOI=10.1126/science.1112631;
RA El-Sayed N.M., Myler P.J., Bartholomeu D.C., Nilsson D., Aggarwal G.,
RA Tran A.N., Ghedin E., Worthey E.A., Delcher A.L., Blandin G.,
RA Westenberger S.J., Caler E., Cerqueira G.C., Branche C., Haas B.,
RA Anupama A., Arner E., Aslund L., Attipoe P., Bontempi E., Bringaud F.,
RA Burton P., Cadag E., Campbell D.A., Carrington M., Crabtree J., Darban H.,
RA da Silveira J.F., de Jong P., Edwards K., Englund P.T., Fazelina G.,
RA Feldblyum T., Ferella M., Frasch A.C., Gull K., Horn D., Hou L., Huang Y.,
RA Kindlund E., Klingbeil M., Kluge S., Koo H., Lacerda D., Levin M.J.,
RA Lorenzi H., Louie T., Machado C.R., McCulloch R., McKenna A., Mizuno Y.,
RA Mottram J.C., Nelson S., Ochaya S., Osoegawa K., Pai G., Parsons M.,
RA Pentony M., Pettersson U., Pop M., Ramirez J.L., Rinta J., Robertson L.,
RA Salzberg S.L., Sanchez D.O., Seyler A., Sharma R., Shetty J., Simpson A.J.,
RA Sisk E., Tammi M.T., Tarleton R., Teixeira S., Van Aken S., Vogt C.,
RA Ward P.N., Wickstead B., Wortman J., White O., Fraser C.M., Stuart K.D.,
RA Andersson B.;
RT "The genome sequence of Trypanosoma cruzi, etiologic agent of Chagas
RT disease.";
RL Science 309:409-415(2005).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EAN90547.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AAHK01000618; EAN90547.1; -; Genomic_DNA.
DR RefSeq; XP_812398.1; XM_807305.1.
DR AlphaFoldDB; Q4DDF1; -.
DR SMR; Q4DDF1; -.
DR STRING; 353153.Q4DDF1; -.
DR PaxDb; 353153-Q4DDF1; -.
DR EnsemblProtists; EAN90547; EAN90547; Tc00.1047053510517.30.
DR GeneID; 3543559; -.
DR KEGG; tcr:510517.30; -.
DR eggNOG; ENOG502RXCI; Eukaryota.
DR InParanoid; Q4DDF1; -.
DR OMA; RKWDSMI; -.
DR OrthoDB; 1221154at2759; -.
DR Proteomes; UP000002296; Unassembled WGS sequence.
DR Gene3D; 1.25.40.10; Tetratricopeptide repeat domain; 1.
DR InterPro; IPR002885; Pentatricopeptide_rpt.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR Pfam; PF13812; PPR_3; 1.
DR PROSITE; PS51375; PPR; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000002296}.
FT REPEAT 67..101
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REGION 183..221
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 566..585
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 697..748
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 697..717
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 748 AA; 85951 MW; D914B2702F534CF5 CRC64;
MWRFLMARES RRAVALGAAT ILLPFRCVHE RPFLDLPTDV SQGEAYDMAR DALVQLNVMV
RRGIEPDALM YTSLIATMGR AKLEWQAYKL FSRMLEQGVR PLPETYVALR DATSPSRRRL
RQDLQLKIEE SLESLPSELA EEELARRQEQ DRLCVEKFED YMRGVLPTPP PCASHTQTTM
IPAVQGEGGR NDDGDFQKER NGASAAATKT DGGGQEEPRP VATMHIRNPT DAWSTARMME
EQRGIRTQRT QGSDTIALSD ELHRLHEEEL RIFLAAQRQL RHGTKDELVR RVLDNVPETS
IRDMLGRRKH YFRSVAHILE NDIHALRREN IGENKDVLQN SVQEQFSAAD RTLTAAEKET
VAPEVLHTPW GILRKPMKRE PDPSASRSME RLQRISLSLE ELQLIRCKGE TGDLDELPES
LLRRYAFEFN LKWKRRFPLS LLEAVQWHCT TFLPEQLDGK VVVRPTPALR RQQEEEGMHK
TLENYEAFRI ISQRTNNLQV VDHKEINLHL KKIQKTMLQK ERHTEETLRR ERNLLDAATL
AASAKSFTPP DGTSTTVSRV LGVNEGGDSS IAKIPPPAST SDTRGDSLVQ EECHELPPWA
IFSGAEEFDI STGRFGDPDV GRYQELSDGR FKVLPSREAQ DKWAVDRQLL PGPLQDKLQR
AELQQKLRHE AIERRYQQKL QYNRYRKWDS FLRKAQEKGQ REKSSEEEEE VDAVRPLPPK
RRLSQLLRKG RDKAPVEGSV KAKYTRSL
//