ID Q4E302_TRYCC Unreviewed; 663 AA.
AC Q4E302;
DT 13-SEP-2005, integrated into UniProtKB/TrEMBL.
DT 13-SEP-2005, sequence version 1.
DT 27-MAR-2024, entry version 82.
DE RecName: Full=SET domain-containing protein {ECO:0000259|PROSITE:PS50280};
GN ORFNames=Tc00.1047053506435.400 {ECO:0000313|EMBL:EAN99174.1};
OS Trypanosoma cruzi (strain CL Brener).
OC Eukaryota; Discoba; Euglenozoa; Kinetoplastea; Metakinetoplastina;
OC Trypanosomatida; Trypanosomatidae; Trypanosoma; Schizotrypanum.
OX NCBI_TaxID=353153 {ECO:0000313|EMBL:EAN99174.1, ECO:0000313|Proteomes:UP000002296};
RN [1] {ECO:0000313|EMBL:EAN99174.1, ECO:0000313|Proteomes:UP000002296}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CL Brener {ECO:0000313|EMBL:EAN99174.1,
RC ECO:0000313|Proteomes:UP000002296};
RX PubMed=16020725; DOI=10.1126/science.1112631;
RA El-Sayed N.M., Myler P.J., Bartholomeu D.C., Nilsson D., Aggarwal G.,
RA Tran A.N., Ghedin E., Worthey E.A., Delcher A.L., Blandin G.,
RA Westenberger S.J., Caler E., Cerqueira G.C., Branche C., Haas B.,
RA Anupama A., Arner E., Aslund L., Attipoe P., Bontempi E., Bringaud F.,
RA Burton P., Cadag E., Campbell D.A., Carrington M., Crabtree J., Darban H.,
RA da Silveira J.F., de Jong P., Edwards K., Englund P.T., Fazelina G.,
RA Feldblyum T., Ferella M., Frasch A.C., Gull K., Horn D., Hou L., Huang Y.,
RA Kindlund E., Klingbeil M., Kluge S., Koo H., Lacerda D., Levin M.J.,
RA Lorenzi H., Louie T., Machado C.R., McCulloch R., McKenna A., Mizuno Y.,
RA Mottram J.C., Nelson S., Ochaya S., Osoegawa K., Pai G., Parsons M.,
RA Pentony M., Pettersson U., Pop M., Ramirez J.L., Rinta J., Robertson L.,
RA Salzberg S.L., Sanchez D.O., Seyler A., Sharma R., Shetty J., Simpson A.J.,
RA Sisk E., Tammi M.T., Tarleton R., Teixeira S., Van Aken S., Vogt C.,
RA Ward P.N., Wickstead B., Wortman J., White O., Fraser C.M., Stuart K.D.,
RA Andersson B.;
RT "The genome sequence of Trypanosoma cruzi, etiologic agent of Chagas
RT disease.";
RL Science 309:409-415(2005).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EAN99174.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AAHK01000028; EAN99174.1; -; Genomic_DNA.
DR RefSeq; XP_821025.1; XM_815932.1.
DR AlphaFoldDB; Q4E302; -.
DR SMR; Q4E302; -.
DR STRING; 353153.Q4E302; -.
DR PaxDb; 353153-Q4E302; -.
DR EnsemblProtists; EAN99174; EAN99174; Tc00.1047053506435.400.
DR GeneID; 3553839; -.
DR KEGG; tcr:506435.400; -.
DR eggNOG; KOG2084; Eukaryota.
DR InParanoid; Q4E302; -.
DR OrthoDB; 166337at2759; -.
DR Proteomes; UP000002296; Unassembled WGS sequence.
DR CDD; cd20071; SET_SMYD; 1.
DR Gene3D; 1.10.220.160; -; 1.
DR Gene3D; 6.10.140.2220; -; 1.
DR Gene3D; 2.170.270.10; SET domain; 1.
DR Gene3D; 1.25.40.10; Tetratricopeptide repeat domain; 2.
DR InterPro; IPR001214; SET_dom.
DR InterPro; IPR046341; SET_dom_sf.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR InterPro; IPR019734; TPR_repeat.
DR PANTHER; PTHR46165:SF8; RE32936P; 1.
DR PANTHER; PTHR46165; SET AND MYND DOMAIN-CONTAINING PROTEIN 4; 1.
DR Pfam; PF00856; SET; 1.
DR SMART; SM00028; TPR; 4.
DR SUPFAM; SSF82199; SET domain; 1.
DR SUPFAM; SSF48452; TPR-like; 2.
DR PROSITE; PS50280; SET; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000002296}.
FT DOMAIN 215..473
FT /note="SET"
FT /evidence="ECO:0000259|PROSITE:PS50280"
SQ SEQUENCE 663 AA; 73804 MW; A8A0E05263BB94DC CRC64;
MNGGVPPDVA SQAQARREAA ANLGTLRGSL LALAKILPPD SGTRPVKELL AHLQREDAPR
LASLLENLKH EVQGASDLNN DGAEKWKEEG HKAYDLGKMA ESVLMYTRGV MYASQDDTLA
SLLYHRGKVF LAQARFMEAL ADTHAAFTFM PTNWEALERR GVCLQKLGFE EEGKKDIHAA
AILNVDSANA ATLISNILHA DVGRSEDEAV NLIGGGISSS HLNDINYNGK MKGVEAKRHL
EPGEIVTEAP VVHALYDDHW GIRCCYCLRV TQALYPGSAY RERGKLSRGL FCGEVCAQLS
WERYGQHETA NPFFQLCPID ALIASRMMCS DCAVSRCHSL RGDFTGELHP AAVIGGYETA
VSLCALVLDA VSLEDVDRLR LAQRQVMLCS FEVKFWTGSQ VTINSETREA FIDESRPIPV
GKALYVSASQ YRHSCDPNCF ASFVGNPLGC SLYLAIRAIR PIPAGEEITI SYHNITTYKA
VSAQFRRRAL AERCGFICNC RACVDTKEER VTVEKKGYYI QASDLYQKGC RLIREGQYDV
AVTVLSQSYT IAMEHLCPPP RPPQSMIPKT HMALARAFNR LKDNEKCVEH LLAKVELDRQ
IYGENHLEFA DDYIRLAYFA LSEEERGVFA ERAKAHLSRF YAPSRELHAQ MSRFNSFVVR
ACF
//