ID Q4N3G8_THEPA Unreviewed; 508 AA.
AC Q4N3G8;
DT 02-AUG-2005, integrated into UniProtKB/TrEMBL.
DT 02-AUG-2005, sequence version 1.
DT 13-SEP-2023, entry version 41.
DE RecName: Full=Theileria-specific sub-telomeric protein, SVSP family {ECO:0008006|Google:ProtNLM};
GN OrderedLocusNames=TP04_0015 {ECO:0000313|EMBL:EAN31367.1};
OS Theileria parva (East coast fever infection agent).
OC Eukaryota; Sar; Alveolata; Apicomplexa; Aconoidasida; Piroplasmida;
OC Theileriidae; Theileria.
OX NCBI_TaxID=5875 {ECO:0000313|EMBL:EAN31367.1, ECO:0000313|Proteomes:UP000001949};
RN [1] {ECO:0000313|EMBL:EAN31367.1, ECO:0000313|Proteomes:UP000001949}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Muguga {ECO:0000313|EMBL:EAN31367.1,
RC ECO:0000313|Proteomes:UP000001949};
RX PubMed=15994558; DOI=10.1126/science.1110439;
RA Gardner M.J., Bishop R., Shah T., de Villiers E.P., Carlton J.M., Hall N.,
RA Ren Q., Paulsen I.T., Pain A., Berriman M., Wilson R.J.M., Sato S.,
RA Ralph S.A., Mann D.J., Xiong Z., Shallom S.J., Weidman J., Jiang L.,
RA Lynn J., Weaver B., Shoaibi A., Domingo A.R., Wasawo D., Crabtree J.,
RA Wortman J.R., Haas B., Angiuoli S.V., Creasy T.H., Lu C., Suh B.,
RA Silva J.C., Utterback T.R., Feldblyum T.V., Pertea M., Allen J.,
RA Nierman W.C., Taracha E.L.N., Salzberg S.L., White O.R., Fitzhugh H.A.,
RA Morzaria S., Venter J.C., Fraser C.M., Nene V.;
RT "Genome sequence of Theileria parva, a bovine pathogen that transforms
RT lymphocytes.";
RL Science 309:134-137(2005).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EAN31367.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AAGK01000004; EAN31367.1; -; Genomic_DNA.
DR RefSeq; XP_763650.1; XM_758557.1.
DR AlphaFoldDB; Q4N3G8; -.
DR EnsemblProtists; EAN31367; EAN31367; TP04_0015.
DR GeneID; 3501311; -.
DR KEGG; tpv:TP04_0015; -.
DR VEuPathDB; PiroplasmaDB:TpMuguga_04g00015; -.
DR eggNOG; ENOG502SKCI; Eukaryota.
DR InParanoid; Q4N3G8; -.
DR Proteomes; UP000001949; Unassembled WGS sequence.
DR InterPro; IPR007480; DUF529.
DR InterPro; IPR011695; Tash_PEST_motif.
DR Pfam; PF04385; FAINT; 1.
DR Pfam; PF07708; Tash_PEST; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000001949};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..21
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 22..508
FT /note="Theileria-specific sub-telomeric protein, SVSP
FT family"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5004240746"
FT REGION 138..307
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 202..219
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 232..246
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 258..302
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 508 AA; 58807 MW; 83398ED462F47610 CRC64;
MHRCLAYIYI LIFIIIQCVE SSDKYPYHPE DNGNDEGSDF NLIIKGIENL LDEDDDQTAG
QTVISDNIMQ HGLGSITDQT YGTAQPSQLI PQSSQVQSGY YQQHLQPVPI PPQHIPEQIY
PLTQQIHYEY HPQRPIYRPP YLQTQPRPQP IQQAPQYGPI YHYQPPEIRQ PRQPYPRPAG
PSYYSRRETP GNKYFGQRYH PYEQSHSDSG HTPHTNTGSG LLHPDPYRLP HPSQIIQQTQ
PPQASKEPLQ PERVTVEVGS DEDDEETEEI GQEPSEPEQP EEAEEGAVGG AGDGDDEEEE
EEKPSEAVKK CKKIRFFKKN SEGNIIPMIK KDFKRIHDDD KLKKYLLYAN LEVLLCDGDV
VYKHNAGINY PTQLSYNKVK NIFIFTRRGG FLLIKYSEGE WRVEGRRDQE YLKFYSKTHW
GKYVEITCND YYTELSAAKA FKYTFREGVN CEKVTYKGET IWRRKKFKAS PESVRFTPKG
NVIIYFSSYL IVFGKRQGSF KQILARPR
//