GenomeNet

Database: UniProt
Entry: Q4N3G8_THEPA
LinkDB: Q4N3G8_THEPA
Original site: Q4N3G8_THEPA 
ID   Q4N3G8_THEPA            Unreviewed;       508 AA.
AC   Q4N3G8;
DT   02-AUG-2005, integrated into UniProtKB/TrEMBL.
DT   02-AUG-2005, sequence version 1.
DT   13-SEP-2023, entry version 41.
DE   RecName: Full=Theileria-specific sub-telomeric protein, SVSP family {ECO:0008006|Google:ProtNLM};
GN   OrderedLocusNames=TP04_0015 {ECO:0000313|EMBL:EAN31367.1};
OS   Theileria parva (East coast fever infection agent).
OC   Eukaryota; Sar; Alveolata; Apicomplexa; Aconoidasida; Piroplasmida;
OC   Theileriidae; Theileria.
OX   NCBI_TaxID=5875 {ECO:0000313|EMBL:EAN31367.1, ECO:0000313|Proteomes:UP000001949};
RN   [1] {ECO:0000313|EMBL:EAN31367.1, ECO:0000313|Proteomes:UP000001949}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Muguga {ECO:0000313|EMBL:EAN31367.1,
RC   ECO:0000313|Proteomes:UP000001949};
RX   PubMed=15994558; DOI=10.1126/science.1110439;
RA   Gardner M.J., Bishop R., Shah T., de Villiers E.P., Carlton J.M., Hall N.,
RA   Ren Q., Paulsen I.T., Pain A., Berriman M., Wilson R.J.M., Sato S.,
RA   Ralph S.A., Mann D.J., Xiong Z., Shallom S.J., Weidman J., Jiang L.,
RA   Lynn J., Weaver B., Shoaibi A., Domingo A.R., Wasawo D., Crabtree J.,
RA   Wortman J.R., Haas B., Angiuoli S.V., Creasy T.H., Lu C., Suh B.,
RA   Silva J.C., Utterback T.R., Feldblyum T.V., Pertea M., Allen J.,
RA   Nierman W.C., Taracha E.L.N., Salzberg S.L., White O.R., Fitzhugh H.A.,
RA   Morzaria S., Venter J.C., Fraser C.M., Nene V.;
RT   "Genome sequence of Theileria parva, a bovine pathogen that transforms
RT   lymphocytes.";
RL   Science 309:134-137(2005).
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:EAN31367.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AAGK01000004; EAN31367.1; -; Genomic_DNA.
DR   RefSeq; XP_763650.1; XM_758557.1.
DR   AlphaFoldDB; Q4N3G8; -.
DR   EnsemblProtists; EAN31367; EAN31367; TP04_0015.
DR   GeneID; 3501311; -.
DR   KEGG; tpv:TP04_0015; -.
DR   VEuPathDB; PiroplasmaDB:TpMuguga_04g00015; -.
DR   eggNOG; ENOG502SKCI; Eukaryota.
DR   InParanoid; Q4N3G8; -.
DR   Proteomes; UP000001949; Unassembled WGS sequence.
DR   InterPro; IPR007480; DUF529.
DR   InterPro; IPR011695; Tash_PEST_motif.
DR   Pfam; PF04385; FAINT; 1.
DR   Pfam; PF07708; Tash_PEST; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000001949};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..21
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           22..508
FT                   /note="Theileria-specific sub-telomeric protein, SVSP
FT                   family"
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5004240746"
FT   REGION          138..307
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        202..219
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        232..246
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        258..302
FT                   /note="Acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   508 AA;  58807 MW;  83398ED462F47610 CRC64;
     MHRCLAYIYI LIFIIIQCVE SSDKYPYHPE DNGNDEGSDF NLIIKGIENL LDEDDDQTAG
     QTVISDNIMQ HGLGSITDQT YGTAQPSQLI PQSSQVQSGY YQQHLQPVPI PPQHIPEQIY
     PLTQQIHYEY HPQRPIYRPP YLQTQPRPQP IQQAPQYGPI YHYQPPEIRQ PRQPYPRPAG
     PSYYSRRETP GNKYFGQRYH PYEQSHSDSG HTPHTNTGSG LLHPDPYRLP HPSQIIQQTQ
     PPQASKEPLQ PERVTVEVGS DEDDEETEEI GQEPSEPEQP EEAEEGAVGG AGDGDDEEEE
     EEKPSEAVKK CKKIRFFKKN SEGNIIPMIK KDFKRIHDDD KLKKYLLYAN LEVLLCDGDV
     VYKHNAGINY PTQLSYNKVK NIFIFTRRGG FLLIKYSEGE WRVEGRRDQE YLKFYSKTHW
     GKYVEITCND YYTELSAAKA FKYTFREGVN CEKVTYKGET IWRRKKFKAS PESVRFTPKG
     NVIIYFSSYL IVFGKRQGSF KQILARPR
//
DBGET integrated database retrieval system