GenomeNet

Database: UniProt
Entry: Q4N392_THEPA
LinkDB: Q4N392_THEPA
Original site: Q4N392_THEPA 
ID   Q4N392_THEPA            Unreviewed;       222 AA.
AC   Q4N392;
DT   02-AUG-2005, integrated into UniProtKB/TrEMBL.
DT   02-AUG-2005, sequence version 1.
DT   28-JUN-2023, entry version 39.
DE   SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EAN31447.1};
GN   OrderedLocusNames=TP04_0095 {ECO:0000313|EMBL:EAN31447.1};
OS   Theileria parva (East coast fever infection agent).
OC   Eukaryota; Sar; Alveolata; Apicomplexa; Aconoidasida; Piroplasmida;
OC   Theileriidae; Theileria.
OX   NCBI_TaxID=5875 {ECO:0000313|EMBL:EAN31447.1, ECO:0000313|Proteomes:UP000001949};
RN   [1] {ECO:0000313|EMBL:EAN31447.1, ECO:0000313|Proteomes:UP000001949}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Muguga {ECO:0000313|EMBL:EAN31447.1,
RC   ECO:0000313|Proteomes:UP000001949};
RX   PubMed=15994558; DOI=10.1126/science.1110439;
RA   Gardner M.J., Bishop R., Shah T., de Villiers E.P., Carlton J.M., Hall N.,
RA   Ren Q., Paulsen I.T., Pain A., Berriman M., Wilson R.J.M., Sato S.,
RA   Ralph S.A., Mann D.J., Xiong Z., Shallom S.J., Weidman J., Jiang L.,
RA   Lynn J., Weaver B., Shoaibi A., Domingo A.R., Wasawo D., Crabtree J.,
RA   Wortman J.R., Haas B., Angiuoli S.V., Creasy T.H., Lu C., Suh B.,
RA   Silva J.C., Utterback T.R., Feldblyum T.V., Pertea M., Allen J.,
RA   Nierman W.C., Taracha E.L.N., Salzberg S.L., White O.R., Fitzhugh H.A.,
RA   Morzaria S., Venter J.C., Fraser C.M., Nene V.;
RT   "Genome sequence of Theileria parva, a bovine pathogen that transforms
RT   lymphocytes.";
RL   Science 309:134-137(2005).
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:EAN31447.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AAGK01000004; EAN31447.1; -; Genomic_DNA.
DR   RefSeq; XP_763730.1; XM_758637.1.
DR   AlphaFoldDB; Q4N392; -.
DR   EnsemblProtists; EAN31447; EAN31447; TP04_0095.
DR   GeneID; 3500887; -.
DR   KEGG; tpv:TP04_0095; -.
DR   VEuPathDB; PiroplasmaDB:TpMuguga_04g00095; -.
DR   InParanoid; Q4N392; -.
DR   Proteomes; UP000001949; Unassembled WGS sequence.
DR   InterPro; IPR007480; DUF529.
DR   Pfam; PF04385; FAINT; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000001949};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..21
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           22..222
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5004241473"
FT   REGION          22..66
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          153..222
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        24..66
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        157..196
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        197..222
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   222 AA;  23714 MW;  2918BA3DF89C1724 CRC64;
     MKIVKLILLV VLLCPYKSAH GADGVDKSQS SGANSLRASP GTQASTPSSG TTTPSAKSSG
     TNLKPVTLDI TKTRSTNDCL YSDYGNYATF NPKPGHGFSK IVKRNKAVWT AKDDYATGVF
     LKDLGDKKKE LSISLGDKVI ILKKEGRKKP WIDITSKRNG ENPNANDGSA TVDPSNASST
     GVSHGINLSV HSSDRRPTTD PSGRRPAEDS SARRPPEDST RR
//
DBGET integrated database retrieval system