GenomeNet

Database: UniProt
Entry: Q4N419_THEPA
LinkDB: Q4N419_THEPA
Original site: Q4N419_THEPA 
ID   Q4N419_THEPA            Unreviewed;       360 AA.
AC   Q4N419;
DT   02-AUG-2005, integrated into UniProtKB/TrEMBL.
DT   02-AUG-2005, sequence version 1.
DT   28-JUN-2023, entry version 41.
DE   SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EAN33104.1};
GN   OrderedLocusNames=TP02_0819 {ECO:0000313|EMBL:EAN33104.1};
OS   Theileria parva (East coast fever infection agent).
OC   Eukaryota; Sar; Alveolata; Apicomplexa; Aconoidasida; Piroplasmida;
OC   Theileriidae; Theileria.
OX   NCBI_TaxID=5875 {ECO:0000313|EMBL:EAN33104.1, ECO:0000313|Proteomes:UP000001949};
RN   [1] {ECO:0000313|EMBL:EAN33104.1, ECO:0000313|Proteomes:UP000001949}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Muguga {ECO:0000313|EMBL:EAN33104.1,
RC   ECO:0000313|Proteomes:UP000001949};
RX   PubMed=15994558; DOI=10.1126/science.1110439;
RA   Gardner M.J., Bishop R., Shah T., de Villiers E.P., Carlton J.M., Hall N.,
RA   Ren Q., Paulsen I.T., Pain A., Berriman M., Wilson R.J.M., Sato S.,
RA   Ralph S.A., Mann D.J., Xiong Z., Shallom S.J., Weidman J., Jiang L.,
RA   Lynn J., Weaver B., Shoaibi A., Domingo A.R., Wasawo D., Crabtree J.,
RA   Wortman J.R., Haas B., Angiuoli S.V., Creasy T.H., Lu C., Suh B.,
RA   Silva J.C., Utterback T.R., Feldblyum T.V., Pertea M., Allen J.,
RA   Nierman W.C., Taracha E.L.N., Salzberg S.L., White O.R., Fitzhugh H.A.,
RA   Morzaria S., Venter J.C., Fraser C.M., Nene V.;
RT   "Genome sequence of Theileria parva, a bovine pathogen that transforms
RT   lymphocytes.";
RL   Science 309:134-137(2005).
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:EAN33104.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AAGK01000002; EAN33104.1; -; Genomic_DNA.
DR   RefSeq; XP_765387.1; XM_760294.1.
DR   AlphaFoldDB; Q4N419; -.
DR   EnsemblProtists; EAN33104; EAN33104; TP02_0819.
DR   GeneID; 3501443; -.
DR   KEGG; tpv:TP02_0819; -.
DR   VEuPathDB; PiroplasmaDB:TpMuguga_02g00819; -.
DR   InParanoid; Q4N419; -.
DR   Proteomes; UP000001949; Unassembled WGS sequence.
DR   InterPro; IPR007480; DUF529.
DR   Pfam; PF04385; FAINT; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000001949};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..21
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           22..360
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5004241076"
FT   REGION          41..124
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        41..72
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        73..90
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        91..106
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   360 AA;  41236 MW;  24F3C17C5A9B5016 CRC64;
     MRRLSVSKWV LLCLIFTLVC GAQTGIESKV SSDNEDQLLY QSDSNLESNT ENSTEGDVLN
     SDDATGSNSQ DTQKSEDSEK QPEDTVERSE QQSQDLGSTV TTFKPTPSDS DDDESEFFDA
     LDDPKTSFPQ PTIFKTVHLD LDVSRDTYDF YYTKDVPWST YTPKLGRVFD KVEISVPYVS
     PMLIWEPKLP SECSTKVKVF STDSMMLFID ISILGGGRKV FGRRPMGTWR DITFPPDLKL
     YTLDKDGKEV ELDPSNYTLK LDIGYKSEYT FDFKSDANCT EIRFKNDIVW KYNYTYCISG
     GLYKISTLHP TSIKYTIRLF LGEFNLTYSE THRFKVFFSD GSYSDFFIGD KHWDQVNDQS
//
DBGET integrated database retrieval system