GenomeNet

Database: UniProt
Entry: Q4N2C4_THEPA
LinkDB: Q4N2C4_THEPA
Original site: Q4N2C4_THEPA 
ID   Q4N2C4_THEPA            Unreviewed;       877 AA.
AC   Q4N2C4;
DT   02-AUG-2005, integrated into UniProtKB/TrEMBL.
DT   02-AUG-2005, sequence version 1.
DT   24-JAN-2024, entry version 43.
DE   SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EAN31779.1};
GN   OrderedLocusNames=TP04_0427 {ECO:0000313|EMBL:EAN31779.1};
OS   Theileria parva (East coast fever infection agent).
OC   Eukaryota; Sar; Alveolata; Apicomplexa; Aconoidasida; Piroplasmida;
OC   Theileriidae; Theileria.
OX   NCBI_TaxID=5875 {ECO:0000313|EMBL:EAN31779.1, ECO:0000313|Proteomes:UP000001949};
RN   [1] {ECO:0000313|EMBL:EAN31779.1, ECO:0000313|Proteomes:UP000001949}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Muguga {ECO:0000313|EMBL:EAN31779.1,
RC   ECO:0000313|Proteomes:UP000001949};
RX   PubMed=15994558; DOI=10.1126/science.1110439;
RA   Gardner M.J., Bishop R., Shah T., de Villiers E.P., Carlton J.M., Hall N.,
RA   Ren Q., Paulsen I.T., Pain A., Berriman M., Wilson R.J.M., Sato S.,
RA   Ralph S.A., Mann D.J., Xiong Z., Shallom S.J., Weidman J., Jiang L.,
RA   Lynn J., Weaver B., Shoaibi A., Domingo A.R., Wasawo D., Crabtree J.,
RA   Wortman J.R., Haas B., Angiuoli S.V., Creasy T.H., Lu C., Suh B.,
RA   Silva J.C., Utterback T.R., Feldblyum T.V., Pertea M., Allen J.,
RA   Nierman W.C., Taracha E.L.N., Salzberg S.L., White O.R., Fitzhugh H.A.,
RA   Morzaria S., Venter J.C., Fraser C.M., Nene V.;
RT   "Genome sequence of Theileria parva, a bovine pathogen that transforms
RT   lymphocytes.";
RL   Science 309:134-137(2005).
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:EAN31779.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AAGK01000004; EAN31779.1; -; Genomic_DNA.
DR   RefSeq; XP_764062.1; XM_758969.1.
DR   AlphaFoldDB; Q4N2C4; -.
DR   STRING; 5875.Q4N2C4; -.
DR   EnsemblProtists; EAN31779; EAN31779; TP04_0427.
DR   GeneID; 3500559; -.
DR   KEGG; tpv:TP04_0427; -.
DR   VEuPathDB; PiroplasmaDB:TpMuguga_04g00427; -.
DR   eggNOG; ENOG502QXJG; Eukaryota.
DR   InParanoid; Q4N2C4; -.
DR   OMA; ESACQEP; -.
DR   Proteomes; UP000001949; Unassembled WGS sequence.
DR   InterPro; IPR007480; DUF529.
DR   Pfam; PF04385; FAINT; 2.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000001949};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..21
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           22..877
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5004241423"
FT   REGION          609..640
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        609..632
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   877 AA;  101730 MW;  5148FAB545E271EF CRC64;
     MKRFSVFGLA VVFLSLGFVN SELEGYRLSL DDPKEDVFKV KTTQFDNVSY KHYAPLKHKY
     AVTVFESENK VWESDGFGTC VYALLAYRKQ LPYLLYFQVD SGHTLFNLYY VKEENQWKKI
     DIVDYCNLLK EVKNEVRVNG IFGFNLFNAV NNSMVSVENE SEEAPYTQYT SAVGYFASHV
     YFKNQKLWNA KKEGERCSAA FCYSHLNEYY MIRLLITNAD EESKLETYFS LKNKWVHVHT
     LEEVHDKKLY VQKPESFKYP NYTFTCKPLS VSRVETFPME VQEPDHLTEP YFDLTDEFYK
     YKVQNQDTDN TFGFHDHVHS YDYVDYVESA CQEPYYDLTN EFYKFKLQTQ DTDDTFDSRD
     LSTKLDISLT AINLFTQLRC VDNVFYKLYN PTSDTVVRTV VDCKTEVWKA KSESERCVHV
     MAPQKEGQHI ALYLLLELGN ELKSLFFEKS EGEWKRTNLE HYKNIIDNLK KNIATDDITI
     NLNSKIESSL YSVEYKLGKT PCILYLPRFG ATCHRVVDGV QTIWEPLNPF DRCVAVWAYD
     SWGKSYMVKL LVQLSDDTCD VLSYINVGNK WDLFNSFKRS FANKLPFKKP VDFYNPEYLF
     NQDIRPYPQH THSHSASETT THSVTPGETT GSKVGEKTEP DYVNTKLSEA LDNLSLHSDD
     STHSVTDSVT HSVTHSVTHS VTNSVDYPKD VYNIYLDSDD KDETNVPGGA EVKSPVPNLT
     DRAVQNAVVD KICNRKAVSQ KYKIYYRPIS ASDCVYLDLS SPMNMSSYEV VRAEFDGRLS
     LLTYRALPGK YFSAVYEGHN VIFHGDHTTH VCYEFNVYLK NKASYLADAL VRVKNEDVTL
     HLRKNKDNWE YIDQETFMNS VNVLRISQAP FKFVETR
//
DBGET integrated database retrieval system