ID Q4N2C4_THEPA Unreviewed; 877 AA.
AC Q4N2C4;
DT 02-AUG-2005, integrated into UniProtKB/TrEMBL.
DT 02-AUG-2005, sequence version 1.
DT 24-JAN-2024, entry version 43.
DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EAN31779.1};
GN OrderedLocusNames=TP04_0427 {ECO:0000313|EMBL:EAN31779.1};
OS Theileria parva (East coast fever infection agent).
OC Eukaryota; Sar; Alveolata; Apicomplexa; Aconoidasida; Piroplasmida;
OC Theileriidae; Theileria.
OX NCBI_TaxID=5875 {ECO:0000313|EMBL:EAN31779.1, ECO:0000313|Proteomes:UP000001949};
RN [1] {ECO:0000313|EMBL:EAN31779.1, ECO:0000313|Proteomes:UP000001949}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Muguga {ECO:0000313|EMBL:EAN31779.1,
RC ECO:0000313|Proteomes:UP000001949};
RX PubMed=15994558; DOI=10.1126/science.1110439;
RA Gardner M.J., Bishop R., Shah T., de Villiers E.P., Carlton J.M., Hall N.,
RA Ren Q., Paulsen I.T., Pain A., Berriman M., Wilson R.J.M., Sato S.,
RA Ralph S.A., Mann D.J., Xiong Z., Shallom S.J., Weidman J., Jiang L.,
RA Lynn J., Weaver B., Shoaibi A., Domingo A.R., Wasawo D., Crabtree J.,
RA Wortman J.R., Haas B., Angiuoli S.V., Creasy T.H., Lu C., Suh B.,
RA Silva J.C., Utterback T.R., Feldblyum T.V., Pertea M., Allen J.,
RA Nierman W.C., Taracha E.L.N., Salzberg S.L., White O.R., Fitzhugh H.A.,
RA Morzaria S., Venter J.C., Fraser C.M., Nene V.;
RT "Genome sequence of Theileria parva, a bovine pathogen that transforms
RT lymphocytes.";
RL Science 309:134-137(2005).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EAN31779.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AAGK01000004; EAN31779.1; -; Genomic_DNA.
DR RefSeq; XP_764062.1; XM_758969.1.
DR AlphaFoldDB; Q4N2C4; -.
DR STRING; 5875.Q4N2C4; -.
DR EnsemblProtists; EAN31779; EAN31779; TP04_0427.
DR GeneID; 3500559; -.
DR KEGG; tpv:TP04_0427; -.
DR VEuPathDB; PiroplasmaDB:TpMuguga_04g00427; -.
DR eggNOG; ENOG502QXJG; Eukaryota.
DR InParanoid; Q4N2C4; -.
DR OMA; ESACQEP; -.
DR Proteomes; UP000001949; Unassembled WGS sequence.
DR InterPro; IPR007480; DUF529.
DR Pfam; PF04385; FAINT; 2.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000001949};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..21
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 22..877
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5004241423"
FT REGION 609..640
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 609..632
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 877 AA; 101730 MW; 5148FAB545E271EF CRC64;
MKRFSVFGLA VVFLSLGFVN SELEGYRLSL DDPKEDVFKV KTTQFDNVSY KHYAPLKHKY
AVTVFESENK VWESDGFGTC VYALLAYRKQ LPYLLYFQVD SGHTLFNLYY VKEENQWKKI
DIVDYCNLLK EVKNEVRVNG IFGFNLFNAV NNSMVSVENE SEEAPYTQYT SAVGYFASHV
YFKNQKLWNA KKEGERCSAA FCYSHLNEYY MIRLLITNAD EESKLETYFS LKNKWVHVHT
LEEVHDKKLY VQKPESFKYP NYTFTCKPLS VSRVETFPME VQEPDHLTEP YFDLTDEFYK
YKVQNQDTDN TFGFHDHVHS YDYVDYVESA CQEPYYDLTN EFYKFKLQTQ DTDDTFDSRD
LSTKLDISLT AINLFTQLRC VDNVFYKLYN PTSDTVVRTV VDCKTEVWKA KSESERCVHV
MAPQKEGQHI ALYLLLELGN ELKSLFFEKS EGEWKRTNLE HYKNIIDNLK KNIATDDITI
NLNSKIESSL YSVEYKLGKT PCILYLPRFG ATCHRVVDGV QTIWEPLNPF DRCVAVWAYD
SWGKSYMVKL LVQLSDDTCD VLSYINVGNK WDLFNSFKRS FANKLPFKKP VDFYNPEYLF
NQDIRPYPQH THSHSASETT THSVTPGETT GSKVGEKTEP DYVNTKLSEA LDNLSLHSDD
STHSVTDSVT HSVTHSVTHS VTNSVDYPKD VYNIYLDSDD KDETNVPGGA EVKSPVPNLT
DRAVQNAVVD KICNRKAVSQ KYKIYYRPIS ASDCVYLDLS SPMNMSSYEV VRAEFDGRLS
LLTYRALPGK YFSAVYEGHN VIFHGDHTTH VCYEFNVYLK NKASYLADAL VRVKNEDVTL
HLRKNKDNWE YIDQETFMNS VNVLRISQAP FKFVETR
//