ID A5KAU1_PLAVS Unreviewed; 487 AA.
AC A5KAU1;
DT 10-JUL-2007, integrated into UniProtKB/TrEMBL.
DT 10-JUL-2007, sequence version 1.
DT 24-JAN-2024, entry version 71.
DE SubName: Full=Merozoite surface protein 8, putative {ECO:0000313|EMBL:EDL43458.1};
GN ORFNames=PVX_097625 {ECO:0000313|EMBL:EDL43458.1};
OS Plasmodium vivax (strain Salvador I).
OC Eukaryota; Sar; Alveolata; Apicomplexa; Aconoidasida; Haemosporida;
OC Plasmodiidae; Plasmodium; Plasmodium (Plasmodium).
OX NCBI_TaxID=126793 {ECO:0000313|EMBL:EDL43458.1, ECO:0000313|Proteomes:UP000008333};
RN [1] {ECO:0000313|EMBL:EDL43458.1, ECO:0000313|Proteomes:UP000008333}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Salvador I {ECO:0000313|EMBL:EDL43458.1,
RC ECO:0000313|Proteomes:UP000008333};
RX PubMed=18843361; DOI=10.1038/nature07327;
RA Carlton J.M., Adams J.H., Silva J.C., Bidwell S.L., Lorenzi H., Caler E.,
RA Crabtree J., Angiuoli S.V., Merino E.F., Amedeo P., Cheng Q., Coulson R.M.,
RA Crabb B.S., Del Portillo H.A., Essien K., Feldblyum T.V.,
RA Fernandez-Becerra C., Gilson P.R., Gueye A.H., Guo X., Kang'a S.,
RA Kooij T.W., Korsinczky M., Meyer E.V., Nene V., Paulsen I., White O.,
RA Ralph S.A., Ren Q., Sargeant T.J., Salzberg S.L., Stoeckert C.J.,
RA Sullivan S.A., Yamamoto M.M., Hoffman S.L., Wortman J.R., Gardner M.J.,
RA Galinski M.R., Barnwell J.W., Fraser-Liggett C.M.;
RT "Comparative genomics of the neglected human malaria parasite Plasmodium
RT vivax.";
RL Nature 455:757-763(2008).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EDL43458.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AAKM01000016; EDL43458.1; -; Genomic_DNA.
DR RefSeq; XP_001613185.1; XM_001613135.1.
DR AlphaFoldDB; A5KAU1; -.
DR STRING; 126793.A5KAU1; -.
DR EnsemblProtists; EDL43458; EDL43458; PVX_097625.
DR GeneID; 5472441; -.
DR KEGG; pvx:PVX_097625; -.
DR VEuPathDB; PlasmoDB:PVX_097625; -.
DR InParanoid; A5KAU1; -.
DR OMA; CEHKKCP; -.
DR PhylomeDB; A5KAU1; -.
DR Proteomes; UP000008333; Chromosome 10.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR Gene3D; 2.10.25.10; Laminin; 1.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR024731; EGF_dom.
DR InterPro; IPR024730; MSP1_EGF_1.
DR Pfam; PF12947; EGF_3; 1.
DR Pfam; PF12946; EGF_MSP1_1; 1.
DR SMART; SM00181; EGF; 2.
DR SUPFAM; SSF57196; EGF/Laminin; 2.
PE 4: Predicted;
KW Membrane {ECO:0000256|SAM:Phobius};
KW Merozoite {ECO:0000313|EMBL:EDL43458.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000008333};
KW Signal {ECO:0000256|SAM:SignalP}; Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT SIGNAL 1..22
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 23..487
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5002685322"
FT TRANSMEM 465..486
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 382..421
FT /note="EGF-like"
FT /evidence="ECO:0000259|SMART:SM00181"
FT DOMAIN 427..464
FT /note="EGF-like"
FT /evidence="ECO:0000259|SMART:SM00181"
FT REGION 30..130
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 30..72
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 73..94
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 95..123
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 487 AA; 54741 MW; 82AFDF89097248C2 CRC64;
MRKNAQIVIF CLFGLLSYRC GAEGNVSPPN FNDNRVNGNN GNKGNGNDND VPSFIGGNNN
NVNGNNDDNI FNKNGKDVTR NDGDAKDGEN RNNKKNENGS GSNENNSIAN ADNGSGKSDA
NANQIDEDGN KMDEASLKKI LKIVDEMENI QGLLDGDYSI LDKYSVKLVD EDDGETNKRK
IIGEYDLKML KNILLFREKI SRVCENKYNK NLPVLLKKCS NVDDPKLSKS REKIKKGLAK
NNMSIEDFVV GLLEDLFEKI NEHFIKDDSF DLSDYLADFE LINYIIMHET SELIDELLNI
IESMNFRLES GSLEKMVKSA ESGMNLNCKM KEDIIHLLKK SSAKFFKIEI DRKTKMIYPV
QATHKGANMK QLALSFLQKN NVCEHKKCPL NSNCYVINGE EVCRCLPGFS DVKIDNVMNC
VRDDTLDCSN NNGGCDVNAT CTLIDKKIVC ECKDNFEGDG IYCSYSIFNS INNFIFLILL
LLCLYLF
//