ID A0A024VQI6_PLAFA Unreviewed; 457 AA.
AC A0A024VQI6;
DT 09-JUL-2014, integrated into UniProtKB/TrEMBL.
DT 09-JUL-2014, sequence version 1.
DT 13-SEP-2023, entry version 21.
DE RecName: Full=Merozoite surface protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=PFFCH_02015 {ECO:0000313|EMBL:ETW30558.1};
OS Plasmodium falciparum FCH/4.
OC Eukaryota; Sar; Alveolata; Apicomplexa; Aconoidasida; Haemosporida;
OC Plasmodiidae; Plasmodium; Plasmodium (Laverania).
OX NCBI_TaxID=1036724 {ECO:0000313|EMBL:ETW30558.1, ECO:0000313|Proteomes:UP000030656};
RN [1] {ECO:0000313|EMBL:ETW30558.1, ECO:0000313|Proteomes:UP000030656}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=FCH/4 {ECO:0000313|EMBL:ETW30558.1,
RC ECO:0000313|Proteomes:UP000030656};
RG The Broad Institute Genome Sequencing Platform;
RG The Broad Institute Genome Sequencing Center for Infectious Disease;
RA Neafsey D., Hoffman S., Volkman S., Rosenthal P., Walker B., Young S.K.,
RA Zeng Q., Gargeya S., Fitzgerald M., Haas B., Abouelleil A., Allen A.W.,
RA Alvarado L., Arachchi H.M., Berlin A.M., Chapman S.B., Gainer-Dewar J.,
RA Goldberg J., Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A.,
RA Ireland A., Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W.,
RA Priest M., Roberts A., Saif S., Shea T., Sisk P., Sykes S., Wortman J.,
RA Nusbaum C., Birren B.;
RT "The Genome Annotation of Plasmodium falciparum FCH/4.";
RL Submitted (FEB-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:ETW30558.1, ECO:0000313|Proteomes:UP000030656}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=FCH/4 {ECO:0000313|EMBL:ETW30558.1,
RC ECO:0000313|Proteomes:UP000030656};
RG The Broad Institute Genome Sequencing Platform;
RG The Broad Institute Genome Sequencing Center for Infectious Disease;
RA Neafsey D., Cheeseman I., Volkman S., Adams J., Walker B., Young S.K.,
RA Zeng Q., Gargeya S., Fitzgerald M., Haas B., Abouelleil A., Alvarado L.,
RA Arachchi H.M., Berlin A.M., Chapman S.B., Dewar J., Goldberg J., Griggs A.,
RA Gujja S., Hansen M., Howarth C., Imamovic A., Larimer J., McCowan C.,
RA Murphy C., Neiman D., Pearson M., Priest M., Roberts A., Saif S., Shea T.,
RA Sisk P., Sykes S., Wortman J., Nusbaum C., Birren B.;
RT "The Genome Sequence of Plasmodium falciparum FCH/4.";
RL Submitted (FEB-2013) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KI927903; ETW30558.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A024VQI6; -.
DR EnsemblProtists; ETW30558; ETW30558; PFFCH_02015.
DR Proteomes; UP000030656; Unassembled WGS sequence.
DR InterPro; IPR010784; Merozoite_SPAM.
DR Pfam; PF07133; Merozoite_SPAM; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000030656};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..21
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 22..457
FT /note="Merozoite surface protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5001539498"
FT REGION 216..288
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 327..385
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 216..235
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 236..252
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 266..285
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 339..378
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 457 AA; 52764 MW; A24EF183153D9840 CRC64;
MLNIFNIIFL LFLINIYICE ANGTLSENIE SAEEIDALKT NLRNGYLNNT YFNEENNNLN
IENEINNTNY NEVTEETKEE LYDINENIFP DYFFLDIFTE NKEQKNEEVP MKIEVVNDGE
EVKTEYVSEK NEEKNNNNSL DTNVTKNTVI DNSNKFQSIE DNNVYNKGIF VGTGIKLNDS
QTTSDNYKNE RYQIDDEKLK YGGSFDTIFS GFVNLLTPSS PTQNDGSTGR NVPPPSEPNV
DTPDPPTAPA PVKVPEDAKL SSSPRPEGPR ANNRNENNQN TDPYNHYFAW EIGGGAPTYK
PENNKNDNIL LEHVKITSWD KEDIIKENED TKREVQETED TDETEDTDET EETEDMEDEN
EIVEDQLQEN EDDEDNVNLE DINKNTRNDI FEEQIKLDST QDDKAQKLIS NEYKKTEEKK
SLEDHVNLLF NFLQTNNQLD PSLKDLENEL TFFLNNY
//