ID A5K7S5_PLAVS Unreviewed; 2061 AA.
AC A5K7S5;
DT 10-JUL-2007, integrated into UniProtKB/TrEMBL.
DT 10-JUL-2007, sequence version 1.
DT 27-MAR-2024, entry version 70.
DE SubName: Full=CTRP adhesive protein (Invasive stage), putative {ECO:0000313|EMBL:EDL44834.1};
GN ORFNames=PVX_095475 {ECO:0000313|EMBL:EDL44834.1};
OS Plasmodium vivax (strain Salvador I).
OC Eukaryota; Sar; Alveolata; Apicomplexa; Aconoidasida; Haemosporida;
OC Plasmodiidae; Plasmodium; Plasmodium (Plasmodium).
OX NCBI_TaxID=126793 {ECO:0000313|EMBL:EDL44834.1, ECO:0000313|Proteomes:UP000008333};
RN [1] {ECO:0000313|EMBL:EDL44834.1, ECO:0000313|Proteomes:UP000008333}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Salvador I {ECO:0000313|EMBL:EDL44834.1,
RC ECO:0000313|Proteomes:UP000008333};
RX PubMed=18843361; DOI=10.1038/nature07327;
RA Carlton J.M., Adams J.H., Silva J.C., Bidwell S.L., Lorenzi H., Caler E.,
RA Crabtree J., Angiuoli S.V., Merino E.F., Amedeo P., Cheng Q., Coulson R.M.,
RA Crabb B.S., Del Portillo H.A., Essien K., Feldblyum T.V.,
RA Fernandez-Becerra C., Gilson P.R., Gueye A.H., Guo X., Kang'a S.,
RA Kooij T.W., Korsinczky M., Meyer E.V., Nene V., Paulsen I., White O.,
RA Ralph S.A., Ren Q., Sargeant T.J., Salzberg S.L., Stoeckert C.J.,
RA Sullivan S.A., Yamamoto M.M., Hoffman S.L., Wortman J.R., Gardner M.J.,
RA Galinski M.R., Barnwell J.W., Fraser-Liggett C.M.;
RT "Comparative genomics of the neglected human malaria parasite Plasmodium
RT vivax.";
RL Nature 455:757-763(2008).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EDL44834.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AAKM01000008; EDL44834.1; -; Genomic_DNA.
DR RefSeq; XP_001614561.1; XM_001614511.1.
DR STRING; 126793.A5K7S5; -.
DR EnsemblProtists; EDL44834; EDL44834; PVX_095475.
DR GeneID; 5473850; -.
DR KEGG; pvx:PVX_095475; -.
DR InParanoid; A5K7S5; -.
DR OMA; DALCNEW; -.
DR Proteomes; UP000008333; Chromosome 8.
DR GO; GO:0005886; C:plasma membrane; IEA:UniProtKB-KW.
DR CDD; cd01473; vWA_CTRP; 4.
DR Gene3D; 2.20.100.10; Thrombospondin type-1 (TSP1) repeat; 4.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 6.
DR InterPro; IPR000884; TSP1_rpt.
DR InterPro; IPR036383; TSP1_rpt_sf.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR24020; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24020:SF70; PH DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF00090; TSP_1; 3.
DR Pfam; PF00092; VWA; 6.
DR SMART; SM00209; TSP1; 5.
DR SMART; SM00327; VWA; 6.
DR SUPFAM; SSF82895; TSP-1 type 1 repeat; 3.
DR SUPFAM; SSF53300; vWA-like; 6.
DR PROSITE; PS50092; TSP1; 4.
DR PROSITE; PS50234; VWFA; 6.
PE 4: Predicted;
KW Cell membrane {ECO:0000256|ARBA:ARBA00022475};
KW Membrane {ECO:0000256|ARBA:ARBA00023136};
KW Reference proteome {ECO:0000313|Proteomes:UP000008333};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..23
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 24..2061
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5002685207"
FT DOMAIN 108..293
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 333..520
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 603..789
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 839..1027
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 1066..1259
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 1313..1506
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT REGION 523..587
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 797..825
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1030..1050
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1964..1990
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 525..587
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 799..825
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1974..1990
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2061 AA; 227261 MW; 384CD3082209F984 CRC64;
MNKSFLLIAS YFCLVVHLGT VIAQRSQDES SHQYVHLAGN GRSVKKATLD KDIPEGRILT
KKQSFLQHNT LAVGSLAAGS LAAGSLAAGS PPPPCVGDDD CFCQNFYDLT LIIDESGSIG
IKNWEKHVIP FTDKIIKDLH IGENEVHAGI LLFSNFIRDY VTFDEDESYK KDKLLKKVDQ
LKKKYAAGAG TKIVSALDYA LEKYTHHKKG RPNAPKVTIL FTDGNDTSSS SSTKLLDMGL
TYRKKNVKLL VLGVAAAKDV NLRAIAGCGD KNVPCPYAMK AEWDTINDIT KKLTNKICHT
EEEDEEITTT TTPPPPQQNP CQGDDCFCED YYDLTLILDE SGSITLNKWK IDVVPFAEKV
VSNLNISKDK IHVGIMRFAI KVKEDVSYGQ ETRYDKSALI NVVKELRDKY GSGQGTRLVD
ALEHSLTNFT RHPNNRPNAP KVTILFTDGN ENYRRPSDVR NIGLKYRKEN VRLIVVGVYK
ATIKSLKMLA GCGENEHCPQ VIKCDWDQLT SITEVITDKI CDIDAGELPG GEEKPGGEEK
PGSSDKPGGE EKPGSSEKPD GTEKPGDADK PNGSDKPNGS DKPNEQHPPC ADWDDCYCKH
FYDLTLILDE SASIGNFRWS HEVVPFATEI VKSLHVGYSA VHVGLLLFAD SRRDVVRFSD
ATRYDKSFLL QKIESLKGDY RNGKKTFIVQ ALVYALASYT KGSSRVNAPK VTMLFTDGND
SRESDERLYQ TGLLYRREKV KLLVLGVSMA DENKLRLLVG CTRNANCPFV IKAEWGQLPS
VSNEFVRRIC SSGPIIPPED GSSSPPLPPP EVANPTDPEP VVPPPQCQGD ECLCQSMYDL
TLILDESASI GHSNWKKQVY PFVEKIVSNL EVSESKVHVG IMLFAKHMRD FVRFSEKESY
EKDSLMRKVP ELKGTYKAGS HTYIVESLEY GLQHYTKGAS SRADVPKVTI LFTDGNNSKS
GDEILSNVNS LYKKENVKLL VVGVGAASMP KLRLLGGCHK TEGDCPFAIK TEWDSLKDIS
QGMVDKICNT DTEINPPPPP PPSSGGAEVA TPSCTGDECF CRDYFDLTFI GAPSTKKNSR
RKDELTKYVT KIVNTFNVGE KHVHVSLSLH LGAKTVNTDF DSAVARDKKE LLSALEVMST
EWANLGRKTN IAEALTVGLK QSFSTGHRED APKVALLLTD SNNDVSEEGM LQSVSKQYAD
RKVKLLVVGI GKLPEELLFI AAGCALSGSG GDPPRYGQTC PTVFISSRFS YINSVEKFLG
RNVCDGGGGG DGGAQQVEPP LPPPSLPCQD DESCTECDDD ACNNDPTCKK IFDIAIVIDQ
SRGITNDQWK KYVRPFSTYV IKSNYLAKNR THVTLVKMGR TAKVLWKLGR KASYRKNQMI
KKVNRMVKTS TSRKDIARNL KHLREEVFTK KTSSSSSSSP APDRRKKLII MLVEGTSHTD
LNELRREVAL LKVNNIDLFV YAIDNVDDEE YRILGDCENA SSGVCKNAVK VTWDKLLSSE
EIHSSYICNQ YPADAECAEW GDWSPCPDSS TSSTCNNLAV SKRERKGPPY TLKEEEYLDD
GRHMHGSSCT DLSSIEYRAC PVTEECNDVC GDFGEWSQCS ASCGDGIRTR QRAGAAGRGD
APLAASFQDD QADSCQLFNA TEVEACNVQD CDDAEICEEV GEWGEWSACS KACGYSTRSR
TFVILPEDMD EYAHCSTFEK SETEVCSVPA CEDEKCFEWE EWTEWSAPCG PRKRAHECEA
YYQNKVERDD GSNGVPCTDN PCGSWSEWSE CDRKCNVGMR MRRFISNVTG FVGDDEDLCL
EYHNEIETEP CLDLPLCDEG ECNDWETWVG CGEEEHPGLR PATSPRGAST SCHASPRRRI
LTRKPELLQH RKESTSKFCS DYKLFREEEC PVLGGATPCN DALCNEWEDW GECSPTCGTE
SYRVRRRKEP LELIPPSEDM DGKMGLTCEQ QNVRIEEREA CSVPACAPPS GGGGSGSGSG
SGNGSDSGGM GTGEKVSLAA GILGLAALGV GGMIYGYNTL NGGEAPHSSN MEFENVETDG
AEVEKSNEDF EVIDANDPMW N
//