GenomeNet

Database: Pfam
Entry: EGF_MSP1_1
LinkDB: EGF_MSP1_1
Original site: EGF_MSP1_1 
#=GF ID   EGF_MSP1_1
#=GF AC   PF12946.7
#=GF DE   MSP1 EGF domain 1
#=GF AU   Bateman A;0000-0002-6982-4660
#=GF SE   Jackhmmer:P04933
#=GF GA   27.00 27.00;
#=GF TC   27.00 27.00;
#=GF NC   26.90 26.90;
#=GF BM   hmmbuild HMM.ann SEED.ann
#=GF SM   hmmsearch -Z 45638612 -E 1000 --cpu 4 HMM pfamseq
#=GF TP   Domain
#=GF RN   [1]
#=GF RM   10339410
#=GF RT   Solution structure of an EGF module pair from the Plasmodium
#=GF RT   falciparum merozoite surface protein 1.
#=GF RA   Morgan WD, Birdsall B, Frenkiel TA, Gradwell MG, Burghaus PA,
#=GF RA   Syed SE, Uthaipibull C, Holder AA, Feeney J;
#=GF RL   J Mol Biol. 1999;289:113-122.
#=GF DR   INTERPRO; IPR024730;
#=GF DR   SO; 0000417; polypeptide_domain;
#=GF CC   This EGF-like domain is found at the C-terminus of the malaria
#=GF CC   parasite MSP1 protein. MSP1 is the merozoite surface protein 1.
#=GF CC   This domain is part of the C-terminal fragment that is
#=GF CC   proteolytically processed from the the rest of the protein and
#=GF CC   is left attached to the surface of the invading parasite.
#=GF SQ   69
#=GS W4IGJ5_PLAFA/1578-1614      AC W4IGJ5.1
#=GS A0A077TMC0_PLACH/1654-1692  AC A0A077TMC0.1
#=GS A7RF84_NEMVE/97-130         AC A7RF84.1
#=GS E4X0M8_OIKDI/324-361        AC E4X0M8.1
#=GS F0JB08_NEOCL/506-543        AC F0JB08.1
#=GS A5K723_PLAVS/1754-1789      AC A5K723.1
#=GS A0A1I8BC42_MELHA/148-179    AC A0A1I8BC42.1
#=GS A0A0B1SAM9_OESDE/43-75      AC A0A0B1SAM9.1
#=GS A0A2G8KYQ3_STIJA/39-74      AC A0A2G8KYQ3.1
#=GS D4AFN2_PLAGO/1607-1643      AC D4AFN2.1
#=GS K6UX91_9APIC/363-402        AC K6UX91.1
#=GS H0Z6C0_TAEGU/169-204        AC H0Z6C0.1
#=GS A0A1B1DXA4_9APIC/1763-1798  AC A0A1B1DXA4.1
#=GS H0Z6C6_TAEGU/172-207        AC H0Z6C6.1
#=GS Q4XPN3_PLACH/203-241        AC Q4XPN3.1
#=GS W4IYZ7_PLAFP/1547-1583      AC W4IYZ7.1
#=GS A7ST24_NEMVE/412-447        AC A7ST24.1
#=GS B9QP44_TOXGV/610-647        AC B9QP44.1
#=GS W7FG83_PLAF8/1574-1610      AC W7FG83.1
#=GS U3K2S4_FICAL/199-234        AC U3K2S4.1
#=GS A0A0Y9WA72_PLABE/1680-1717  AC A0A0Y9WA72.1
#=GS W7AGJ4_9APIC/1859-1894      AC W7AGJ4.1
#=GS A0A1Y1JCZ7_PLAGO/1917-1952  AC A0A1Y1JCZ7.1
#=GS A0A024WQJ1_PLAFA/1601-1637  AC A0A024WQJ1.1
#=GS A0A0D9QFB7_PLAFR/1727-1763  AC A0A0D9QFB7.1
#=GS A7ST24_NEMVE/125-160        AC A7ST24.1
#=GS A0A1A8VYZ0_PLAMA/1611-1647  AC A0A1A8VYZ0.1
#=GS A0A1A8YRE9_9APIC/1565-1601  AC A0A1A8YRE9.1
#=GS A0A024W7K7_PLAFA/740-776    AC A0A024W7K7.1
#=GS Q8I0U8_PLAF7/1612-1648      AC Q8I0U8.1
#=GS W7A8Y0_9APIC/1682-1718      AC W7A8Y0.1
#=GS A0A1J1H3N0_PLARL/1490-1526  AC A0A1J1H3N0.1
#=GS V8P6B4_OPHHA/164-199        AC V8P6B4.1
#=GS W7AGK7_9APIC/382-421        AC W7AGK7.1
#=GS A0A1A8VP40_9APIC/1623-1659  AC A0A1A8VP40.1
#=GS A0A1B1E161_9APIC/373-412    AC A0A1B1E161.1
#=GS A7SBC0_NEMVE/284-319        AC A7SBC0.1
#=GS E4X8F9_OIKDI/457-494        AC E4X8F9.1
#=GS E9G6S4_DAPPU/632-668        AC E9G6S4.1
#=GS A0A1A8WY67_PLAMA/302-342    AC A0A1A8WY67.1
#=GS A0A1B1DWU8_9APIC/1824-1860  AC A0A1B1DWU8.1
#=GS A0A0D9QEU6_PLAFR/1755-1790  AC A0A0D9QEU6.1
#=GS A0A2B4S7D3_STYPI/1140-1176  AC A0A2B4S7D3.1
#=GS A0A0D8Y6J0_DICVI/783-819    AC A0A0D8Y6J0.1
#=GS A0A0L0CYS7_PLAFA/1567-1603  AC A0A0L0CYS7.1
#=GS A5K724_PLAVS/1646-1682      AC A5K724.1
#=GS A0A226NXS7_COLVI/85-120     AC A0A226NXS7.1
#=GS Q4Z273_PLABA/1447-1484      AC Q4Z273.1
#=GS A7RF88_NEMVE/99-136         AC A7RF88.1
#=GS W7JYM8_PLAFA/1610-1646      AC W7JYM8.1
#=GS A7ST24_NEMVE/164-201        AC A7ST24.1
#=GS B3L6Q1_PLAKH/371-410        AC B3L6Q1.1
#=GS W7KFP9_PLAFO/1612-1648      AC W7KFP9.1
#=GS K6URU2_9APIC/1744-1779      AC K6URU2.1
#=GS B3L2X8_PLAKH/1717-1753      AC B3L2X8.1
#=GS MSP1_PLAYO/1662-1700        AC P13828.1
#=GS MSP1_PLAYO/1662-1700        DR PDB; 2MGR A; 7-45;
#=GS MSP1_PLAYO/1662-1700        DR PDB; 2MGP A; 7-45;
#=GS A0A024VR46_PLAFA/1583-1619  AC A0A024VR46.1
#=GS K6UUJ2_9APIC/1692-1728      AC K6UUJ2.1
#=GS A0A0L7KDM6_PLAFX/1519-1555  AC A0A0L7KDM6.1
#=GS A0A0D9QNW7_PLAFR/372-411    AC A0A0D9QNW7.1
#=GS T1KLJ0_TETUR/640-676        AC T1KLJ0.1
#=GS B3L2X7_PLAKH/1772-1807      AC B3L2X7.1
#=GS A0A1A8W1E4_PLAMA/371-413    AC A0A1A8W1E4.1
#=GS G3W254_SARHA/179-215        AC G3W254.1
#=GS A0A0L7LXP3_PLAF4/1589-1625  AC A0A0L7LXP3.1
#=GS A0A024V850_PLAFA/1537-1573  AC A0A024V850.1
#=GS H3ER29_PRIPA/289-325        AC H3ER29.1
#=GS A0A0L1I842_PLAFA/1577-1613  AC A0A0L1I842.1
#=GS A0A077ZBR1_TRITR/1-34       AC A0A077ZBR1.1
W4IGJ5_PLAFA/1578-1614                 .............QCVK..KQ..CPE.NSGCFRHLDEREECKCLLNYKQ...EGDK..CV...........
A0A077TMC0_PLACH/1654-1692             .............VCIG..TN..IPE.NAGCFRYDDGKEEWRCLLGFKKn.dDGTR..CE...........
A7RF84_NEMVE/97-130                    ..........sva----..-T..CPA.SADCVNND-GSYTCRCKPGYEL...NNNE..CE...........
E4X0M8_OIKDI/324-361                   .............VCAN.aNP..CDA.NAACANNADGSQTCTCNDGYKG...DGSE..C-t..........
F0JB08_NEOCL/506-543                   .......eceaep----..EK..TPA.NATCVNTD-GSFEWSCNAGYEQ...VGSQ..CQ...........
A5K723_PLAVS/1754-1789                 .............NCRN..RK..CPL.NSFCFIQT-INEECLCLLNYSM...VGEK..CI...........
A0A1I8BC42_MELHA/148-179               ............y---G..VD..CDA.NAACTDTS-GSYECHCHPGYSD...VSE-..--t..........
A0A0B1SAM9_OESDE/43-75                 ............i-CKG.kDD..CPA.NSFCFQQL-----CRCMLGYRA...NGGY..CE...........
A0A2G8KYQ3_STIJA/39-74                 .............TCLD..IK..CDE.NAQCTEQD-GVRGCYCNPGFEG...DGIT..C-t..........
D4AFN2_PLAGO/1607-1643                 .............KCID..TV..VPE.NAACYRYLDGREEWRCLLKFKL...DGGK..CV...........
K6UX91_9APIC/363-402                   .............VCEH..KK..CPL.NSNCYVID-GEEVCRCLPGFSD...----..--vkidnvmncv.
H0Z6C0_TAEGU/169-204                   ............l-CKG..VT..CEA.NSECVVRD-GTALCDCKLGYRK...QGST..CQ...........
A0A1B1DXA4_9APIC/1763-1798             .............NCRN..RV..CPS.NSFCFIQR-FSEKCLCFLNYNM...VKGK..CI...........
H0Z6C6_TAEGU/172-207                   ............l-CKG..VT..CEA.NSECVVRD-GTALCDCKLGYRK...QGST..CQ...........
Q4XPN3_PLACH/203-241                   .............VCIG..TN..IPE.NAGCFRYDDGKEEWRCLLGFKKn.dDGTR..CE...........
W4IYZ7_PLAFP/1547-1583                 .............QCVK..KQ..CPE.NSGCFRHLDEREECKCLLNYKQ...EGDK..CV...........
A7ST24_NEMVE/412-447                   .............ECLK..GP..CPQ.NAKCVNNF-GSYLCICNPGYKK...VNGK..CE...........
B9QP44_TOXGV/610-647                   .......eceaep----..ER..IPP.NATCVNTD-GSFEWSCNAGYEH...VGSQ..CQ...........
W7FG83_PLAF8/1574-1610                 .............QCVK..KQ..CPE.NSGCFRHLDEREECKCLLNYKQ...EGDK..CV...........
U3K2S4_FICAL/199-234                   ............l-CRG..VT..CEA.NSECVVRD-GAALCDCKLGYTK...QGST..CQ...........
A0A0Y9WA72_PLABE/1680-1717             .............VCIN.tRD..IPA.NAGCFRYDNGNEEWRCLLGYKK...NNNT..CI...........
W7AGJ4_9APIC/1859-1894                 .............NCGN..RK..CPP.NSFCFIHA-FNEECFCLLNYNM...VGGK..CI...........
A0A1Y1JCZ7_PLAGO/1917-1952             .............NCSN..KR..CPP.NSFCFIDE-YDEECFCFLNYTM...IGKN..CI...........
A0A024WQJ1_PLAFA/1601-1637             .............QCVK..KQ..CPE.NSGCFRHLDEREECKCLLNYKQ...EGDK..CV...........
A0A0D9QFB7_PLAFR/1727-1763             .............KCTD..TV..VPD.NAACYRYLDGTEEWRCLLTFKE...VSGK..CV...........
A7ST24_NEMVE/125-160                   .............ECES..VS..CPA.NADCINSA-GSYECRCRLGFSK...NGSE..CQ...........
A0A1A8VYZ0_PLAMA/1611-1647             ............a-CTE..TK..YPE.NAGCYRYEDGKEVWRCLLNYKL...VDGE..CV...........
A0A1A8YRE9_9APIC/1565-1601             .............KCID..IT..YPD.NAGCYRFPDGREEWRCLLNFKK...VGET..CV...........
A0A024W7K7_PLAFA/740-776               .............QCVK..KQ..CPQ.NSGCFRHLDEREECKCLLNYKQ...EGDK..CV...........
Q8I0U8_PLAF7/1612-1648                 .............QCVK..KQ..CPE.NSGCFRHLDEREECKCLLNYKQ...EGDK..CV...........
W7A8Y0_9APIC/1682-1718                 .............KCID..TT..VPD.NAACYRYLDGREEWRCLLNFKK...EGDK..CV...........
A0A1J1H3N0_PLARL/1490-1526             ............i-CIN..TV..CPI.NSGCIRRLNGKEECRCLLNFKK...EGMM..CV...........
V8P6B4_OPHHA/164-199                   ..........inp----..HS..CDA.NAECIKSGPGTHECVCQPGWTG...NGRD..C-s..........
W7AGK7_9APIC/382-421                   .............VCEH..KK..CPL.NSNCYVID-GEEVCRCLPGFSD...----..--vkidnvmncv.
A0A1A8VP40_9APIC/1623-1659             .............KCID..IT..YPD.NAGCYRFSDGREEWRCLLNFKK...VGET..CV...........
A0A1B1E161_9APIC/373-412               .............VCEH..KK..CPL.NSNCYVID-GEEVCRCLPGFSD...----..--vkidnvmncv.
A7SBC0_NEMVE/284-319                   .............ECQN..NP..CPL.NSICSNTI-GSYDCACQPNYVK...SGER..CI...........
E4X8F9_OIKDI/457-494                   .............VCAS.aSP..CDA.NASCTNNADGSQSCECNFDFKG...DGYT..C-t..........
E9G6S4_DAPPU/632-668                   ............p-CSR..VR..CPV.HSQCIENNQGHPECRCLAGYQE...TESQ..CQ...........
A0A1A8WY67_PLAMA/302-342               ............i-CEY..SK..CGA.NARCYIVDKDKEECRCRANYVQ...DT--..--svdyfkci...
A0A1B1DWU8_9APIC/1824-1860             .............KCID..TT..VPE.NAACYRYLDGTEEWRCLLNFKE...LEGK..CI...........
A0A0D9QEU6_PLAFR/1755-1790             .............NCRN..RN..CPP.NSFCFIQR-FNEECLCFLHYNM...VEGK..CI...........
A0A2B4S7D3_STYPI/1140-1176             ............s-CNG..VD..CDA.HAQCVQPLDGPPNCACIKGWTG...DGQT..C-a..........
A0A0D8Y6J0_DICVI/783-819               .............TCGQ..HV..CDM.NAECMPSSSGGTECVCKLGYVG...NGVT..CE...........
A0A0L0CYS7_PLAFA/1567-1603             .............QCVK..KQ..CPE.NSGCFRHLDEREECKCLLNYKQ...EGDK..CV...........
A5K724_PLAVS/1646-1682                 .............TCID..TN..VPD.NAACYRYLDGTEEWRCLLTFKE...EGGK..CV...........
A0A226NXS7_COLVI/85-120                ............l-CRG..VS..CES.NSECVVKD-GTAMCNCKLGYKK...SGSI..CI...........
Q4Z273_PLABA/1447-1484                 .............VCIN.tRD..IPA.NAGCFRYDNGNEEWRCLLGYKK...NNNT..CI...........
A7RF88_NEMVE/99-136                    .......ecalgv----..AT..CPA.SADCVNND-GSYTCRCKRGYTL...NNNT..C-t..........
W7JYM8_PLAFA/1610-1646                 .............QCVK..KQ..CPE.NSGCFRHLDEREECKCLLNYKQ...EGDK..CV...........
A7ST24_NEMVE/164-201                   .............TCGG..VA..CDA.HAQCVQHADGRRECVCNAGWVQ...DAGR.aCL...........
B3L6Q1_PLAKH/371-410                   .............VCEH..KK..CPL.NSNCYVID-GEEVCRCLPGFSD...----..--vkidnvmncv.
W7KFP9_PLAFO/1612-1648                 .............QCVK..KQ..CPE.NSGCFRHLDEREECKCLLNYKQ...EGDK..CV...........
K6URU2_9APIC/1744-1779                 ............d-CRN..RK..CPS.NSFCFIQT-FNENCLCFLNYNM...VGEK..CI...........
B3L2X8_PLAKH/1717-1753                 .............KCID..TN..VPE.NAACYRYLDGTEEWRCLLGFKE...VGGK..CV...........
MSP1_PLAYO/1662-1700                   .............VCVD.tRD..IPK.NAGCFRDDNGTEEWRCLLGYKK..gEGNT..CV...........
#=GR MSP1_PLAYO/1662-1700        SS    .............--S-.-SS..--T.TEEEEE-TTS-EEEEE-TTEEE..-SSSS..EE...........
A0A024VR46_PLAFA/1583-1619             .............QCVK..KQ..CPQ.NSGCFRHLDEREECKCLLNYKQ...EGDK..CV...........
K6UUJ2_9APIC/1692-1728                 ............r-CID..TN..VPE.NAACYRYLDGTEEWRCLLYFKE...DAGK..CV...........
A0A0L7KDM6_PLAFX/1519-1555             .............QCVK..KQ..CPQ.NSGCFRHLDEREECKCLLNYKQ...EGDK..CV...........
A0A0D9QNW7_PLAFR/372-411               .............VCEH..KK..CPL.NSNCYVID-GEEVCRCLPGFSD...----..--vkidnvmncv.
T1KLJ0_TETUR/640-676                   ............m-CGD..KF..CDP.NADCLPSTDGSKKCICLPGFTG...NGII..C-r..........
B3L2X7_PLAKH/1772-1807                 .............NCRN..RK..CPP.NSFCFIET-FNEECLCFLNYNM...VGGK..CI...........
A0A1A8W1E4_PLAMA/371-413               .............VCEH..KK..CPL.NSNCYVIN-GEETCRCLPGYSDvklDNEM.nC-vrdd.......
G3W254_SARHA/179-215                   ............a-CKD..LA..CPQ.NSQCVNTDTGAPSCKCLPGYRL...QGTK..CQ...........
A0A0L7LXP3_PLAF4/1589-1625             .............QCVK..KQ..CPE.NSGCFRHLDEREECKCLLNYKQ...EGDK..CV...........
A0A024V850_PLAFA/1537-1573             .............QCVK..KQ..CPQ.NSGCFRHLDEREECKCLLNYKQ...EGDK..CV...........
H3ER29_PRIPA/289-325                   ............e-CLI.hGI..CPA.FSNCTNTP-GSYECTCITGFKK...ENGK..CV...........
A0A0L1I842_PLAFA/1577-1613             .............QCVK..KQ..CPE.NSGCFRHLDEREECKCLLNYKQ...EGDK..CV...........
A0A077ZBR1_TRITR/1-34                  ...........mn----..IT..CDQ.NAHCVSSY-GKHRCVCKPGFVG...NGQT..C-r..........
#=GC SS_cons                           .............--S-.-SS..--T.TEEEEE-TTS-EEEEE-TTEEE..-SSSS..EE...........
#=GC seq_cons                          ..............Chp..pp..CPt.NusCapp..GpEEC+ClLsYcp...pGsp..C............
//
DBGET integrated database retrieval system