ID A0A2C9KAN0_BIOGL Unreviewed; 4449 AA.
AC A0A2C9KAN0;
DT 20-DEC-2017, integrated into UniProtKB/TrEMBL.
DT 20-DEC-2017, sequence version 1.
DT 27-MAR-2024, entry version 31.
DE RecName: Full=EGF-like domain-containing protein {ECO:0000259|PROSITE:PS50026};
GN Name=106063406 {ECO:0000313|EnsemblMetazoa:BGLB016997-PA};
OS Biomphalaria glabrata (Bloodfluke planorb) (Freshwater snail).
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Mollusca; Gastropoda;
OC Heterobranchia; Euthyneura; Panpulmonata; Hygrophila; Lymnaeoidea;
OC Planorbidae; Biomphalaria.
OX NCBI_TaxID=6526 {ECO:0000313|EnsemblMetazoa:BGLB016997-PA, ECO:0000313|Proteomes:UP000076420};
RN [1] {ECO:0000313|EnsemblMetazoa:BGLB016997-PA}
RP IDENTIFICATION.
RC STRAIN=BB02 {ECO:0000313|EnsemblMetazoa:BGLB016997-PA};
RG EnsemblMetazoa;
RL Submitted (MAY-2020) to UniProtKB.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_013077212.1; XM_013221758.1.
DR STRING; 6526.A0A2C9KAN0; -.
DR EnsemblMetazoa; BGLB016997-RA; BGLB016997-PA; BGLB016997.
DR GeneID; 106063406; -.
DR KEGG; bgt:106063406; -.
DR VEuPathDB; VectorBase:BGLB016997; -.
DR OrthoDB; 2876235at2759; -.
DR Proteomes; UP000076420; Unassembled WGS sequence.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR GO; GO:0006897; P:endocytosis; IEA:UniProtKB-KW.
DR CDD; cd00054; EGF_CA; 4.
DR CDD; cd00112; LDLa; 30.
DR Gene3D; 2.10.25.10; Laminin; 7.
DR Gene3D; 4.10.400.10; Low-density Lipoprotein Receptor; 32.
DR Gene3D; 2.120.10.30; TolB, C-terminal domain; 8.
DR InterPro; IPR011042; 6-blade_b-propeller_TolB-like.
DR InterPro; IPR026823; cEGF.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
DR InterPro; IPR018097; EGF_Ca-bd_CS.
DR InterPro; IPR009030; Growth_fac_rcpt_cys_sf.
DR InterPro; IPR036055; LDL_receptor-like_sf.
DR InterPro; IPR023415; LDLR_class-A_CS.
DR InterPro; IPR000033; LDLR_classB_rpt.
DR InterPro; IPR002172; LDrepeatLR_classA_rpt.
DR PANTHER; PTHR22722; LOW-DENSITY LIPOPROTEIN RECEPTOR-RELATED PROTEIN 2-RELATED; 1.
DR PANTHER; PTHR22722:SF12; LOW-DENSITY LIPOPROTEIN RECEPTOR-RELATED PROTEIN 4 ISOFORM X1; 1.
DR Pfam; PF12662; cEGF; 1.
DR Pfam; PF07645; EGF_CA; 2.
DR Pfam; PF14670; FXa_inhibition; 2.
DR Pfam; PF00057; Ldl_recept_a; 29.
DR Pfam; PF00058; Ldl_recept_b; 9.
DR PRINTS; PR00261; LDLRECEPTOR.
DR SMART; SM00181; EGF; 21.
DR SMART; SM00179; EGF_CA; 8.
DR SMART; SM00192; LDLa; 32.
DR SMART; SM00135; LY; 39.
DR SUPFAM; SSF57196; EGF/Laminin; 5.
DR SUPFAM; SSF57184; Growth factor receptor domain; 3.
DR SUPFAM; SSF57424; LDL receptor-like module; 32.
DR SUPFAM; SSF63825; YWTD domain; 8.
DR PROSITE; PS00010; ASX_HYDROXYL; 5.
DR PROSITE; PS00022; EGF_1; 1.
DR PROSITE; PS01186; EGF_2; 5.
DR PROSITE; PS50026; EGF_3; 4.
DR PROSITE; PS01187; EGF_CA; 4.
DR PROSITE; PS01209; LDLRA_1; 13.
DR PROSITE; PS50068; LDLRA_2; 32.
DR PROSITE; PS51120; LDLRB; 18.
PE 4: Predicted;
KW Calcium {ECO:0000256|ARBA:ARBA00022837};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Endocytosis {ECO:0000256|ARBA:ARBA00022583};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Membrane {ECO:0000256|ARBA:ARBA00023136, ECO:0000256|SAM:Phobius};
KW Receptor {ECO:0000256|ARBA:ARBA00023170};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP};
KW Transmembrane {ECO:0000256|ARBA:ARBA00022692, ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|ARBA:ARBA00022989,
KW ECO:0000256|SAM:Phobius}.
FT SIGNAL 1..24
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 25..4449
FT /note="EGF-like domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5013310899"
FT TRANSMEM 4285..4308
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT REPEAT 405..447
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REPEAT 736..778
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT DOMAIN 1245..1284
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT REPEAT 1333..1375
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REPEAT 1376..1418
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REPEAT 1421..1464
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REPEAT 1465..1509
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REPEAT 1736..1782
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REPEAT 2009..2053
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REPEAT 2054..2097
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REPEAT 2369..2412
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REPEAT 2413..2454
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT DOMAIN 2963..3004
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT REPEAT 3092..3134
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REPEAT 3135..3177
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REPEAT 3178..3222
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REPEAT 3223..3264
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT DOMAIN 3851..3887
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT REPEAT 3987..4029
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REPEAT 4030..4074
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REPEAT 4076..4118
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT DOMAIN 4217..4252
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT REGION 4391..4449
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 4427..4449
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 32..44
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 39..57
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 51..66
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 71..83
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 78..96
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 90..105
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 882..894
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 889..907
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 901..916
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 922..934
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 929..947
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 941..956
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 961..973
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 968..986
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1001..1013
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1008..1026
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1020..1035
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1060..1075
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1083..1095
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1090..1108
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1123..1135
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1130..1148
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1249..1259
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 2550..2562
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2557..2575
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2591..2603
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2598..2616
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2610..2625
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2630..2642
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2637..2655
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2757..2769
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2764..2782
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2845..2857
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2852..2870
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2885..2897
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2892..2910
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2928..2940
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2935..2953
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2947..2962
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3397..3409
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3404..3422
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3416..3431
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3436..3448
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3443..3461
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3455..3470
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3475..3487
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3482..3500
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3565..3583
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3601..3613
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3608..3626
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3643..3655
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3650..3668
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3662..3677
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3687..3699
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3694..3712
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3736..3754
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3748..3763
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3770..3782
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3777..3795
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3789..3804
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 4223..4240
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 4242..4251
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ SEQUENCE 4449 AA; 496491 MW; 7E919E7892D6D164 CRC64;
MDVRLVSSWC LLLATSWLMS STSSQDVTCM PCFEGQFQCK NCRCILQTAQ CDHSNDCGDL
SDERNCTYPS CAGHEYTCSN KVCISQEWVC DGSDDCSDGT DEQYCSNYAC HPQEWACPHS
GLCIPLSHVC DKTKHCASGG DEDGLCDTSC ADLSCDYGCH ATPKGGQCYC QLGYQINKED
NRTCIDFDEC STFGFCDQLC TNTPGGFRCS CREGYNLGQD GVCRAPESNS IRVFVTSTTK
IMSMNRDGQD TKELVKVDAV DVDIDSKGEK IFYINNTDNQ IYAMTATGNS ASRKLPIQGL
AIPVDIALDW VTNNLYIVDR DTARIELFSI TSGFQHNIVS DNLQTPIAIA VDPNIGYLFF
ADRGQKGPTM KPRIERVFMD GSHRWDLGLS KMLHPQGLTL DLVNQRVYWV DSQLDHLEGV
DYNGQRRRTV LSGGLHIPAP VSVAIFESQL FFADITKLGI MRVDRNDSTV APKTLKQIYK
AENGIPTAVV VSHSSFYKLT SRDSNPCLGN PCQHICALSH NTDNNGLGYR CLCKAGYELD
HTMNNCTRAE KFILFATPRA IRGISLEDPG QYPIDSIQPI VGQRWGRLGV NYVALDYDAE
NETVFFSDVR NRVIYRAQIG DSDPVPQLVT EIGSVEGISY DWIAKNLFFT DFRRSTLSVI
RANHPTDRRD LIKNLGNARS VVVHPLKGYL YYSDWLRNSR QSAYIARSFT DGTNITQIKK
NQLGWPNGLT IDFVNDRLYW ADAYFDRIQH SKLDGDDLQT LTGHTIVHPF GIAIYKDFIY
YTDWRLQSII RINKRGGQEQ KIRSGIGRVM GIRVYDPSLQ PLSSQNPCHL RNGDCSHFCF
VVPILEGMST VGRHCGCPYG MKLSRDQRNC EANPEEVDVT TCRPGLFQCQ NGRCIPNSYR
CDRDNDCLDR TDELNCPEAT TCPANRFKCD NGQCISSVWV CDGDNDCGDM TDEKNCSAKT
CNSREFQCNN SLCISQSLKC DTDNDCGDGS DEGEFCGTHT CPAHFFQCDD KRCIPELRVC
DGGNDCYDNT DERNCPPLNC SGTRWTCKNV RQCILTKHHC DGVPDCNDES DEQDCPQHST
DSCQRDQYRC RDGGCIPENW KCDGQSDCDD GSDEGNQCPP VTCYGDRFRC ANGRCIFKGW
VCDGDDDCGD NSDEDASLTC APPPFSCPRG KWECPGGSHV CINTTLVCDG NPDCPEGHDE
SPICNQDNCR DNNGGCSHMC IQTPRGAECK CPRGQALNGT KTCIDEDECT PPGRCSQTCI
NTKGSFKCEC VEGYTLLTDR RTCKVMRNDT ALSLLIASRT AVVKSNLEVL LYDPLPLPPF
RSLTAIDIDV NRSQIYFSDT VLKKIFRTSI DGTNLTEIVA TGIDVVEDIA VDWIGKNLYW
TDYGMETIEV VNLAGENRMV LFSENITNPR AIEVDPRDGV RYLFWSDWGQ NPRIERSGLD
GSGRIVLVSE KLFWPNALTV DYPNKRLYFA DARMDFIEFC NYDGSGRHQV FANDHFLRHP
HSLTIFEDWI YWTDRAASRV SKCNKFNCSD RSVVASSISR PLGIAVYNII KQPTGTNPCE
VAKCSHLCLL SPRPTGYSCA CPVGMQLDGT AHNCVKDSSE ILLFMQSRFI AGIKLGSTNV
TGIIPVSSIS SGQDFDFDSK EGYIYYVEKI NGSLKRIQLN GRNTSEYVPT AVIGSPNAIA
IDWMSRNLFW ANAAAGLMEV MRLDGPEHFR RVLLSNNGRR RDVANPLSLC VDPTKGLLYW
GDGGVPGVPA KIAVVEMDGS NPRVLLSAGI RTPMYMTLDT QSQSLYFTDT FNNKLQRYLI
RSGSMSQVVS AGSPQGVVFH NSRLYYYDSI YETINRAPYP TIRSAAVLRS NIKGVGALKV
YYDRHESGET NACSVNNGDC PHLCLPKSLS RTCACSIGFE QKEDGSCAAE SSFVVVSMYN
VIRGFGMTRV DVDEAMVPIA GSGRAPVAID VYMGANYIYW VDSRASSTGG KQEGGIHRIK
PDGSNFQDIL TSGMGSNGIQ GLSVDWIAGN IYFTNVFDVE VYIEVIKLDG AHRKVLVKES
QGQPRALAVN PIKRYLYWAD MGQTAKIERS LLDGTNRTTI VKSGISLPRD VTIDFVTHDV
YWVDAIVDAI QCVTFDGENR RYIQTNTPNP YGLSVFNSYV YWVDRNLQKI FRALKSPQGS
VPQVLKSNLE MLSDIAIYDQ AMQPHDDNNP CSANNGGCQQ LCFAKPNQTE PECGCATGTL
GPQKKTCVAP SSFLLFAAET EIYSLSLDPD STSNPIPTIS DLQGAVAVDY DAHENYIYFS
QVNSKKISRV KKGSTVVEDL MSPSMNTTPG YVHDVTSVEG IAFDWVGKKL YWADLFRNKI
YSINVNLTYK VVIATVQSPR ALAIDPCKGF IYWSDWGVTP KIERATMAGN QRQAIVSTDL
GWPNGLTIDY EEEKIYWADA QKDRIERANL DGNYREVIVE TTVHPFSLTV HGFYIYWSDW
TLRGIYRAEK HTGANMKMLV QGLSTRPMGV AVYSQEKQKC NNNPCTVFNG GCSHSCHPGP
DGSVDCACVE GTGQVIGNNG KVCVPANNTC TSDQFVCQNG RCLRERWVCD LDNDCGDGSD
EEPNLCALHT CDPKYFSCRN GRCIPLRYRC DFDNDCRDNS DEEACDYPTC GPNQFTCNNF
RCIDAAQRCN GIDNCRDGNR TDEVNCPPRT CPPNQVKCPT TNICIIRRYM CDGDNDCGDN
SDENPFFCHL VSCAPGDFQC SVSHKCIPGS WQCDGDDDCG SGEDESPTTC SSFNRTCQTN
QFACNNNRCV SQRWVCDGED DCGDNSDESA DRNCNERTCP PDTFTCESNK QQGSYPCIPL
SRVCDGVKNC RNGEDEMQTC PPRTCMPHEF QCTNGICISA RFKCDHDDDC GDASDEPTDC
NYHSCSSEQY TCDNKRCVPK TWSCDGDNDC GDGSDEKESI CLTPEPTCPG NKFRCTNGQC
IASELVCNKN PDCSDESDEQ YCNVDECQST RVNQCQHKCI NTITSYKCEC NPGYQLMSDR
KGCRDIDECV EVVGACSQEC ENTEGSYICK CSEGYQKMED GKTCKKTDYI TPWLIFTNRY
YLREISTEGD NHRRIAQGFE NIVSLDFDIA NDLIYFTDVK QHKIYSIFVN GTDQKVIIKD
NVPSVEGISV DWIGRKLYWV DGRRSTIGVS EMNGTSQLTL LKEGIRRPRA ISVHPFKGFL
YWSDWGNPPY IGRMGMNGKN LSTDFITEKL GWPNALTLDF ETDRLWWADA HLDIIEYANL
DGSHRHIVLE NVPHPFALSL FEDFMYWTDW NHLTIEKANK FTGENHRIIW NVTHRPMDIH
IFHPLKQKPG LNPCGTDNGG CSHLCLIAPG VANYSCACPD YFILKSDLKT CEARCTSIQY
RCGRTDDRCI PRLWKCDGEK DCRDGSDEPA DCPVSHCHPG QFQCKNKNCT FAFRVCDLHD
DCGDGSDEEG CQKHSCEPWQ MKCDNGKCIP KAWMCDREDD CGDGTDEKSC GNSTCKPNQF
RCDNGNCIQA NWKCDFDNDC GDNSDEKAEY KCETRQCEVG WWKCQTNYRC VPNWALCDGE
DDCRDNSDEK EENCPKCHPS GDFKCQNRRC IPKRWMCDFD DDCGDNSDED VNKCRDSYRK
CSESEFRCDN NKCIQGKFKC DHDNDCGDAS DEKESLCKDY QKCTPDKFTC ASGHCTNMAN
MCDGQRDCLD ASDEKNCTAL LPGGKFCHSS MFECRNHMCI PWTWRCDGVN DCGDDSDETP
AVCKEVSCTQ PERSLCNNFK CIPSWRRCDG VDNCGDNSDE ESCQVKKKEC TSDDFKCADG
TCIDGSKACD SSPDCKDFSD ERGCHKDVGL TCDDDNGGCE RNCTSLGQNS FYCSCPTGLR
VSERNRKECE DIDECASWGN FCPQECINVK GSFKCRCHKG FTDPHNRGQE CKSDEDNSYI
ILFTVGDEVR QYRSKNKDYT TEVTSGIRSA GIDIDADRRL VYWSDTNVGK IYRASMPKDD
KTKAVARDLD IVGYNRPEGL SVDWVAKNIY WTDSEVGIIA VATADGFYQK ALISTDLKYP
KAIAVHPGLG YMYWTDVHAS GPKIERAWMN GEMRTVLVST KLSYPSGLAI DYYMDNRIYW
CDSKENLIES MKPDGTDRVI VTSKAAYNPV ALDVFEGQMF WLSEKLGQLA SMDKFGREDN
KTIQTGLQLP KGLKVFNIYR YNISIKSPCK FLICSHLCLV VPEGARCACP EGASFVPDTN
NTICDATHPK PKPTPKPSQE CLCTNGGSCI TKENEVETKC ICPAGWTGEQ CESAFENVTQ
TNETVTQVPE EHKINTNLSE DDNHVAIVVP VVIGIIAILA IVLLVVILRR RGVDFNFKKL
ITKSQPPSSP TVSFKEGGQV KLGVPEMMYD AQGQGEEMQP TSSDSPTNFC NPVYDSLHGP
LTVHESVILP THGPHYESST DPEKGEIHLS GKGKVSQPSK EFRIAPRALD PNLDEDERDE
AGLVKSGDL
//