ID A0A2A2JLL6_9BILA Unreviewed; 4727 AA.
AC A0A2A2JLL6;
DT 20-DEC-2017, integrated into UniProtKB/TrEMBL.
DT 20-DEC-2017, sequence version 1.
DT 27-MAR-2024, entry version 22.
DE RecName: Full=EGF-like domain-containing protein {ECO:0000259|PROSITE:PS50026};
GN ORFNames=WR25_26419 {ECO:0000313|EMBL:PAV62585.1};
OS Diploscapter pachys.
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Diploscapter.
OX NCBI_TaxID=2018661 {ECO:0000313|EMBL:PAV62585.1, ECO:0000313|Proteomes:UP000218231};
RN [1] {ECO:0000313|EMBL:PAV62585.1, ECO:0000313|Proteomes:UP000218231}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=PF1309 {ECO:0000313|EMBL:PAV62585.1};
RA Fradin H., Zegar C., Gutwein M., Lucas J., Kovtun M., Corcoran D.,
RA Baugh L.R., Kiontke K., Gunsalus K., Fitch D.H., Piano F.;
RT "Genome architecture and evolution of a unichromosomal asexual nematode.";
RL Curr. Biol. 0:0-0(2017).
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PAV62585.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LIAE01010354; PAV62585.1; -; Genomic_DNA.
DR Proteomes; UP000218231; Unassembled WGS sequence.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR GO; GO:0006897; P:endocytosis; IEA:UniProtKB-KW.
DR CDD; cd00054; EGF_CA; 1.
DR CDD; cd00112; LDLa; 31.
DR Gene3D; 2.40.128.620; -; 1.
DR Gene3D; 2.10.25.10; Laminin; 7.
DR Gene3D; 4.10.400.10; Low-density Lipoprotein Receptor; 34.
DR Gene3D; 2.120.10.30; TolB, C-terminal domain; 8.
DR InterPro; IPR011042; 6-blade_b-propeller_TolB-like.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
DR InterPro; IPR018097; EGF_Ca-bd_CS.
DR InterPro; IPR009030; Growth_fac_rcpt_cys_sf.
DR InterPro; IPR036055; LDL_receptor-like_sf.
DR InterPro; IPR023415; LDLR_class-A_CS.
DR InterPro; IPR000033; LDLR_classB_rpt.
DR InterPro; IPR002172; LDrepeatLR_classA_rpt.
DR PANTHER; PTHR22722; LOW-DENSITY LIPOPROTEIN RECEPTOR-RELATED PROTEIN 2-RELATED; 1.
DR PANTHER; PTHR22722:SF14; MEGALIN, ISOFORM A; 1.
DR Pfam; PF07645; EGF_CA; 3.
DR Pfam; PF14670; FXa_inhibition; 1.
DR Pfam; PF00057; Ldl_recept_a; 28.
DR Pfam; PF00058; Ldl_recept_b; 7.
DR PRINTS; PR00261; LDLRECEPTOR.
DR SMART; SM00181; EGF; 21.
DR SMART; SM00179; EGF_CA; 9.
DR SMART; SM00192; LDLa; 35.
DR SMART; SM00135; LY; 32.
DR SUPFAM; SSF57184; Growth factor receptor domain; 4.
DR SUPFAM; SSF57424; LDL receptor-like module; 32.
DR SUPFAM; SSF63825; YWTD domain; 8.
DR PROSITE; PS00010; ASX_HYDROXYL; 1.
DR PROSITE; PS00022; EGF_1; 1.
DR PROSITE; PS50026; EGF_3; 1.
DR PROSITE; PS01187; EGF_CA; 3.
DR PROSITE; PS01209; LDLRA_1; 14.
DR PROSITE; PS50068; LDLRA_2; 34.
DR PROSITE; PS51120; LDLRB; 11.
PE 4: Predicted;
KW Calcium {ECO:0000256|ARBA:ARBA00022837};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Endocytosis {ECO:0000256|ARBA:ARBA00022583};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Membrane {ECO:0000256|ARBA:ARBA00023136, ECO:0000256|SAM:Phobius};
KW Receptor {ECO:0000256|ARBA:ARBA00023170};
KW Reference proteome {ECO:0000313|Proteomes:UP000218231};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP};
KW Transmembrane {ECO:0000256|ARBA:ARBA00022692, ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|ARBA:ARBA00022989,
KW ECO:0000256|SAM:Phobius}.
FT SIGNAL 1..23
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 24..4727
FT /note="EGF-like domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5012200809"
FT TRANSMEM 4569..4594
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT REPEAT 492..534
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REPEAT 576..620
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REPEAT 1527..1569
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REPEAT 1570..1613
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REPEAT 1614..1657
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REPEAT 1992..2034
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REPEAT 2219..2263
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REPEAT 2264..2307
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REPEAT 2586..2636
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REPEAT 4341..4384
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REPEAT 4386..4427
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT DOMAIN 4522..4552
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT REGION 4621..4727
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 4621..4652
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 4660..4684
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 60..72
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 67..85
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 79..94
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 123..138
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 232..244
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 239..257
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 251..266
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 291..306
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1068..1086
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1151..1163
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1158..1176
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1192..1204
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1199..1217
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1253..1268
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1316..1328
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1323..1341
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2801..2819
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2836..2848
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2843..2861
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2855..2870
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2876..2888
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2883..2901
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2963..2975
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2970..2988
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3016..3034
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3103..3115
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3110..3128
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3143..3155
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3150..3168
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3162..3177
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3188..3200
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3195..3213
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3672..3684
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3679..3697
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3691..3706
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3754..3766
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3761..3779
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3879..3891
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3886..3904
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3898..3913
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3918..3930
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3925..3943
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3937..3952
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3960..3972
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3967..3985
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 4050..4062
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 4057..4075
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 4069..4084
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 4542..4551
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ SEQUENCE 4727 AA; 522433 MW; 558F651E642A205A CRC64;
MLDIRICASL LCAILPFLQL SLAFSAPRQL PIQIDAPASL IRARVQPLIS SSVNTASTVC
TDQDFRCDDG RCIRLEWRCD GSGDCSQGED EKDCPHPGCK DDQWQCSSYE WHGVSCIASY
QRCDNITNCA DGSDEVDCPP PAVDCSRSDG SVFMCADGRQ CFDIKQKCDG IYDCSDLSDE
KDSCSRNHTA CFQYQFRCAD KTQCIQKSWV CDGSSDCSDG SDEPKNCEFK PCLPSEYQCT
NKRCVKKSFR CDYFDDCGDN SDEEDCGEYR CPEGRFHCPH TGKCIDQVKL CDGHKDCESG
ADEEHCADNL CPSLGCQAGC HSSPSGGQCT CPNGYELDER FHRTCLDINE CTEFGYCDQL
CSNHRPGFTC SCLGECFTLE MAHGPGKDNL TMRGYCISRE AEKMKLFVAR REGLYKLNPN
SKTEEPKRLA SGEFIYGIDF DYDDKKLFWT DRMSHSAFSA EVDENGDIGH IKKLGLKSLI
YPRSLAVDWI SNLLYIIESG SRRIDVCTFD GEKRTVLIAD ALTLPLDIAL DPLRGEMFFS
NQLKLEGAAM DGSHRRVLVN THTHQVSGVV VDITAKRVYW VDPKVDRVES IDYEGRDRRV
VAQGMNQVPH PFGLALFDQY LYWTDWTRLG AIKIEKFGSP TEVIWTKKEN NVFPMGISAY
HRMAQPVTSD SDCLGMKIEN PCAKADCEEM CLLAKDATGF DVGYRCACPI GKKLIDGKKC
VDSTDYLLFS SNKIVRGIFP TEDQSLSEAV LPISPISQRR IGMYFEVECD VHGNSFFYAD
IMDNTVFRIR PDGEGSAPVL VTHNDGLVSM SYDWVSKQLY YVDNIRSSLE VVKVTDTGLV
HPDQLVHRQL LKDLRDPVSV VVHPWKGWLF YAEAQRPAKI YRCAIDGADC IVIRNTSLGR
PSEMAIDFAE NKLYWGDTLL KTISYMDFDG KNMQVLNVDN PIPVALSIMD DHIYYVHQRP
YSIRKVGKHH GGKGKIIREF GGEERSIFSL KACSKANQPI PDDSREHPCR NSDCSQLCFA
MESAGTLKKK CACRQGFKIN PDNDHTCERD TSEPTEELCP SNSTQFQCAN GRCIPKEWKC
DGENDCLDNS DELNEKGEAC YHEQPCPENT IRCTSTKRCI PSAYACDGDN DCGQYEDEDP
KYCKPGQKPQ CSAKKFQCAN HRCIPESWKC DSDNDCSDGS DEAPELCANM TCAANQFQCS
SGRCIPIYWV CDGDQDCYGG DDEDKNRCPP IQCSSLQFRC ANGRQCIPLR NRCDSQVECE
DGSDEDDCVA PESKCTPEQF HCATTNVCIP ASWKCDKQVD CDDGSDEPPS CSTHECGANE
FKCKNNRCIP KQWLCDQTND CGDDSDESEE VGCKTSTQFA RKCPFEHVPC EDAPDQCIPI
HQLCDGKSHC PGGTDEGGRC ARDLCAADRA GCAYKCHQSP NGPLCSCPYG ETPVNNTRCA
PLNECLDPST CSQICIDEKH GFTCKCADGY TLDAADKRTC KATEGEMRIY VSNRNRIYWS
DSHLENWKTF SAAVENAIAI AWDSVTDRIY WSDIREKKIF SATRNGTDVK IFIGEGLDIT
EGIALDWVGR NLYWVDSSLN TIEVANLENP AARALLVHEN VSQPRGIAVD PRKGLMFWTD
WGQNPKIERA NMDGSDRTVI VDTKIYWPNT IALDFTTDRV YFADSKLDYI DFVSYDGTGR
TQVLCSPKLV QHPHALAIFE DFVYYSDRRL QRLQIYPKFP NGTTRTYPSH TFSKALGVVA
IHPALQPKVD NDPCASKPCS DLCLLNAHGS FTCKCPMGKR LDGTGKSCVV DSKPFLLLIQ
KTNVFGVETT TQNGVVPNLA GMVPLAGLAN AFDAGYDAES GTLFILEHTN IARSLAQIST
DAAVYKTTIN AGNRTLLYSS HVPDDPYCLA FDWNGRNVIV GNKVSQTIEV IRTVGDTYRA
VILTNDQSPT AVVNPVAIAA DSDRGLIFWL DRGGGASDVK VARASMDGTQ PLVVVSNDLT
QLDHLALDIV NQRIYFTESK SGRITSVTYD GQDRHYLLND PGKQPNGLAF YSDKLYYSDS
AFDSIEYAQI TGSGEAPQFS HFKKEVENLV NIKMLQPRAS SLSHPCRINN ANCKHICIPQ
MFSQYKCICA TGYTPTPGNT NECKLFDESF VLVATKNKIT GFPVDQTQTK GVAMESIGGL
SITAVDYDYD SKTVFVADGA GINRGITAYT LGQGAPRTIV KDTFGSMVVK SISVDWVNYN
LYFINQDAER TNIEVCKFDG QYRKILVSTK TETPSSIAVD PVGRYLYWAD NGQKPSIQKA
LLDGSRRELL ISEDLGEPTD LIVDTASRML YWTDAKKDGI FRVKTTGGKP ELVRSDIASA
AGVTLLGQDM FWSDNRLSKV FKAGSKPNAT PVPLTPTVVA TSVPDVGDIR IFSSLNQPKT
TSPCQITDNL RKSPCSQLCF SSPGTQSATC ACARGTLKGR VCEEPDTYLM FSDGDKIIDA
PIEPDIKANK PLMDALPAID NLQTFDVDVN LRKIYYVAES PAGVNISWIA MNNADSPRLI
FGPSKQKHAT DIRHISDMKF DWHNQKLYFT TGRSGKLMVL DTLGEHMGTI ARGDWTYALG
LDPCAGLIFW SDSGYKASGG LYEPRIERAN TAGGNRKVLI SQSVSLPAAI TVDWREKRIY
WADVNRLNIE SCDYEGGNRR VLGAGYRAKS LDLWDNWIYM SDPLSNGVFR IDKNSGGSVE
VVVADRRVPG TLRVFASEDD IRTRNQACSS ITSQACKTDN GGCEQICTVV SDEIGDAAQK
VQCACNETYE LVTEPGKDFA SKCVLRDNAG KACMPPYNFQ CGDGTCISLD ATCDSKSDCP
SDNSDEDPIY CNSRVCPADY FLCVNRRCVS GLKRCNSIDD CGDGSDELDC ASTAQCAPGM
FACGNGHCIN QTRVCDGRND CHDEAVSDEN STTCPGLPID CRGVKIKCPN TNICIQPADL
CDGYDDCGTK DDENKLFCMN QKCAQNYVRC PSGRCIPETW QCDGDADCPD AWDETHTNCT
DSSGKRICVG EYLFQCDNGK CISRAFICDG ESDCEDGSDE NTARHHCGNR TCSDQEFHCA
SNARLAQPKY ECIPKSWLCD SEVSCAGGED ESVELCKREK KSCNKNEFAC ANSHCINASW
ECDGDQDCLD GSDEHANCTY SSCQPEFWQC KDHKCIPLSW KCDGQRDCSG GEDEDQCEGA
KGPGTGNCTA NQYACTSGEC IDMKKVCDNK FDCTDHSDES AQCNIDECTL AEKPLCEQKC
VDKPIGYQCE CFEGFALDKD DQKSCHNVDE CYEGTSTCSQ QCEDKIGSYK CSCVKGYQLE
KDDHGCKRTD PEPEPYMLLA NKHYIRKLSL DGSIYDMAAE GFDNVVSMDF DWKEKMLYIV
DQGRLRLLRI GLDEIGSGLN SYETIVRHHV FGTEGFAIDW IGRKMYMLNR QERAIRVCEL
DGTSCKTLIR DRIQQPKAIA IHPGKGYLYF TEWSLQPYIG RMALDASPEL ADPIVKLAEK
DLGWPNAITI DFFSDRIFWG DAHLNEIGFM DFDGGGRRHI PAQRTSHVSS MVIFDDWLYW
SDWNLKEVIR CNKWTGKNET VLKKIIQLPN ELRVIHPMRQ PDFPNPCGDN NGGCSHLCLI
GAGGNGFTCA CPDQFVLLPD SKTCEPNCTA RQFACGGDDA KCIPKLWYCD GEKDCRNGED
EPGPDICGIR VCPVGEFQCG NHNCTRPFQI CDGTDDCGDG SDEQNCNQAC DPGQFKCKET
GKCIPTRFVC DGDDDCGGRS DEADEICLSP NRTCTAEEFK CTNNRCISKA WTCDNEDDCG
DSSDETPECA QVECRKGWIR CSNSYRCVPG WAACNGNDDC RDNSDENREK CPSCDDVGEF
RCGSSGKCIP KRWMCDTEAD CPGGEDELDE SCGGTTRPCS ESEFRCNSGK CIPGKRVCDG
IANCEDGLDE SQCTHRNCSA GYRQCNDGQC ILEHKWCDRK KDCQHAEDET SCESTTRRPC
SPFEFQCSNG VCVNMKFKCD GDDDCGDLSD ETTPDCRTAA CDPPLRFRCA HSRLCLNILQ
LCNGYNDCGP NDYSDEHLSM CSSFSEYGDC TVEQFKCANG KCINATMACD RVDQCGDASD
EIGCVKAGGS TCETHGNNGG CKQLCTDLAG GGYICACREG FEPDPNNPKD CVDIDECKGN
NTCTQMCLNT KGSYLCRCHD DYENNVVVGA MTGKDCRAKG DPADVVVAAG DTLVQLSLHG
GGVNRHAAAQ APDDDNDIIS LAFDGRRDMM YWIDEDDKNV FRAATVKGNQ SHEAQKLDID
WAGMGLRPTA VATDYSTGNL FITAVNDHIN DITRKKRMSE PMRAADFGSV LVSLPDGRYV
KKIISGHLEA PTAIVTLPTM GKICYSDAGL HAKIECAQMD GSHREILVKD LVFSPSSLAV
DEGKGNRIYW ADPKYRTVEV INPDGTGRIT VVRDNNVPVA IDVFENHLYW LSKKTKTLFV
QDKFGRGRIQ VLASNLEDVH TVKVSQRFAR DSTRFKGGCA STVCSHLCVQ LPDEGFACLC
PDNSIPHPDG SCSTPRSEPL TMPRQCSCTN GGTCKLDGTC ICTSDFEGEN CDRDSSVSRK
LISTLSSNVL LAILLLLVII AASGVIFFVG MHLIRKRRLL AKKEGDDGTV SFHGNVISFS
NPALESKSEP NPVEYSMQTI STGPNGTTFS NPVYELEDAG HQMSDSDKPG TSTERRHSET
HKSIELSGPS KPSSSKPEVA PKPKKGDKTL LVDNPLYEPP DTEISDV
//