ID Q7Z5C1_HUMAN Unreviewed; 4655 AA.
AC Q7Z5C1;
DT 01-OCT-2003, integrated into UniProtKB/TrEMBL.
DT 01-OCT-2003, sequence version 1.
DT 27-MAR-2024, entry version 141.
DE SubName: Full=Glycoprotein receptor gp330/megalin {ECO:0000313|EMBL:AAP88585.1};
OS Homo sapiens (Human).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC Homo.
OX NCBI_TaxID=9606 {ECO:0000313|EMBL:AAP88585.1};
RN [1] {ECO:0000313|EMBL:AAP88585.1}
RP NUCLEOTIDE SEQUENCE.
RC TISSUE=Placenta {ECO:0000313|EMBL:AAP88585.1};
RX PubMed=8706697; DOI=10.1111/j.1432-1033.1996.0132u.x;
RA Hjaelm G., Murray E., Crumley G., Harazim W., Lundgren S., Onyango I.,
RA Ek B., Larsson M., Juhlin C., Hellman P., Davis H., Aekerstroem G.,
RA Rask L., Morse B.;
RT "Cloning and sequencing of human gp330, a Ca(2+)-binding receptor with
RT potential intracellular signaling properties.";
RL Eur. J. Biochem. 239:132-137(1996).
RN [2] {ECO:0000313|EMBL:AAP88585.1}
RP NUCLEOTIDE SEQUENCE.
RC TISSUE=Placenta {ECO:0000313|EMBL:AAP88585.1};
RA Hjalm G.;
RL Submitted (MAR-2003) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the LDLR family.
CC {ECO:0000256|ARBA:ARBA00009939}.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AY265357; AAP88585.1; -; mRNA.
DR RefSeq; NP_004516.2; NM_004525.2.
DR PeptideAtlas; Q7Z5C1; -.
DR DNASU; 4036; -.
DR GeneID; 4036; -.
DR KEGG; hsa:4036; -.
DR CTD; 4036; -.
DR PharmGKB; PA30452; -.
DR OrthoDB; 2876235at2759; -.
DR BioGRID-ORCS; 4036; 7 hits in 1149 CRISPR screens.
DR GenomeRNAi; 4036; -.
DR Genevisible; Q7Z5C1; HS.
DR GO; GO:0005765; C:lysosomal membrane; HDA:UniProtKB.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR GO; GO:0006897; P:endocytosis; IEA:UniProtKB-KW.
DR CDD; cd00054; EGF_CA; 3.
DR CDD; cd00112; LDLa; 34.
DR Gene3D; 2.10.25.10; Laminin; 7.
DR Gene3D; 4.10.400.10; Low-density Lipoprotein Receptor; 36.
DR Gene3D; 2.120.10.30; TolB, C-terminal domain; 8.
DR InterPro; IPR011042; 6-blade_b-propeller_TolB-like.
DR InterPro; IPR026823; cEGF.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
DR InterPro; IPR018097; EGF_Ca-bd_CS.
DR InterPro; IPR009030; Growth_fac_rcpt_cys_sf.
DR InterPro; IPR036055; LDL_receptor-like_sf.
DR InterPro; IPR023415; LDLR_class-A_CS.
DR InterPro; IPR000033; LDLR_classB_rpt.
DR InterPro; IPR002172; LDrepeatLR_classA_rpt.
DR InterPro; IPR019825; Lectin_legB_Mn/Ca_BS.
DR PANTHER; PTHR22722:SF11; LOW-DENSITY LIPOPROTEIN RECEPTOR-RELATED PROTEIN 2; 1.
DR PANTHER; PTHR22722; LOW-DENSITY LIPOPROTEIN RECEPTOR-RELATED PROTEIN 2-RELATED; 1.
DR Pfam; PF12662; cEGF; 1.
DR Pfam; PF07645; EGF_CA; 2.
DR Pfam; PF14670; FXa_inhibition; 2.
DR Pfam; PF00057; Ldl_recept_a; 34.
DR Pfam; PF00058; Ldl_recept_b; 15.
DR PRINTS; PR00261; LDLRECEPTOR.
DR SMART; SM00181; EGF; 26.
DR SMART; SM00179; EGF_CA; 10.
DR SMART; SM00192; LDLa; 36.
DR SMART; SM00135; LY; 38.
DR SUPFAM; SSF57196; EGF/Laminin; 4.
DR SUPFAM; SSF57184; Growth factor receptor domain; 2.
DR SUPFAM; SSF57424; LDL receptor-like module; 36.
DR SUPFAM; SSF63825; YWTD domain; 8.
DR PROSITE; PS00010; ASX_HYDROXYL; 3.
DR PROSITE; PS00022; EGF_1; 1.
DR PROSITE; PS01186; EGF_2; 6.
DR PROSITE; PS50026; EGF_3; 1.
DR PROSITE; PS01187; EGF_CA; 3.
DR PROSITE; PS01209; LDLRA_1; 17.
DR PROSITE; PS50068; LDLRA_2; 36.
DR PROSITE; PS51120; LDLRB; 23.
DR PROSITE; PS00307; LECTIN_LEGUME_BETA; 1.
DR PROSITE; PS51257; PROKAR_LIPOPROTEIN; 1.
PE 2: Evidence at transcript level;
KW Calcium {ECO:0000256|ARBA:ARBA00022837};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Endocytosis {ECO:0000256|ARBA:ARBA00022583};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Membrane {ECO:0000256|ARBA:ARBA00023136, ECO:0000256|SAM:Phobius};
KW Receptor {ECO:0000256|ARBA:ARBA00023170, ECO:0000313|EMBL:AAP88585.1};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP};
KW Transmembrane {ECO:0000256|ARBA:ARBA00022692, ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|ARBA:ARBA00022989,
KW ECO:0000256|SAM:Phobius}.
FT SIGNAL 1..25
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 26..4655
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5004298552"
FT TRANSMEM 4424..4446
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT REPEAT 436..478
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REPEAT 479..521
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REPEAT 522..568
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REPEAT 569..613
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REPEAT 753..795
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REPEAT 796..837
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REPEAT 838..881
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REPEAT 882..925
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REPEAT 1478..1520
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REPEAT 1521..1563
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REPEAT 1566..1609
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REPEAT 1883..1930
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REPEAT 1931..1972
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REPEAT 2202..2245
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REPEAT 2246..2289
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REPEAT 2519..2562
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REPEAT 2563..2604
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REPEAT 3239..3281
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REPEAT 3333..3376
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REPEAT 3377..3419
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REPEAT 4154..4196
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REPEAT 4197..4240
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REPEAT 4242..4283
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT DOMAIN 4377..4411
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT REGION 4550..4575
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 4591..4655
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 4591..4612
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 4613..4630
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 28..40
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 35..53
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 47..62
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 108..120
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 115..133
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 127..142
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 152..170
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 164..179
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 183..195
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 190..208
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 202..217
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 222..234
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 229..247
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 241..256
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1026..1038
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1033..1051
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1045..1060
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1067..1079
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1074..1092
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1086..1101
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1109..1121
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1116..1134
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1128..1143
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1149..1161
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1156..1174
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1168..1183
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1207..1222
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1271..1283
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1278..1296
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1290..1305
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2700..2712
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2707..2725
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2741..2753
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2748..2766
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2760..2775
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2788..2806
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2864..2876
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2871..2889
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2906..2918
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2913..2931
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2993..3005
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3000..3018
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3012..3027
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3032..3044
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3039..3057
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3075..3087
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3082..3100
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3094..3109
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3553..3565
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3560..3578
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3594..3606
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3601..3619
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3635..3647
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3642..3660
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3699..3714
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3727..3745
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3739..3754
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3759..3771
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3766..3784
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3778..3793
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3798..3810
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3805..3823
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3817..3832
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3842..3854
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3849..3867
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3891..3909
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3928..3940
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3935..3953
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 3947..3962
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 4401..4410
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ SEQUENCE 4655 AA; 521982 MW; E0AFDCF1AC9F235D CRC64;
MDRGPAAVAC TLLLALVACL APASGQECDS AHFRCGSGHC IPADWRCDGT KDCSDDADEI
GCAVVTCQQG YFKCQSEGQC IPSSWVCDQD QDCDDGSDER QDCSQSTCSS HQITCSNGQC
IPSEYRCDHV RDCPDGADEN DCQYPTCEQL TCDNGACYNT SQKCDWKVDC RDSSDEINCT
EICLHNEFSC GNGECIPRAY VCDHDNDCQD GSDEHACNYP TCGGYQFTCP SGRCIYQNWV
CDGEDDCKDN GDEDGCESGP HDVHKCSPRE WSCPESGRCI SIYKVCDGIL DCPGREDENN
TSTGKYCSMT LCSALNCQYQ CHETPYGGAC FCPPGYIINH NDSRTCVEFD DCQIWGICDQ
KCESRPGRHL CHCEEGYILE RGQYCKANDS FGEASIIFSN GRDLLIGDIH GRSFRILVES
QNRGVAVGVA FHYHLQRVFW TDTVQNKVFS VDINGLNIQE VLNVSVETPE NLAVDWVNNK
IYLVETKVNR IDMVNLDGSY RVTLITENLG HPRGIAVDPT VGYLFFSDWE SLSGEPKLER
AFMDGSNRKD LVKTKLGWPA GVTLDMISKR VYWVDSRFDY IETVTYDGIQ RKTVVHGGSL
IPHPFGVSLF EGQVFFTDWT KMAVLKANKF TETNPQVYYQ ASLRPYGVTV YHSLRQPYAT
NPCKDNNGGC EQVCVLSHRT DNDGLGFRCK CTFGFQLDTD ERHCIAVQNF LIFSSQVAIR
GIPFTLSTQE DVMVPVSGNP SFFVGIDFDA QDSTIFFSDM SKHMIFKQKI DGTGREILAA
NRVENVESLA FDWISKNLYW TDSHYKSISV MRLADKTRRT VVQYLNNPRS VVVHPFAGYL
FFTDWFRPAK IMRAWSDGSH LLPVINTTLG WPNGLAIDWA ASRLYWVDAY FDKIEHSTFD
GLDRRRLGHI EQMTHPFGLA IFGEHLFFTD WRLGAIIRVR KADGGEMTVI RSGIAYILHL
KSYDVNIQTG SNACNQPTHP NGDCSHFCFP VPNFQRVCGC PYGMRLASNH LTCEGDPTNE
PPTEQCGLFS FPCKNGRCVP NYYLCDGVDD CHDNSDEQLC GTLNNTCSSS AFTCGHGECI
PAHWRCDKRN DCVDGSDEHN CPTHAPASCL DTQYTCDNHQ CISKNWVCDT DNDCGDGSDE
KNCNSTETCQ PSQFNCPNHR CIDLSFVCDG DKDCVDGSDE VGCVLNCTAS QFKCASGDKC
IGVTNRCDGV FDCSDNSDEA GCPTRPPGMC HSDEFQCQED GICIPNFWEC DGHPDCLYGS
DEHNACVPKT CPSSYFHCDN GNCIHRXWLC DRDNDCGDMS DEKDCPTQPF RCPSWQWQCL
GHNICVNLSV VCDGIFDCPN GTDESPLCNG NSCSDFNGGC THECVQEPFG AKCLCPLGFL
LANDSKTCED IDECDILGSC SQHCYNMRGS FRCSCDTGYM LESDGRTCKV TASESLLLLV
ASQNKIIADS VTSQVHNIYS LVENGSYIVA VDFDSISGRI FWSDATQGKT WSAFQNGTDR
RVVFDSSIIL TETIAIDWVG RNLYWTDYAL ETIEVSKIDG SHRTVLISKN LTNPRGLALD
PRMNEHLLFW SDWGHHPRIE RASMDGSMRT VIVQDKIFWP CGLTIDYPNR LLYFMDSYLD
YMDFCDYNGH HRRQVIASDL IIRHPYALTL FEDSVYWTDR ATRRVMRANK WHGGNQSVVM
YNIQWPLGIV AVHPSKQPNS VNPCAFSRCS HLCLLSSQGP HFYSCVCPSG WSLSPDLLNC
LRDDQPFLIT VRQHIIFGIS LNPEVKSNDA MVPIAGIQNG LDVEFDDAEQ YIYWVENPGE
IHRVKTDGTN RTVFASISMV GPSMNLALDW ISRNLYSTNP RTQSIEVLTL HGDIRYRKTL
IANDGTALGV GFPIGITVDP ARGKLYWSDQ GTDSGVPAKI ASANMDGTSV KTLFTGNLEH
LECVTLDIEE QKLYWAVTGR GVIERGNVDG TDRMILVHQL SHPWGIAVHD SFLYYTDEQY
EVIERVDKAT GANKIVLRDN VPNLRGLQVY HRRNAAESSN GCSNNMNACQ QICLPVPGGL
FSCACATGFK LNPDNRSCSP YNSFIVVSML SAIRGFSLEL SDHSETMVPV AGQGRNALHV
DVDVSSGFIY WCDFSSSVAS DNAIRRIKPD GSSLMNIVTH GIGENGVRGI AVDWVAGNLY
FTNAFVSETL IEVLRINTTY RRVLLKVTVD MPRHIVVDPK NRYLFWADYG QRPKIERSFL
DCTNRTVLVS EGIVTPRGLA VDRSDGYVYW VDDSLDIIAR IRINGENSEV IRYGSRYPTP
YGITVFENSI IWVDRNLKKI FQASKEPENT EPPTVIRDNI NWLRDVTIFD KQVQPRSPAE
VNNNPCLENN GGCSHLCFAL PGLHTPKCDC AFGTLQSDGK NCAISTENFL IFALSNSLRS
LHLDPENHSP PFQTINVERT VMSLDYDSVS DRIYFTQNLA SGVGQISYAT LSSGIHTPTV
IASGIGTADG IAFDWITRRI YYSDYLNQMI NSMAEDGSNR TVIARVPKPR AIVLDPCQGY
LYWADWDTHA KIERATLGGN FRVPIVNSSL VMPSGLTLDY EEDLLYWVDA SLQRIERSTL
TGVDREVIVN AAVHAFGLTL YGQYIYWTDL YTQRIYRANK YDGSGQIAMT TNLLSQPRGI
NTVVKNQKQQ CNNPCEQFNG GCSHICAPGP NGAECQCPHE GNWYLANNRK HCIVDNGERC
GASSFTCSNG RCISEEWKCD NDNDCGDGSD EMESVCALHT CSPTAFTCAN GRCVQYSYRC
DYYNDCGDGS DEAGCLFRDC NATTEFMCNN RRCIPREFIC NGVDNCHDNN TSDEKNCPDR
TCQSGYTKCH NSNICIPRVY LCDGDNDCGD NSDENPTYCT THTCSSSEFQ CTSGRCIPQH
WYCDQETDCF DASDEPASCG HSERTCLADE FKCDGGRCIP SEWICDGDND CGDMSDEDKR
HQCQNQNCSD SEFLCVNDRP PDRRCIPQSW VCDGDVDCTD GYDENQNCTR RTCSENEFTC
GYGLCIPKIF RCDRHNDCGD YSDERGCLYQ TCQQNQFTCQ NGRCISKTFV CDEDNDCGDG
SDELMHLCHT PEPTCPPHEF KCDNGRCIEM MKLCNHLDDC LDNSDEKGCG INECHDPSIS
GCDHNCTDTL TSFYCSCRPG YKLMSDKRTC VDIDECTEMP FVCSQKCENV IGSYICKCAP
GYLREPDGKT CRQNSNIEPY LIFSNRYYLR NLTIDGYFYS LILEGLDNVV ALDFDRVEKR
LYWIDTQRQV IERMFLNKTN KETIINHRLP AAESLAVDWV SRKLYWLDAR LDGLFVSDLN
GGHRRMLAQH CVDANNTFCF DNPRGLALHP QYGYLYWADW GHRAYIGRVG MDGTNKSVII
STKLEWPNGI TIDYTNDLLY WADAHLGYIE YSDLEGHHRH TVYDGALPHP FAITIFEDTI
YWTDWNTRTV EKGNKYDGSN RQTLVNTTHR PFDIHVYHPY RQPIVSNPCG TNNGGCSHLC
LIKPGGKGFT CECPDDFRTL QLSGSTYCMP MCSSTQFLCA NNEKCIPIWW KCDGQKDCSD
GSDELALCPQ RFCRLGQFQC SDGNCTSPQT LCNAHQNCPD GSDEDRLLCE NHHCDSNEWQ
CANKRCIPES WQCDTFNDCE DNSDEDSSHC ASRTCRPGQF RCANGRCIPQ AWKCDVDNDC
GDHSDEPIEE CMSSAHLCDN FTEFSCKTNY RCIPKWAVCN GVDDCRDNSD EQGCEERTCH
PVGDFRCKNH HCIPLRWQCD GQNDCGDNSD EENCAPRECT ESEFRCVNQQ CIPSRWICDH
YNDCGDNSDE RDCEMRTCHP EYFQCTSGHC VHSELKCDGS ADCLDASDEA DCPTRFPDGA
YCQATMFECK NHVCIPPYWK CDGDDDCGDG SDEELHLCLD VPCNSPNRFR CDNNRCIYSH
EVCNGVDDCG DGTDETEEHC RKPTPKPCTE YEYKCGNGHC IPHDNVCDDA DDCGDWSDEL
GCNKGKERTC AENICEQNCT QLNEGGFICS CTAGFETNVF DRTSCLDINE CEQFGTCPQH
CRNTKGSYEC VCADGFTSMS DRPGKRCAAE GSSPLLLLPD NVRIRKYNLS SERFSEYLQD
EEYIQAVDYD WDPXDIGLSV VYYTVRGEGS RFGAIKRAYI PNFESGRNNL VQEVDLKLKY
VMQPDGIAVD WVGRHIYWSD VKNKRIEVAK LDGRYRKWLI STDLDQPAAI AVNPKLGLMF
WTDWGKEPKX ESAWMNGEDR NILVFEDLGW PTGLSIDYLN NDRIYWSDFK EDVIETIKYD
GTDRRVIAKE AMNPYSLDIF EDQLYWISKE KGEVWKQNKF GQGKKEKTLV VNPWLTQVRI
FHQLRYNKSV PNLCKQICSH LCLLRPGGYS CACPQGSSFI EGSTTECDAA IELPINLPPP
CRCMHGGNCY FDETDLPKCK CPSGYTGKYC EMAFSKGISP GTTAVAVLLT ILLIVVIGAL
AIAGFFHYRR TGSLLPALPK LPSLSSLVKP SENGNGVTFR SGADLNMDIG VSGFGPETAI
DRSMAMSEDF VMEMGKQPII FENPMYSARD SAVKVVQPIQ VTVSENVDNK NYGSPINPSE
IVPETNPTSP AADGTQVTKW NLFKRKSKQT TNFENPIYAQ MENEQKESVA ATPPPSPSLP
AKPKPPSRRD PTPTYSATED TFKDTANLVK EDSEV
//