ID A0A096NGD9_PAPAN Unreviewed; 3312 AA.
AC A0A096NGD9;
DT 26-NOV-2014, integrated into UniProtKB/TrEMBL.
DT 25-MAY-2022, sequence version 3.
DT 24-JAN-2024, entry version 62.
DE SubName: Full=Cadherin EGF LAG seven-pass G-type receptor 3 {ECO:0000313|Ensembl:ENSPANP00000012013.3};
GN Name=CELSR3 {ECO:0000313|Ensembl:ENSPANP00000012013.3};
OS Papio anubis (Olive baboon).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini;
OC Cercopithecidae; Cercopithecinae; Papio.
OX NCBI_TaxID=9555 {ECO:0000313|Ensembl:ENSPANP00000012013.3, ECO:0000313|Proteomes:UP000028761};
RN [1] {ECO:0000313|Ensembl:ENSPANP00000012013.3, ECO:0000313|Proteomes:UP000028761}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Liu Y.L., Abraham K.A., Akbar H.A., Ali S.A., Anosike U.A., Aqrawi P.A.,
RA Arias F.A., Attaway T.A., Awwad R.A., Babu C.B., Bandaranaike D.B.,
RA Battles P.B., Bell A.B., Beltran B.B., Berhane-Mersha D.B., Bess C.B.,
RA Bickham C.B., Bolden T.B., Carter K.C., Chau D.C., Chavez A.C.,
RA Clerc-Blankenburg K.C., Coyle M.C., Dao M.D., Davila M.L.D.,
RA Davy-Carroll L.D., Denson S.D., Dinh H.D., Fernandez S.F., Fernando P.F.,
RA Forbes L.F., Francis C.F., Francisco L.F., Fu Q.F., Garcia-Iii R.G.,
RA Garrett T.G., Gross S.G., Gubbala S.G., Hirani K.H., Hogues M.H.,
RA Hollins B.H., Jackson L.J., Javaid M.J., Jhangiani S.J., Johnson A.J.,
RA Johnson B.J., Jones J.J., Joshi V.J., Kalu J.K., Khan N.K., Korchina V.K.,
RA Kovar C.K., Lago L.L., Lara F.L., Le T.-K.L., Lee S.L., Legall-Iii F.L.,
RA Lemon S.L., Liu J.L., Liu Y.-S.L., Liyanage D.L., Lopez J.L.,
RA Lorensuhewa L.L., Mata R.M., Mathew T.M., Mercado C.M., Mercado I.M.,
RA Morales K.M., Morgan M.M., Munidasa M.M., Ngo D.N., Nguyen L.N.,
RA Nguyen T.N., Nguyen N.N., Obregon M.O., Okwuonu G.O., Ongeri F.O.,
RA Onwere C.O., Osifeso I.O., Parra A.P., Patil S.P., Perez A.P., Perez Y.P.,
RA Pham C.P., Pu L.-L.P., Puazo M.P., Quiroz J.Q., Rouhana J.R., Ruiz M.R.,
RA Ruiz S.-J.R., Saada N.S., Santibanez J.S., Scheel M.S., Schneider B.S.,
RA Simmons D.S., Sisson I.S., Tang L.-Y.T., Thornton R.T., Tisius J.T.,
RA Toledanes G.T., Trejos Z.T., Usmani K.U., Varghese R.V., Vattathil S.V.,
RA Vee V.V., Walker D.W., Weissenberger G.W., White C.W., Williams A.W.,
RA Woodworth J.W., Wright R.W., Zhu Y.Z., Han Y.H., Newsham I.N.,
RA Nazareth L.N., Worley K.W., Muzny D.M., Rogers J.R., Gibbs R.G.;
RT "Whole Genome Assembly of Papio anubis.";
RL Submitted (MAR-2012) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSPANP00000012013.3}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC -!- FUNCTION: Receptor that may have an important role in cell/cell
CC signaling during nervous system formation.
CC {ECO:0000256|ARBA:ARBA00002066}.
CC -!- SUBCELLULAR LOCATION: Cell membrane {ECO:0000256|ARBA:ARBA00004651};
CC Multi-pass membrane protein {ECO:0000256|ARBA:ARBA00004651}. Membrane
CC {ECO:0000256|ARBA:ARBA00004141}; Multi-pass membrane protein
CC {ECO:0000256|ARBA:ARBA00004141}.
CC -!- SIMILARITY: Belongs to the G-protein coupled receptor 2 family. LN-TM7
CC subfamily. {ECO:0000256|ARBA:ARBA00010933}.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR Ensembl; ENSPANT00000026432.3; ENSPANP00000012013.3; ENSPANG00000013159.3.
DR eggNOG; KOG4289; Eukaryota.
DR GeneTree; ENSGT00940000160077; -.
DR HOGENOM; CLU_000158_1_0_1; -.
DR OMA; ECETRWG; -.
DR Proteomes; UP000028761; Chromosome 2.
DR Bgee; ENSPANG00000013159; Expressed in postnatal subventricular zone and 15 other cell types or tissues.
DR GO; GO:0005886; C:plasma membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005509; F:calcium ion binding; IEA:UniProtKB-UniRule.
DR GO; GO:0004930; F:G protein-coupled receptor activity; IEA:UniProtKB-KW.
DR GO; GO:0007166; P:cell surface receptor signaling pathway; IEA:InterPro.
DR GO; GO:0007156; P:homophilic cell adhesion via plasma membrane adhesion molecules; IEA:InterPro.
DR CDD; cd11304; Cadherin_repeat; 9.
DR CDD; cd00054; EGF_CA; 5.
DR CDD; cd00055; EGF_Lam; 2.
DR CDD; cd00110; LamG; 2.
DR Gene3D; 2.60.120.200; -; 2.
DR Gene3D; 2.60.220.50; -; 1.
DR Gene3D; 2.60.40.60; Cadherins; 9.
DR Gene3D; 4.10.1240.10; GPCR, family 2, extracellular hormone receptor domain; 1.
DR Gene3D; 2.10.25.10; Laminin; 7.
DR Gene3D; 1.20.1070.10; Rhodopsin 7-helix transmembrane proteins; 1.
DR InterPro; IPR002126; Cadherin-like_dom.
DR InterPro; IPR015919; Cadherin-like_sf.
DR InterPro; IPR020894; Cadherin_CS.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR032471; GAIN_dom_N.
DR InterPro; IPR046338; GAIN_dom_sf.
DR InterPro; IPR017981; GPCR_2-like_7TM.
DR InterPro; IPR036445; GPCR_2_extracell_dom_sf.
DR InterPro; IPR001879; GPCR_2_extracellular_dom.
DR InterPro; IPR000832; GPCR_2_secretin-like.
DR InterPro; IPR000203; GPS.
DR InterPro; IPR001791; Laminin_G.
DR InterPro; IPR002049; LE_dom.
DR PANTHER; PTHR24026:SF38; CADHERIN EGF LAG SEVEN-PASS G-TYPE RECEPTOR 3; 1.
DR PANTHER; PTHR24026; FAT ATYPICAL CADHERIN-RELATED; 1.
DR Pfam; PF00002; 7tm_2; 1.
DR Pfam; PF00028; Cadherin; 8.
DR Pfam; PF00008; EGF; 3.
DR Pfam; PF16489; GAIN; 1.
DR Pfam; PF01825; GPS; 1.
DR Pfam; PF02793; HRM; 1.
DR Pfam; PF00053; Laminin_EGF; 1.
DR Pfam; PF02210; Laminin_G_2; 2.
DR PRINTS; PR00205; CADHERIN.
DR PRINTS; PR00249; GPCRSECRETIN.
DR SMART; SM00112; CA; 9.
DR SMART; SM00181; EGF; 6.
DR SMART; SM00179; EGF_CA; 5.
DR SMART; SM00180; EGF_Lam; 1.
DR SMART; SM00303; GPS; 1.
DR SMART; SM00008; HormR; 1.
DR SMART; SM00282; LamG; 2.
DR SUPFAM; SSF49313; Cadherin-like; 9.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 2.
DR SUPFAM; SSF57196; EGF/Laminin; 4.
DR PROSITE; PS00232; CADHERIN_1; 6.
DR PROSITE; PS50268; CADHERIN_2; 8.
DR PROSITE; PS00022; EGF_1; 4.
DR PROSITE; PS01186; EGF_2; 3.
DR PROSITE; PS50026; EGF_3; 6.
DR PROSITE; PS01248; EGF_LAM_1; 1.
DR PROSITE; PS50027; EGF_LAM_2; 1.
DR PROSITE; PS50227; G_PROTEIN_RECEP_F2_3; 1.
DR PROSITE; PS50261; G_PROTEIN_RECEP_F2_4; 1.
DR PROSITE; PS50221; GPS; 1.
DR PROSITE; PS50025; LAM_G_DOMAIN; 2.
PE 3: Inferred from homology;
KW Calcium {ECO:0000256|ARBA:ARBA00022837, ECO:0000256|PROSITE-
KW ProRule:PRU00043}; Cell membrane {ECO:0000256|ARBA:ARBA00022475};
KW Developmental protein {ECO:0000256|ARBA:ARBA00022473};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076};
KW G-protein coupled receptor {ECO:0000256|ARBA:ARBA00023040};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW Laminin EGF-like domain {ECO:0000256|ARBA:ARBA00023292,
KW ECO:0000256|PROSITE-ProRule:PRU00460};
KW Membrane {ECO:0000256|ARBA:ARBA00023136, ECO:0000256|SAM:Phobius};
KW Receptor {ECO:0000256|ARBA:ARBA00023170};
KW Reference proteome {ECO:0000313|Proteomes:UP000028761};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP};
KW Transducer {ECO:0000256|ARBA:ARBA00023224};
KW Transmembrane {ECO:0000256|ARBA:ARBA00022692, ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|ARBA:ARBA00022989,
KW ECO:0000256|SAM:Phobius}.
FT SIGNAL 1..32
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 33..3312
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5035239839"
FT TRANSMEM 2538..2562
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 2574..2594
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 2600..2622
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 2643..2663
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 2683..2705
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 2726..2745
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 326..433
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 434..545
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 546..651
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 652..756
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 757..858
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 859..961
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 962..1067
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 1068..1169
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 1375..1433
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1435..1471
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1475..1514
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1515..1719
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 1722..1758
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1764..1944
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 1946..1979
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1982..2020
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 2077..2124
FT /note="Laminin EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50027"
FT DOMAIN 2109..2182
FT /note="G-protein coupled receptors family 2 profile 1"
FT /evidence="ECO:0000259|PROSITE:PS50227"
FT DOMAIN 2539..2775
FT /note="G-protein coupled receptors family 2 profile 2"
FT /evidence="ECO:0000259|PROSITE:PS50261"
FT REGION 89..199
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 213..306
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2361..2399
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2888..2942
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2977..3015
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3084..3312
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 103..119
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2366..2393
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2915..2942
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2979..3000
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3109..3123
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3196..3210
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3235..3302
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 1423..1432
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1461..1470
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1748..1757
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 2010..2019
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 2077..2089
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00460"
FT DISULFID 2079..2096
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00460"
FT DISULFID 2098..2107
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00460"
SQ SEQUENCE 3312 AA; 358654 MW; FA5DB0C1D199214C CRC64;
MMARRPPWRG LGGRSTPILL LLLLSLFPLS QEELGGGGHQ GWDPGLAATT GLRPHIGGRA
LALCPESPGV REDGGPGLGV REPVFVGLRG ERQSSRNSRG PPEQPNEELR IEHGVQPLGS
RERETGQGPG SMLHWRPEIS SCGRTGPLQR GSLSPGALSP GVPGSGNSSP LPSDFLVRHH
GPKPVSSQRN AGTGARKRVG TARCCGELRA IGSKSQGERA TTSGAERTAP RRNCLPGASG
SGPELDSAPR TARTAPAPGS APRESRTAPK PAPERMRSRG LFRRRFLPQR SGPRPPGIPA
RPEARKITSA NRARFRRAAN RHPQFPQYNY QTLVPENEAA GTAVLRVVAQ DPDAGEAGRL
VYSLAALMNS RSLELFSIDP QSGLIRTAAA LDRESMERHY LRVTAQDHGS PRLSATTMVA
VTVADRNDHS PVFEQAQYRE TLRENVEEGY PILQLRATDG DAPLNANLRY RFVGPPAARA
AAAAAFEIDP RSGLISTSGR VDREHMESYE LVVEASDQGQ EPGPRSATVR VHITVLDEND
NAPQFSEKRY VAQVREDVRP HTVVLRVTAT DRDKDANGLV HYNIISGNSR GHFAIDSLTG
EIQVVAPLDF EAEREYALRI RAQDAGRPPL SNNTGLASIQ VVDINDHIPI FVSTPFQVSV
LENAPLGHSV IHIQAVDADH GENARLEYSL TGVASDTPFV INSATGWVSV SGPLDRESVE
HYFFGVEARD HGSPPLSASA SVTVTVLDVN DNRPEFTMKE YHLRLNEDAA VGTSVVSVTA
VDRDANSAIS YQITGGNTRN RFAISTQGGV GLVTLALPLD YKQERYFKLV LTASDRALHD
HCYVHINITD ANTHRPVFQS AHYSVSVNED RPVGSTIVVI SASDDDVGEN ARITYLLEDN
LPQFRIDADS GAITLQAPLD YEDQVTYTLA ITARDNGIPQ KADTTYVEVM VNDVNDNAPQ
FVASHYTGLV SEDAPPFTSV LQISATDRDA HANGRVQYTF QNGEDGDGDF TIEPTSGIVR
TVRRLDREAV SVYELTAYAV DRGVPPLRTP VSIQVTVQDV NDNAPVFPAE EFEVRVKENS
IVGSVVAQIT AVDPDEGPNA HIMYQIVEGN IPELFQMDIF SGELTALIDL DYEARQEYVI
VVQATSAPLV SRATVHVRLI DQNDNSPVLN NFQILFNNYV SNRSDTFPSG IIGRIPAYDP
DVSDHLFYSF ERGNELQLLV VNQTSGELRL SRKLDNNRPL VASMLVTVTD GLHSVTAQCV
LRVVIITEEL LANSLTVRLE NMWQERFLSP LLGRFLEGVA EVLATPAEDV FIFNIQNDTD
VGGTVLNVSF SALAPRGAGA GAAGPWFSSE ELQEQLYVRR AALAARSLLD VLPFDDNVCL
REPCENYMKC VSVLRFDSSA PFLASASTLF RPIQPIAGLR CRCPPGFTGD FCETELDLCY
SNPCRNGGAC ARREGGYTCV CRPRFTGEDC ELDTEAGRCV PGVCRNGGTC TDAPNGGFRC
QCPAGGAFEG PRCEVAARSF PPSSFVMFRG LRQRFHLTLS LSFATVQQSG LLFYNGRLNE
KHDFLALELV AGQVRLTYST GESNTVVSPT VPGGLSDGQW HTVHLKYYNK PRTDALGGAQ
GPSKDKVAVL SVDDCDVAVA LQFGAEIGNY SCAAAGVQTS SKKSLDLTGP LLLGGVPNLP
ENFPVSHKDF IGCMRDLHID GRRMDMAAFV ANNGTMAGCQ AKLHFCDSGP CKNSGFCLER
WGGFSCDCPV GFGGKDCRLT MAHPHHFRGN GTLSWNFGSD MAVSVPWYLG LAFRTRATQG
VLMQVQAGPH STLLCQLDRG LLSVTVTRGS GRASHLLLDQ VTVSDGRWHD LRLELQEEPG
GRRGHHVLMV SLDFSLFQDT MAVGSELQGL KVKQLHVGGL PPGSAEEAPQ GLVGCIQGVW
LGSTPSGSPA LLPPSHQVNA EPGCVMTNAC ASGPCPPHAD CRDLWQTFSC ICRPGYYGPG
CVDACLLNPC QNQGSCRHLP GAPHGYTCDC VGGYFGHHCE HRMDQQCPRG WWGSPTCGPC
NCDVHKGFDP NCNKTNGQCH CKEFHYRPRG SDSCLPCDCY PVGSTSRSCA PHSGQCPCRP
GALGRQCNSC DSPFAEVTAS GCRVLYDACP KSLRSGVWWP QTKFGVLATV PCPRGALGAA
VRLCDEAQGW LEPDLFNCTS PAFRELSLLL DGLELNKTAL DTMEAKKLAQ RLREVTSHTD
HYFSQDVRVT ARLLAHLLAF ESHQQGFGLT ATQDAHFNEN LLWAGSALLA PETGDLWAAL
GQRAPGGSPG SAGLVRHLEE YAATLARNME LTYLNPMGLV TPNIMLSIDR MEHPSSPRGA
HRYPRYHSNL FRGQDAWDPH THVLLPSQSP RPSPSEVLPT SSSMENSTTS SVVPRPAPPE
PEPGISIIIL LVYRTLGGLL PAQFQAERRG ARLPQNPVMN SPVVSVAVFH GRNFLRGVLE
SPISLEFRLL QTANRSKAIC VQWDPPGLAE QHGVWTARDC ELVHRNGSHA RCRCSRTGTF
GVLMDASPRE RLEGDLELLA VFTHVVVAVS VAALVLTAAV LLSLRSLKSN VRGIHANVAA
TLGMAELLFL LGIHRTHNQL VCTAVAILLH YFFLSTFAWL FVQGLHLYRM QVEPRNVDSG
AMRFYHALGW GVPAVLLGLA VGLDPEGYGN PDFCWISVHE PLIWSFAGPI VLVIVMNGTM
FLLAARTSCS TGQREAKKTS ALTLRSSFLL LLLVSASWLF GLLAVNHSIL AFHYLHAGLC
GLQGLAALLL FCVLNADARA AWTPACLGRK AAPEEARPAP GTGPGAYNNT ALFEESGLIR
ITLGASTVSS VSSARSGRTQ DQDSQRGRSY LRDNVLVRHG SAADHTDHSL QAHAGPTDLD
VAMFHRDAGA DSDSDSDLSL EEERSLSIPS SESEDNGRTR GRFQRPLRRA AQSERLLTHP
KDVDGNDLLS YWPALGECEA APCALQTWGS ERRLGLDTNK DAANNNHPDP ALTSGDETSL
GRAQRQRKGI LKNPLQYPLV PQTRGAPELS WCRAATLGHR AVPAASYGRI YAGRGTGSLS
QPASRYSSRE QLDLLLRRQL SRERLEEAPD PVPHPLSRPG SQECMDAAPG RLEPRDRGST
LPRRQPPRDY PGAMAGRFGS RDALDLGAPR EWLSTLPPPH CTRNLDPQPP PLSLSPQRQL
SRDPLLPSRP LDSLSRSSNS RERLDQVPSR HPSREALGPP PQLLRAREDP VSGPSHGPST
EQLDILSSIL ASFNSSALSS VQSSSTPSGP HTTATPSATA SVLGPSTPRS ATSHSISELS
PDSEVPRSEG HS
//