GenomeNet

Database: UniProt
Entry: G3RKM9_GORGO
LinkDB: G3RKM9_GORGO
Original site: G3RKM9_GORGO 
ID   G3RKM9_GORGO            Unreviewed;      3312 AA.
AC   G3RKM9;
DT   16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT   28-FEB-2018, sequence version 2.
DT   27-MAR-2024, entry version 83.
DE   SubName: Full=Cadherin EGF LAG seven-pass G-type receptor 3 {ECO:0000313|Ensembl:ENSGGOP00000016301.3};
GN   Name=CELSR3 {ECO:0000313|Ensembl:ENSGGOP00000016301.3};
OS   Gorilla gorilla gorilla (Western lowland gorilla).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC   Gorilla.
OX   NCBI_TaxID=9595 {ECO:0000313|Ensembl:ENSGGOP00000016301.3, ECO:0000313|Proteomes:UP000001519};
RN   [1] {ECO:0000313|Ensembl:ENSGGOP00000016301.3, ECO:0000313|Proteomes:UP000001519}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA   Scally A.;
RT   "Insights into the evolution of the great apes provided by the gorilla
RT   genome.";
RL   Submitted (MAY-2011) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|Ensembl:ENSGGOP00000016301.3, ECO:0000313|Proteomes:UP000001519}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=22398555; DOI=10.1038/nature10842;
RA   Scally A., Dutheil J.Y., Hillier L.W., Jordan G.E., Goodhead I.,
RA   Herrero J., Hobolth A., Lappalainen T., Mailund T., Marques-Bonet T.,
RA   McCarthy S., Montgomery S.H., Schwalie P.C., Tang Y.A., Ward M.C., Xue Y.,
RA   Yngvadottir B., Alkan C., Andersen L.N., Ayub Q., Ball E.V., Beal K.,
RA   Bradley B.J., Chen Y., Clee C.M., Fitzgerald S., Graves T.A., Gu Y.,
RA   Heath P., Heger A., Karakoc E., Kolb-Kokocinski A., Laird G.K., Lunter G.,
RA   Meader S., Mort M., Mullikin J.C., Munch K., O'Connor T.D., Phillips A.D.,
RA   Prado-Martinez J., Rogers A.S., Sajjadian S., Schmidt D., Shaw K.,
RA   Simpson J.T., Stenson P.D., Turner D.J., Vigilant L., Vilella A.J.,
RA   Whitener W., Zhu B., Cooper D.N., de Jong P., Dermitzakis E.T.,
RA   Eichler E.E., Flicek P., Goldman N., Mundy N.I., Ning Z., Odom D.T.,
RA   Ponting C.P., Quail M.A., Ryder O.A., Searle S.M., Warren W.C.,
RA   Wilson R.K., Schierup M.H., Rogers J., Tyler-Smith C., Durbin R.;
RT   "Insights into hominid evolution from the gorilla genome sequence.";
RL   Nature 483:169-175(2012).
RN   [3] {ECO:0000313|Ensembl:ENSGGOP00000016301.3}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   -!- FUNCTION: Receptor that may have an important role in cell/cell
CC       signaling during nervous system formation.
CC       {ECO:0000256|ARBA:ARBA00002066}.
CC   -!- SUBCELLULAR LOCATION: Cell membrane {ECO:0000256|ARBA:ARBA00004651};
CC       Multi-pass membrane protein {ECO:0000256|ARBA:ARBA00004651}. Membrane
CC       {ECO:0000256|ARBA:ARBA00004141}; Multi-pass membrane protein
CC       {ECO:0000256|ARBA:ARBA00004141}.
CC   -!- SIMILARITY: Belongs to the G-protein coupled receptor 2 family. LN-TM7
CC       subfamily. {ECO:0000256|ARBA:ARBA00010933}.
CC   -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC       feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; CABD030021211; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; CABD030021212; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   Ensembl; ENSGGOT00000016763.3; ENSGGOP00000016301.3; ENSGGOG00000016683.3.
DR   eggNOG; KOG4289; Eukaryota.
DR   GeneTree; ENSGT00940000160077; -.
DR   HOGENOM; CLU_000158_0_0_1; -.
DR   OMA; ECETRWG; -.
DR   TreeFam; TF323983; -.
DR   Proteomes; UP000001519; Chromosome 3.
DR   Bgee; ENSGGOG00000016683; Expressed in cerebellum and 3 other cell types or tissues.
DR   GO; GO:0005886; C:plasma membrane; IEA:UniProtKB-SubCell.
DR   GO; GO:0005509; F:calcium ion binding; IEA:UniProtKB-UniRule.
DR   GO; GO:0004930; F:G protein-coupled receptor activity; IEA:UniProtKB-KW.
DR   GO; GO:0007166; P:cell surface receptor signaling pathway; IEA:InterPro.
DR   GO; GO:0007156; P:homophilic cell adhesion via plasma membrane adhesion molecules; IEA:InterPro.
DR   CDD; cd11304; Cadherin_repeat; 9.
DR   CDD; cd00054; EGF_CA; 5.
DR   CDD; cd00055; EGF_Lam; 2.
DR   CDD; cd00110; LamG; 2.
DR   Gene3D; 2.60.120.200; -; 2.
DR   Gene3D; 2.60.220.50; -; 1.
DR   Gene3D; 2.60.40.60; Cadherins; 9.
DR   Gene3D; 4.10.1240.10; GPCR, family 2, extracellular hormone receptor domain; 1.
DR   Gene3D; 2.10.25.10; Laminin; 7.
DR   Gene3D; 1.20.1070.10; Rhodopsin 7-helix transmembrane proteins; 1.
DR   InterPro; IPR002126; Cadherin-like_dom.
DR   InterPro; IPR015919; Cadherin-like_sf.
DR   InterPro; IPR020894; Cadherin_CS.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR   InterPro; IPR000742; EGF-like_dom.
DR   InterPro; IPR032471; GAIN_dom_N.
DR   InterPro; IPR046338; GAIN_dom_sf.
DR   InterPro; IPR017981; GPCR_2-like_7TM.
DR   InterPro; IPR036445; GPCR_2_extracell_dom_sf.
DR   InterPro; IPR001879; GPCR_2_extracellular_dom.
DR   InterPro; IPR000832; GPCR_2_secretin-like.
DR   InterPro; IPR017983; GPCR_2_secretin-like_CS.
DR   InterPro; IPR000203; GPS.
DR   InterPro; IPR001791; Laminin_G.
DR   InterPro; IPR002049; LE_dom.
DR   PANTHER; PTHR24026:SF38; CADHERIN EGF LAG SEVEN-PASS G-TYPE RECEPTOR 3; 1.
DR   PANTHER; PTHR24026; FAT ATYPICAL CADHERIN-RELATED; 1.
DR   Pfam; PF00002; 7tm_2; 1.
DR   Pfam; PF00028; Cadherin; 8.
DR   Pfam; PF00008; EGF; 3.
DR   Pfam; PF16489; GAIN; 1.
DR   Pfam; PF01825; GPS; 1.
DR   Pfam; PF02793; HRM; 1.
DR   Pfam; PF00053; Laminin_EGF; 1.
DR   Pfam; PF02210; Laminin_G_2; 2.
DR   PRINTS; PR00205; CADHERIN.
DR   PRINTS; PR00249; GPCRSECRETIN.
DR   SMART; SM00112; CA; 9.
DR   SMART; SM00181; EGF; 6.
DR   SMART; SM00179; EGF_CA; 5.
DR   SMART; SM00180; EGF_Lam; 1.
DR   SMART; SM00303; GPS; 1.
DR   SMART; SM00008; HormR; 1.
DR   SMART; SM00282; LamG; 2.
DR   SUPFAM; SSF49313; Cadherin-like; 9.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 2.
DR   SUPFAM; SSF57196; EGF/Laminin; 4.
DR   PROSITE; PS00232; CADHERIN_1; 6.
DR   PROSITE; PS50268; CADHERIN_2; 8.
DR   PROSITE; PS00022; EGF_1; 4.
DR   PROSITE; PS01186; EGF_2; 3.
DR   PROSITE; PS50026; EGF_3; 6.
DR   PROSITE; PS01248; EGF_LAM_1; 1.
DR   PROSITE; PS50027; EGF_LAM_2; 1.
DR   PROSITE; PS00650; G_PROTEIN_RECEP_F2_2; 1.
DR   PROSITE; PS50227; G_PROTEIN_RECEP_F2_3; 1.
DR   PROSITE; PS50261; G_PROTEIN_RECEP_F2_4; 1.
DR   PROSITE; PS50221; GPS; 1.
DR   PROSITE; PS50025; LAM_G_DOMAIN; 2.
PE   3: Inferred from homology;
KW   Calcium {ECO:0000256|ARBA:ARBA00022837, ECO:0000256|PROSITE-
KW   ProRule:PRU00043}; Cell membrane {ECO:0000256|ARBA:ARBA00022475};
KW   Developmental protein {ECO:0000256|ARBA:ARBA00022473};
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW   ProRule:PRU00076};
KW   EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW   ProRule:PRU00076};
KW   G-protein coupled receptor {ECO:0000256|ARBA:ARBA00023040};
KW   Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW   Laminin EGF-like domain {ECO:0000256|ARBA:ARBA00023292,
KW   ECO:0000256|PROSITE-ProRule:PRU00460};
KW   Membrane {ECO:0000256|ARBA:ARBA00023136, ECO:0000256|SAM:Phobius};
KW   Phosphoprotein {ECO:0000256|ARBA:ARBA00022553};
KW   Receptor {ECO:0000256|ARBA:ARBA00023170};
KW   Reference proteome {ECO:0000313|Proteomes:UP000001519};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP};
KW   Transducer {ECO:0000256|ARBA:ARBA00023224};
KW   Transmembrane {ECO:0000256|ARBA:ARBA00022692, ECO:0000256|SAM:Phobius};
KW   Transmembrane helix {ECO:0000256|ARBA:ARBA00022989,
KW   ECO:0000256|SAM:Phobius}.
FT   SIGNAL          1..32
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           33..3312
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5014178594"
FT   TRANSMEM        2537..2561
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        2573..2593
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        2599..2621
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        2642..2662
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        2682..2703
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        2724..2745
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   DOMAIN          326..433
FT                   /note="Cadherin"
FT                   /evidence="ECO:0000259|PROSITE:PS50268"
FT   DOMAIN          434..545
FT                   /note="Cadherin"
FT                   /evidence="ECO:0000259|PROSITE:PS50268"
FT   DOMAIN          546..651
FT                   /note="Cadherin"
FT                   /evidence="ECO:0000259|PROSITE:PS50268"
FT   DOMAIN          652..756
FT                   /note="Cadherin"
FT                   /evidence="ECO:0000259|PROSITE:PS50268"
FT   DOMAIN          757..858
FT                   /note="Cadherin"
FT                   /evidence="ECO:0000259|PROSITE:PS50268"
FT   DOMAIN          859..961
FT                   /note="Cadherin"
FT                   /evidence="ECO:0000259|PROSITE:PS50268"
FT   DOMAIN          962..1067
FT                   /note="Cadherin"
FT                   /evidence="ECO:0000259|PROSITE:PS50268"
FT   DOMAIN          1068..1169
FT                   /note="Cadherin"
FT                   /evidence="ECO:0000259|PROSITE:PS50268"
FT   DOMAIN          1375..1433
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   DOMAIN          1435..1471
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   DOMAIN          1475..1514
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   DOMAIN          1515..1719
FT                   /note="Laminin G"
FT                   /evidence="ECO:0000259|PROSITE:PS50025"
FT   DOMAIN          1722..1758
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   DOMAIN          1764..1944
FT                   /note="Laminin G"
FT                   /evidence="ECO:0000259|PROSITE:PS50025"
FT   DOMAIN          1946..1979
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   DOMAIN          1982..2020
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   DOMAIN          2077..2124
FT                   /note="Laminin EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50027"
FT   DOMAIN          2109..2191
FT                   /note="G-protein coupled receptors family 2 profile 1"
FT                   /evidence="ECO:0000259|PROSITE:PS50227"
FT   DOMAIN          2538..2775
FT                   /note="G-protein coupled receptors family 2 profile 2"
FT                   /evidence="ECO:0000259|PROSITE:PS50261"
FT   REGION          90..131
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          144..166
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          212..306
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2361..2398
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2830..2850
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2888..2927
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2978..3006
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          3086..3312
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2375..2390
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2979..3000
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        3109..3123
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        3162..3177
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        3191..3206
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        3235..3302
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   DISULFID        1423..1432
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT   DISULFID        1461..1470
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT   DISULFID        1748..1757
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT   DISULFID        2010..2019
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT   DISULFID        2077..2089
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00460"
FT   DISULFID        2079..2096
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00460"
FT   DISULFID        2098..2107
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00460"
SQ   SEQUENCE   3312 AA;  358239 MW;  890F34CBDA72C903 CRC64;
     MMARRPPWRG LGGRSTPILL LLLLSLFPLS QEELGGGGHQ GWDPGLAATT GPRAHIGGGA
     LALCPESSGV REDGGPGLGV REPIFVGLRG RRQSARNSRG PPEQPNEELG IEHGVQPSGS
     RERETGQGPG SVLYWRSEVS SCGRTGPLQR GSLSPGALSS GVPGLGNSSP LPSDFLVRHH
     GPKLVSSQRN AGTGSRKRVG TARCCGELWA TGSKGQGERA TTSGAERTAP RRNCLPGASG
     SGPELDSAPR TARTAPASGS APRESRTAPE PAPKRMRSRG LFRRRFLPQR PGPRPPGLPA
     RPEARKITSA NRARFRRAAN RHPQFPQYNY QTLVPENEAA GTAVLRVVAQ DPDAGEAGRL
     VYSLAALMNS RSLELFSIDP QSGLIRTAAA LDRESMERHY LRVTAQDHGS PRLSATTMVA
     VTVADRNDHS PVFEQAQYRE TLRENVEEGY PILQLRATDG DAPPNANLRY RFVGPPAARA
     AAAAAFEIDP RSGLISTSGQ VDREHMESYE LVVEASDQGQ EPGPRSATVR VHITVLDEND
     NAPQFSEKRY VAQVREDVRP HTVVLRVTAT DRDKDANGLV HYNIISGNSR GHFAIDSLTG
     EIQVVAPLDF EAEREYALRI RAQDAGRPPL SNNTGLASIQ VVDINDHIPI FVSTPFQVSV
     LENAPLGHSV IHIQAVDADH GENARLEYSL TGVAPDTPFV INSATGWVSV SGPLDRESVE
     HYFFGVEARD HGSPPLSASA SVTVTVLDVN DNRPEFTMKE YHLRLNEDAA VGTSVVSVTA
     VDRDANSAIS YQITGGNTRN RFAISTQGGV GLVTLALPLD YKQERYFKLV LTASDRALHD
     HCYVHINITD ANTHRPVFQS AHYSVSVNED RPVGSTIVVI SASDDDVGEN ARITYLLEDN
     LPQFRIDADS GAITLQAPLD YEDQVTYTLA ITARDNGIPQ KADTTYVEVM VNDVNDNAPQ
     FVASHYTGLV SEDAPPFTSV LQISATDRDA HANGRVQYTF QNGEDGDGDF TIEPTSGIVR
     TVRRLDREAV SVYELTAYAV DRGVPPLRTP VSFQVMVQDV NDNAPVFPAE EFEVRVKENS
     IVGSVVAQIT AVDPDEGPNA HIMYQIVEGN IPELFQMDIF SGELTALIDL DYEARQEYVI
     VVQATSAPLV SRATVHVRLV DQNDNSPVLN NFQILFNNYV SNRSDTFPSG IIGRIPAYDP
     DVSDHLFYSF ERGNELQLLV VNQTSGELRL SRKLDNNRPL VASMLVTVTD GLHSVTAQCV
     LRVVIITEEL LANSLTVRLE NMWQERFLSP LLGRFLEGVA AVLATPAEDV FIFNIQNDTD
     VGGTVLNVSF SALAPRGAGA GAAGPWFSSE ELQEQLYVRR AALAARSLLD VLPFDDNVCL
     REPCENYMKC VSVLRFDSSA PFLASASTLF RPIQPIAGLR CRCPPGFTGD FCETELDLCY
     SNPCRNGGAC ARREGGYTCV CRPRFTGEDC ELDTEAGRCV PGVCRNGGTC TDAPNGGFRC
     QCPAGGAFEG PRCEVAARSF PPSSFVMFRG LRQRFHLTLS LSFATVQQSG LLFYNGRLNE
     KHDFLALELV AGQVRLTYST GESNTVVSPT VPGGLSDGQW HTVHLRYYNK PRTDALGGAQ
     GPSKDKVAVL SVDDCDVAVA LQFGAEIGNY SCAAAGVQTS SKKSLDLTGP LLLGGVPNLP
     ENFPVSHKDF IGCMRDLHID GRRVDMAAFV ANNGTMAGCQ AKLHFCDSGP CKNSGFCLER
     WGGFSCDCPV GFGGKDCRLT MAHPHHFRGN GTLSWNFGSD MAVSVPWYLG LAFRTRATQG
     VLMQVQAGPH STLLCQLDRG LLSVTVTRGS GRASHLLLDQ VTVSDGRWHD LRLELQEEPG
     GRRGHHVLMV SLDFSLFQDT MAVGSELQGL KVKQLHVGGL PPGSAEEAPQ GLVGCIQGVW
     LGSTPSGSPA LLPPSHRVNV EPGCVVTNAC ASGPCPPHAD CRDHWQTFSC TCRPGYYGPG
     CVDACLLNPC QNQGSCRHLP GAPHGYTCDC VGGYFGHHCE HRMDQQCPRG WWGSPTCGPC
     NCDVHKGFDP NCNKTNGQCH CKEFHYRPRG SDSCLPCDCY PVGSTSRSCA PHSGQCPCRP
     GALGRQCNSC DSPFAEVTAS GCRVLYDACP KSLRSGVWWP QTKFGILATV PCPRGALGLT
     RGCLPPGAAV RLCDEAQGWL EPDLFNCTSP AFRELSLLLD GLELNKTALD TMEAKKLAQR
     LREVTGHTDH YFSQDVRVTA RLLAHLLAFE SHQQGFGLTA TQDAHFNEVG LCPMFSVPWP
     PSSQPPAPGS PAQAPAVLLG SEGPGLSLCP CPLQPHPLAC WMLSIDRMEH PSSPRGARRY
     PRYHSNLFRG QDAWDPHTHV LLPSQSPRPW ETPGPILPTS SSTENSTTSS VVPPPAPPEP
     EPGISIIILL VYRTLGGLLP AQFQAERRGA RLPQNPVMNS PVVSVAVFHG RNFLRGILES
     PISLEFRLLQ TANRSKAICV QWDPPGLAEQ HGVWTARDCE LVHRNGSHAR CRCSRTGTFG
     VLMDASPRER LEGDLELLAV FTHVVVAVSV AALVLTAAVL LSLRSLKSNV RGIHANVAAA
     LGVAELLFLL GIHRTHNQLV CTAVAILLHY FFLSTFAWLF VQGLHLYRMQ VEPRNVDRGA
     MRFYHALGWG VPAVLLGLAV GLDPEGYGNP DFCWISVHEP LIWSFAGPVV LVIVMNGTMF
     LLAARTSCST GQREAKKTSA LRTLRSSFLL LLLVSASWLF GLLAVNHSIL AFHYLHAGLC
     GLQGLAVLLL FCVLNADARA AWTPACLGRK AAPEEARPAP GMGPGAYNNT ALFEESGLIR
     ITLGASTVSS VSSTRSGRTQ DQDSQRGRSY LRDNVLVRHG SAADHTDHSL QAHAGPTDLD
     VAMFHRDAGA DSDSDSDLSL EEERSLSIPS SESEDNGRTR GRFQRPLCRA AQSERLLTHP
     KDVDGNDLLS YWPALGECEA APCALQTWGS ERRLGLDTSK DAANNNQPDP ALTSGDETSL
     GRAQRQRKGI LKNRLQYPLV PQTRGAPELS WCRAATLGHR AVPAASYGRI YAGGGTGSLS
     QPASRYSSRE QLDLLLRRQL SRERLEEAPA PVLRPLSRPG SQECMDAAPG RLEPRDRGST
     LPRRQPPRDY PGAMAGRFGS RDALDLGAPR EWLSTLPPPR RTRDLDPQPP PLPLSPQRQL
     SRDPLLPSRP LDSLSRSSNS REQLDQVPSR HPSREALGPP PQLLRAREDP VSGPSHGPST
     EQLDILSSIL ASFNSSALSS VQSSSTPLGP HTTATPSATA SVLGPSTPRS ATSHSISELS
     PDSEVPRSEG HS
//
DBGET integrated database retrieval system