ID G3PAJ2_GASAC Unreviewed; 1104 AA.
AC G3PAJ2;
DT 16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT 16-NOV-2011, sequence version 1.
DT 27-MAR-2024, entry version 73.
DE RecName: Full=Heparan sulfate proteoglycan 2 {ECO:0008006|Google:ProtNLM};
OS Gasterosteus aculeatus (Three-spined stickleback).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Eupercaria; Perciformes; Cottioidei; Gasterosteales; Gasterosteidae;
OC Gasterosteus.
OX NCBI_TaxID=69293 {ECO:0000313|Ensembl:ENSGACP00000014616.1, ECO:0000313|Proteomes:UP000007635};
RN [1] {ECO:0000313|Ensembl:ENSGACP00000014616.1, ECO:0000313|Proteomes:UP000007635}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Lindblad-Toh K., Mauceli E., Grabherr M., Chang J.L., Lander E.S.;
RL Submitted (JAN-2006) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSGACP00000014616.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; G3PAJ2; -.
DR STRING; 69293.ENSGACP00000014616; -.
DR Ensembl; ENSGACT00000014642.1; ENSGACP00000014616.1; ENSGACG00000011047.1.
DR eggNOG; KOG3509; Eukaryota.
DR GeneTree; ENSGT00940000156670; -.
DR InParanoid; G3PAJ2; -.
DR OMA; EINICIT; -.
DR TreeFam; TF326548; -.
DR Proteomes; UP000007635; Unassembled WGS sequence.
DR Bgee; ENSGACG00000011047; Expressed in heart and 12 other cell types or tissues.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR GO; GO:0048856; P:anatomical structure development; IEA:UniProt.
DR CDD; cd00054; EGF_CA; 3.
DR CDD; cd00110; LamG; 3.
DR Gene3D; 2.60.120.200; -; 3.
DR Gene3D; 2.60.40.10; Immunoglobulins; 4.
DR Gene3D; 2.10.25.10; Laminin; 3.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR007110; Ig-like_dom.
DR InterPro; IPR036179; Ig-like_dom_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR013098; Ig_I-set.
DR InterPro; IPR003599; Ig_sub.
DR InterPro; IPR003598; Ig_sub2.
DR InterPro; IPR001791; Laminin_G.
DR PANTHER; PTHR15036; PIKACHURIN-LIKE PROTEIN; 1.
DR PANTHER; PTHR15036:SF85; SP2353, ISOFORM A; 1.
DR Pfam; PF00008; EGF; 2.
DR Pfam; PF07679; I-set; 3.
DR Pfam; PF13927; Ig_3; 1.
DR Pfam; PF00054; Laminin_G_1; 3.
DR SMART; SM00181; EGF; 4.
DR SMART; SM00179; EGF_CA; 2.
DR SMART; SM00409; IG; 4.
DR SMART; SM00408; IGc2; 4.
DR SMART; SM00282; LamG; 3.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 3.
DR SUPFAM; SSF57196; EGF/Laminin; 2.
DR SUPFAM; SSF48726; Immunoglobulin; 4.
DR PROSITE; PS00022; EGF_1; 4.
DR PROSITE; PS01186; EGF_2; 2.
DR PROSITE; PS50026; EGF_3; 4.
DR PROSITE; PS50835; IG_LIKE; 4.
DR PROSITE; PS50025; LAM_G_DOMAIN; 3.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Membrane {ECO:0000256|ARBA:ARBA00022989};
KW Reference proteome {ECO:0000313|Proteomes:UP000007635};
KW Transmembrane {ECO:0000256|ARBA:ARBA00022989};
KW Transmembrane helix {ECO:0000256|ARBA:ARBA00022989}.
FT DOMAIN 12..94
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 105..192
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 197..281
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 286..368
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 374..551
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 547..584
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 587..625
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 630..810
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 806..843
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 845..878
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 906..1102
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DISULFID 574..583
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 596..613
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 615..624
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 833..842
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 868..877
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ SEQUENCE 1104 AA; 119155 MW; 70C5FBB7AAD6BF3D CRC64;
EVTVMLDVET PPYATSMPDD VAVRVGEVIR LQCLAHGTPP LTYTWTKLDG NLPPRAQVSG
GDLQINLASA EDAGSYKCVA SNKVANSEVI AKVTVRCKFT ENSFPSAAPL AVRVSPQVEV
KAQGSAVEFT CSSAGGIETK IEWLKEGGAL PPNHHIKDGV LRIENLEQSN EGVYICRASS
VYGQAQDAAR LTIQALPKVM INVRTSVQTV MIGNSVEFEC QAVGDPEPTV RWSKVGGSLP
AHIMVKGGML RIEKVTEADA GKYRCTATNN VGSVQSQVVL NVQSLPQIAT LPETKEVTVG
SDAVLPCVAS GYPAPEIKWS KVEKELPPKC FQEHNVLTVP RVTHDDSGIY VCTASNKQGK
VEAFTTLQVH ERVMPYFAQE PLSYLTLPTI KNAYKAFSIK INFRPDNVDG MLLYNGQKKT
TGADFISLGL VGGRVEFRFD VGSGMATIRD PNPVKMGEFH TVELYRNHTL GYITVDGREP
INGTSQGKFQ GLDLNEEVHV GGYPNYTVLA KTAGINTGFV GCIRQLVIQG EEVIFKDLDR
SSTGVTNCPT CKDHPCQNGG VCEDSDASLY MCGCPRGFTG SNCQHHSSLH CHPEACGPDA
TCIPRTSSLG YECRCHLGKF GNKCMDGELV TTPLFGEESY IAYPPLTNVH DDLRVELEFK
PMERDGLMFF CGGKKMKVED FVAISMVEGH VEFRYELGTG QAILLSPEPV SLGQWHSVVA
ERNKRVGHLK VDQGPVEKKT SPGKAQGLNI HTHMYLGGVP DMDILPKPAN ISELFEGCIG
EVSINNKQVD LSYSFTDSRS IRKCVDNSPC DRRPCLNGGD CLSSFEYEYQ CLCKDGFEGE
RCEVVRDACQ SNLQCQNGGS CVKGQCVCAP GRTGLNCEEI SSLYNHPSGS LHPSATGGHC
SNSPYQYAAH FHDDGYIALP KSVFPRSVSA HDSPETIELE INTSSSEGLI LWQGVERGEH
GKGKDFVSLG LQNGHLVFSY QLGSGEAEIL SQKAINDGRW HKVTAVRTGK NGYIQIDGGV
ELSGQSKGRS LMVNTKGSIY LGGPPATGGA PDMAAMTGGK FSSGMTGCVR NLAMMNARPG
QQPAQAIDLQ THAAHGINVL PCSS
//