ID A0A286ZS39_PIG Unreviewed; 2108 AA.
AC A0A286ZS39;
DT 22-NOV-2017, integrated into UniProtKB/TrEMBL.
DT 22-NOV-2017, sequence version 1.
DT 27-MAR-2024, entry version 38.
DE RecName: Full=Agrin {ECO:0000256|ARBA:ARBA00016077};
GN Name=AGRN {ECO:0000313|Ensembl:ENSSSCP00000034280.1,
GN ECO:0000313|VGNC:VGNC:85188};
OS Sus scrofa (Pig).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Artiodactyla; Suina; Suidae; Sus.
OX NCBI_TaxID=9823 {ECO:0000313|Ensembl:ENSSSCP00000034280.1, ECO:0000313|Proteomes:UP000008227};
RN [1] {ECO:0000313|Ensembl:ENSSSCP00000034280.1, ECO:0000313|Proteomes:UP000008227}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Duroc {ECO:0000313|Ensembl:ENSSSCP00000034280.1,
RC ECO:0000313|Proteomes:UP000008227};
RG Porcine genome sequencing project;
RL Submitted (NOV-2009) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSSSCP00000034280.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR SMR; A0A286ZS39; -.
DR STRING; 9823.ENSSSCP00000034280; -.
DR Ensembl; ENSSSCT00000065088.2; ENSSSCP00000034280.1; ENSSSCG00000022204.4.
DR VGNC; VGNC:85188; AGRN.
DR GeneTree; ENSGT00940000158337; -.
DR InParanoid; A0A286ZS39; -.
DR Reactome; R-SSC-1971475; A tetrasaccharide linker sequence is required for GAG synthesis.
DR Reactome; R-SSC-2022928; HS-GAG biosynthesis.
DR Reactome; R-SSC-2024096; HS-GAG degradation.
DR Reactome; R-SSC-3000178; ECM proteoglycans.
DR Reactome; R-SSC-975634; Retinoid metabolism and transport.
DR Proteomes; UP000008227; Chromosome 6.
DR Bgee; ENSSSCG00000022204; Expressed in endocardial endothelium and 43 other cell types or tissues.
DR ExpressionAtlas; A0A286ZS39; baseline and differential.
DR GO; GO:0005604; C:basement membrane; IEA:Ensembl.
DR GO; GO:0005576; C:extracellular region; IEA:UniProt.
DR GO; GO:0005886; C:plasma membrane; IEA:GOC.
DR GO; GO:0045202; C:synapse; IEA:GOC.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR GO; GO:0043236; F:laminin binding; IEA:InterPro.
DR GO; GO:0030154; P:cell differentiation; IEA:UniProtKB-KW.
DR GO; GO:0007213; P:G protein-coupled acetylcholine receptor signaling pathway; IEA:InterPro.
DR GO; GO:0007528; P:neuromuscular junction development; IBA:GO_Central.
DR GO; GO:0043113; P:receptor clustering; IBA:GO_Central.
DR CDD; cd00054; EGF_CA; 2.
DR CDD; cd00055; EGF_Lam; 2.
DR CDD; cd00104; KAZAL_FS; 9.
DR CDD; cd00110; LamG; 3.
DR Gene3D; 2.40.50.120; -; 1.
DR Gene3D; 2.60.120.200; -; 3.
DR Gene3D; 3.30.60.30; -; 9.
DR Gene3D; 2.10.25.10; Laminin; 5.
DR Gene3D; 3.30.70.960; SEA domain; 1.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR003884; FacI_MAC.
DR InterPro; IPR003645; Fol_N.
DR InterPro; IPR002350; Kazal_dom.
DR InterPro; IPR036058; Kazal_dom_sf.
DR InterPro; IPR001791; Laminin_G.
DR InterPro; IPR002049; LE_dom.
DR InterPro; IPR004850; NtA_dom.
DR InterPro; IPR000082; SEA_dom.
DR InterPro; IPR036364; SEA_dom_sf.
DR InterPro; IPR008993; TIMP-like_OB-fold.
DR PANTHER; PTHR15036:SF83; AGRIN; 1.
DR PANTHER; PTHR15036; PIKACHURIN-LIKE PROTEIN; 1.
DR Pfam; PF00008; EGF; 1.
DR Pfam; PF00050; Kazal_1; 1.
DR Pfam; PF07648; Kazal_2; 8.
DR Pfam; PF00053; Laminin_EGF; 2.
DR Pfam; PF00054; Laminin_G_1; 4.
DR Pfam; PF03146; NtA; 1.
DR Pfam; PF01390; SEA; 1.
DR PRINTS; PR00011; EGFLAMININ.
DR SMART; SM00181; EGF; 7.
DR SMART; SM00179; EGF_CA; 2.
DR SMART; SM00180; EGF_Lam; 2.
DR SMART; SM00057; FIMAC; 2.
DR SMART; SM00274; FOLN; 6.
DR SMART; SM00280; KAZAL; 9.
DR SMART; SM00282; LamG; 3.
DR SMART; SM00200; SEA; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 3.
DR SUPFAM; SSF57196; EGF/Laminin; 2.
DR SUPFAM; SSF100895; Kazal-type serine protease inhibitors; 9.
DR SUPFAM; SSF82671; SEA domain; 1.
DR SUPFAM; SSF50242; TIMP-like; 1.
DR PROSITE; PS00022; EGF_1; 3.
DR PROSITE; PS01186; EGF_2; 1.
DR PROSITE; PS50026; EGF_3; 4.
DR PROSITE; PS01248; EGF_LAM_1; 1.
DR PROSITE; PS50027; EGF_LAM_2; 2.
DR PROSITE; PS51465; KAZAL_2; 9.
DR PROSITE; PS50025; LAM_G_DOMAIN; 3.
DR PROSITE; PS51121; NTA; 1.
DR PROSITE; PS50024; SEA; 1.
PE 1: Evidence at protein level;
KW Developmental protein {ECO:0000256|ARBA:ARBA00022473};
KW Differentiation {ECO:0000256|ARBA:ARBA00022782};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Heparan sulfate {ECO:0000256|ARBA:ARBA00023207};
KW Laminin EGF-like domain {ECO:0000256|ARBA:ARBA00023292,
KW ECO:0000256|PROSITE-ProRule:PRU00460};
KW Proteoglycan {ECO:0000256|ARBA:ARBA00023207};
KW Proteomics identification {ECO:0007829|PeptideAtlas:A0A286ZS39};
KW Reference proteome {ECO:0000313|Proteomes:UP000008227};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..27
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 28..2108
FT /note="Agrin"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5012561090"
FT DOMAIN 34..161
FT /note="NtA"
FT /evidence="ECO:0000259|PROSITE:PS51121"
FT DOMAIN 200..248
FT /note="Kazal-like"
FT /evidence="ECO:0000259|PROSITE:PS51465"
FT DOMAIN 268..323
FT /note="Kazal-like"
FT /evidence="ECO:0000259|PROSITE:PS51465"
FT DOMAIN 349..395
FT /note="Kazal-like"
FT /evidence="ECO:0000259|PROSITE:PS51465"
FT DOMAIN 412..467
FT /note="Kazal-like"
FT /evidence="ECO:0000259|PROSITE:PS51465"
FT DOMAIN 493..540
FT /note="Kazal-like"
FT /evidence="ECO:0000259|PROSITE:PS51465"
FT DOMAIN 545..605
FT /note="Kazal-like"
FT /evidence="ECO:0000259|PROSITE:PS51465"
FT DOMAIN 623..670
FT /note="Kazal-like"
FT /evidence="ECO:0000259|PROSITE:PS51465"
FT DOMAIN 708..756
FT /note="Kazal-like"
FT /evidence="ECO:0000259|PROSITE:PS51465"
FT DOMAIN 797..850
FT /note="Laminin EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50027"
FT DOMAIN 851..897
FT /note="Laminin EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50027"
FT DOMAIN 926..975
FT /note="Kazal-like"
FT /evidence="ECO:0000259|PROSITE:PS51465"
FT DOMAIN 1132..1254
FT /note="SEA"
FT /evidence="ECO:0000259|PROSITE:PS50024"
FT DOMAIN 1329..1367
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1372..1580
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 1581..1618
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1650..1687
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1698..1881
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 1877..1916
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1927..2105
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT REGION 1010..1104
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1285..1335
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1010..1037
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1293..1312
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1314..1331
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 35..107
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00443"
FT DISULFID 797..809
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00460"
FT DISULFID 799..816
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00460"
FT DISULFID 818..827
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00460"
FT DISULFID 851..863
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00460"
FT DISULFID 853..870
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00460"
FT DISULFID 872..881
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00460"
FT DISULFID 1338..1355
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1357..1366
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1677..1686
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1906..1915
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ SEQUENCE 2108 AA; 223048 MW; 722B2E01975718A5 CRC64;
MASLRPSGSA PLQPLLLLLL LVVAARALPS ADGTCPERAL ERREEEANVV LTGTVEEILN
VDPVQHTYSC KVRVWRYLKG KDIVTQESLL DGGNKVVIGG FGDPLICDNQ VSTGDTRIFF
VNPAPPYLWP AHKNELMLNS SLMRITLRNL EEVEHCVEDK PGTHFTPVPP TPPDACRGML
CGFGAVCEPS AEGPGRASCV CKKSTCPNVV APVCGSDAST YSNECELQRA QCSQQRRIRL
LRRGPCGMRD PCSNITCSFG STCARSSDGL TATCLCPATC LGAPESPVCG SDGTDYPSEC
QLLRHACARQ ENVFKKFDGP CDPCQGALND LSRVCRVNPR TRHPEMLLRP ESCPSRQAPV
CGDDGVTYDN DCVMGRTGAT RGLLLQKVRS GQCLPRDQCP ETCRFSAVCL SRRGRPRCSC
DRVICDGAYR PVCAHDGHTY DNDCWRQQAE CQQQRSIPAK HQGPCDQPPS PCLGVQCPFG
ATCAVKNGEA ECVCQQTCSG LYDPVCGSDG VTYGSACELE ATACALGQEI RVARRGPCDR
CGQCRFGALC EAETGRCVCP SECVASAQPV CGSDGHTYAS ECELHVHACT HQISLHVASA
GHCQTCGESV CAFGAVCLAG QCVCPRCEHP PPGPVCGSDG VTYRSACELR EAACQQQRQI
EEARGGPCEQ AECGSGGSGS GEDGECEQEL CRQRGGIWDE DSEDGPCVCD FSCQGVLRSP
VCGSDGVTYG TECELKKARC DSRQELYVVA QGACRSPTLA PRPPAVPLHC AQAPYGCCQD
NITAARGVGL AGCPSSCQCN PHGSYSGTCD PATGQCSCRP GVGGLKCDRC EPGFWNFRGI
VTDGRSGCTP CSCDPQGAVR DDCEQMTGLC SCKPGVAGPK CGQCPDGHAL GPAGCETDPS
APKTCAEMPC EFGASCVEEA GSAHCVCPTP TCLGANATKV CGSDGVTYGN ECQLRTIACR
QGLDISVQSL GPCQEGIASG PRPTSASVAA SWLDLSKALL PPASALPLAP SGSHLSQPAS
RPSSQPWTTA SIPRTTTRPV PTLPPTPPPA AASLAMSAFG ESGSADGSSD EELSGDLEAS
GAGSGGLLPP EGDGAGTPGP PAERASCYNS PVGCCSDGRT PSLDAEGSNC PATKVFQGVL
ELEGVEGQEL FYTPEMADPK SELFGETARS IESALDDLFR NSDVKKDFRS IRLRDLGPGS
SVRAIVDVHF DPTTTFRAPD VGRALLRQIQ VSRRRSLGVR RPLQEHVRFM DFDWFPAFFT
GATPAAATAR ATTVARLPPS AAATRALYPS HTSRPVARTT APLTTRQPPT TAPSRVPGRR
PLPPGTQKPP SPCDSQPCLH GGTCQDQGPG GAFTCSCPAG RGGTFCEEAL PLSRPAFGGR
SFLAFPTLRA YHTLRLALEF RALEPEGLLL YNGNARGKDF LALALLGGRV QLRWVVGGWG
WGGARPGRGP GSTRGCPEHL LPPSRFDTGS GPAVLTSSVP VQPGRWHRLE LSRHWRRGTL
SVDGETPVLG QSPSGTDGLN LDTDLFVGGV PEDQAAVVLE RTSVSVGLRG CIRLLDVNNQ
RLELSAWQGS ATRSSRVGEC GDHPCVPSPC LGSAPCQALE AGMFHCQCPP GRFGEDRTWA
RHWGDLCVCE GLTPPGTPSR LPPGPTCAEE KGPCQPNPCH GAAPCHVLPQ GEAQCECPRG
RGGSLCQTAS ERDDAMQPFL ADFNSFSYLE LKGLHTFERD LGEKMELEVV FLARGPSGLL
LYNGQKTDGK GDFVSLALHN RLLEFRYDLG KGAAVIRSKE PVALGVWTRV SLERNGRKGA
MRVGDGPRVL GESPVPHTIL NLKEPLYIGG APDFSRLARA AAVSSGFDGA IQLVALNGRQ
LLTREHVVQA VDVSSFADHP CTQAAGHPCL NGASCLPRGA SYECLCPGGF SGLHCEKGLI
EKSAGDLDAL AFDGRTYVEY LNAVTESEKA LQSNHFELSL RTEATQGLVL WSGKATERAD
YIALAIVDGH LQLTYDLGSQ PVVLRSTVAV NTNRWLRVRA HRDQREGSLQ VGNEAPVTGS
SPLGATQLDT DGALWLGGLE KLPGGQALPK AYSTGFVGCL RDVVVGRRPL HLLEDAVTKP
ELRPCPTP
//