ID D4A2F1_RAT Unreviewed; 2047 AA.
AC D4A2F1;
DT 20-APR-2010, integrated into UniProtKB/TrEMBL.
DT 25-MAY-2022, sequence version 3.
DT 27-MAR-2024, entry version 89.
DE RecName: Full=Agrin {ECO:0000256|ARBA:ARBA00016077};
GN Name=Agrn {ECO:0000313|Ensembl:ENSRNOP00000040828.4,
GN ECO:0000313|RGD:2067};
OS Rattus norvegicus (Rat).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC Murinae; Rattus.
OX NCBI_TaxID=10116 {ECO:0000313|Ensembl:ENSRNOP00000040828.4, ECO:0000313|Proteomes:UP000002494};
RN [1] {ECO:0000313|Ensembl:ENSRNOP00000040828.4, ECO:0000313|Proteomes:UP000002494}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Brown Norway {ECO:0000313|Ensembl:ENSRNOP00000040828.4,
RC ECO:0000313|Proteomes:UP000002494};
RX PubMed=15057822; DOI=10.1038/nature02426;
RG Rat Genome Sequencing Project Consortium;
RA Gibbs R.A., Weinstock G.M., Metzker M.L., Muzny D.M., Sodergren E.J.,
RA Scherer S., Scott G., Steffen D., Worley K.C., Burch P.E., Okwuonu G.,
RA Hines S., Lewis L., Deramo C., Delgado O., Dugan-Rocha S., Miner G.,
RA Morgan M., Hawes A., Gill R., Holt R.A., Adams M.D., Amanatides P.G.,
RA Baden-Tillson H., Barnstead M., Chin S., Evans C.A., Ferriera S.,
RA Fosler C., Glodek A., Gu Z., Jennings D., Kraft C.L., Nguyen T.,
RA Pfannkoch C.M., Sitter C., Sutton G.G., Venter J.C., Woodage T., Smith D.,
RA Lee H.-M., Gustafson E., Cahill P., Kana A., Doucette-Stamm L.,
RA Weinstock K., Fechtel K., Weiss R.B., Dunn D.M., Green E.D.,
RA Blakesley R.W., Bouffard G.G., De Jong P.J., Osoegawa K., Zhu B., Marra M.,
RA Schein J., Bosdet I., Fjell C., Jones S., Krzywinski M., Mathewson C.,
RA Siddiqui A., Wye N., McPherson J., Zhao S., Fraser C.M., Shetty J.,
RA Shatsman S., Geer K., Chen Y., Abramzon S., Nierman W.C., Havlak P.H.,
RA Chen R., Durbin K.J., Egan A., Ren Y., Song X.-Z., Li B., Liu Y., Qin X.,
RA Cawley S., Cooney A.J., D'Souza L.M., Martin K., Wu J.Q.,
RA Gonzalez-Garay M.L., Jackson A.R., Kalafus K.J., McLeod M.P.,
RA Milosavljevic A., Virk D., Volkov A., Wheeler D.A., Zhang Z., Bailey J.A.,
RA Eichler E.E., Tuzun E., Birney E., Mongin E., Ureta-Vidal A., Woodwark C.,
RA Zdobnov E., Bork P., Suyama M., Torrents D., Alexandersson M., Trask B.J.,
RA Young J.M., Huang H., Wang H., Xing H., Daniels S., Gietzen D., Schmidt J.,
RA Stevens K., Vitt U., Wingrove J., Camara F., Mar Alba M., Abril J.F.,
RA Guigo R., Smit A., Dubchak I., Rubin E.M., Couronne O., Poliakov A.,
RA Huebner N., Ganten D., Goesele C., Hummel O., Kreitler T., Lee Y.-A.,
RA Monti J., Schulz H., Zimdahl H., Himmelbauer H., Lehrach H., Jacob H.J.,
RA Bromberg S., Gullings-Handley J., Jensen-Seaman M.I., Kwitek A.E.,
RA Lazar J., Pasko D., Tonellato P.J., Twigger S., Ponting C.P., Duarte J.M.,
RA Rice S., Goodstadt L., Beatson S.A., Emes R.D., Winter E.E., Webber C.,
RA Brandt P., Nyakatura G., Adetobi M., Chiaromonte F., Elnitski L.,
RA Eswara P., Hardison R.C., Hou M., Kolbe D., Makova K., Miller W.,
RA Nekrutenko A., Riemer C., Schwartz S., Taylor J., Yang S., Zhang Y.,
RA Lindpaintner K., Andrews T.D., Caccamo M., Clamp M., Clarke L., Curwen V.,
RA Durbin R.M., Eyras E., Searle S.M., Cooper G.M., Batzoglou S., Brudno M.,
RA Sidow A., Stone E.A., Payseur B.A., Bourque G., Lopez-Otin C., Puente X.S.,
RA Chakrabarti K., Chatterji S., Dewey C., Pachter L., Bray N., Yap V.B.,
RA Caspi A., Tesler G., Pevzner P.A., Haussler D., Roskin K.M., Baertsch R.,
RA Clawson H., Furey T.S., Hinrichs A.S., Karolchik D., Kent W.J.,
RA Rosenbloom K.R., Trumbower H., Weirauch M., Cooper D.N., Stenson P.D.,
RA Ma B., Brent M., Arumugam M., Shteynberg D., Copley R.R., Taylor M.S.,
RA Riethman H., Mudunuri U., Peterson J., Guyer M., Felsenfeld A., Old S.,
RA Mockrin S., Collins F.S.;
RT "Genome sequence of the Brown Norway rat yields insights into mammalian
RT evolution.";
RL Nature 428:493-521(2004).
RN [2] {ECO:0000313|Ensembl:ENSRNOP00000040828.4}
RP IDENTIFICATION.
RC STRAIN=Brown Norway {ECO:0000313|Ensembl:ENSRNOP00000040828.4};
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_006239603.1; XM_006239541.3.
DR Ensembl; ENSRNOT00000047854.4; ENSRNOP00000040828.4; ENSRNOG00000020205.9.
DR GeneID; 25592; -.
DR CTD; 375790; -.
DR RGD; 2067; Agrn.
DR VEuPathDB; HostDB:ENSRNOG00000020205; -.
DR GeneTree; ENSGT00940000158337; -.
DR OMA; AMEISPF; -.
DR Proteomes; UP000002494; Chromosome 5.
DR Bgee; ENSRNOG00000020205; Expressed in lung and 20 other cell types or tissues.
DR ExpressionAtlas; D4A2F1; baseline and differential.
DR GO; GO:0005604; C:basement membrane; IEA:Ensembl.
DR GO; GO:0005576; C:extracellular region; IEA:UniProt.
DR GO; GO:0005886; C:plasma membrane; IEA:GOC.
DR GO; GO:0045202; C:synapse; IEA:GOC.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR GO; GO:0043236; F:laminin binding; IEA:InterPro.
DR GO; GO:0030154; P:cell differentiation; IEA:UniProtKB-KW.
DR GO; GO:0007213; P:G protein-coupled acetylcholine receptor signaling pathway; IEA:InterPro.
DR GO; GO:0043113; P:receptor clustering; IEA:Ensembl.
DR CDD; cd00054; EGF_CA; 1.
DR CDD; cd00055; EGF_Lam; 2.
DR CDD; cd00104; KAZAL_FS; 9.
DR CDD; cd00110; LamG; 3.
DR Gene3D; 2.40.50.120; -; 1.
DR Gene3D; 2.60.120.200; -; 3.
DR Gene3D; 3.30.60.30; -; 9.
DR Gene3D; 2.10.25.10; Laminin; 6.
DR Gene3D; 3.30.70.960; SEA domain; 1.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR003884; FacI_MAC.
DR InterPro; IPR003645; Fol_N.
DR InterPro; IPR002350; Kazal_dom.
DR InterPro; IPR036058; Kazal_dom_sf.
DR InterPro; IPR001791; Laminin_G.
DR InterPro; IPR002049; LE_dom.
DR InterPro; IPR004850; NtA_dom.
DR InterPro; IPR000082; SEA_dom.
DR InterPro; IPR036364; SEA_dom_sf.
DR InterPro; IPR008993; TIMP-like_OB-fold.
DR PANTHER; PTHR15036:SF83; AGRIN; 1.
DR PANTHER; PTHR15036; PIKACHURIN-LIKE PROTEIN; 1.
DR Pfam; PF00008; EGF; 3.
DR Pfam; PF00050; Kazal_1; 1.
DR Pfam; PF07648; Kazal_2; 8.
DR Pfam; PF00053; Laminin_EGF; 2.
DR Pfam; PF00054; Laminin_G_1; 3.
DR Pfam; PF03146; NtA; 1.
DR Pfam; PF01390; SEA; 1.
DR PRINTS; PR00011; EGFLAMININ.
DR SMART; SM00181; EGF; 6.
DR SMART; SM00179; EGF_CA; 3.
DR SMART; SM00180; EGF_Lam; 2.
DR SMART; SM00057; FIMAC; 3.
DR SMART; SM00274; FOLN; 5.
DR SMART; SM00280; KAZAL; 9.
DR SMART; SM00282; LamG; 3.
DR SMART; SM00200; SEA; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 3.
DR SUPFAM; SSF57196; EGF/Laminin; 3.
DR SUPFAM; SSF100895; Kazal-type serine protease inhibitors; 9.
DR SUPFAM; SSF82671; SEA domain; 1.
DR SUPFAM; SSF50242; TIMP-like; 1.
DR PROSITE; PS00022; EGF_1; 4.
DR PROSITE; PS01186; EGF_2; 1.
DR PROSITE; PS50026; EGF_3; 4.
DR PROSITE; PS01248; EGF_LAM_1; 1.
DR PROSITE; PS50027; EGF_LAM_2; 2.
DR PROSITE; PS51465; KAZAL_2; 9.
DR PROSITE; PS50025; LAM_G_DOMAIN; 3.
DR PROSITE; PS51121; NTA; 1.
DR PROSITE; PS50024; SEA; 1.
PE 1: Evidence at protein level;
KW Developmental protein {ECO:0000256|ARBA:ARBA00022473};
KW Differentiation {ECO:0000256|ARBA:ARBA00022782};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Heparan sulfate {ECO:0000256|ARBA:ARBA00023207};
KW Laminin EGF-like domain {ECO:0000256|ARBA:ARBA00023292,
KW ECO:0000256|PROSITE-ProRule:PRU00460};
KW Proteoglycan {ECO:0000256|ARBA:ARBA00023207};
KW Proteomics identification {ECO:0007829|PeptideAtlas:D4A2F1};
KW Reference proteome {ECO:0000313|Proteomes:UP000002494};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..31
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 32..2047
FT /note="Agrin"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5035166315"
FT DOMAIN 32..159
FT /note="NtA"
FT /evidence="ECO:0000259|PROSITE:PS51121"
FT DOMAIN 198..246
FT /note="Kazal-like"
FT /evidence="ECO:0000259|PROSITE:PS51465"
FT DOMAIN 266..321
FT /note="Kazal-like"
FT /evidence="ECO:0000259|PROSITE:PS51465"
FT DOMAIN 347..393
FT /note="Kazal-like"
FT /evidence="ECO:0000259|PROSITE:PS51465"
FT DOMAIN 410..465
FT /note="Kazal-like"
FT /evidence="ECO:0000259|PROSITE:PS51465"
FT DOMAIN 491..538
FT /note="Kazal-like"
FT /evidence="ECO:0000259|PROSITE:PS51465"
FT DOMAIN 549..603
FT /note="Kazal-like"
FT /evidence="ECO:0000259|PROSITE:PS51465"
FT DOMAIN 610..668
FT /note="Kazal-like"
FT /evidence="ECO:0000259|PROSITE:PS51465"
FT DOMAIN 706..754
FT /note="Kazal-like"
FT /evidence="ECO:0000259|PROSITE:PS51465"
FT DOMAIN 795..848
FT /note="Laminin EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50027"
FT DOMAIN 849..895
FT /note="Laminin EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50027"
FT DOMAIN 917..973
FT /note="Kazal-like"
FT /evidence="ECO:0000259|PROSITE:PS51465"
FT DOMAIN 1130..1252
FT /note="SEA"
FT /evidence="ECO:0000259|PROSITE:PS50024"
FT DOMAIN 1327..1365
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1370..1546
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 1547..1584
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1586..1623
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1633..1820
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 1816..1855
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1866..2044
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT REGION 1058..1087
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1280..1335
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 33..105
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00443"
FT DISULFID 795..807
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00460"
FT DISULFID 797..814
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00460"
FT DISULFID 816..825
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00460"
FT DISULFID 849..861
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00460"
FT DISULFID 851..868
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00460"
FT DISULFID 870..879
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00460"
FT DISULFID 1336..1353
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1355..1364
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1574..1583
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1613..1622
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1845..1854
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ SEQUENCE 2047 AA; 218373 MW; 67A46D61C4ACB969 CRC64;
MVSLRLCSRA PLLPPLLLLV VAARTLPGAS GTCPERALER REEEANVVLT GTVEEILNVD
PVQHTYSCKV RVWRYLKGKD VVAQESLLDG GNKVVIGGFG DPLICDNQVS TGDTRIFFVN
PAPPYMWPAH KNELMLNSSL MRITLRNLEE VEFCVEDKPG IHFTPAPPTP PDVCRGMLCG
FGAVCEPSVE DPGRASCVCK KNACPATVAP VCGSDASTYS NECELQRAQC NQQRRIRLLR
QGPCGSRDPC ANVTCSFGST CVPSADGQTA SCLCPTTCFG APDGTVCGSD GVDYPSECQL
LSHACASQEH IFKKFNGPCD PCQGSMSDLN HICRVNPRTR HPEMLLRPEN CPAQHTPICG
DDGVTYENDC VMSRIGATRG LLLQKVRSGQ CQTRDQCPET CQFNSVCLSR RGRPHCSCDR
VTCDGSYRPV CAQDGHTYNN DCWRQQAECR QQRAIPPKHQ GPCDQTPSPC HGVQCAFGAV
CTVKNGKAEC ECQRVCSGIY DPVCGSDGVT YGSVCELESM ACTLGREIQV ARRGPCDPCG
QCRFGSLCEV ETGRCVCPSE CVESAQPVCG SDGHTYASEC ELHVHACTHQ ISLYVASAGH
CQTCGEKVCT FGAVCSAGQC VCPRCEHPPP GPVCGSDGVT YLSACELREA ACQQQVQIEE
AHAGPCEPAE CGSGGSGSGE DDECEQELCR QRGGIWDEDS EDGPCVCDFS CQSVPRSPVC
GSDGVTYGTE CDLKKARCES QQELYVAAQG ACRGPTLAPL LPVAFPHCAQ TPYGCCQDNF
TAAQGVGLAG CPSTCHCNPH GSYSGTCDPA TGQCSCRPGV GGLRCDRCEP GFWNFRGIVT
DGHSGCTPCS CDPRGAVRDD CEQMTGLCSC RPGVAGPKCG QCPDGQVLGH LGCEADPMTP
VTCVEIHCEF GASCVEKAGF AQCICPTLTC PEANSTKVCG SDGVTYGNEC QLKAIACRQR
LDISTQSLGP CQESVTPGAS PTSASMTTPR HILSKTLPFP HNSLPLSPGS TTHDWPTPLP
ISPHTTVSIP RSTAWPVLTV PPTAAASDVT SLATSIFSES GSANGSGDEE LSGDEEASGG
GSGGLEPPVG SIVVTHGPPI ERASCYNSPL GCCSDGKTPS LDSEGSNCPA TKAFQGVLEL
EGVEGQELFY TPEMADPKSE LFGETARSIE STLDDLFRNS DVKKDFWSVR LRELGPGKLV
RAIVDVHFDP TTAFQASDVG QALLRQIQVS RPWALAVRRP LQEHVRFLDF DWFPTFFTGA
ATGTTAAMAT ARATTVSRLP ASSVTPRVYP SHTSRPVGRT TAPPTTRRPP TTATNMDRPR
TPGHQQPSKS CDSQPCLHGG TCQDQDSGKG FTCSCTAGRG GSVCEKVQPP SMPAFKGHSF
LAFPTLRAYH TLRLALEFRA LETEGLLLYN GNARGKDFLA LALLDGRVQF RFDTGSGPAV
LTSLVPVEPG RWHRLELSRH WRQGTLSVDG ETPVVGESPS GTDGLNLDTN LYVGGIPEEQ
VAMVLDRTSV GVGLKGCIRM LDINNQQLEL SDWQRAAVQS SGVGECGDHP CLPNPCHGGA
LCQALEAGMF LCQCPPGRFG PTCADEKSPC QPNPCHGAAP CRVLSSGGAK CECPLGRSGT
FCQTVLETAG SRPFLADFNG FSYLELKGLH TFERDLGEKM ALEMVFLARG PSGLLLYNGQ
KTDGKGDFVS LALHNRHLEF CYDLGKGAAV IRSKEPIALG TWVRVFLERN GRKGALQVGD
GPRVLGESPK SRKVPHTMLN LKEPLYIGGA PDFSKLARGA AVSSGFNGVI QLVSLRGHQL
LTQEHVLRAV DVSPFADHPC TQALGNPCLN GGSCVPREAT YECLCPGGFS GLHCEKGLVE
KSVGDLETLA FDGRTYIEYL NAVIESEKAL QSNHFELSLR TEATQGLVLW IGKAAERADY
MALAIVDGHL QLSYDLGSQP VVLRSTVKVN TNRWLRIRAH REHREGSLQV GNEAPVTGSS
PLGATQLDTD GALWLGGLQK LPVGQALPKA YGTGFVGCLR DVVVGHRQLH LLEDAVTKPE
LRPCPTP
//