ID H2L4F8_ORYLA Unreviewed; 1950 AA.
AC H2L4F8;
DT 21-MAR-2012, integrated into UniProtKB/TrEMBL.
DT 05-DEC-2018, sequence version 2.
DT 27-MAR-2024, entry version 79.
DE RecName: Full=Agrin {ECO:0000256|ARBA:ARBA00016077};
GN Name=agrn {ECO:0000313|Ensembl:ENSORLP00000000649.2};
OS Oryzias latipes (Japanese rice fish) (Japanese killifish).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Ovalentaria; Atherinomorphae; Beloniformes; Adrianichthyidae; Oryziinae;
OC Oryzias.
OX NCBI_TaxID=8090 {ECO:0000313|Ensembl:ENSORLP00000000649.2, ECO:0000313|Proteomes:UP000001038};
RN [1] {ECO:0000313|Ensembl:ENSORLP00000000649.2, ECO:0000313|Proteomes:UP000001038}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Hd-rR {ECO:0000313|Ensembl:ENSORLP00000000649.2,
RC ECO:0000313|Proteomes:UP000001038};
RX PubMed=17554307; DOI=10.1038/nature05846;
RA Kasahara M., Naruse K., Sasaki S., Nakatani Y., Qu W., Ahsan B., Yamada T.,
RA Nagayasu Y., Doi K., Kasai Y., Jindo T., Kobayashi D., Shimada A.,
RA Toyoda A., Kuroki Y., Fujiyama A., Sasaki T., Shimizu A., Asakawa S.,
RA Shimizu N., Hashimoto S., Yang J., Lee Y., Matsushima K., Sugano S.,
RA Sakaizumi M., Narita T., Ohishi K., Haga S., Ohta F., Nomoto H., Nogata K.,
RA Morishita T., Endo T., Shin-I T., Takeda H., Morishita S., Kohara Y.;
RT "The medaka draft genome and insights into vertebrate genome evolution.";
RL Nature 447:714-719(2007).
RN [2] {ECO:0000313|Ensembl:ENSORLP00000000649.2}
RP IDENTIFICATION.
RC STRAIN=Hd-rR {ECO:0000313|Ensembl:ENSORLP00000000649.2};
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 8090.ENSORLP00000000649; -.
DR Ensembl; ENSORLT00000000649.2; ENSORLP00000000649.2; ENSORLG00000000529.2.
DR eggNOG; KOG3509; Eukaryota.
DR GeneTree; ENSGT00940000158337; -.
DR HOGENOM; CLU_001582_1_0_1; -.
DR InParanoid; H2L4F8; -.
DR TreeFam; TF326548; -.
DR Proteomes; UP000001038; Chromosome 7.
DR Bgee; ENSORLG00000000529; Expressed in brain and 15 other cell types or tissues.
DR GO; GO:0005604; C:basement membrane; IEA:UniProt.
DR GO; GO:0005576; C:extracellular region; IEA:UniProt.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR GO; GO:0009952; P:anterior/posterior pattern specification; IEA:Ensembl.
DR GO; GO:0007409; P:axonogenesis; IEA:Ensembl.
DR GO; GO:0007417; P:central nervous system development; IEA:Ensembl.
DR GO; GO:0007422; P:peripheral nervous system development; IEA:Ensembl.
DR GO; GO:0010842; P:retina layer formation; IEA:Ensembl.
DR CDD; cd00054; EGF_CA; 2.
DR CDD; cd00055; EGF_Lam; 2.
DR CDD; cd00104; KAZAL_FS; 9.
DR CDD; cd00110; LamG; 3.
DR Gene3D; 2.60.120.200; -; 3.
DR Gene3D; 3.30.60.30; -; 9.
DR Gene3D; 2.10.25.10; Laminin; 6.
DR Gene3D; 3.30.70.960; SEA domain; 1.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
DR InterPro; IPR003884; FacI_MAC.
DR InterPro; IPR003645; Fol_N.
DR InterPro; IPR002350; Kazal_dom.
DR InterPro; IPR036058; Kazal_dom_sf.
DR InterPro; IPR001791; Laminin_G.
DR InterPro; IPR002049; LE_dom.
DR InterPro; IPR000082; SEA_dom.
DR InterPro; IPR036364; SEA_dom_sf.
DR PANTHER; PTHR15036:SF83; AGRIN; 1.
DR PANTHER; PTHR15036; PIKACHURIN-LIKE PROTEIN; 1.
DR Pfam; PF00008; EGF; 3.
DR Pfam; PF00050; Kazal_1; 1.
DR Pfam; PF07648; Kazal_2; 8.
DR Pfam; PF00053; Laminin_EGF; 2.
DR Pfam; PF00054; Laminin_G_1; 3.
DR Pfam; PF01390; SEA; 1.
DR PRINTS; PR00011; EGFLAMININ.
DR SMART; SM00181; EGF; 8.
DR SMART; SM00179; EGF_CA; 3.
DR SMART; SM00180; EGF_Lam; 2.
DR SMART; SM00057; FIMAC; 6.
DR SMART; SM00274; FOLN; 6.
DR SMART; SM00280; KAZAL; 9.
DR SMART; SM00282; LamG; 3.
DR SMART; SM00200; SEA; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 3.
DR SUPFAM; SSF57196; EGF/Laminin; 2.
DR SUPFAM; SSF100895; Kazal-type serine protease inhibitors; 9.
DR SUPFAM; SSF82671; SEA domain; 1.
DR PROSITE; PS00010; ASX_HYDROXYL; 1.
DR PROSITE; PS00022; EGF_1; 4.
DR PROSITE; PS01186; EGF_2; 2.
DR PROSITE; PS50026; EGF_3; 5.
DR PROSITE; PS01248; EGF_LAM_1; 1.
DR PROSITE; PS50027; EGF_LAM_2; 1.
DR PROSITE; PS51465; KAZAL_2; 9.
DR PROSITE; PS50025; LAM_G_DOMAIN; 3.
DR PROSITE; PS50024; SEA; 1.
PE 4: Predicted;
KW Developmental protein {ECO:0000256|ARBA:ARBA00022473};
KW Differentiation {ECO:0000256|ARBA:ARBA00022782};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Heparan sulfate {ECO:0000256|ARBA:ARBA00023207};
KW Laminin EGF-like domain {ECO:0000256|ARBA:ARBA00023292,
KW ECO:0000256|PROSITE-ProRule:PRU00460}; Membrane {ECO:0000256|SAM:Phobius};
KW Proteoglycan {ECO:0000256|ARBA:ARBA00023207};
KW Reference proteome {ECO:0000313|Proteomes:UP000001038};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT TRANSMEM 24..53
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 93..141
FT /note="Kazal-like"
FT /evidence="ECO:0000259|PROSITE:PS51465"
FT DOMAIN 161..216
FT /note="Kazal-like"
FT /evidence="ECO:0000259|PROSITE:PS51465"
FT DOMAIN 234..288
FT /note="Kazal-like"
FT /evidence="ECO:0000259|PROSITE:PS51465"
FT DOMAIN 288..319
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 305..360
FT /note="Kazal-like"
FT /evidence="ECO:0000259|PROSITE:PS51465"
FT DOMAIN 380..434
FT /note="Kazal-like"
FT /evidence="ECO:0000259|PROSITE:PS51465"
FT DOMAIN 440..500
FT /note="Kazal-like"
FT /evidence="ECO:0000259|PROSITE:PS51465"
FT DOMAIN 518..565
FT /note="Kazal-like"
FT /evidence="ECO:0000259|PROSITE:PS51465"
FT DOMAIN 601..649
FT /note="Kazal-like"
FT /evidence="ECO:0000259|PROSITE:PS51465"
FT DOMAIN 691..744
FT /note="Laminin EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50027"
FT DOMAIN 820..869
FT /note="Kazal-like"
FT /evidence="ECO:0000259|PROSITE:PS51465"
FT DOMAIN 1039..1161
FT /note="SEA"
FT /evidence="ECO:0000259|PROSITE:PS50024"
FT DOMAIN 1232..1268
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1273..1449
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 1450..1487
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1489..1526
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1534..1717
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 1713..1749
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1760..1946
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT REGION 959..1004
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1182..1218
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1198..1218
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 292..302
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 691..703
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00460"
FT DISULFID 693..710
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00460"
FT DISULFID 712..721
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00460"
FT DISULFID 1258..1267
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1477..1486
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1516..1525
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1739..1748
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ SEQUENCE 1950 AA; 208303 MW; C7A8C20714B4EAE5 CRC64;
MSGCHYPSPP GPERERRYQH KVSLVVRYFM IPCNICLILL ATSTLGFAVL LFLNNYKPSH
FTPTPVDGCR GMLCGFGAVC ERDPADPAKA DCVCKKGDCP SLVAPVCGSD SSTYSNECEM
ERAQCKAQRR IKVLRRGPCS LKDPCAEVTC SYGSTCVQSS DGLAAKCMCP LGCDGKAGET
VCGSDGNDYP SECELHQHAC KNQKNIRVQH LGPCDPCKDA ENSLNVLCRV EALTRRPLLY
SPPESCPPGD DPLCASDGHT YPNECHMTAT GVQKGIKLQK IHAGQCRRLD ECQDECLFNA
VCVVEPEGPR CSCDPLECDD TYKPLCGRNG RTYTNDCLRR KAECEMKTLI SIRSQGPCDL
NTPSPCLDKV CKHGAVCVVK NNEPVCECPE ACQLTSDPVC GSDGHSYGSP CEMRAMGCAF
QKTIHIQHKG LCDEACANCS FGAICDLQSK RCVCPSECVK SRQPVCGSDG NTYDSECELH
VKACTQQTDL QVVSQGACKT CGPTVCAWGA LCVQNKCECQ QCVGEALSPL CGSDGKTYDN
ECELRRSSCL QNKKIDVEKD GSCDEDCGSG GSGSGAESCE QDRCRKYGGT WDEDAEDDRC
VCDFSCESVP HNAVCGSDGK NYSNECELKK AGCDRRELVS IQNHGPCAAL SAVTPTALTA
AQHCSQSAHG CCSDNATAAL GAGQAGCPST CQCNLYGSYK GTCDPATGQC SCKPGVGGQK
CDRCEPGFWN FRGIVTENMS GCTPCSCDPV GSIRDDCEQM SGLCSCKTAV KGLKCNVCPD
GSKMGMNGCD SGPEAPGSCA DLVCGFGATC IVVNGQAHCE CPSPDCDVKN KTKVCGSDGV
TYADQCQLKT IACRQDKDIL VAHTGQCTES IFEPADPHPP TTPSSTVPAA AITTPAPFNP
DNVWAVPPPM TAFTEWASTA TALPTHSSAR HPIHTQPQAL QPTTAASIIT SLSHSSPAPA
AAAVTSFEGS GSGEPSGDDL EESSGGGVPT EASGADDSVG TAAAASTPIA EKSCDNTEFG
CCPDDQTPSS TPEGANCPPT MMYGGFLHLD QVEGQEIFYL PEMNDTKSEL FGETARSIEN
ALNDLFRKSQ VRKDFMSVRV HKLAPSNSIV AIVEAHFRPD TRFTADDIME ALLKQLKASK
DTSISVKKPE GKNIHFSSSG LSSVPVFTTT TAAAVTILPP TTTTTRRPLK SRVTTHRPAT
SRRTTTTAAP STTTPLLTTT SRVRGGKLAP KTQRPCDSHP CLHGGTCEDD GSTYSCRCPA
GRGGPVCEKV MKYFIPSFGG QSYLAFPTMK AYHTVRIAMA FRASEMNGVL LYNGQRGSKD
FISLTLVNGK VELRFNTGSG TGTLTSKVKV SQGHWHQVEV TRNRRNGVLS VDGEPDVQGE
SPHGTDGLNL DTELFIGGVT EDLKQDVTAR TGVSAGLVGC IRMLDVNNRM LNLQEDGGDS
LFGSGVGECG NNPCEPNPCK NGAQCQVKEA EMFQCKCSKG FWGTLCADVR DPCAASKCHA
TSQCQVLPEG GYKCVCPMGR EGRHCEKVAE RRGAYMPTFN GDSYLELKGL HLYGHDLRQK
VSMMVVLMAN DSDGLIFYNG QKTDGKGDFI SLGLNNGILE FRYDLGKGPA IIRSKAPLPL
KVWNTINLER SLRKGEIRIN DQQPARGESP AVHSDLNLKE SLFVGGAPDY SKLARAAGLT
EGFKGTVQKI MLMSDPVLKE ENALSSSNVA MFQGHPCSQE PCQNGGRCNP MLATYECSCF
PGYVGDNCSV AIHEKSAGET EAVAFDGRTF IEYHNGVTKS QLTNEIPDEK ALLVNKFELS
IRTEATHGLL LWSGKGVERS DYIALAIVDG RVQMTYDLGS RPVVLRSSVR VNTNRWIHIK
ASRALRDGFL QVGNEAAVTG SSPLAATQLD TDGALWLGGL EELAVARRLP KAYSTGFVGC
IKDVVVDGVE LHLVEDALNS PKILHCSAAK
//