ID R7UWX5_CAPTE Unreviewed; 816 AA.
AC R7UWX5;
DT 24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT 24-JUL-2013, sequence version 1.
DT 27-MAR-2024, entry version 52.
DE RecName: Full=Agrin {ECO:0008006|Google:ProtNLM};
GN ORFNames=CAPTEDRAFT_223648 {ECO:0000313|EMBL:ELU10834.1};
OS Capitella teleta (Polychaete worm).
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Annelida; Polychaeta;
OC Sedentaria; Scolecida; Capitellidae; Capitella.
OX NCBI_TaxID=283909 {ECO:0000313|EMBL:ELU10834.1};
RN [1] {ECO:0000313|Proteomes:UP000014760}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=I ESC-2004 {ECO:0000313|Proteomes:UP000014760};
RA Hellsten U., Grimwood J., Chapman J.A., Shapiro H., Aerts A., Otillar R.P.,
RA Terry A.Y., Boore J.L., Simakov O., Marletaz F., Cho S.-J.,
RA Edsinger-Gonzales E., Havlak P., Kuo D.-H., Larsson T., Lv J., Arendt D.,
RA Savage R., Osoegawa K., de Jong P., Lindberg D.R., Seaver E.C.,
RA Weisblat D.A., Putnam N.H., Grigoriev I.V., Rokhsar D.S.;
RL Submitted (DEC-2012) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:ELU10834.1, ECO:0000313|Proteomes:UP000014760}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=I ESC-2004 {ECO:0000313|EMBL:ELU10834.1,
RC ECO:0000313|Proteomes:UP000014760};
RX PubMed=23254933; DOI=10.1038/nature11696;
RA Simakov O., Marletaz F., Cho S.J., Edsinger-Gonzales E., Havlak P.,
RA Hellsten U., Kuo D.H., Larsson T., Lv J., Arendt D., Savage R.,
RA Osoegawa K., de Jong P., Grimwood J., Chapman J.A., Shapiro H., Aerts A.,
RA Otillar R.P., Terry A.Y., Boore J.L., Grigoriev I.V., Lindberg D.R.,
RA Seaver E.C., Weisblat D.A., Putnam N.H., Rokhsar D.S.;
RT "Insights into bilaterian evolution from three spiralian genomes.";
RL Nature 493:526-531(2013).
RN [3] {ECO:0000313|EnsemblMetazoa:CapteP223648}
RP IDENTIFICATION.
RG EnsemblMetazoa;
RL Submitted (JUN-2015) to UniProtKB.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AMQN01000930; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; KB297234; ELU10834.1; -; Genomic_DNA.
DR AlphaFoldDB; R7UWX5; -.
DR STRING; 283909.R7UWX5; -.
DR EnsemblMetazoa; CapteT223648; CapteP223648; CapteG223648.
DR HOGENOM; CLU_009840_0_0_1; -.
DR OMA; CGEGTIE; -.
DR Proteomes; UP000014760; Unassembled WGS sequence.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR CDD; cd00054; EGF_CA; 2.
DR CDD; cd00104; KAZAL_FS; 1.
DR CDD; cd00110; LamG; 3.
DR Gene3D; 2.60.120.200; -; 3.
DR Gene3D; 3.30.60.30; -; 1.
DR Gene3D; 2.10.25.10; Laminin; 3.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR013032; EGF-like_CS.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR002350; Kazal_dom.
DR InterPro; IPR036058; Kazal_dom_sf.
DR InterPro; IPR001791; Laminin_G.
DR PANTHER; PTHR15036:SF83; AGRIN; 1.
DR PANTHER; PTHR15036; PIKACHURIN-LIKE PROTEIN; 1.
DR Pfam; PF00008; EGF; 2.
DR Pfam; PF12661; hEGF; 1.
DR Pfam; PF07648; Kazal_2; 1.
DR Pfam; PF00054; Laminin_G_1; 2.
DR Pfam; PF02210; Laminin_G_2; 1.
DR SMART; SM00181; EGF; 4.
DR SMART; SM00179; EGF_CA; 3.
DR SMART; SM00280; KAZAL; 1.
DR SMART; SM00282; LamG; 3.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 3.
DR SUPFAM; SSF57196; EGF/Laminin; 1.
DR SUPFAM; SSF100895; Kazal-type serine protease inhibitors; 1.
DR PROSITE; PS00022; EGF_1; 3.
DR PROSITE; PS01186; EGF_2; 1.
DR PROSITE; PS50026; EGF_3; 3.
DR PROSITE; PS51465; KAZAL_2; 1.
DR PROSITE; PS50025; LAM_G_DOMAIN; 3.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; EGF-like domain {ECO:0000256|PROSITE-ProRule:PRU00076};
KW Reference proteome {ECO:0000313|Proteomes:UP000014760}.
FT DOMAIN 2..59
FT /note="Kazal-like"
FT /evidence="ECO:0000259|PROSITE:PS51465"
FT DOMAIN 147..330
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 331..367
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 370..407
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 412..590
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 586..624
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 633..814
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DISULFID 357..366
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 397..406
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 614..623
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ SEQUENCE 816 AA; 89273 MW; 38594019ABA60753 CRC64;
MEDGAPQCTC PSTCELSEKR SQPVCGTDRM THGDACQLRI FACRLMRDIR VAHEGPCADD
TTTRMPYQRP RKTTRHVSNH VTNRDSHSTL PYGDVGYFCK SDGDCFVDGT YCNSGRCSCR
PGYEATADNA ACQRSFDIML SDAGTGYMIP SFSGNSYLEL TKIHHGNSRI TIEMTFRPLK
PDGLLLFAAQ DQTGNGDFIS LSLVDGHVDF RYDLGSGISR LVSTSPVNMN VYHKVVAQRF
GKNGMLQVNG GHEVSGVSPG ALRSLNLKSP LFLGGLPIAT SKVVENIGTE KSFVGCIEVL
RITDEKNTKD YSLVYPASDD INLAVHIDEC MDNPCDSSPC QNDGTCMALQ TTYQCICPLK
YSGLTCTVEQ TLACESFPCA EGATCVDLPG GKFTCTCAGD QQGELCDQKI EIDVPQFNGA
SYLELKTTKN LETALNFEIW FLSTHPDGVL LYNEQDGEES GDFLSLNLVD GYLQFRFDLG
SGMADIKSSF PVPLNTWNKV SIIRNEKQGV LTINGAESDN GQSKGSLKQL NLHKKFYLGG
FPSAYHPDSG IISGFRGAIQ RVYEDNILVD GLYDSSISSA GISPYLGPPC PPESNPCTNG
GTCVPVLNDF QCRCTEGFSG KKCESTSVVR MKSNPVSFDG RTYHRYLNNI NEKFRAESKN
SFEIHFHTLG VRGLLLLVHK SETVAGDYLA IAINSGHLEV SFNLGKELST ELFSLRSPVR
VNDGHWHTLT FKRDERSASL QVDNGDVIYG SSKEGANQLD TNGDLWIGGS RNPPEGLPKS
YRSGFVGCIE SVYVNDSPLD LDTDRVNFSP LNFCDV
//