GenomeNet

Database: UniProt
Entry: R7UWX5_CAPTE
LinkDB: R7UWX5_CAPTE
Original site: R7UWX5_CAPTE 
ID   R7UWX5_CAPTE            Unreviewed;       816 AA.
AC   R7UWX5;
DT   24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT   24-JUL-2013, sequence version 1.
DT   27-MAR-2024, entry version 52.
DE   RecName: Full=Agrin {ECO:0008006|Google:ProtNLM};
GN   ORFNames=CAPTEDRAFT_223648 {ECO:0000313|EMBL:ELU10834.1};
OS   Capitella teleta (Polychaete worm).
OC   Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Annelida; Polychaeta;
OC   Sedentaria; Scolecida; Capitellidae; Capitella.
OX   NCBI_TaxID=283909 {ECO:0000313|EMBL:ELU10834.1};
RN   [1] {ECO:0000313|Proteomes:UP000014760}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=I ESC-2004 {ECO:0000313|Proteomes:UP000014760};
RA   Hellsten U., Grimwood J., Chapman J.A., Shapiro H., Aerts A., Otillar R.P.,
RA   Terry A.Y., Boore J.L., Simakov O., Marletaz F., Cho S.-J.,
RA   Edsinger-Gonzales E., Havlak P., Kuo D.-H., Larsson T., Lv J., Arendt D.,
RA   Savage R., Osoegawa K., de Jong P., Lindberg D.R., Seaver E.C.,
RA   Weisblat D.A., Putnam N.H., Grigoriev I.V., Rokhsar D.S.;
RL   Submitted (DEC-2012) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|EMBL:ELU10834.1, ECO:0000313|Proteomes:UP000014760}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=I ESC-2004 {ECO:0000313|EMBL:ELU10834.1,
RC   ECO:0000313|Proteomes:UP000014760};
RX   PubMed=23254933; DOI=10.1038/nature11696;
RA   Simakov O., Marletaz F., Cho S.J., Edsinger-Gonzales E., Havlak P.,
RA   Hellsten U., Kuo D.H., Larsson T., Lv J., Arendt D., Savage R.,
RA   Osoegawa K., de Jong P., Grimwood J., Chapman J.A., Shapiro H., Aerts A.,
RA   Otillar R.P., Terry A.Y., Boore J.L., Grigoriev I.V., Lindberg D.R.,
RA   Seaver E.C., Weisblat D.A., Putnam N.H., Rokhsar D.S.;
RT   "Insights into bilaterian evolution from three spiralian genomes.";
RL   Nature 493:526-531(2013).
RN   [3] {ECO:0000313|EnsemblMetazoa:CapteP223648}
RP   IDENTIFICATION.
RG   EnsemblMetazoa;
RL   Submitted (JUN-2015) to UniProtKB.
CC   -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC       feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AMQN01000930; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; KB297234; ELU10834.1; -; Genomic_DNA.
DR   AlphaFoldDB; R7UWX5; -.
DR   STRING; 283909.R7UWX5; -.
DR   EnsemblMetazoa; CapteT223648; CapteP223648; CapteG223648.
DR   HOGENOM; CLU_009840_0_0_1; -.
DR   OMA; CGEGTIE; -.
DR   Proteomes; UP000014760; Unassembled WGS sequence.
DR   GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR   CDD; cd00054; EGF_CA; 2.
DR   CDD; cd00104; KAZAL_FS; 1.
DR   CDD; cd00110; LamG; 3.
DR   Gene3D; 2.60.120.200; -; 3.
DR   Gene3D; 3.30.60.30; -; 1.
DR   Gene3D; 2.10.25.10; Laminin; 3.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR   InterPro; IPR013032; EGF-like_CS.
DR   InterPro; IPR000742; EGF-like_dom.
DR   InterPro; IPR002350; Kazal_dom.
DR   InterPro; IPR036058; Kazal_dom_sf.
DR   InterPro; IPR001791; Laminin_G.
DR   PANTHER; PTHR15036:SF83; AGRIN; 1.
DR   PANTHER; PTHR15036; PIKACHURIN-LIKE PROTEIN; 1.
DR   Pfam; PF00008; EGF; 2.
DR   Pfam; PF12661; hEGF; 1.
DR   Pfam; PF07648; Kazal_2; 1.
DR   Pfam; PF00054; Laminin_G_1; 2.
DR   Pfam; PF02210; Laminin_G_2; 1.
DR   SMART; SM00181; EGF; 4.
DR   SMART; SM00179; EGF_CA; 3.
DR   SMART; SM00280; KAZAL; 1.
DR   SMART; SM00282; LamG; 3.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 3.
DR   SUPFAM; SSF57196; EGF/Laminin; 1.
DR   SUPFAM; SSF100895; Kazal-type serine protease inhibitors; 1.
DR   PROSITE; PS00022; EGF_1; 3.
DR   PROSITE; PS01186; EGF_2; 1.
DR   PROSITE; PS50026; EGF_3; 3.
DR   PROSITE; PS51465; KAZAL_2; 1.
DR   PROSITE; PS50025; LAM_G_DOMAIN; 3.
PE   4: Predicted;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW   ProRule:PRU00076}; EGF-like domain {ECO:0000256|PROSITE-ProRule:PRU00076};
KW   Reference proteome {ECO:0000313|Proteomes:UP000014760}.
FT   DOMAIN          2..59
FT                   /note="Kazal-like"
FT                   /evidence="ECO:0000259|PROSITE:PS51465"
FT   DOMAIN          147..330
FT                   /note="Laminin G"
FT                   /evidence="ECO:0000259|PROSITE:PS50025"
FT   DOMAIN          331..367
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   DOMAIN          370..407
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   DOMAIN          412..590
FT                   /note="Laminin G"
FT                   /evidence="ECO:0000259|PROSITE:PS50025"
FT   DOMAIN          586..624
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   DOMAIN          633..814
FT                   /note="Laminin G"
FT                   /evidence="ECO:0000259|PROSITE:PS50025"
FT   DISULFID        357..366
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT   DISULFID        397..406
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT   DISULFID        614..623
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ   SEQUENCE   816 AA;  89273 MW;  38594019ABA60753 CRC64;
     MEDGAPQCTC PSTCELSEKR SQPVCGTDRM THGDACQLRI FACRLMRDIR VAHEGPCADD
     TTTRMPYQRP RKTTRHVSNH VTNRDSHSTL PYGDVGYFCK SDGDCFVDGT YCNSGRCSCR
     PGYEATADNA ACQRSFDIML SDAGTGYMIP SFSGNSYLEL TKIHHGNSRI TIEMTFRPLK
     PDGLLLFAAQ DQTGNGDFIS LSLVDGHVDF RYDLGSGISR LVSTSPVNMN VYHKVVAQRF
     GKNGMLQVNG GHEVSGVSPG ALRSLNLKSP LFLGGLPIAT SKVVENIGTE KSFVGCIEVL
     RITDEKNTKD YSLVYPASDD INLAVHIDEC MDNPCDSSPC QNDGTCMALQ TTYQCICPLK
     YSGLTCTVEQ TLACESFPCA EGATCVDLPG GKFTCTCAGD QQGELCDQKI EIDVPQFNGA
     SYLELKTTKN LETALNFEIW FLSTHPDGVL LYNEQDGEES GDFLSLNLVD GYLQFRFDLG
     SGMADIKSSF PVPLNTWNKV SIIRNEKQGV LTINGAESDN GQSKGSLKQL NLHKKFYLGG
     FPSAYHPDSG IISGFRGAIQ RVYEDNILVD GLYDSSISSA GISPYLGPPC PPESNPCTNG
     GTCVPVLNDF QCRCTEGFSG KKCESTSVVR MKSNPVSFDG RTYHRYLNNI NEKFRAESKN
     SFEIHFHTLG VRGLLLLVHK SETVAGDYLA IAINSGHLEV SFNLGKELST ELFSLRSPVR
     VNDGHWHTLT FKRDERSASL QVDNGDVIYG SSKEGANQLD TNGDLWIGGS RNPPEGLPKS
     YRSGFVGCIE SVYVNDSPLD LDTDRVNFSP LNFCDV
//
DBGET integrated database retrieval system