GenomeNet

Database: UniProt
Entry: G3W3J1_SARHA
LinkDB: G3W3J1_SARHA
Original site: G3W3J1_SARHA 
ID   G3W3J1_SARHA            Unreviewed;       550 AA.
AC   G3W3J1;
DT   16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT   07-APR-2021, sequence version 2.
DT   27-MAR-2024, entry version 75.
DE   SubName: Full=EGF like domain multiple 6 {ECO:0000313|Ensembl:ENSSHAP00000009996.2};
GN   Name=EGFL6 {ECO:0000313|Ensembl:ENSSHAP00000009996.2};
OS   Sarcophilus harrisii (Tasmanian devil) (Sarcophilus laniarius).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Metatheria; Dasyuromorphia; Dasyuridae; Sarcophilus.
OX   NCBI_TaxID=9305 {ECO:0000313|Ensembl:ENSSHAP00000009996.2, ECO:0000313|Proteomes:UP000007648};
RN   [1] {ECO:0000313|Ensembl:ENSSHAP00000009996.2, ECO:0000313|Proteomes:UP000007648}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=21709235; DOI=10.1073/pnas.1102838108;
RA   Miller W., Hayes V.M., Ratan A., Petersen D.C., Wittekindt N.E., Miller J.,
RA   Walenz B., Knight J., Qi J., Zhao F., Wang Q., Bedoya-Reina O.C.,
RA   Katiyar N., Tomsho L.P., Kasson L.M., Hardie R.A., Woodbridge P.,
RA   Tindall E.A., Bertelsen M.F., Dixon D., Pyecroft S., Helgen K.M.,
RA   Lesk A.M., Pringle T.H., Patterson N., Zhang Y., Kreiss A., Woods G.M.,
RA   Jones M.E., Schuster S.C.;
RT   "Genetic diversity and population structure of the endangered marsupial
RT   Sarcophilus harrisii (Tasmanian devil).";
RL   Proc. Natl. Acad. Sci. U.S.A. 108:12348-12353(2011).
RN   [2] {ECO:0000313|Ensembl:ENSSHAP00000009996.2}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   -!- SIMILARITY: Belongs to the nephronectin family.
CC       {ECO:0000256|ARBA:ARBA00009738}.
CC   -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC       feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   RefSeq; XP_003765626.1; XM_003765578.1.
DR   AlphaFoldDB; G3W3J1; -.
DR   STRING; 9305.ENSSHAP00000009996; -.
DR   Ensembl; ENSSHAT00000010084.2; ENSSHAP00000009996.2; ENSSHAG00000008649.2.
DR   GeneID; 100922811; -.
DR   KEGG; shr:100922811; -.
DR   CTD; 25975; -.
DR   eggNOG; KOG1217; Eukaryota.
DR   GeneTree; ENSGT00930000150973; -.
DR   HOGENOM; CLU_036867_0_0_1; -.
DR   InParanoid; G3W3J1; -.
DR   TreeFam; TF330819; -.
DR   Proteomes; UP000007648; Unassembled WGS sequence.
DR   GO; GO:0005604; C:basement membrane; IEA:Ensembl.
DR   GO; GO:0016020; C:membrane; IEA:InterPro.
DR   GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR   GO; GO:0007155; P:cell adhesion; IEA:UniProtKB-KW.
DR   GO; GO:0030198; P:extracellular matrix organization; IEA:Ensembl.
DR   GO; GO:0010811; P:positive regulation of cell-substrate adhesion; IEA:Ensembl.
DR   CDD; cd00054; EGF_CA; 2.
DR   CDD; cd06263; MAM; 1.
DR   Gene3D; 2.60.120.200; -; 1.
DR   Gene3D; 2.10.25.10; Laminin; 5.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR   InterPro; IPR000742; EGF-like_dom.
DR   InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
DR   InterPro; IPR018097; EGF_Ca-bd_CS.
DR   InterPro; IPR024731; EGF_dom.
DR   InterPro; IPR009030; Growth_fac_rcpt_cys_sf.
DR   InterPro; IPR000998; MAM_dom.
DR   PANTHER; PTHR24050:SF24; EPIDERMAL GROWTH FACTOR-LIKE PROTEIN 6; 1.
DR   PANTHER; PTHR24050; PA14 DOMAIN-CONTAINING PROTEIN; 1.
DR   Pfam; PF12947; EGF_3; 1.
DR   Pfam; PF07645; EGF_CA; 2.
DR   Pfam; PF00629; MAM; 1.
DR   SMART; SM00181; EGF; 5.
DR   SMART; SM00179; EGF_CA; 3.
DR   SMART; SM00137; MAM; 1.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR   SUPFAM; SSF57196; EGF/Laminin; 1.
DR   SUPFAM; SSF57184; Growth factor receptor domain; 1.
DR   PROSITE; PS00010; ASX_HYDROXYL; 3.
DR   PROSITE; PS00022; EGF_1; 1.
DR   PROSITE; PS01186; EGF_2; 3.
DR   PROSITE; PS50026; EGF_3; 3.
DR   PROSITE; PS01187; EGF_CA; 1.
DR   PROSITE; PS50060; MAM_2; 1.
PE   3: Inferred from homology;
KW   Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW   ProRule:PRU00076}; Reference proteome {ECO:0000313|Proteomes:UP000007648};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..18
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           19..550
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5029468894"
FT   DOMAIN          95..134
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   DOMAIN          175..213
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   DOMAIN          220..260
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   DOMAIN          402..548
FT                   /note="MAM"
FT                   /evidence="ECO:0000259|PROSITE:PS50060"
SQ   SEQUENCE   550 AA;  61482 MW;  B507AA9C9F17C4C6 CRC64;
     MRASWCPGLS LLLSFVAGGF VKTAASSRSH RSLAFADQPG ICHYGPKLEC CYGWKKNNKG
     LCEAVCDHGC KYGECIGPNK CKCFPGFTGK TCNQDMNECG LKPRPCKHRC MNTHGSYKCY
     CLSGYMLMPD GTCSNSRTCA MTNCQYGCEG MKDEVRCLCP SAGLQLGPNG RACIDIDECS
     SGKVLCPYNR RCVNTFGSYY CKCHIGFELK YISGRYDCVD INECSANTHK CSIHADCLNT
     QGAFKCKCRQ GYKGNGLHCS AIPENSVKEI FRAPGTIKDS IKKLLAHKNS VKKNEEIKNV
     IPEPAVTPSI KVLLQPFHYE DSIQTGGAYD EKVKEHENTK REERMEEEEG QIDLKNQIDP
     EKSLRGDVFF SKVNEAVPFD LFPIQRKVLT SKLEHKDLNS SIDCNFDQGI CDWIQDTDDD
     FDWNPADRDN AVGYYMVVPA FVGHKNNVGR LILLLSNLQP QSSSCLMFNY RLAGERVGTL
     RVFVKDKNNT LAWEETSSED GRWRTEKIQL YHEIETTKSV IFEAERGKGK TGEIGLDTVL
     LVSGICPEVL
//
DBGET integrated database retrieval system