ID G3W3J1_SARHA Unreviewed; 550 AA.
AC G3W3J1;
DT 16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT 07-APR-2021, sequence version 2.
DT 27-MAR-2024, entry version 75.
DE SubName: Full=EGF like domain multiple 6 {ECO:0000313|Ensembl:ENSSHAP00000009996.2};
GN Name=EGFL6 {ECO:0000313|Ensembl:ENSSHAP00000009996.2};
OS Sarcophilus harrisii (Tasmanian devil) (Sarcophilus laniarius).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Metatheria; Dasyuromorphia; Dasyuridae; Sarcophilus.
OX NCBI_TaxID=9305 {ECO:0000313|Ensembl:ENSSHAP00000009996.2, ECO:0000313|Proteomes:UP000007648};
RN [1] {ECO:0000313|Ensembl:ENSSHAP00000009996.2, ECO:0000313|Proteomes:UP000007648}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21709235; DOI=10.1073/pnas.1102838108;
RA Miller W., Hayes V.M., Ratan A., Petersen D.C., Wittekindt N.E., Miller J.,
RA Walenz B., Knight J., Qi J., Zhao F., Wang Q., Bedoya-Reina O.C.,
RA Katiyar N., Tomsho L.P., Kasson L.M., Hardie R.A., Woodbridge P.,
RA Tindall E.A., Bertelsen M.F., Dixon D., Pyecroft S., Helgen K.M.,
RA Lesk A.M., Pringle T.H., Patterson N., Zhang Y., Kreiss A., Woods G.M.,
RA Jones M.E., Schuster S.C.;
RT "Genetic diversity and population structure of the endangered marsupial
RT Sarcophilus harrisii (Tasmanian devil).";
RL Proc. Natl. Acad. Sci. U.S.A. 108:12348-12353(2011).
RN [2] {ECO:0000313|Ensembl:ENSSHAP00000009996.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SIMILARITY: Belongs to the nephronectin family.
CC {ECO:0000256|ARBA:ARBA00009738}.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_003765626.1; XM_003765578.1.
DR AlphaFoldDB; G3W3J1; -.
DR STRING; 9305.ENSSHAP00000009996; -.
DR Ensembl; ENSSHAT00000010084.2; ENSSHAP00000009996.2; ENSSHAG00000008649.2.
DR GeneID; 100922811; -.
DR KEGG; shr:100922811; -.
DR CTD; 25975; -.
DR eggNOG; KOG1217; Eukaryota.
DR GeneTree; ENSGT00930000150973; -.
DR HOGENOM; CLU_036867_0_0_1; -.
DR InParanoid; G3W3J1; -.
DR TreeFam; TF330819; -.
DR Proteomes; UP000007648; Unassembled WGS sequence.
DR GO; GO:0005604; C:basement membrane; IEA:Ensembl.
DR GO; GO:0016020; C:membrane; IEA:InterPro.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR GO; GO:0007155; P:cell adhesion; IEA:UniProtKB-KW.
DR GO; GO:0030198; P:extracellular matrix organization; IEA:Ensembl.
DR GO; GO:0010811; P:positive regulation of cell-substrate adhesion; IEA:Ensembl.
DR CDD; cd00054; EGF_CA; 2.
DR CDD; cd06263; MAM; 1.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 2.10.25.10; Laminin; 5.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
DR InterPro; IPR018097; EGF_Ca-bd_CS.
DR InterPro; IPR024731; EGF_dom.
DR InterPro; IPR009030; Growth_fac_rcpt_cys_sf.
DR InterPro; IPR000998; MAM_dom.
DR PANTHER; PTHR24050:SF24; EPIDERMAL GROWTH FACTOR-LIKE PROTEIN 6; 1.
DR PANTHER; PTHR24050; PA14 DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF12947; EGF_3; 1.
DR Pfam; PF07645; EGF_CA; 2.
DR Pfam; PF00629; MAM; 1.
DR SMART; SM00181; EGF; 5.
DR SMART; SM00179; EGF_CA; 3.
DR SMART; SM00137; MAM; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR SUPFAM; SSF57196; EGF/Laminin; 1.
DR SUPFAM; SSF57184; Growth factor receptor domain; 1.
DR PROSITE; PS00010; ASX_HYDROXYL; 3.
DR PROSITE; PS00022; EGF_1; 1.
DR PROSITE; PS01186; EGF_2; 3.
DR PROSITE; PS50026; EGF_3; 3.
DR PROSITE; PS01187; EGF_CA; 1.
DR PROSITE; PS50060; MAM_2; 1.
PE 3: Inferred from homology;
KW Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Reference proteome {ECO:0000313|Proteomes:UP000007648};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..18
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 19..550
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5029468894"
FT DOMAIN 95..134
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 175..213
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 220..260
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 402..548
FT /note="MAM"
FT /evidence="ECO:0000259|PROSITE:PS50060"
SQ SEQUENCE 550 AA; 61482 MW; B507AA9C9F17C4C6 CRC64;
MRASWCPGLS LLLSFVAGGF VKTAASSRSH RSLAFADQPG ICHYGPKLEC CYGWKKNNKG
LCEAVCDHGC KYGECIGPNK CKCFPGFTGK TCNQDMNECG LKPRPCKHRC MNTHGSYKCY
CLSGYMLMPD GTCSNSRTCA MTNCQYGCEG MKDEVRCLCP SAGLQLGPNG RACIDIDECS
SGKVLCPYNR RCVNTFGSYY CKCHIGFELK YISGRYDCVD INECSANTHK CSIHADCLNT
QGAFKCKCRQ GYKGNGLHCS AIPENSVKEI FRAPGTIKDS IKKLLAHKNS VKKNEEIKNV
IPEPAVTPSI KVLLQPFHYE DSIQTGGAYD EKVKEHENTK REERMEEEEG QIDLKNQIDP
EKSLRGDVFF SKVNEAVPFD LFPIQRKVLT SKLEHKDLNS SIDCNFDQGI CDWIQDTDDD
FDWNPADRDN AVGYYMVVPA FVGHKNNVGR LILLLSNLQP QSSSCLMFNY RLAGERVGTL
RVFVKDKNNT LAWEETSSED GRWRTEKIQL YHEIETTKSV IFEAERGKGK TGEIGLDTVL
LVSGICPEVL
//