ID A0A9D4LLT6_DREPO Unreviewed; 224 AA.
AC A0A9D4LLT6;
DT 03-MAY-2023, integrated into UniProtKB/TrEMBL.
DT 03-MAY-2023, sequence version 1.
DT 28-JAN-2026, entry version 13.
DE RecName: Full=EGF-like domain-containing protein {ECO:0000259|PROSITE:PS50026};
GN ORFNames=DPMN_022972 {ECO:0000313|EMBL:KAH3860079.1};
OS Dreissena polymorpha (Zebra mussel) (Mytilus polymorpha).
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Mollusca; Bivalvia;
OC Autobranchia; Heteroconchia; Euheterodonta; Imparidentia; Neoheterodontei;
OC Myida; Dreissenoidea; Dreissenidae; Dreissena.
OX NCBI_TaxID=45954 {ECO:0000313|EMBL:KAH3860079.1, ECO:0000313|Proteomes:UP000828390};
RN [1] {ECO:0000313|EMBL:KAH3860079.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=Duluth1 {ECO:0000313|EMBL:KAH3860079.1};
RC TISSUE=Whole animal {ECO:0000313|EMBL:KAH3860079.1};
RA McCartney M.A., Auch B., Kono T., Mallez S., Zhang Y., Obille A.,
RA Becker A., Abrahante J.E., Garbe J., Badalamenti J.P., Herman A.,
RA Mangelson H., Liachko I., Sullivan S., Sone E.D., Koren S.,
RA Silverstein K.A.T., Beckman K.B., Gohl D.M.;
RT "The Genome of the Zebra Mussel, Dreissena polymorpha: A Resource for
RT Invasive Species Research.";
RL bioRxiv 0:0-0(2019).
RN [2] {ECO:0000313|EMBL:KAH3860079.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=Duluth1 {ECO:0000313|EMBL:KAH3860079.1};
RC TISSUE=Whole animal {ECO:0000313|EMBL:KAH3860079.1};
RA McCartney M.A., Auch B., Kono T., Mallez S., Becker A., Gohl D.M.,
RA Silverstein K.A.T., Koren S., Bechman K.B., Herman A., Abrahante J.E.,
RA Garbe J.;
RL Submitted (NOV-2020) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KAH3860079.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JAIWYP010000002; KAH3860079.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A9D4LLT6; -.
DR OrthoDB; 6162427at2759; -.
DR Proteomes; UP000828390; Unassembled WGS sequence.
DR CDD; cd00054; EGF_CA; 1.
DR Gene3D; 2.10.25.10; Laminin; 3.
DR InterPro; IPR000742; EGF.
DR InterPro; IPR013111; EGF_extracell.
DR InterPro; IPR051022; Notch_Cell-Fate_Det.
DR PANTHER; PTHR24049; CRUMBS FAMILY MEMBER; 1.
DR Pfam; PF07974; EGF_2; 1.
DR SMART; SM00181; EGF; 3.
DR SUPFAM; SSF57196; EGF/Laminin; 3.
DR PROSITE; PS00022; EGF_1; 3.
DR PROSITE; PS01186; EGF_2; 1.
DR PROSITE; PS50026; EGF_3; 3.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Reference proteome {ECO:0000313|Proteomes:UP000828390};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..22
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 23..224
FT /note="EGF-like domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5039027901"
FT DOMAIN 21..59
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 60..94
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 95..134
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT REGION 142..168
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 142..152
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 153..168
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 49..58
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 84..93
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 105..122
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 124..133
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ SEQUENCE 224 AA; 24011 MW; 37E8B53A6BF86BDA CRC64;
MALEFARTLF LSLCFFVPAA DSNDCDDGNF CLNSGTCSYD QHGHGKCQCP TWYTGKHCEH
GEPCKHIVCQ HGGTCNERDG RCQCQAGFVG DLCEVVDSCD DHIICYNGGS CYFNNSVNVA
ACRCPVDFTG YRCERQISTT TTTSHVTNPP STATASNHPQ TSVSTTRNPA CAKYDPGSRA
CIDNSACGLL HVKEVICPDP NEAHYCPITC GCCLPASHTA AVVG
//