ID A0A4U1EH17_MONMO Unreviewed; 2415 AA.
AC A0A4U1EH17;
DT 31-JUL-2019, integrated into UniProtKB/TrEMBL.
DT 31-JUL-2019, sequence version 1.
DT 28-JAN-2026, entry version 28.
DE RecName: Full=Sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1 {ECO:0008006|Google:ProtNLM};
DE Flags: Fragment;
GN ORFNames=EI555_004065 {ECO:0000313|EMBL:TKC34906.1};
OS Monodon monoceros (Narwhal) (Ceratodon monodon).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Artiodactyla; Whippomorpha; Cetacea; Odontoceti;
OC Monodontidae; Monodon.
OX NCBI_TaxID=40151 {ECO:0000313|EMBL:TKC34906.1, ECO:0000313|Proteomes:UP000308365};
RN [1] {ECO:0000313|Proteomes:UP000308365}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=31054839; DOI=10.1016/j.isci.2019.03.023;
RA Westbury M.V., Petersen B., Garde E., Heide-Jorgensen M.P., Lorenzen E.D.;
RT "Narwhal Genome Reveals Long-Term Low Genetic Diversity despite Current
RT Large Abundance Size.";
RL IScience 15:592-599(2019).
CC -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004370}.
CC Secreted {ECO:0000256|ARBA:ARBA00004613}.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:TKC34906.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; RWIC01001679; TKC34906.1; -; Genomic_DNA.
DR Proteomes; UP000308365; Unassembled WGS sequence.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-SubCell.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0043005; C:neuron projection; IEA:UniProtKB-ARBA.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR GO; GO:0045944; P:positive regulation of transcription by RNA polymerase II; IEA:UniProtKB-ARBA.
DR GO; GO:0007435; P:salivary gland morphogenesis; IEA:UniProtKB-ARBA.
DR GO; GO:0001944; P:vasculature development; IEA:UniProtKB-ARBA.
DR CDD; cd00033; CCP; 12.
DR CDD; cd00054; EGF_CA; 6.
DR CDD; cd01450; vWFA_subfamily_ECM; 1.
DR FunFam; 2.10.25.10:FF:000123; Crumbs homolog 1 (Drosophila); 1.
DR FunFam; 2.10.70.10:FF:000011; CUB and sushi domain-containing protein 3 isoform A; 2.
DR FunFam; 2.10.25.10:FF:000117; Delta-like protein; 1.
DR FunFam; 2.10.25.10:FF:000038; Fibrillin 2; 1.
DR FunFam; 2.10.25.10:FF:000004; Neurogenic locus notch 1; 1.
DR FunFam; 2.60.120.200:FF:000012; neuronal pentraxin receptor; 1.
DR FunFam; 2.10.25.10:FF:000109; Notch homolog 4, [Drosophila]; 1.
DR FunFam; 2.10.70.10:FF:000183; Sushi, von Willebrand factor type A, EGF and pentraxin domain containing 1; 1.
DR FunFam; 2.10.50.10:FF:000018; Sushi, von Willebrand factor type A, EGF and pentraxin domain-containing 1; 1.
DR FunFam; 3.40.50.410:FF:000070; sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1; 1.
DR FunFam; 2.10.25.10:FF:000309; Uncharacterized protein, isoform A; 1.
DR FunFam; 2.10.70.10:FF:000003; Versican core protein; 2.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 2.10.70.10; Complement Module, domain 1; 12.
DR Gene3D; 2.10.25.10; Laminin; 6.
DR Gene3D; 2.10.50.10; Tumor Necrosis Factor Receptor, subunit A, domain 2; 1.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 1.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR000742; EGF.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR013032; EGF-like_CS.
DR InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
DR InterPro; IPR018097; EGF_Ca-bd_CS.
DR InterPro; IPR009030; Growth_fac_rcpt_cys_sf.
DR InterPro; IPR003410; HYR_dom.
DR InterPro; IPR049883; NOTCH1_EGF-like.
DR InterPro; IPR001759; PTX_dom.
DR InterPro; IPR051277; SEZ6_CSMD_C4BPB_Regulators.
DR InterPro; IPR035976; Sushi/SCR/CCP_sf.
DR InterPro; IPR000436; Sushi_SCR_CCP_dom.
DR InterPro; IPR011641; Tyr-kin_ephrin_A/B_rcpt-like.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR45656; PROTEIN CBR-CLEC-78; 1.
DR PANTHER; PTHR45656:SF15; SUSHI DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF00008; EGF; 4.
DR Pfam; PF07645; EGF_CA; 1.
DR Pfam; PF07699; Ephrin_rec_like; 2.
DR Pfam; PF12661; hEGF; 1.
DR Pfam; PF02494; HYR; 2.
DR Pfam; PF00354; Pentaxin; 1.
DR Pfam; PF00084; Sushi; 11.
DR Pfam; PF00092; VWA; 1.
DR PRINTS; PR00895; PENTAXIN.
DR SMART; SM00032; CCP; 13.
DR SMART; SM00181; EGF; 7.
DR SMART; SM00179; EGF_CA; 6.
DR SMART; SM01411; Ephrin_rec_like; 2.
DR SMART; SM00159; PTX; 1.
DR SMART; SM00327; VWA; 1.
DR SUPFAM; SSF57535; Complement control module/SCR domain; 12.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR SUPFAM; SSF57196; EGF/Laminin; 4.
DR SUPFAM; SSF57184; Growth factor receptor domain; 2.
DR SUPFAM; SSF53300; vWA-like; 1.
DR PROSITE; PS00010; ASX_HYDROXYL; 6.
DR PROSITE; PS00022; EGF_1; 4.
DR PROSITE; PS01186; EGF_2; 5.
DR PROSITE; PS50026; EGF_3; 6.
DR PROSITE; PS01187; EGF_CA; 3.
DR PROSITE; PS50825; HYR; 2.
DR PROSITE; PS51828; PTX_2; 1.
DR PROSITE; PS50923; SUSHI; 13.
DR PROSITE; PS50234; VWFA; 1.
PE 4: Predicted;
KW Calcium {ECO:0000256|ARBA:ARBA00022837};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Membrane {ECO:0000256|ARBA:ARBA00023136};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Secreted {ECO:0000256|ARBA:ARBA00022525};
KW Signal {ECO:0000256|ARBA:ARBA00022729};
KW Sushi {ECO:0000256|PROSITE-ProRule:PRU00302}.
FT DOMAIN 134..319
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 455..512
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT DOMAIN 513..572
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT DOMAIN 573..638
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT DOMAIN 637..719
FT /note="HYR"
FT /evidence="ECO:0000259|PROSITE:PS50825"
FT DOMAIN 720..801
FT /note="HYR"
FT /evidence="ECO:0000259|PROSITE:PS50825"
FT DOMAIN 802..866
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT DOMAIN 1300..1336
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1338..1374
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1390..1426
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1428..1464
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1466..1502
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1560..1764
FT /note="Pentraxin (PTX)"
FT /evidence="ECO:0000259|PROSITE:PS51828"
FT DOMAIN 1765..1823
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT DOMAIN 1824..1881
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT DOMAIN 1881..1920
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1960..2017
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT DOMAIN 2018..2075
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT DOMAIN 2076..2156
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT DOMAIN 2157..2214
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT DOMAIN 2215..2276
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT DOMAIN 2277..2339
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT DOMAIN 2340..2397
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT REGION 90..109
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 483..510
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00302"
FT DISULFID 543..570
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00302"
FT DISULFID 1326..1335
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1416..1425
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1454..1463
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1492..1501
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1794..1821
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00302"
FT DISULFID 1852..1879
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00302"
FT DISULFID 1988..2015
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00302"
FT DISULFID 2046..2073
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00302"
FT DISULFID 2185..2212
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00302"
FT DISULFID 2279..2322
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00302"
FT DISULFID 2368..2395
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00302"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:TKC34906.1"
SQ SEQUENCE 2415 AA; 264122 MW; 51AB47280F6CF048 CRC64;
SDERALRLAA AVPCPEPRAA HAPRRAGRLS VSPRRADACA PGGRSVSSAM WARLAFCCWG
LALVSGWATF QQLSPSRNFS FRLFPEAAPE TPGRLPAPPG PGEDAAAEES KVERLGQAFR
RRVRRLRELS ERLELVFLVD ESSSVGQANF LSELKFVRKL LSDFPVVSTA TRVAIVTFSS
KNNVVPRVDY ISSRRAHQHK CALLSREIPA ITYRGGGTYT KGAFQQAAGQ TVAKLEQQIL
RHSRENSTKV IFLITDGYSN GGDPRPIAAS LRDFGVEIFT FGIWQGNIRE LNDMASTPKE
EHCYLLHSFE EFEALARRAL HEAGKDCCDQ MASCKCGTHT GQFECICEKG YYGKGLQYEC
TEDLFPISRI IPLKPFTFSS PFVQCFEPKQ GAQCFLHVYN YSVLLGLAHP GHTNQKVHQE
ESARAFHVLM KITLLHPEVL PLKTVSARRG TGHLARAAKP PENGYFIQNT CNSHFNAACG
VRCHPGFDLV GSSIFLCLPD GLWSGSESSC RVRTCPRLRQ PKHGRLSCST GELSYRTVCL
VTCDEGYRLE GSARLTCQVN AQWDGLEPRC VERHCSTFQK PKGVIVSPPN CGKHPAKPGT
ICQLSCHQGF ILSGGREEVR CTTSGKWSAK VQTAACEDVE APQINCPADI EAETQEQQDS
ANITWQIPTA KDNSGEKVSI HVHPAFTPPY LFPIGDVAIT YTATDLSSNQ ASCTFHIKVI
DVEPPSIDWC RSPPPVQVSE KEHAATWDEP QFSDNSGAVL AITRSHTPGD LFPHGETVVR
YTATDASGNN RTCDIHIIIK GSPCEVPFTP VNGDFLCSQD SAGVNCTLSC LEGYDFTEGS
TDKYYCAYED GIWKPPYSTE WPDCAIKRFA NHGFKSFEML YKATRCDDTD LLKKFSEAFE
TTLGKMVPSF CSDADDIDCR LEDLTKKYCL EYNYEYENGF AIGPVGWGAA NRLDYSYDDF
LDTVRETPTG AGKARSSRIK RSAPLSDHKI KLIFNITASV PLPDERNDTL ELENQQRLIK
TLETITNRLK RTLTKEPMYS FHLASEMLVA DSNSLETEKA FLFCRPGSML RGRMCVALWE
PTILWRILSV KAACWAPIKM KKGSLSANPV QLELTLNISI QEVARNVKCK QGTYSSNGLE
TCESCPLGSY QPAFGSRGCL VCPENTSTVK RGAVDISACG VPCPVGEFSR SGLMPCYPCP
RDYYQPNPGK SFCLSCPFYG TTTITGARSI TDCSTAEESI VPVASPGHIK KKYEVSSQAS
PFYIYKIFSN GILGKQFVII HVPQRKTTRV ETSSLKCETD IDECSSLPCY NNGICKDQVG
EFICECPSGY TGQLCEENIN ECSSSPCSNK GTCVDGLAGY RCTCVKGYMG VNTRAPSRPH
PSPGLHCETE VNECQSSPCL NNAVCEDQLG GFLCKCLPGF LGNRCEINMD ECFSQPCKNG
ATCKDGANSF RCQCAVGFTG PHCDLNINEC QSNPCRNQAT CVDELNSYSC KCQPGFSGSR
CETGIYELNV INNDTNHNDI ITVSVFTCTE HHLYTKNYAK HFVLILHSHP LDWFMLSPSA
GFNLDFEVSG IYGYVMLDGV LPSLRAVTCT FWMKSSDTTN YGTPISYALE NGSDNTFLLT
DYNGWVLYVN GKEKITDCPS VNDGSWHHIA ITWTSADGAW KVYIDGKLSD GGVGLSVGSP
IPGGGALVLG QEQDKKGEGF NPAESFVGSI SQLNLWDYVL SPQQVKSLAS SCPEELRKGN
VLAWPDFLSG IVGRVKIDSK SLFCSDCPPL EGSVPHLRTA SGDVKPGSRI SLFCDPGFQM
VGNPVQYCLN QGQWTQPLPL CERISCGVPP PLENGFYSAE DFHAGSTVTY QCNNGYYLLG
DSRMFCTDNG SWNGISPSCL DVDECAVGSD CSEHASCLNT NGSYLCSCIP PYTGDGKNCA
ASVSLDFMPF PSKAFFPPQD LTGTLTVVEF FCYETLHEPI KCKAPGNPEN GRSSGEIYTV
GSEVTFSCDE GHQLMGVAKI TCLESGEWSH LIPYCEPVSC GAPAIPENGG IDGSAFTYGS
KVIYRCNKGY TLEGEKESSC LASSSWSHSP PLCELVKCSS PEDINNGKYI LSGLTYLSTA
SYSCENGYRY SPRHQPSFLQ YPVPHYLLSG FHLHGPLVIE CSASGSWDRA PPTCHLVVCG
EPPAIKDAVT TGSNFTFGNT VTYTCKEGYT LAGPDTIECL ANGKWSRSDQ QCLAVSCDEP
PSVEHASPET AHRLFGDIAF YYCSDGYSLA DNSQLLCNAQ GKWVPPEGQA VPRCIAHFCE
KPPAVSYSIL ESVSKAKFAA GSVVSFKCTE GFVLNTSAKI ECLRGGQWNP SPMSIQCIPV
RCGEPPRIMN GYAIGSNYSF GAVVAYSCNR GFYIKGEKKS ACEATGQWSS PIPTCHPVSC
NEPPKVENGF LEVRD
//