ID A0A1S3QRP0_SALSA Unreviewed; 619 AA.
AC A0A1S3QRP0;
DT 12-APR-2017, integrated into UniProtKB/TrEMBL.
DT 12-APR-2017, sequence version 1.
DT 27-MAR-2024, entry version 29.
DE SubName: Full=Intestinal mucin-like protein {ECO:0000313|RefSeq:XP_013982360.1, ECO:0000313|RefSeq:XP_013982509.1};
GN Name=LOC106595521 {ECO:0000313|RefSeq:XP_014042367.1};
GN Synonyms=LOC106562192 {ECO:0000313|RefSeq:XP_013982360.1},
GN LOC106562267 {ECO:0000313|RefSeq:XP_013982509.1};
OS Salmo salar (Atlantic salmon).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Protacanthopterygii; Salmoniformes;
OC Salmonidae; Salmoninae; Salmo.
OX NCBI_TaxID=8030 {ECO:0000313|Proteomes:UP000087266, ECO:0000313|RefSeq:XP_014042367.1};
RN [1] {ECO:0000313|RefSeq:XP_013982360.1, ECO:0000313|RefSeq:XP_013982509.1}
RP IDENTIFICATION.
RC TISSUE=Muscle {ECO:0000313|RefSeq:XP_013982360.1,
RC ECO:0000313|RefSeq:XP_013982509.1};
RG RefSeq;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Secreted {ECO:0000256|ARBA:ARBA00004613}.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00039}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_013982360.1; XM_014126885.1.
DR RefSeq; XP_013982509.1; XM_014127034.1.
DR RefSeq; XP_014042367.1; XM_014186892.1.
DR STRING; 8030.ENSSSAP00000038946; -.
DR KEGG; sasa:106562192; -.
DR KEGG; sasa:106562267; -.
DR KEGG; sasa:106595521; -.
DR OrthoDB; 2872912at2759; -.
DR Proteomes; UP000087266; Unplaced.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-SubCell.
DR Gene3D; 2.10.25.10; Laminin; 1.
DR InterPro; IPR006207; Cys_knot_C.
DR InterPro; IPR006208; Glyco_hormone_CN.
DR InterPro; IPR036084; Ser_inhib-like_sf.
DR InterPro; IPR014853; VWF/SSPO/ZAN-like_Cys-rich_dom.
DR InterPro; IPR001007; VWF_dom.
DR InterPro; IPR001846; VWF_type-D.
DR PANTHER; PTHR11339; EXTRACELLULAR MATRIX GLYCOPROTEIN RELATED; 1.
DR PANTHER; PTHR11339:SF399; MUCIN-2; 1.
DR Pfam; PF08742; C8; 1.
DR Pfam; PF00007; Cys_knot; 1.
DR Pfam; PF00094; VWD; 1.
DR SMART; SM00832; C8; 1.
DR SMART; SM00041; CT; 1.
DR SMART; SM00214; VWC; 2.
DR SUPFAM; SSF57603; FnI-like domain; 1.
DR SUPFAM; SSF57567; Serine protease inhibitors; 1.
DR PROSITE; PS01185; CTCK_1; 1.
DR PROSITE; PS01225; CTCK_2; 1.
DR PROSITE; PS01208; VWFC_1; 2.
DR PROSITE; PS50184; VWFC_2; 2.
DR PROSITE; PS51233; VWFD; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00039}; Reference proteome {ECO:0000313|Proteomes:UP000087266};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Secreted {ECO:0000256|ARBA:ARBA00022525}.
FT DOMAIN 1..114
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 265..333
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 369..436
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 510..604
FT /note="CTCK"
FT /evidence="ECO:0000259|PROSITE:PS01225"
FT DISULFID 531..580
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00039"
FT DISULFID 542..596
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00039"
FT DISULFID 546..598
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00039"
SQ SEQUENCE 619 AA; 68678 MW; 726A44834A7B78AE CRC64;
MTKKDVDGVF TSLIYVNQQR IIPAYETKNF RITDNGIETL LVIPAINAKV SFTGLMFSIY
LPWDKFNGNT EGQCGTCDNN RTDDCRLPNG AIDSSCPDMA HQWHVADHNN SQCTPPPEPT
PTQPPGCDPP ICYLIQSKVF ESCHKIIPYE PFIVACIFDA CYMDDVTIGC TSLQTYADAC
AQAGVCIEWR NYTNGQCDFT CEKPKVYNAC GPQVEPTCNA WYNFKFIQTQ NEFSVMGDIQ
LEGCYCPPGT TLMSSSSNYC IPSCDICLLP NGEWKEANET WVSNCQECVC DPYSLEIQCQ
PVACQHQPPL TCDQEGQVKV VETVDCCQKD KCECDVTRCS TSKITCPVGF ETEATMGVCC
LTYQCVPKDV CVFNNTEYQP GANVPEDKCK NCVCGDSVDA QAHLHIIECQ PTECDTHCQQ
GYDYQAVPGQ CCGKCVQTSC VVMLPDNTTH TIQPGSVWIP SGDKCLKYEC VNIMDQLIPI
QAKTVCPDFY PEDCIPGTEF VAPDGCCHVC IPITKQCNVT KGKVYLDSNG CKSANKVEVT
TCGGSCVTYA MYSLEANMME RSCTCCREES TTKKEVEMIC PDGSKFNHSY IHINKCGCQR
TECVTPEATQ VTRSRRRRR
//