ID A0A3P8XYL7_ESOLU Unreviewed; 2561 AA.
AC A0A3P8XYL7;
DT 13-FEB-2019, integrated into UniProtKB/TrEMBL.
DT 02-DEC-2020, sequence version 2.
DT 27-MAR-2024, entry version 21.
DE SubName: Full=Mucin-2 {ECO:0000313|Ensembl:ENSELUP00000009512.2};
OS Esox lucius (Northern pike).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Protacanthopterygii; Esociformes;
OC Esocidae; Esox.
OX NCBI_TaxID=8010 {ECO:0000313|Ensembl:ENSELUP00000009512.2, ECO:0000313|Proteomes:UP000265140};
RN [1] {ECO:0000313|Ensembl:ENSELUP00000009512.2, ECO:0000313|Proteomes:UP000265140}
RP NUCLEOTIDE SEQUENCE.
RX PubMed=25069045;
RA Rondeau E.B., Minkley D.R., Leong J.S., Messmer A.M., Jantzen J.R.,
RA von Schalburg K.R., Lemon C., Bird N.H., Koop B.F.;
RT "The genome and linkage map of the northern pike (Esox lucius): conserved
RT synteny revealed between the salmonid sister group and the Neoteleostei.";
RL PLoS ONE 9:e102089-e102089(2014).
RN [2] {ECO:0000313|Ensembl:ENSELUP00000009512.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00039}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR Ensembl; ENSELUT00000003720.2; ENSELUP00000009512.2; ENSELUG00000011141.2.
DR GeneTree; ENSGT00940000165245; -.
DR InParanoid; A0A3P8XYL7; -.
DR OMA; DENTRRC; -.
DR Proteomes; UP000265140; LG23.
DR Bgee; ENSELUG00000011141; Expressed in mesonephros and 5 other cell types or tissues.
DR CDD; cd19941; TIL; 1.
DR Gene3D; 2.10.25.10; Laminin; 2.
DR InterPro; IPR006207; Cys_knot_C.
DR InterPro; IPR036084; Ser_inhib-like_sf.
DR InterPro; IPR014853; VWF/SSPO/ZAN-like_Cys-rich_dom.
DR InterPro; IPR001007; VWF_dom.
DR InterPro; IPR001846; VWF_type-D.
DR PANTHER; PTHR11339; EXTRACELLULAR MATRIX GLYCOPROTEIN RELATED; 1.
DR PANTHER; PTHR11339:SF289; MUCIN-19-LIKE; 1.
DR Pfam; PF08742; C8; 3.
DR Pfam; PF00094; VWD; 4.
DR SMART; SM00832; C8; 3.
DR SMART; SM00041; CT; 1.
DR SMART; SM00214; VWC; 5.
DR SMART; SM00215; VWC_out; 3.
DR SMART; SM00216; VWD; 3.
DR SUPFAM; SSF57603; FnI-like domain; 1.
DR SUPFAM; SSF57567; Serine protease inhibitors; 3.
DR PROSITE; PS01225; CTCK_2; 1.
DR PROSITE; PS51233; VWFD; 3.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00039}; Reference proteome {ECO:0000313|Proteomes:UP000265140};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..18
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 19..2561
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5028041017"
FT DOMAIN 365..535
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 808..983
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 2014..2195
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 2466..2558
FT /note="CTCK"
FT /evidence="ECO:0000259|PROSITE:PS01225"
FT REGION 1170..1195
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1209..1877
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1908..1936
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 2487..2536
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00039"
SQ SEQUENCE 2561 AA; 282950 MW; FED2A3AF04DC0912 CRC64;
MLSPFWTLEA ALLSIVFALG SVGSFSAIDT HKNVCKTFGS GVIKTFNGTS FYVRSTCPFT
LTRFTYNRVD CDITMRRGTN GLMDYVEISV NKIQTRILYN GTIFVELSMV SLPYDHTYQH
VFRYGIYTKL RSTVLPLSVI WSSVGVGIDS LWVELEQELD SGMTGLCGHP NTPDEPQGLI
AGSVLPGDRC QTMDPLRTLN RVCRTFFSHY LECLQAETVT YISLCEKNIY GYANNQHVGC
AFYKEIAHQC KDTSLAWGFW RTITQCPEFS CQGDLRFQEL GDAFVPTCSN PEPRTSNQDI
TSTCACSQGQ VLNDRAEGQH CVNVSSCPCE FAGRTYAALE ERITKCQTCV CLNGKWSCSQ
NSCPSRCVIE GQFVTTFDGK EYTLPGKCTY VVSKGLNWTI SMQFSEKTIS LQKVVLQVYQ
NTYTFTHNSV QFEKEEIQEL HQSEHALVFW QSSMYVQVLT SFGMKIQVQT SPDLQLYITL
PQREVGMPEG LCGNYNTDTT DDFTTSSGIV ENSPEKFALS WSVGDCPVDI INVCINTDNE
IFADEKCQPL RDPDGIFAKC HNHVPADIYH KACIQKTCTC GGGLQQCLCV ALANYAKACA
NQGITVGDWR TATNCTLSCE NNLRFDYGMR ACNRTCLSLS GPDPRCGVED APTEGCGCME
GTHLNGELRC TPSAECPCHH LGGATAPGLI VLDGRQCKCE NGRLQCSEDC GCTLGKVCVH
CSEYPINTAQ KTCGSLSKPT GDIQSCTSGC YCPGGLMEDH RGVCVTMDNC TCEFSGRVFD
AGQSVKTNCR TCTCRDGQWT CVDEPCPGSC QVYGNGHYQT FDSKWYRYDG NCQYTLVKDG
CGGEAGSFAV TVESVPCCDE ALTCSRAIVL DLMGQVTLIL NEMKVTRRLQ GEWASVETEP
LYSTHTVGLY IMISVPTRGL TLIWDKHTRL TVTLEHQWRN RVCGLCGNFD SNEMNDLQIS
GSSVVSSPLT FGNSWKTAAP PCSDVTNDVF PCQRHTYCSA WAERRCMILR GDTFKDCHLK
VDPEPFYQAC VLESCSCEFE GKFLGFCTAV AAYAEACSDH DVCVRWRTPD MCPVYCDYYN
EENGCSWHYD PCGQVKTCGK NYRFAGKLEG CYPRCPVEAP YYDENTERCS TLFNCTCYSN
ETVIEPGTVV RTPTGNCICE QGRITCGPQP TPTSEYTTTS TQPTTTSEYT TTSTQPTTTF
EYTTTSVYTT KSTQPTTTSE YTTASTQPTT TSEYTTTSTQ PTTTSEYTTT STQPTTTSEY
TTTSTQPTTT SEYTTTSTQP TTTSEYTTTS EYTTTSTQPT TKSEYTTTST QPTTISEFTT
TSEYTTTSTQ PTTTSEYTTT STQPTTTSEY TTTSTQPTTT SEYTTTSEYT TTSTQPTTTS
EYTTTSEYTT TSTQPTTTSE YTTTSTQPTT TSEYTTTSTQ PTTTSEYTTT SEYTTESTQL
TATSEYTTTS EYTTTSTQPT TTSEYTTTSE YTTKSTQPIT TSEYTTTSTQ PTTTSEYTTT
STQPTTTSEY TTTSEYTTTS TQPTTTSEYT TTSTQPTTTS EYTTTSTQPT TTSEYTTTSE
YTTKSTQPIT TSEYTTTSTQ PTTTSEYTTT STQPTTTSEY TTTSEYTTTS TQPTTTSEYT
TTSTQPTTTS EYTTTSTQPT TTSEYTTTST QPTTTTEYTT TSTQPTTTSE YTTTSEYTTT
STQPTTTSEY TTTSEYTTTS EYTTTSTQPT TTSEYSTTST QPTTTSEYTT TSEYTTTSTQ
PTTTSEYTTT SEYTTTSTQP TTTSEYTTTS EYTTTSTQPT TTSEYTTTST QPTTTSEYTT
TSTQPTTTSE YTTTSTQPTT TSEYTTTSEY TTTSTQPTTT SEYTTTSTQP TTTSEYTTTS
TQPTTTSEYT TTSTQPTTTF SYFTTSNVKP TSETAISYTV ITNTPVPVST SRQISSNTSH
ESTTTSSTRS PPTSITISTA QTTTQRCECL DLKGNHSWNC GQNWTEDCFH KTCNNRKIEM
TSVTCPSPVR PTFCPRGQMV IVSDGCCDSW KCDCRCDLYG DPHYISFQGV TFDFLDNCTY
ILVEEKTPRH YLTIAVDNYF CMEGLDGSCV KGIILQYQNN TATLSVVPDE YTVQCTLNRR
VVKPPYEEHG FRFETTGYMV SVYIPDIRSH ISLTPSYNLV VTLAMEHFLN NTQGQCGVCG
GSSCVRRSGE IEDDSCCDKT AYDWLYKDPL KPECSAAPTN VPCPTPSTEL PLPPDCTYNH
LCELFHHPVF ANCSRLVNLK LHEDHCRFDS CLKGEMACAS LEQAAEECKN LGFCVDWRQL
TNSTCVVECP VGMVYKECEG QLDDYCHGGE RLPGTVLRQV KAGCFCPSGQ FRAEEHKTIC
VSECPYCKGP LGEPKMFGEI WRSNCQECRC NNQTKTEECR PVPPSPPPIC SQSSSLVTRC
CGQQICVEKT CTYNGTAYKV GDRWTDPALP CESYRCTRDG TQTERTVCPH QNCLEEDRVW
DEQHCCYTCD HSCAPRVSSV NITIDNCTAT LQLPVCLGQC GVETRWVGAG SFLQLEQKET
FCRVRSFEKR IVTLSCTGNS FVNHLYAHVT SCVCQVCSGL E
//