ID E9QAQ8_MOUSE Unreviewed; 3455 AA.
AC E9QAQ8;
DT 05-APR-2011, integrated into UniProtKB/TrEMBL.
DT 05-OCT-2016, sequence version 5.
DT 27-MAR-2024, entry version 91.
DE SubName: Full=Mucin 5, subtypes A and C, tracheobronchial/gastric {ECO:0000313|Ensembl:ENSMUSP00000122353.4};
GN Name=Muc5ac {ECO:0000313|Ensembl:ENSMUSP00000122353.4,
GN ECO:0000313|MGI:MGI:104697};
OS Mus musculus (Mouse).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC Murinae; Mus; Mus.
OX NCBI_TaxID=10090 {ECO:0000313|Ensembl:ENSMUSP00000122353.4, ECO:0000313|Proteomes:UP000000589};
RN [1] {ECO:0000313|Ensembl:ENSMUSP00000122353.4, ECO:0000313|Proteomes:UP000000589}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=C57BL/6J {ECO:0000313|Ensembl:ENSMUSP00000122353.4,
RC ECO:0000313|Proteomes:UP000000589};
RX PubMed=19468303; DOI=10.1371/journal.pbio.1000112;
RA Church D.M., Goodstadt L., Hillier L.W., Zody M.C., Goldstein S., She X.,
RA Bult C.J., Agarwala R., Cherry J.L., DiCuccio M., Hlavina W., Kapustin Y.,
RA Meric P., Maglott D., Birtle Z., Marques A.C., Graves T., Zhou S.,
RA Teague B., Potamousis K., Churas C., Place M., Herschleb J., Runnheim R.,
RA Forrest D., Amos-Landgraf J., Schwartz D.C., Cheng Z., Lindblad-Toh K.,
RA Eichler E.E., Ponting C.P.;
RT "Lineage-specific biology revealed by a finished genome assembly of the
RT mouse.";
RL PLoS Biol. 7:E1000112-E1000112(2009).
RN [2] {ECO:0000313|Ensembl:ENSMUSP00000122353.4}
RP IDENTIFICATION.
RC STRAIN=C57BL/6J {ECO:0000313|Ensembl:ENSMUSP00000122353.4};
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00039}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR SMR; E9QAQ8; -.
DR ProteomicsDB; 316007; -.
DR Antibodypedia; 3457; 1150 antibodies from 35 providers.
DR Ensembl; ENSMUST00000155534.9; ENSMUSP00000122353.4; ENSMUSG00000037974.17.
DR AGR; MGI:104697; -.
DR MGI; MGI:104697; Muc5ac.
DR VEuPathDB; HostDB:ENSMUSG00000037974; -.
DR GeneTree; ENSGT00940000156076; -.
DR HOGENOM; CLU_000076_3_1_1; -.
DR InParanoid; E9QAQ8; -.
DR OMA; CRAKSHP; -.
DR Reactome; R-MMU-913709; O-linked glycosylation of mucins.
DR Reactome; R-MMU-977068; Termination of O-glycan biosynthesis.
DR Proteomes; UP000000589; Chromosome 7.
DR RNAct; E9QAQ8; Protein.
DR Bgee; ENSMUSG00000037974; Expressed in pyloric antrum and 23 other cell types or tissues.
DR ExpressionAtlas; E9QAQ8; baseline and differential.
DR GO; GO:0031012; C:extracellular matrix; IBA:GO_Central.
DR GO; GO:0005615; C:extracellular space; IDA:MGI.
DR GO; GO:0070701; C:mucus layer; ISO:MGI.
DR GO; GO:0036438; P:maintenance of lens transparency; IMP:MGI.
DR CDD; cd19941; TIL; 3.
DR Gene3D; 2.10.25.10; Laminin; 4.
DR InterPro; IPR006207; Cys_knot_C.
DR InterPro; IPR036084; Ser_inhib-like_sf.
DR InterPro; IPR002919; TIL_dom.
DR InterPro; IPR014853; VWF/SSPO/ZAN-like_Cys-rich_dom.
DR InterPro; IPR001007; VWF_dom.
DR InterPro; IPR001846; VWF_type-D.
DR InterPro; IPR025155; WxxW_domain.
DR PANTHER; PTHR11339; EXTRACELLULAR MATRIX GLYCOPROTEIN RELATED; 1.
DR PANTHER; PTHR11339:SF404; MUCIN-5AC; 1.
DR Pfam; PF08742; C8; 4.
DR Pfam; PF13330; Mucin2_WxxW; 5.
DR Pfam; PF01826; TIL; 2.
DR Pfam; PF00094; VWD; 4.
DR SMART; SM00832; C8; 4.
DR SMART; SM00041; CT; 1.
DR SMART; SM00214; VWC; 5.
DR SMART; SM00215; VWC_out; 2.
DR SMART; SM00216; VWD; 4.
DR SUPFAM; SSF57603; FnI-like domain; 1.
DR SUPFAM; SSF57567; Serine protease inhibitors; 4.
DR PROSITE; PS01185; CTCK_1; 1.
DR PROSITE; PS01225; CTCK_2; 1.
DR PROSITE; PS51257; PROKAR_LIPOPROTEIN; 1.
DR PROSITE; PS01208; VWFC_1; 2.
DR PROSITE; PS50184; VWFC_2; 2.
DR PROSITE; PS51233; VWFD; 4.
PE 1: Evidence at protein level;
KW Copper {ECO:0000256|ARBA:ARBA00023008};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00039};
KW Proteomics identification {ECO:0007829|MaxQB:E9QAQ8,
KW ECO:0007829|ProteomicsDB:E9QAQ8};
KW Reference proteome {ECO:0000313|Proteomes:UP000000589};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..27
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 28..3455
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5003243159"
FT DOMAIN 78..248
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 431..606
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 900..1071
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 2743..2925
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 3080..3144
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 3181..3248
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 3327..3415
FT /note="CTCK"
FT /evidence="ECO:0000259|PROSITE:PS01225"
FT REGION 1313..1364
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1489..1594
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1878..2297
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2572..2592
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3425..3455
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3425..3443
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 3327..3377
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00039"
SQ SEQUENCE 3455 AA; 372170 MW; EF38DF82D8E8029C CRC64;
MGVGRRKLVP FWVLALALAC SQCTGQAQQD SLKSYHEHRS DVPHPQGHVG TPLNRVTIIP
PLKTIPVVRA FNPGHTRRVC STWGNFHYKT FDGQVFYFPG LCNYVFSAHC GDAYEDFNIQ
LRRVQESNTT TLSRVTMKLD GLVVELTKSS VLVNNHPVQL PFSQSGVLIE LSNGYLKVVA
RLGLLFVWNE DDSLLLELDT KYTNKTCGLC GDFNGSPKSN EFLSNNVRLT PLEFGNLQKM
DGPTEQCQDP LPVPQKNCSA RSGICEMILK GELFSGCAAL VDISSYVEAC RQDVCLCESL
DPSDCICHTL AEYSRQCAHA GGQPQDWRGP NLCSQTCPLN MQHQECGSPC VDTCSNPQHS
QVCEDHCIAG CFCPEGMVLD DINQMGCVPV SQCACLYNGT LYAPGTNYST DCTKCTCSGG
QWSCQDIPCA GTCSVMGGSH MSTFDGRQYT VHGDCTYVLS KPCDSNAFTV LVELRKCGLT
ESETCLKTVT LNLGGGQTEI MVKATGEVFV NQIYTQLPVS TANATFFRPS TFFIVGETNL
GLQLEIQLSP IMQTSVRLKP GLRGLTCGLC GNFNSMQADD FQTISGVVEG TAAAFFNTFK
TQAACPNVKN IFQDPCSLSV ENEKYAQHWC SLLTNASGPF SQCHATVNPS TFFSNCMYDT
CNCEKSEDCM CAALSSYVRA CAAKGVLLSD WRDGICTKPT ITCPKSMTYQ YHISTCQPTC
RALNEKDVTC HVSFIPVDGC TCPKGTFLDD LGKCVQATSC PCYYKGSTVP NGESVQDSGA
ICTCTQGALT CIGGPAPTPV CDAPMIYFDC HNATPGDTGA GCQKSCHTLD MTCYSSECVP
GCVCPNGLVA DGNGGCVVTE DCPCVHNEAT YRPGETIQVG CNNCTCENRM WQCTDKPCLA
TCAVYGDGHY ITFDGQRYSF NGDCEYTLLQ DNCGGNGSSQ DAFRVITENI PCGTTGTTCS
KSIKIFLGNY ELKLSDSKME VVQKDVGQEP PYFVHQMGNY LVVETDIGLV LLWDKKTSIF
LRLSPEFKGR VCGLCGNFDD NAINDFTTRS QSVVSDMLEF GNSWKLSPSC PDVLVPKDPC
TANPYRKSWA QKQCSIINSE TFSACHAHVE PAKYYEACVN DACACDSGGD CECFCTTVAA
YAQACHEVGV CVSWRTPDIC PLFCDYYNPE GQCEWHYQPC GAPCMRTCQN PTGQCLQDLR
GLEGCYPKCP PTAPIFDEGT MQCVSNCTVT FPCRVNGKLY RPGASVPSDK NCDSCICTES
GVRCTHNAGA CVCTYNGQQF HPGEIIYHTT DGIGGCISAH CRANGTIERS VDTCNSTTPT
PPTTFSFSTP PVMTSMQPSS THSSPTPSVG SSGASSKAAS TTSSILSVKS PVTAPMTMST
SASAVTTSGC REECLWSPWM DVSRPGRGID SGDFDTLENL RAHGYPICQV PKAVECRAEA
SPGVPLPELQ QHLECSTTVG LICYNSDQLS GLCDNYQIKV QCCTPVSCPT SQTTHVISSS
RTTNLDNTTS SVPVTSTEHP YSSTVTSGSS THTPGLSPSS SVPSSPTPAS STPAPVSSTT
VKTTLPITSP TPEPTPAISS VSISTSGSTM PSSETTHECK QELCNWTNWL DGSYPGSGRN
SGDFDTFVNL RSKGYKFCEK PRNVECRAQF FPNTPLEELG QNVTCSREEG LICLNKNQLP
PMCYNYEIRI ECCTVVNNCS TASVTTHPTS HGVSTKTETN WTTHVYSSPT KDTSSHSATI
DTKTWTSGIS HTTTQPVTTH CQLQCNWTKW FDTDFPVPGP HGGDLETYSN IERSGERLCH
REEITQLQCR AKNYPEREME DLGQVVKCDP SVGLVCNNRD QGGDSGMCLN YEVRLLCCHI
PEGCSMTTHV TLLSSTSEIV TSSTPGTTSM HVASSTSMPQ TSSPNTGKTS TISTTQTSSP
NTGKTSTTST TQTSSPNTGK TSTISTTQTS SPNTGKTSTT STTQTSSPNT GKTSTISTTQ
TSSPNTGKAS TPSTPHTSSP NTGKTSTIST TQTSSPNTGK TSTTSTTQTS SPNTGKTSTI
STTQTSSPNT GKASTPSTPH TSSPNTGKTS TISTTQTSSP NTGKASTPST PQTSSPNTGK
TSTISTTQTS SPNTGKGSTP STPQTSSPNT GKTSTISTTQ TSSPNTGKTS TTSTTQTSSP
NTGKTSTIST TQTSSPNTGK ASTPSTPHTS SPNTGKTSTI STTQTSSPNT GKASTPSTPQ
TSSPNTGKTS TISTTQTSSP NTGKGSTPST PQTSSPNTGK TSTTSTTQTS SPNTGKASTI
STTQTISTSG STMPSSETTH ECKQELCNWT NWLDGSYPGS GRNSGDFDTF VNLRSKGYKF
CEKPRNVECR AQFFPNTPLE ELGQNVTCSR EEGLICLNKN QLPPMCYNYE IRIECCTVVN
NCSTASVTTH PTSHGVSTKT ETNWTTHVYS SPTKDTSSHS ATIDTKTWTS GISHTTTQPV
TTHCQLQCNW TKWFDTDFPV PGPHGGDLET YSNIERSGER LCHREEITQL QCRAKNYPER
EMEDLGQVVK CDPSVGLVCN NRDQGGDSGM CLNYEVRLLC CHIPEDCPRT DQTSPVTLSH
KPSSAVVSPS SVSPSLSTSH RVHSTTPCFC SVSGQLYPLG SIIYNQTDLD GHCYYAMCSQ
DCQVVKRVSQ DCPSTMPPPA TTLSTSTTPP VTGRDRCNVF PPRLRGETWP MPNCSQATCE
GNNVISLSPR QCPELNEPSC ANGYPPLKVD DQDGCCQHYQ CQCVCSGWGD PHYITFDGTY
YTFLDNCTYV LVQQIVPVFG YFRVLIDNYY CDVGDSVSCP QSIIVEYHQD RVVLTRRPVS
GVMTNQIIFN NKVVSPGFQQ NGIVTSRVGI KMYVTIQEIG VRVMFSGLIF SVEVPFNLFA
NNTEGQCGTC TNDKKDECRL PGGSIASSCS EMSLHWKVPN QPSCQGPPPT PTSVVPRPSP
TPCPPSPLCE LILSNTFKLC HDVIPPLQFY QGCLFDYCHM LDLEVVCSGL ELYASLCAAQ
GVCIPWRSQT NNTCSFTCPD NQVYQPCGPS NPHYCYRDDS ISPSLTLQEA GPKTEGCFCP
DSTTLFSTND SICVPSCQWC LGPRGEPVEP GHTISIDCQD CICKEATLTC QKKACPQPTC
PEPGFVPVPV ALEAGQCCPQ FSCACNSSHC PPPLHCPKNS SLIVTYEEGA CCPTQNCSSQ
KGCEVNGTLY QPGDVVSSSL CERCLCEVSS NPLSDVFMVS CETELCNTQC PKGSEYQAMP
GQCCGKCIPK TCPFKNNSGS TYFYQPGELW AEPGNPCVTH KCEKFQDVLM VVTMKTECPK
INCPQGQAQL REDGCCYDCP LPNQQKCTVH QRQQIIRQQN CSSEGPVSIS YCQGNCGDSI
SMYSLEANKV EHTCECCQEL QTSQRNVTLR CDDGSSQTFS YTQVEKCGCL GQQCHALGDT
SHAESSEQEF KSKESEEHGQ QLAFRVSEDM LGPFQ
//