ID A0A0G2K1Y4_RAT Unreviewed; 3586 AA.
AC A0A0G2K1Y4;
DT 22-JUL-2015, integrated into UniProtKB/TrEMBL.
DT 25-MAY-2022, sequence version 2.
DT 27-MAR-2024, entry version 37.
DE SubName: Full=Mucin 5AC, oligomeric mucus/gel-forming {ECO:0000313|Ensembl:ENSRNOP00000072033.2};
GN Name=Muc5ac {ECO:0000313|Ensembl:ENSRNOP00000072033.2,
GN ECO:0000313|RGD:62001};
OS Rattus norvegicus (Rat).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC Murinae; Rattus.
OX NCBI_TaxID=10116 {ECO:0000313|Ensembl:ENSRNOP00000072033.2, ECO:0000313|Proteomes:UP000002494};
RN [1] {ECO:0000313|Ensembl:ENSRNOP00000072033.2, ECO:0000313|Proteomes:UP000002494}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Brown Norway {ECO:0000313|Ensembl:ENSRNOP00000072033.2,
RC ECO:0000313|Proteomes:UP000002494};
RX PubMed=15057822; DOI=10.1038/nature02426;
RG Rat Genome Sequencing Project Consortium;
RA Gibbs R.A., Weinstock G.M., Metzker M.L., Muzny D.M., Sodergren E.J.,
RA Scherer S., Scott G., Steffen D., Worley K.C., Burch P.E., Okwuonu G.,
RA Hines S., Lewis L., Deramo C., Delgado O., Dugan-Rocha S., Miner G.,
RA Morgan M., Hawes A., Gill R., Holt R.A., Adams M.D., Amanatides P.G.,
RA Baden-Tillson H., Barnstead M., Chin S., Evans C.A., Ferriera S.,
RA Fosler C., Glodek A., Gu Z., Jennings D., Kraft C.L., Nguyen T.,
RA Pfannkoch C.M., Sitter C., Sutton G.G., Venter J.C., Woodage T., Smith D.,
RA Lee H.-M., Gustafson E., Cahill P., Kana A., Doucette-Stamm L.,
RA Weinstock K., Fechtel K., Weiss R.B., Dunn D.M., Green E.D.,
RA Blakesley R.W., Bouffard G.G., De Jong P.J., Osoegawa K., Zhu B., Marra M.,
RA Schein J., Bosdet I., Fjell C., Jones S., Krzywinski M., Mathewson C.,
RA Siddiqui A., Wye N., McPherson J., Zhao S., Fraser C.M., Shetty J.,
RA Shatsman S., Geer K., Chen Y., Abramzon S., Nierman W.C., Havlak P.H.,
RA Chen R., Durbin K.J., Egan A., Ren Y., Song X.-Z., Li B., Liu Y., Qin X.,
RA Cawley S., Cooney A.J., D'Souza L.M., Martin K., Wu J.Q.,
RA Gonzalez-Garay M.L., Jackson A.R., Kalafus K.J., McLeod M.P.,
RA Milosavljevic A., Virk D., Volkov A., Wheeler D.A., Zhang Z., Bailey J.A.,
RA Eichler E.E., Tuzun E., Birney E., Mongin E., Ureta-Vidal A., Woodwark C.,
RA Zdobnov E., Bork P., Suyama M., Torrents D., Alexandersson M., Trask B.J.,
RA Young J.M., Huang H., Wang H., Xing H., Daniels S., Gietzen D., Schmidt J.,
RA Stevens K., Vitt U., Wingrove J., Camara F., Mar Alba M., Abril J.F.,
RA Guigo R., Smit A., Dubchak I., Rubin E.M., Couronne O., Poliakov A.,
RA Huebner N., Ganten D., Goesele C., Hummel O., Kreitler T., Lee Y.-A.,
RA Monti J., Schulz H., Zimdahl H., Himmelbauer H., Lehrach H., Jacob H.J.,
RA Bromberg S., Gullings-Handley J., Jensen-Seaman M.I., Kwitek A.E.,
RA Lazar J., Pasko D., Tonellato P.J., Twigger S., Ponting C.P., Duarte J.M.,
RA Rice S., Goodstadt L., Beatson S.A., Emes R.D., Winter E.E., Webber C.,
RA Brandt P., Nyakatura G., Adetobi M., Chiaromonte F., Elnitski L.,
RA Eswara P., Hardison R.C., Hou M., Kolbe D., Makova K., Miller W.,
RA Nekrutenko A., Riemer C., Schwartz S., Taylor J., Yang S., Zhang Y.,
RA Lindpaintner K., Andrews T.D., Caccamo M., Clamp M., Clarke L., Curwen V.,
RA Durbin R.M., Eyras E., Searle S.M., Cooper G.M., Batzoglou S., Brudno M.,
RA Sidow A., Stone E.A., Payseur B.A., Bourque G., Lopez-Otin C., Puente X.S.,
RA Chakrabarti K., Chatterji S., Dewey C., Pachter L., Bray N., Yap V.B.,
RA Caspi A., Tesler G., Pevzner P.A., Haussler D., Roskin K.M., Baertsch R.,
RA Clawson H., Furey T.S., Hinrichs A.S., Karolchik D., Kent W.J.,
RA Rosenbloom K.R., Trumbower H., Weirauch M., Cooper D.N., Stenson P.D.,
RA Ma B., Brent M., Arumugam M., Shteynberg D., Copley R.R., Taylor M.S.,
RA Riethman H., Mudunuri U., Peterson J., Guyer M., Felsenfeld A., Old S.,
RA Mockrin S., Collins F.S.;
RT "Genome sequence of the Brown Norway rat yields insights into mammalian
RT evolution.";
RL Nature 428:493-521(2004).
RN [2] {ECO:0000313|Ensembl:ENSRNOP00000072033.2}
RP IDENTIFICATION.
RC STRAIN=Brown Norway {ECO:0000313|Ensembl:ENSRNOP00000072033.2};
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00039}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; A0A0G2K1Y4; -.
DR STRING; 10116.ENSRNOP00000072033; -.
DR Ensembl; ENSRNOT00000085159.2; ENSRNOP00000072033.2; ENSRNOG00000055996.2.
DR AGR; RGD:62001; -.
DR RGD; 62001; Muc5ac.
DR VEuPathDB; HostDB:ENSRNOG00000055996; -.
DR GeneTree; ENSGT00940000156076; -.
DR InParanoid; A0A0G2K1Y4; -.
DR Reactome; R-RNO-913709; O-linked glycosylation of mucins.
DR Reactome; R-RNO-977068; Termination of O-glycan biosynthesis.
DR Proteomes; UP000002494; Chromosome 1.
DR Bgee; ENSRNOG00000055996; Expressed in stomach and 1 other cell type or tissue.
DR GO; GO:0031012; C:extracellular matrix; IBA:GO_Central.
DR GO; GO:0005615; C:extracellular space; IDA:RGD.
DR GO; GO:0070701; C:mucus layer; ISO:RGD.
DR GO; GO:0071364; P:cellular response to epidermal growth factor stimulus; IEP:RGD.
DR GO; GO:0071385; P:cellular response to glucocorticoid stimulus; IEP:RGD.
DR GO; GO:0071300; P:cellular response to retinoic acid; IEP:RGD.
DR GO; GO:0030855; P:epithelial cell differentiation; IEP:RGD.
DR GO; GO:0036438; P:maintenance of lens transparency; ISO:RGD.
DR GO; GO:0032496; P:response to lipopolysaccharide; IEP:RGD.
DR GO; GO:0010193; P:response to ozone; IEP:RGD.
DR GO; GO:0010477; P:response to sulfur dioxide; IEP:RGD.
DR GO; GO:0033189; P:response to vitamin A; IEP:RGD.
DR CDD; cd19941; TIL; 3.
DR Gene3D; 2.10.25.10; Laminin; 4.
DR InterPro; IPR006207; Cys_knot_C.
DR InterPro; IPR036084; Ser_inhib-like_sf.
DR InterPro; IPR002919; TIL_dom.
DR InterPro; IPR014853; VWF/SSPO/ZAN-like_Cys-rich_dom.
DR InterPro; IPR001007; VWF_dom.
DR InterPro; IPR001846; VWF_type-D.
DR InterPro; IPR025155; WxxW_domain.
DR PANTHER; PTHR11339; EXTRACELLULAR MATRIX GLYCOPROTEIN RELATED; 1.
DR PANTHER; PTHR11339:SF404; MUCIN-5AC; 1.
DR Pfam; PF08742; C8; 4.
DR Pfam; PF13330; Mucin2_WxxW; 4.
DR Pfam; PF01826; TIL; 2.
DR Pfam; PF00094; VWD; 4.
DR SMART; SM00832; C8; 4.
DR SMART; SM00041; CT; 1.
DR SMART; SM00214; VWC; 5.
DR SMART; SM00215; VWC_out; 3.
DR SMART; SM00216; VWD; 4.
DR SUPFAM; SSF57567; Serine protease inhibitors; 4.
DR PROSITE; PS01185; CTCK_1; 1.
DR PROSITE; PS01225; CTCK_2; 1.
DR PROSITE; PS01208; VWFC_1; 1.
DR PROSITE; PS50184; VWFC_2; 2.
DR PROSITE; PS51233; VWFD; 4.
PE 4: Predicted;
KW Copper {ECO:0000256|ARBA:ARBA00023008};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00039}; Reference proteome {ECO:0000313|Proteomes:UP000002494};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..31
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 32..3586
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5035306648"
FT DOMAIN 82..252
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 435..610
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 904..1075
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 2875..3057
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 3212..3276
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 3313..3380
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 3458..3546
FT /note="CTCK"
FT /evidence="ECO:0000259|PROSITE:PS01225"
FT REGION 1510..1590
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1752..2121
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2270..2585
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3547..3586
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3551..3569
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 3458..3508
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00039"
SQ SEQUENCE 3586 AA; 382969 MW; 5412C95FB0E499F5 CRC64;
MLHSMGVGRR KLAPFWVLAL ALTFNQHTGQ ALEDTRKSHL EHYSDLSQPQ GHVGTPLNRV
TIIPPLKTIP VVRAFNPAHT RRVCSTWGNF HYKTFDGQVF YFPGLCNYVF SEHCGAAYED
FNIQLRRGLE SNSTTLSRVI MKLDGLVVEL TKSSVLVNNH PVQLPFSQSG VLIELSNGYL
KVVARLGLVF MWNDDDSLLL ELDTKYANKT CGLCGDFNGS PESSEFLSHN VRLTPLEFGN
FQKMDGPTEQ CQDPLPVPQK NCSIRSSICE EILKGQLFSN CAALVDISSY LEACQQDLCL
CESSDPSNCI CHTLAEYSRQ CAHAGGQPQN WRGPNLCPQT CLLNMEYQEC GSPCVDTCSN
PQHSQVCEDH CVAGCFCPEG MVLDDSNQTG CVPVSQCACL YNGTLYAPGT SYSTDCTKCT
CSGGQWSCQE VPCSGTCSVM GGSHISTFDE RQYTVHGDCS YVLCKPYDSN AFTVLAELRK
CGLTESETCL KTVTLNLGGG KTVITVKATG EVFVNQIYTQ LPVSTANATF FRPSTFFIIG
QTNLGLQLEI QLHPIMQVSV RIAPEFRGLT SGLCGNFNSM QADDFQTISG VVEGTAAAFF
NTFKTQAACP NVKNIFEDPC SLSVENEKYA QHWCSQLTDA NGPFSQCHAT VNPSTFFSNC
MFDTCNCEKS EDCLCAALSS YVRACAAKGV LLSDWREGIC EKPTITCPKS MTYQYHISTC
QPTCRSLSEE DVTCHVNFIP VDGCTCPKGT FLDDSGKCVQ ATSCPCYYKG SPVPNGESVH
DNGAICTCTQ GALTCIGGPV LTPVCDAPMI YFDCRNATPG DTGAGCQKSC HTLDMTCYSS
ECVPGCVCPN GLVADGNGSC VVAEDCPCVH NEATYRPGET IQVGCNNCTC ENRMWQCTDK
PCLATCAVYG DGHYITFDGQ RYSFSGDCEY TLLQDNCGGN GSSQDAFRVV TENIPCGTTG
TTCSKGIKIF LGSYELKLSD SKMEVVQKGV GQEPPYFVHQ MGNYLVVETD IGLVLLWDKK
TSIFLRLSPE FKGKVCGLCG NFDDNAINDF TTRSQSVVSD MLEFGNSWKL SPSCPDASVS
KDPCTANPYR KSWAQKQCSI INSATFSACH AHVEPAKYYE ACVNDACACD SGGDCECFCT
AVAAYAQACH EVGVCVSWRT PDICPLFCDY YNPEGQCEWH YQPCGAPCMR TCQNPTGQCL
QDLRGLEGCY PKCPPTAPIF DEGTMQCVSN CTVSPSPCRV NGKLYRPGTP IPSDENCYSC
VCTESGVNCT HDAGACVCTY NGQRYHPGDT IYHTTDGMGG CISAHCRDNG TIERIVDTCS
STSPPPPTTF SFSTTLVMTS MQPSSTHSSP TPSVVYPGSP SKAVLTASSV SSVKTPETTS
VLTTSTSAST LTMPACQEEC LWSPWMDISR PGRGIDSGDF DTLENLHAHG YQICPVPKAV
ECRAEDNPGV PFHALQQHVE CSTTVGLICY NSDQVSGLCD NYQIKIQCCT PINCPTSTGP
TQTTHLIVSR TSTMEDTTSS VPVTSTEHTY STVASSPSTH TPGPSPSSSV PSSSAPARST
PTPVSSTTVK TTLPTTSPMP EPTSATSSVS ISTLGSTLAS PEITHGCRKE LCNWTDWIDG
SYPEPGRSSG DFDTFVNLRA KGYKFCEKPW NVECRAQFFP NTPLQELGQD VTCSREVGLI
CLNKNQLPPI CYNYEIRIEC CTIVNICSTT SATTQPTSHG VSIKTKTNWI TNTYSFSTEN
TSGHSTVINT KTWVTGSTHT TPQPGTRPTP STVSTQDTST SSVQTDSTTS SSHTSSPNTG
RVSTTHTTHT SSPPTGGTSP TSTTHTSSPP TGGTSPTSTT HTSSPPTGGT SPTSTTHTSS
PPTGGTSPTS TTHTSSPPTG GTSPTSTTHT SSPPTGGTSP TSTTHTSSPN TGRVSTTHTT
HTSSPPTGGT SPTSTTHTSS PNTGGTSPTS TTHTSSPNTG GTSPTSTTHT SSPPTGGTSP
TSTTHTSSPN TGGTSPTSTT HTSSPPTGGT SPTSTTHTSS PPTGGTSPTS TTHTSLPPTG
GTSPTSTTHT SSPPTGGTSP TSTTHTSSPP IGESSTISTT DIRTSSTQMA HTTFVGKTST
ISGPGTSTTS IPTSTTGSSS HFEVTRSPVT AHCQPQCNWT KWLDTDFPVP GPHGGDLETY
GNIKKSGERL CPWPEEITRL ECRAKDYPER AMEDLGQVVQ CDPSMGLVCK NSDQGPTFGM
CLNYEVRLLC CHVPEDCLTT KCTTLSSTQS TTVTSSTPGS TSMPQTMTTI VQTDSTTSTS
QTSSPITGRA STIHTTQTTS TSTGEAWTSS VQIASTTSTT HTSSPNTGRV STTHRTHTSS
PPIGGTSPTP TTHTSSPPTG GTSPTSTTHT SSPPIGGTSP TSTTHTSSPP TGGTSPTSTT
HTSSPPTGGT SPTHRTHTSS PPTGGTSPTS TTHTSSPPTG GTSPTSRTHT SSPPTGGTSP
TSTTHTSSPP TGGTSPTSTT HTSSPPTGGT SPTHRTHTSS PPTGGTSPTS TTHTSSPNTG
RVNTTHTTHT SSPPIGESST ISTTDIRTSS TQMAHTTFVG KTSTISGPGT STTSIPTSTT
GSSSHFEVTR SPVTAHCQPQ CNWTKWLDTD FPVPGPHGGD LETYGNIKKS GERLCPWPEE
ITRLECRAKD YPERAMEDLG QVVQCDPRVG LVCKNSDQGP TFGMCLNYEV RLLCCHIPED
CLRTDRTPVT SHKTSFPVVS SSSVSTSSST SPRVHSTTRC FCTMSGQLYP LGTIIYNQTD
LDGHCYYAMC SHDCQVVKGV SQDCPSTMPP RTPTLSTSTA PPVTERDWCN VFPPRLKGET
WPMPNCSQAT CEGNNVVSLS PRQCPEVKEP SCANGYPPLK VDDQDGCCQH YQCQCVCSGW
GDPHYITFDG TYYTFLDNCT YVLVQQIVPV FGDFRVLIDN YYCDLGDSVS CPQSIIVEYH
QDRVVLTRRP VHGVMTNQII FNNKVVSPGF QQNGIIISRV GIKMYVTIQE IGVQVMFSGL
IFSVEVPFNL FANNTEGQCG TCTNDKKDEC RLPGGSIASS CSEMSLHWKV PNQPSCQGPP
PTPTSMVPRS TPTPCSPSPL CQLILSDVFK LCHDIIPPLQ FYEGCLFDYC HMLDLEVVCS
GLELYASLCA AQGVCIPWRS HTNNTCPFTC PENQVYQPCG PSNPHYCYRN DDISLSLAIQ
KAGPKSEGCF CPDDMTLFSS NDSICVPSCQ WCLGPHGEPV EPGHTISINC QDCICKEGTL
TCQEKLCPQP TCPEPGFVPV SVALEAGQCC SQFSCVCNSS HCPPPLHCPE SSSLIVTYEE
GTCCPSQNCS SQKGCDVNGT LYQPGDVVSS SLCERCLCEV SSNAFSDGFV VNCEIELCNT
QCPKGFEYQT TPGHCCGHCI PKTCPFKNSN NSTSLYKPGE FWPEPGNPCV THKCEKFQDV
LTVVTMKIEC PKINCPQDQA QLREDGCCYD CLVPQQKCTV HQRQQIIRQQ NCSSEGPVSL
SYCQGNCGDS TSMYSLEANT VEHTCECCQE LQTSQRSVTL HCDDGSSRTF SYTQVEKCGC
LGQRCHAPGD TSHSESSEQE FKSKESEEPG QQSASRVSED TLGPFQ
//