GenomeNet

Database: UniProt
Entry: A0A0G2K1Y4_RAT
LinkDB: A0A0G2K1Y4_RAT
Original site: A0A0G2K1Y4_RAT 
ID   A0A0G2K1Y4_RAT          Unreviewed;      3586 AA.
AC   A0A0G2K1Y4;
DT   22-JUL-2015, integrated into UniProtKB/TrEMBL.
DT   25-MAY-2022, sequence version 2.
DT   27-MAR-2024, entry version 37.
DE   SubName: Full=Mucin 5AC, oligomeric mucus/gel-forming {ECO:0000313|Ensembl:ENSRNOP00000072033.2};
GN   Name=Muc5ac {ECO:0000313|Ensembl:ENSRNOP00000072033.2,
GN   ECO:0000313|RGD:62001};
OS   Rattus norvegicus (Rat).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC   Murinae; Rattus.
OX   NCBI_TaxID=10116 {ECO:0000313|Ensembl:ENSRNOP00000072033.2, ECO:0000313|Proteomes:UP000002494};
RN   [1] {ECO:0000313|Ensembl:ENSRNOP00000072033.2, ECO:0000313|Proteomes:UP000002494}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Brown Norway {ECO:0000313|Ensembl:ENSRNOP00000072033.2,
RC   ECO:0000313|Proteomes:UP000002494};
RX   PubMed=15057822; DOI=10.1038/nature02426;
RG   Rat Genome Sequencing Project Consortium;
RA   Gibbs R.A., Weinstock G.M., Metzker M.L., Muzny D.M., Sodergren E.J.,
RA   Scherer S., Scott G., Steffen D., Worley K.C., Burch P.E., Okwuonu G.,
RA   Hines S., Lewis L., Deramo C., Delgado O., Dugan-Rocha S., Miner G.,
RA   Morgan M., Hawes A., Gill R., Holt R.A., Adams M.D., Amanatides P.G.,
RA   Baden-Tillson H., Barnstead M., Chin S., Evans C.A., Ferriera S.,
RA   Fosler C., Glodek A., Gu Z., Jennings D., Kraft C.L., Nguyen T.,
RA   Pfannkoch C.M., Sitter C., Sutton G.G., Venter J.C., Woodage T., Smith D.,
RA   Lee H.-M., Gustafson E., Cahill P., Kana A., Doucette-Stamm L.,
RA   Weinstock K., Fechtel K., Weiss R.B., Dunn D.M., Green E.D.,
RA   Blakesley R.W., Bouffard G.G., De Jong P.J., Osoegawa K., Zhu B., Marra M.,
RA   Schein J., Bosdet I., Fjell C., Jones S., Krzywinski M., Mathewson C.,
RA   Siddiqui A., Wye N., McPherson J., Zhao S., Fraser C.M., Shetty J.,
RA   Shatsman S., Geer K., Chen Y., Abramzon S., Nierman W.C., Havlak P.H.,
RA   Chen R., Durbin K.J., Egan A., Ren Y., Song X.-Z., Li B., Liu Y., Qin X.,
RA   Cawley S., Cooney A.J., D'Souza L.M., Martin K., Wu J.Q.,
RA   Gonzalez-Garay M.L., Jackson A.R., Kalafus K.J., McLeod M.P.,
RA   Milosavljevic A., Virk D., Volkov A., Wheeler D.A., Zhang Z., Bailey J.A.,
RA   Eichler E.E., Tuzun E., Birney E., Mongin E., Ureta-Vidal A., Woodwark C.,
RA   Zdobnov E., Bork P., Suyama M., Torrents D., Alexandersson M., Trask B.J.,
RA   Young J.M., Huang H., Wang H., Xing H., Daniels S., Gietzen D., Schmidt J.,
RA   Stevens K., Vitt U., Wingrove J., Camara F., Mar Alba M., Abril J.F.,
RA   Guigo R., Smit A., Dubchak I., Rubin E.M., Couronne O., Poliakov A.,
RA   Huebner N., Ganten D., Goesele C., Hummel O., Kreitler T., Lee Y.-A.,
RA   Monti J., Schulz H., Zimdahl H., Himmelbauer H., Lehrach H., Jacob H.J.,
RA   Bromberg S., Gullings-Handley J., Jensen-Seaman M.I., Kwitek A.E.,
RA   Lazar J., Pasko D., Tonellato P.J., Twigger S., Ponting C.P., Duarte J.M.,
RA   Rice S., Goodstadt L., Beatson S.A., Emes R.D., Winter E.E., Webber C.,
RA   Brandt P., Nyakatura G., Adetobi M., Chiaromonte F., Elnitski L.,
RA   Eswara P., Hardison R.C., Hou M., Kolbe D., Makova K., Miller W.,
RA   Nekrutenko A., Riemer C., Schwartz S., Taylor J., Yang S., Zhang Y.,
RA   Lindpaintner K., Andrews T.D., Caccamo M., Clamp M., Clarke L., Curwen V.,
RA   Durbin R.M., Eyras E., Searle S.M., Cooper G.M., Batzoglou S., Brudno M.,
RA   Sidow A., Stone E.A., Payseur B.A., Bourque G., Lopez-Otin C., Puente X.S.,
RA   Chakrabarti K., Chatterji S., Dewey C., Pachter L., Bray N., Yap V.B.,
RA   Caspi A., Tesler G., Pevzner P.A., Haussler D., Roskin K.M., Baertsch R.,
RA   Clawson H., Furey T.S., Hinrichs A.S., Karolchik D., Kent W.J.,
RA   Rosenbloom K.R., Trumbower H., Weirauch M., Cooper D.N., Stenson P.D.,
RA   Ma B., Brent M., Arumugam M., Shteynberg D., Copley R.R., Taylor M.S.,
RA   Riethman H., Mudunuri U., Peterson J., Guyer M., Felsenfeld A., Old S.,
RA   Mockrin S., Collins F.S.;
RT   "Genome sequence of the Brown Norway rat yields insights into mammalian
RT   evolution.";
RL   Nature 428:493-521(2004).
RN   [2] {ECO:0000313|Ensembl:ENSRNOP00000072033.2}
RP   IDENTIFICATION.
RC   STRAIN=Brown Norway {ECO:0000313|Ensembl:ENSRNOP00000072033.2};
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC       feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00039}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   AlphaFoldDB; A0A0G2K1Y4; -.
DR   STRING; 10116.ENSRNOP00000072033; -.
DR   Ensembl; ENSRNOT00000085159.2; ENSRNOP00000072033.2; ENSRNOG00000055996.2.
DR   AGR; RGD:62001; -.
DR   RGD; 62001; Muc5ac.
DR   VEuPathDB; HostDB:ENSRNOG00000055996; -.
DR   GeneTree; ENSGT00940000156076; -.
DR   InParanoid; A0A0G2K1Y4; -.
DR   Reactome; R-RNO-913709; O-linked glycosylation of mucins.
DR   Reactome; R-RNO-977068; Termination of O-glycan biosynthesis.
DR   Proteomes; UP000002494; Chromosome 1.
DR   Bgee; ENSRNOG00000055996; Expressed in stomach and 1 other cell type or tissue.
DR   GO; GO:0031012; C:extracellular matrix; IBA:GO_Central.
DR   GO; GO:0005615; C:extracellular space; IDA:RGD.
DR   GO; GO:0070701; C:mucus layer; ISO:RGD.
DR   GO; GO:0071364; P:cellular response to epidermal growth factor stimulus; IEP:RGD.
DR   GO; GO:0071385; P:cellular response to glucocorticoid stimulus; IEP:RGD.
DR   GO; GO:0071300; P:cellular response to retinoic acid; IEP:RGD.
DR   GO; GO:0030855; P:epithelial cell differentiation; IEP:RGD.
DR   GO; GO:0036438; P:maintenance of lens transparency; ISO:RGD.
DR   GO; GO:0032496; P:response to lipopolysaccharide; IEP:RGD.
DR   GO; GO:0010193; P:response to ozone; IEP:RGD.
DR   GO; GO:0010477; P:response to sulfur dioxide; IEP:RGD.
DR   GO; GO:0033189; P:response to vitamin A; IEP:RGD.
DR   CDD; cd19941; TIL; 3.
DR   Gene3D; 2.10.25.10; Laminin; 4.
DR   InterPro; IPR006207; Cys_knot_C.
DR   InterPro; IPR036084; Ser_inhib-like_sf.
DR   InterPro; IPR002919; TIL_dom.
DR   InterPro; IPR014853; VWF/SSPO/ZAN-like_Cys-rich_dom.
DR   InterPro; IPR001007; VWF_dom.
DR   InterPro; IPR001846; VWF_type-D.
DR   InterPro; IPR025155; WxxW_domain.
DR   PANTHER; PTHR11339; EXTRACELLULAR MATRIX GLYCOPROTEIN RELATED; 1.
DR   PANTHER; PTHR11339:SF404; MUCIN-5AC; 1.
DR   Pfam; PF08742; C8; 4.
DR   Pfam; PF13330; Mucin2_WxxW; 4.
DR   Pfam; PF01826; TIL; 2.
DR   Pfam; PF00094; VWD; 4.
DR   SMART; SM00832; C8; 4.
DR   SMART; SM00041; CT; 1.
DR   SMART; SM00214; VWC; 5.
DR   SMART; SM00215; VWC_out; 3.
DR   SMART; SM00216; VWD; 4.
DR   SUPFAM; SSF57567; Serine protease inhibitors; 4.
DR   PROSITE; PS01185; CTCK_1; 1.
DR   PROSITE; PS01225; CTCK_2; 1.
DR   PROSITE; PS01208; VWFC_1; 1.
DR   PROSITE; PS50184; VWFC_2; 2.
DR   PROSITE; PS51233; VWFD; 4.
PE   4: Predicted;
KW   Copper {ECO:0000256|ARBA:ARBA00023008};
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW   ProRule:PRU00039}; Reference proteome {ECO:0000313|Proteomes:UP000002494};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..31
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           32..3586
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5035306648"
FT   DOMAIN          82..252
FT                   /note="VWFD"
FT                   /evidence="ECO:0000259|PROSITE:PS51233"
FT   DOMAIN          435..610
FT                   /note="VWFD"
FT                   /evidence="ECO:0000259|PROSITE:PS51233"
FT   DOMAIN          904..1075
FT                   /note="VWFD"
FT                   /evidence="ECO:0000259|PROSITE:PS51233"
FT   DOMAIN          2875..3057
FT                   /note="VWFD"
FT                   /evidence="ECO:0000259|PROSITE:PS51233"
FT   DOMAIN          3212..3276
FT                   /note="VWFC"
FT                   /evidence="ECO:0000259|PROSITE:PS50184"
FT   DOMAIN          3313..3380
FT                   /note="VWFC"
FT                   /evidence="ECO:0000259|PROSITE:PS50184"
FT   DOMAIN          3458..3546
FT                   /note="CTCK"
FT                   /evidence="ECO:0000259|PROSITE:PS01225"
FT   REGION          1510..1590
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1752..2121
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2270..2585
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          3547..3586
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        3551..3569
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   DISULFID        3458..3508
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00039"
SQ   SEQUENCE   3586 AA;  382969 MW;  5412C95FB0E499F5 CRC64;
     MLHSMGVGRR KLAPFWVLAL ALTFNQHTGQ ALEDTRKSHL EHYSDLSQPQ GHVGTPLNRV
     TIIPPLKTIP VVRAFNPAHT RRVCSTWGNF HYKTFDGQVF YFPGLCNYVF SEHCGAAYED
     FNIQLRRGLE SNSTTLSRVI MKLDGLVVEL TKSSVLVNNH PVQLPFSQSG VLIELSNGYL
     KVVARLGLVF MWNDDDSLLL ELDTKYANKT CGLCGDFNGS PESSEFLSHN VRLTPLEFGN
     FQKMDGPTEQ CQDPLPVPQK NCSIRSSICE EILKGQLFSN CAALVDISSY LEACQQDLCL
     CESSDPSNCI CHTLAEYSRQ CAHAGGQPQN WRGPNLCPQT CLLNMEYQEC GSPCVDTCSN
     PQHSQVCEDH CVAGCFCPEG MVLDDSNQTG CVPVSQCACL YNGTLYAPGT SYSTDCTKCT
     CSGGQWSCQE VPCSGTCSVM GGSHISTFDE RQYTVHGDCS YVLCKPYDSN AFTVLAELRK
     CGLTESETCL KTVTLNLGGG KTVITVKATG EVFVNQIYTQ LPVSTANATF FRPSTFFIIG
     QTNLGLQLEI QLHPIMQVSV RIAPEFRGLT SGLCGNFNSM QADDFQTISG VVEGTAAAFF
     NTFKTQAACP NVKNIFEDPC SLSVENEKYA QHWCSQLTDA NGPFSQCHAT VNPSTFFSNC
     MFDTCNCEKS EDCLCAALSS YVRACAAKGV LLSDWREGIC EKPTITCPKS MTYQYHISTC
     QPTCRSLSEE DVTCHVNFIP VDGCTCPKGT FLDDSGKCVQ ATSCPCYYKG SPVPNGESVH
     DNGAICTCTQ GALTCIGGPV LTPVCDAPMI YFDCRNATPG DTGAGCQKSC HTLDMTCYSS
     ECVPGCVCPN GLVADGNGSC VVAEDCPCVH NEATYRPGET IQVGCNNCTC ENRMWQCTDK
     PCLATCAVYG DGHYITFDGQ RYSFSGDCEY TLLQDNCGGN GSSQDAFRVV TENIPCGTTG
     TTCSKGIKIF LGSYELKLSD SKMEVVQKGV GQEPPYFVHQ MGNYLVVETD IGLVLLWDKK
     TSIFLRLSPE FKGKVCGLCG NFDDNAINDF TTRSQSVVSD MLEFGNSWKL SPSCPDASVS
     KDPCTANPYR KSWAQKQCSI INSATFSACH AHVEPAKYYE ACVNDACACD SGGDCECFCT
     AVAAYAQACH EVGVCVSWRT PDICPLFCDY YNPEGQCEWH YQPCGAPCMR TCQNPTGQCL
     QDLRGLEGCY PKCPPTAPIF DEGTMQCVSN CTVSPSPCRV NGKLYRPGTP IPSDENCYSC
     VCTESGVNCT HDAGACVCTY NGQRYHPGDT IYHTTDGMGG CISAHCRDNG TIERIVDTCS
     STSPPPPTTF SFSTTLVMTS MQPSSTHSSP TPSVVYPGSP SKAVLTASSV SSVKTPETTS
     VLTTSTSAST LTMPACQEEC LWSPWMDISR PGRGIDSGDF DTLENLHAHG YQICPVPKAV
     ECRAEDNPGV PFHALQQHVE CSTTVGLICY NSDQVSGLCD NYQIKIQCCT PINCPTSTGP
     TQTTHLIVSR TSTMEDTTSS VPVTSTEHTY STVASSPSTH TPGPSPSSSV PSSSAPARST
     PTPVSSTTVK TTLPTTSPMP EPTSATSSVS ISTLGSTLAS PEITHGCRKE LCNWTDWIDG
     SYPEPGRSSG DFDTFVNLRA KGYKFCEKPW NVECRAQFFP NTPLQELGQD VTCSREVGLI
     CLNKNQLPPI CYNYEIRIEC CTIVNICSTT SATTQPTSHG VSIKTKTNWI TNTYSFSTEN
     TSGHSTVINT KTWVTGSTHT TPQPGTRPTP STVSTQDTST SSVQTDSTTS SSHTSSPNTG
     RVSTTHTTHT SSPPTGGTSP TSTTHTSSPP TGGTSPTSTT HTSSPPTGGT SPTSTTHTSS
     PPTGGTSPTS TTHTSSPPTG GTSPTSTTHT SSPPTGGTSP TSTTHTSSPN TGRVSTTHTT
     HTSSPPTGGT SPTSTTHTSS PNTGGTSPTS TTHTSSPNTG GTSPTSTTHT SSPPTGGTSP
     TSTTHTSSPN TGGTSPTSTT HTSSPPTGGT SPTSTTHTSS PPTGGTSPTS TTHTSLPPTG
     GTSPTSTTHT SSPPTGGTSP TSTTHTSSPP IGESSTISTT DIRTSSTQMA HTTFVGKTST
     ISGPGTSTTS IPTSTTGSSS HFEVTRSPVT AHCQPQCNWT KWLDTDFPVP GPHGGDLETY
     GNIKKSGERL CPWPEEITRL ECRAKDYPER AMEDLGQVVQ CDPSMGLVCK NSDQGPTFGM
     CLNYEVRLLC CHVPEDCLTT KCTTLSSTQS TTVTSSTPGS TSMPQTMTTI VQTDSTTSTS
     QTSSPITGRA STIHTTQTTS TSTGEAWTSS VQIASTTSTT HTSSPNTGRV STTHRTHTSS
     PPIGGTSPTP TTHTSSPPTG GTSPTSTTHT SSPPIGGTSP TSTTHTSSPP TGGTSPTSTT
     HTSSPPTGGT SPTHRTHTSS PPTGGTSPTS TTHTSSPPTG GTSPTSRTHT SSPPTGGTSP
     TSTTHTSSPP TGGTSPTSTT HTSSPPTGGT SPTHRTHTSS PPTGGTSPTS TTHTSSPNTG
     RVNTTHTTHT SSPPIGESST ISTTDIRTSS TQMAHTTFVG KTSTISGPGT STTSIPTSTT
     GSSSHFEVTR SPVTAHCQPQ CNWTKWLDTD FPVPGPHGGD LETYGNIKKS GERLCPWPEE
     ITRLECRAKD YPERAMEDLG QVVQCDPRVG LVCKNSDQGP TFGMCLNYEV RLLCCHIPED
     CLRTDRTPVT SHKTSFPVVS SSSVSTSSST SPRVHSTTRC FCTMSGQLYP LGTIIYNQTD
     LDGHCYYAMC SHDCQVVKGV SQDCPSTMPP RTPTLSTSTA PPVTERDWCN VFPPRLKGET
     WPMPNCSQAT CEGNNVVSLS PRQCPEVKEP SCANGYPPLK VDDQDGCCQH YQCQCVCSGW
     GDPHYITFDG TYYTFLDNCT YVLVQQIVPV FGDFRVLIDN YYCDLGDSVS CPQSIIVEYH
     QDRVVLTRRP VHGVMTNQII FNNKVVSPGF QQNGIIISRV GIKMYVTIQE IGVQVMFSGL
     IFSVEVPFNL FANNTEGQCG TCTNDKKDEC RLPGGSIASS CSEMSLHWKV PNQPSCQGPP
     PTPTSMVPRS TPTPCSPSPL CQLILSDVFK LCHDIIPPLQ FYEGCLFDYC HMLDLEVVCS
     GLELYASLCA AQGVCIPWRS HTNNTCPFTC PENQVYQPCG PSNPHYCYRN DDISLSLAIQ
     KAGPKSEGCF CPDDMTLFSS NDSICVPSCQ WCLGPHGEPV EPGHTISINC QDCICKEGTL
     TCQEKLCPQP TCPEPGFVPV SVALEAGQCC SQFSCVCNSS HCPPPLHCPE SSSLIVTYEE
     GTCCPSQNCS SQKGCDVNGT LYQPGDVVSS SLCERCLCEV SSNAFSDGFV VNCEIELCNT
     QCPKGFEYQT TPGHCCGHCI PKTCPFKNSN NSTSLYKPGE FWPEPGNPCV THKCEKFQDV
     LTVVTMKIEC PKINCPQDQA QLREDGCCYD CLVPQQKCTV HQRQQIIRQQ NCSSEGPVSL
     SYCQGNCGDS TSMYSLEANT VEHTCECCQE LQTSQRSVTL HCDDGSSRTF SYTQVEKCGC
     LGQRCHAPGD TSHSESSEQE FKSKESEEPG QQSASRVSED TLGPFQ
//
DBGET integrated database retrieval system