ID A0A452C8P0_BALAS Unreviewed; 3824 AA.
AC A0A452C8P0;
DT 08-MAY-2019, integrated into UniProtKB/TrEMBL.
DT 08-MAY-2019, sequence version 1.
DT 27-MAR-2024, entry version 23.
DE SubName: Full=Mucin-5AC {ECO:0000313|RefSeq:XP_028019452.1};
GN Name=MUC5AC {ECO:0000313|RefSeq:XP_028019452.1};
OS Balaenoptera acutorostrata scammoni (North Pacific minke whale)
OS (Balaenoptera davidsoni).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Artiodactyla; Whippomorpha; Cetacea; Mysticeti;
OC Balaenopteridae; Balaenoptera.
OX NCBI_TaxID=310752 {ECO:0000313|Proteomes:UP000261681, ECO:0000313|RefSeq:XP_028019452.1};
RN [1] {ECO:0000313|RefSeq:XP_028019452.1}
RP IDENTIFICATION.
RC TISSUE=Muscle {ECO:0000313|RefSeq:XP_028019452.1};
RG RefSeq;
RL Submitted (JAN-2023) to UniProtKB.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00039}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_028019452.1; XM_028163651.1.
DR STRING; 310752.A0A452C8P0; -.
DR InParanoid; A0A452C8P0; -.
DR OrthoDB; 2872912at2759; -.
DR Proteomes; UP000261681; Unplaced.
DR CDD; cd19941; TIL; 3.
DR Gene3D; 2.10.25.10; Laminin; 4.
DR InterPro; IPR006207; Cys_knot_C.
DR InterPro; IPR036084; Ser_inhib-like_sf.
DR InterPro; IPR002919; TIL_dom.
DR InterPro; IPR014853; VWF/SSPO/ZAN-like_Cys-rich_dom.
DR InterPro; IPR001007; VWF_dom.
DR InterPro; IPR001846; VWF_type-D.
DR InterPro; IPR025155; WxxW_domain.
DR PANTHER; PTHR11339; EXTRACELLULAR MATRIX GLYCOPROTEIN RELATED; 1.
DR PANTHER; PTHR11339:SF398; MUCIN-2-LIKE-RELATED; 1.
DR Pfam; PF08742; C8; 4.
DR Pfam; PF13330; Mucin2_WxxW; 6.
DR Pfam; PF01826; TIL; 2.
DR Pfam; PF00094; VWD; 4.
DR SMART; SM00832; C8; 4.
DR SMART; SM00041; CT; 1.
DR SMART; SM00214; VWC; 6.
DR SMART; SM00215; VWC_out; 2.
DR SMART; SM00216; VWD; 4.
DR SUPFAM; SSF57603; FnI-like domain; 1.
DR SUPFAM; SSF57567; Serine protease inhibitors; 4.
DR PROSITE; PS01185; CTCK_1; 1.
DR PROSITE; PS01225; CTCK_2; 1.
DR PROSITE; PS51257; PROKAR_LIPOPROTEIN; 1.
DR PROSITE; PS01208; VWFC_1; 1.
DR PROSITE; PS50184; VWFC_2; 2.
DR PROSITE; PS51233; VWFD; 4.
PE 4: Predicted;
KW Copper {ECO:0000256|ARBA:ARBA00023008};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00039}; Reference proteome {ECO:0000313|Proteomes:UP000261681};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..21
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 22..3824
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5019040857"
FT DOMAIN 78..248
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 431..606
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 900..1071
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 3103..3287
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 3437..3512
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 3548..3615
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 3697..3785
FT /note="CTCK"
FT /evidence="ECO:0000259|PROSITE:PS01225"
FT REGION 36..58
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1338..1388
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1485..1593
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1701..1760
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1872..2008
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2119..2408
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2518..2816
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3008..3032
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3795..3824
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1338..1363
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1364..1380
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 3697..3747
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00039"
SQ SEQUENCE 3824 AA; 406874 MW; 14CA388839F3D6BC CRC64;
MGAGRRKLAL LWALGLTLAC AQHTGQARDS AAELDYEYPD LPPAPQGPGG NSLRGVTIHP
PLRSSPLVRA LNAAHGGRVC STWGDFHYKT FDGDVFRFPG LCNYVFSAHC GAAYEDFNLQ
LRRGPGPNAT APSRVTMKLD GMVVELTKGS VLVNGRPAQL PFSHSGVFIE QSSSSVKVAA
KLGLVFMWNQ DDSLLLELDA KYANQTCGLC GDFNGVPIFN EFFSHNIKLT PTEFGNLQKM
DGPTERCQDP APEPPTNCLT GFGICEEMLS SELFSDCAAL VDSSSYLEAC RQDLCRCDQA
NPSSCLCHTL AEYSRQCSHA GGLPLDWRRP QLCPQTCPLN MQYRECGSPC ADTCSNPERS
PLCEDHCVAG CFCPEGTVLD DIGHAGCIPV PQCSCVYNGA TYTPGTGYST DCTNCTCSGG
RWRCQEVQCP GTCSVLGGAH FSTFDERHYT VHGDCSYVLA KPCDSSAFTV LAELRRCGLT
DSETCLKSLT LILGGGHMVF VVKASGEVFV NQIYTQLPLS AANVTLFRPS TFFIIAQTQL
GLQLDVQLVP VMQVFVRLGP QLRGQTCGLC GNFNRNQADD FRTISGVVEG TAAAFANTWK
TQAACPNVKN GFEDPCSLSV ENEKYAQRWC SRLTDPHGPF TRCHAAVNPS AYYSNCMFDT
CSCEKSEDCM CAALSSYVRA CAARGVLLGG WRDGVCTKPM TTCPKSLTYL YNISTCQPTC
RARSDADVTC SVSFVPVDGC TCPDGTFLDD MGKCVPATSC PCYRGGSVVP DGESTHEQGV
ICTCTQGTLT CIGGHAPAPV CTPPMVYFDC RNTTPGAAGA GCQKSCHTLD MDCYSSQCVP
GCVCPHGLVA NGDGGCIPVS DCPCVHNEAS YQPGQIIRVG CNTCTCKSRM WQCTDQPCPA
TCAVYGDGHY LTFDGQRYSF SGDCEYTLLQ DHCGGNGSAQ DGFRVITENV PCGTTGTTCS
KAIKIFLGSY ELKLSDGNVE VIEKEQGQLP PFSIRQMGIY LVVDTDVSLV LLWDKGTSIF
LRLSPEFKGR VCGLCGNFDD NALNDFTTRS QSVVGNALEF GNSWKFSPSC PDAQAPKDPC
AANPYRKSWA QKQCSIVNSA TFSACHAHVE SARYYEACVS DACACDSGGD CECFCTAVAA
YAQACHEAGV CVSWRTPDIC PLFCDYYNPQ GQCEWHYQPC GAPFLRTCRN PRGQYLHDVR
GLEGCYPKCP SEAPIFDEDQ MRCVASCPTP TPPTPCRVQG KSYRPGSTVP LEKNCHYCVC
TESGVHCTYD SEACVCTYDG KRFRPGEVIY HTTDGTGGCI SARCGANGTI ERRVEACSPT
PRTPQTTFIF STTPFVMSST LPPSTHPSPS ESTAHTPSSA PATTPGTSPG PTPPTASSPP
SSRPHCGEEC QWSPWLDVSR PGRSIDSGDF DTLENLRAYG YRVCRAPRAV ECRAEGAPEV
PLEVLGQHVE CSPRVGLTCY NRDQASGRCD NYQIRILCCS PRACPTSSTE TTPPVTFSTT
ETGPTPGTSR VTSVPTGSTG SSAPVTTPGP TTASTTSKTP GPGTSTPAPG TSTTGKTTLS
TPSRAPVSTT VTTTTSVSTT TPGPSKGTSS EPTQVTCLQE SCTWTMWIDG SYPGAGRNSG
DFDSFQNLRS KGYKFCAKPK NVECRAEVFP NTPLQALGQN LICDKNVGLI CWNKDQLPPI
CYNYQIRILC CEVVDVCRTK TTPPTTPGPT QTTTGETSTA RATATFSSST KHSTASSAHV
PPRTRVTTRT NERTSPGTTS CQPQCTWTKW FDADVPSPGP HGGDFETYSN ILRRGEKICH
RPEYISALEC RAENHPDVSI QTLGQVVQCS PEVGLVCRNR DQKGKFRMCL NYEVRVLCCE
PQKGCPVTHV TVPSTSSVSV TSHTRTTSHE ATSSATTVQT TSPTAVPITS TTSVETTSTT
SAPTTSTTTV ETASPTTAPT TSTTTQETTS PTTGPTISTT TVETTSTTSA PTTSTSTVET
TSLTSAPSSS TTFVPIPETT PSQPGTTSCQ PQCTWTKWFD ADVPSPGPHG GDFETYSNIL
RRGEKICHRP EYISALECRA ENHPDVSIQT LGQVVQCSPE VGLVCRNRDQ KGKFRMCLNY
EVRVLCCEPQ KGCPVTHVTV PSTSSVSVTS HTRTTSHEAT SSATTVQTTS PTAVPITSTT
SVETTSTTSA PPTSTTTVET ASPTTAPTTS TTTVEITSPT TAPTTSTTTQ ETTSPTPVPT
ISTTTVETTS PITGPKISTT TVETTSATTG PTISTTTVET TSATTGPTIS TTTVETTSAT
TGPTISTTTV ETTSATTGPT ISTTTVETTS ATTGPTISTT TVETTSTTTV ETTGPSTAPR
TSTITVETTS PTTAPTTSTT TVETTSTTSA PTTSTSTVET TSLTSAPSSS TTFVPIPETT
PSQPGTTSCQ PQCTWTKWFD ADVPSPGPHG GDFETYSNIL RRGEKICHRP EYISALECRA
ENHPDVSIQT LGQVVQCSPE VGLVCRNRDQ KGKFRMCLNY EVRVLCCEPQ KGCPVTHVTV
PSTSSVSVTS HTRTTSHEAT SSATTVQTTS PTAVPITSTT SVETTSTTSA PTTSTTTVET
ASPTTAPTAS TTTQETSSPT TGPTISTTTV ETTSATTGPT ISTTTVETTS ATTGPTISTT
TVETTSATTG PTISTTTVET TGPTTAPRTS TTTVETTSPT TAPTTSTTTV EITSPTTAPT
TSTTTQETTS PTPVPTISTT TVETTSPTTA QTTSTTTVET TGPSTAPRTS TITVETTSPT
TAPTTSTTTV ETTSTTLAPT TSTSTVETTS LTSAPSSSTT FVPIPETTPS QPGTTTCQPQ
CTWTKWFDAD FPSPGPHGGD FETYSNILRR GEKICLRPEY FSALECRAEN HPDVSIQTLG
QVVQCSPEVG LVCRNRDQKG KFRMCLNYEV RVLCCEPQKG CPVNSVTPFV TSSPQSSLSP
ESTTLMVSLL TSPSSAAPST STCYCSVAQG FYSAGSIIYQ QTDLSGHCYY AVCSLDCHVV
RRTDHTCPTS TPPPAPATST PASSPSASVP VTQQGCPNAV PPRMKGETWA MPNCSEATCE
GNGAITVRSR QCPKVQELTC ANGYPAVTVA NGEDCCPHYA CQCVCSGWGD PHYITFDGTY
YTFLDNCTYV LVQQILPVFG HFRVLIDNYF CDAEDRLSCP QSIIVEYQQD RVVLTRKPVR
GVMTNEIIFN GEVVRPGFQK DGIVVSQIGI KMYVAIPALG VQVMFSGLIF SVEVPFSKFS
NNTEGQCGTC TNDQKDECRL PGGAVVSSCS DMSGHWNVTY PGQPSCHRPP LMPTTAEPKK
SPTSCPPSPI CLLILSEVFK PCHSVIPPWP YYQGCTFDQC HMPRGDVVCS SLELYAALCA
SLSVCIDWRG LTNHTCSFTC PADKVYRPCG PSNPSYCYMN NSANTLALPE AIPITEGCFC
PEDMMLFSSG TEVCVPANCS SDQWPGGPMS SPFQPGHTVS VGCQECTCKG STRTLSCQTR
PCPLPPACPE PGFVPAPAAL QTGQCCPQYT CVCNATHCPA PADCPEGSGS TLTYKEGACC
PTQNCSWTVC SVNGTLYQPG AIVSSNLCEK CWCELAGGPQ SDTFAINCET QICSTRCPVG
FEYQTRSGQC CGECVQVACV MNTSGSSAHL FYPGESWSDP RSRCVTHECE KHQDGLVVVT
TKKACPPLSC PADTARLSED GCCLSCPQRP PQNRSTCTVY HKLQVLRQPG CSSVEPVRLT
YCQGNCGDTS SVYSLEARTV EHRCRCCREL RASLRNATLR CEDGSSQAFS YTEVEECGCV
GLQCDSHGGL GRSGEVQLEQ GAERSQETGL RHWRRGAPGP RPSQ
//