GenomeNet

Database: UniProt
Entry: A0A452C8P0_BALAS
LinkDB: A0A452C8P0_BALAS
Original site: A0A452C8P0_BALAS 
ID   A0A452C8P0_BALAS        Unreviewed;      3824 AA.
AC   A0A452C8P0;
DT   08-MAY-2019, integrated into UniProtKB/TrEMBL.
DT   08-MAY-2019, sequence version 1.
DT   27-MAR-2024, entry version 23.
DE   SubName: Full=Mucin-5AC {ECO:0000313|RefSeq:XP_028019452.1};
GN   Name=MUC5AC {ECO:0000313|RefSeq:XP_028019452.1};
OS   Balaenoptera acutorostrata scammoni (North Pacific minke whale)
OS   (Balaenoptera davidsoni).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Laurasiatheria; Artiodactyla; Whippomorpha; Cetacea; Mysticeti;
OC   Balaenopteridae; Balaenoptera.
OX   NCBI_TaxID=310752 {ECO:0000313|Proteomes:UP000261681, ECO:0000313|RefSeq:XP_028019452.1};
RN   [1] {ECO:0000313|RefSeq:XP_028019452.1}
RP   IDENTIFICATION.
RC   TISSUE=Muscle {ECO:0000313|RefSeq:XP_028019452.1};
RG   RefSeq;
RL   Submitted (JAN-2023) to UniProtKB.
CC   -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC       feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00039}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   RefSeq; XP_028019452.1; XM_028163651.1.
DR   STRING; 310752.A0A452C8P0; -.
DR   InParanoid; A0A452C8P0; -.
DR   OrthoDB; 2872912at2759; -.
DR   Proteomes; UP000261681; Unplaced.
DR   CDD; cd19941; TIL; 3.
DR   Gene3D; 2.10.25.10; Laminin; 4.
DR   InterPro; IPR006207; Cys_knot_C.
DR   InterPro; IPR036084; Ser_inhib-like_sf.
DR   InterPro; IPR002919; TIL_dom.
DR   InterPro; IPR014853; VWF/SSPO/ZAN-like_Cys-rich_dom.
DR   InterPro; IPR001007; VWF_dom.
DR   InterPro; IPR001846; VWF_type-D.
DR   InterPro; IPR025155; WxxW_domain.
DR   PANTHER; PTHR11339; EXTRACELLULAR MATRIX GLYCOPROTEIN RELATED; 1.
DR   PANTHER; PTHR11339:SF398; MUCIN-2-LIKE-RELATED; 1.
DR   Pfam; PF08742; C8; 4.
DR   Pfam; PF13330; Mucin2_WxxW; 6.
DR   Pfam; PF01826; TIL; 2.
DR   Pfam; PF00094; VWD; 4.
DR   SMART; SM00832; C8; 4.
DR   SMART; SM00041; CT; 1.
DR   SMART; SM00214; VWC; 6.
DR   SMART; SM00215; VWC_out; 2.
DR   SMART; SM00216; VWD; 4.
DR   SUPFAM; SSF57603; FnI-like domain; 1.
DR   SUPFAM; SSF57567; Serine protease inhibitors; 4.
DR   PROSITE; PS01185; CTCK_1; 1.
DR   PROSITE; PS01225; CTCK_2; 1.
DR   PROSITE; PS51257; PROKAR_LIPOPROTEIN; 1.
DR   PROSITE; PS01208; VWFC_1; 1.
DR   PROSITE; PS50184; VWFC_2; 2.
DR   PROSITE; PS51233; VWFD; 4.
PE   4: Predicted;
KW   Copper {ECO:0000256|ARBA:ARBA00023008};
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW   ProRule:PRU00039}; Reference proteome {ECO:0000313|Proteomes:UP000261681};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..21
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           22..3824
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5019040857"
FT   DOMAIN          78..248
FT                   /note="VWFD"
FT                   /evidence="ECO:0000259|PROSITE:PS51233"
FT   DOMAIN          431..606
FT                   /note="VWFD"
FT                   /evidence="ECO:0000259|PROSITE:PS51233"
FT   DOMAIN          900..1071
FT                   /note="VWFD"
FT                   /evidence="ECO:0000259|PROSITE:PS51233"
FT   DOMAIN          3103..3287
FT                   /note="VWFD"
FT                   /evidence="ECO:0000259|PROSITE:PS51233"
FT   DOMAIN          3437..3512
FT                   /note="VWFC"
FT                   /evidence="ECO:0000259|PROSITE:PS50184"
FT   DOMAIN          3548..3615
FT                   /note="VWFC"
FT                   /evidence="ECO:0000259|PROSITE:PS50184"
FT   DOMAIN          3697..3785
FT                   /note="CTCK"
FT                   /evidence="ECO:0000259|PROSITE:PS01225"
FT   REGION          36..58
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1338..1388
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1485..1593
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1701..1760
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1872..2008
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2119..2408
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2518..2816
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          3008..3032
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          3795..3824
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1338..1363
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1364..1380
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   DISULFID        3697..3747
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00039"
SQ   SEQUENCE   3824 AA;  406874 MW;  14CA388839F3D6BC CRC64;
     MGAGRRKLAL LWALGLTLAC AQHTGQARDS AAELDYEYPD LPPAPQGPGG NSLRGVTIHP
     PLRSSPLVRA LNAAHGGRVC STWGDFHYKT FDGDVFRFPG LCNYVFSAHC GAAYEDFNLQ
     LRRGPGPNAT APSRVTMKLD GMVVELTKGS VLVNGRPAQL PFSHSGVFIE QSSSSVKVAA
     KLGLVFMWNQ DDSLLLELDA KYANQTCGLC GDFNGVPIFN EFFSHNIKLT PTEFGNLQKM
     DGPTERCQDP APEPPTNCLT GFGICEEMLS SELFSDCAAL VDSSSYLEAC RQDLCRCDQA
     NPSSCLCHTL AEYSRQCSHA GGLPLDWRRP QLCPQTCPLN MQYRECGSPC ADTCSNPERS
     PLCEDHCVAG CFCPEGTVLD DIGHAGCIPV PQCSCVYNGA TYTPGTGYST DCTNCTCSGG
     RWRCQEVQCP GTCSVLGGAH FSTFDERHYT VHGDCSYVLA KPCDSSAFTV LAELRRCGLT
     DSETCLKSLT LILGGGHMVF VVKASGEVFV NQIYTQLPLS AANVTLFRPS TFFIIAQTQL
     GLQLDVQLVP VMQVFVRLGP QLRGQTCGLC GNFNRNQADD FRTISGVVEG TAAAFANTWK
     TQAACPNVKN GFEDPCSLSV ENEKYAQRWC SRLTDPHGPF TRCHAAVNPS AYYSNCMFDT
     CSCEKSEDCM CAALSSYVRA CAARGVLLGG WRDGVCTKPM TTCPKSLTYL YNISTCQPTC
     RARSDADVTC SVSFVPVDGC TCPDGTFLDD MGKCVPATSC PCYRGGSVVP DGESTHEQGV
     ICTCTQGTLT CIGGHAPAPV CTPPMVYFDC RNTTPGAAGA GCQKSCHTLD MDCYSSQCVP
     GCVCPHGLVA NGDGGCIPVS DCPCVHNEAS YQPGQIIRVG CNTCTCKSRM WQCTDQPCPA
     TCAVYGDGHY LTFDGQRYSF SGDCEYTLLQ DHCGGNGSAQ DGFRVITENV PCGTTGTTCS
     KAIKIFLGSY ELKLSDGNVE VIEKEQGQLP PFSIRQMGIY LVVDTDVSLV LLWDKGTSIF
     LRLSPEFKGR VCGLCGNFDD NALNDFTTRS QSVVGNALEF GNSWKFSPSC PDAQAPKDPC
     AANPYRKSWA QKQCSIVNSA TFSACHAHVE SARYYEACVS DACACDSGGD CECFCTAVAA
     YAQACHEAGV CVSWRTPDIC PLFCDYYNPQ GQCEWHYQPC GAPFLRTCRN PRGQYLHDVR
     GLEGCYPKCP SEAPIFDEDQ MRCVASCPTP TPPTPCRVQG KSYRPGSTVP LEKNCHYCVC
     TESGVHCTYD SEACVCTYDG KRFRPGEVIY HTTDGTGGCI SARCGANGTI ERRVEACSPT
     PRTPQTTFIF STTPFVMSST LPPSTHPSPS ESTAHTPSSA PATTPGTSPG PTPPTASSPP
     SSRPHCGEEC QWSPWLDVSR PGRSIDSGDF DTLENLRAYG YRVCRAPRAV ECRAEGAPEV
     PLEVLGQHVE CSPRVGLTCY NRDQASGRCD NYQIRILCCS PRACPTSSTE TTPPVTFSTT
     ETGPTPGTSR VTSVPTGSTG SSAPVTTPGP TTASTTSKTP GPGTSTPAPG TSTTGKTTLS
     TPSRAPVSTT VTTTTSVSTT TPGPSKGTSS EPTQVTCLQE SCTWTMWIDG SYPGAGRNSG
     DFDSFQNLRS KGYKFCAKPK NVECRAEVFP NTPLQALGQN LICDKNVGLI CWNKDQLPPI
     CYNYQIRILC CEVVDVCRTK TTPPTTPGPT QTTTGETSTA RATATFSSST KHSTASSAHV
     PPRTRVTTRT NERTSPGTTS CQPQCTWTKW FDADVPSPGP HGGDFETYSN ILRRGEKICH
     RPEYISALEC RAENHPDVSI QTLGQVVQCS PEVGLVCRNR DQKGKFRMCL NYEVRVLCCE
     PQKGCPVTHV TVPSTSSVSV TSHTRTTSHE ATSSATTVQT TSPTAVPITS TTSVETTSTT
     SAPTTSTTTV ETASPTTAPT TSTTTQETTS PTTGPTISTT TVETTSTTSA PTTSTSTVET
     TSLTSAPSSS TTFVPIPETT PSQPGTTSCQ PQCTWTKWFD ADVPSPGPHG GDFETYSNIL
     RRGEKICHRP EYISALECRA ENHPDVSIQT LGQVVQCSPE VGLVCRNRDQ KGKFRMCLNY
     EVRVLCCEPQ KGCPVTHVTV PSTSSVSVTS HTRTTSHEAT SSATTVQTTS PTAVPITSTT
     SVETTSTTSA PPTSTTTVET ASPTTAPTTS TTTVEITSPT TAPTTSTTTQ ETTSPTPVPT
     ISTTTVETTS PITGPKISTT TVETTSATTG PTISTTTVET TSATTGPTIS TTTVETTSAT
     TGPTISTTTV ETTSATTGPT ISTTTVETTS ATTGPTISTT TVETTSTTTV ETTGPSTAPR
     TSTITVETTS PTTAPTTSTT TVETTSTTSA PTTSTSTVET TSLTSAPSSS TTFVPIPETT
     PSQPGTTSCQ PQCTWTKWFD ADVPSPGPHG GDFETYSNIL RRGEKICHRP EYISALECRA
     ENHPDVSIQT LGQVVQCSPE VGLVCRNRDQ KGKFRMCLNY EVRVLCCEPQ KGCPVTHVTV
     PSTSSVSVTS HTRTTSHEAT SSATTVQTTS PTAVPITSTT SVETTSTTSA PTTSTTTVET
     ASPTTAPTAS TTTQETSSPT TGPTISTTTV ETTSATTGPT ISTTTVETTS ATTGPTISTT
     TVETTSATTG PTISTTTVET TGPTTAPRTS TTTVETTSPT TAPTTSTTTV EITSPTTAPT
     TSTTTQETTS PTPVPTISTT TVETTSPTTA QTTSTTTVET TGPSTAPRTS TITVETTSPT
     TAPTTSTTTV ETTSTTLAPT TSTSTVETTS LTSAPSSSTT FVPIPETTPS QPGTTTCQPQ
     CTWTKWFDAD FPSPGPHGGD FETYSNILRR GEKICLRPEY FSALECRAEN HPDVSIQTLG
     QVVQCSPEVG LVCRNRDQKG KFRMCLNYEV RVLCCEPQKG CPVNSVTPFV TSSPQSSLSP
     ESTTLMVSLL TSPSSAAPST STCYCSVAQG FYSAGSIIYQ QTDLSGHCYY AVCSLDCHVV
     RRTDHTCPTS TPPPAPATST PASSPSASVP VTQQGCPNAV PPRMKGETWA MPNCSEATCE
     GNGAITVRSR QCPKVQELTC ANGYPAVTVA NGEDCCPHYA CQCVCSGWGD PHYITFDGTY
     YTFLDNCTYV LVQQILPVFG HFRVLIDNYF CDAEDRLSCP QSIIVEYQQD RVVLTRKPVR
     GVMTNEIIFN GEVVRPGFQK DGIVVSQIGI KMYVAIPALG VQVMFSGLIF SVEVPFSKFS
     NNTEGQCGTC TNDQKDECRL PGGAVVSSCS DMSGHWNVTY PGQPSCHRPP LMPTTAEPKK
     SPTSCPPSPI CLLILSEVFK PCHSVIPPWP YYQGCTFDQC HMPRGDVVCS SLELYAALCA
     SLSVCIDWRG LTNHTCSFTC PADKVYRPCG PSNPSYCYMN NSANTLALPE AIPITEGCFC
     PEDMMLFSSG TEVCVPANCS SDQWPGGPMS SPFQPGHTVS VGCQECTCKG STRTLSCQTR
     PCPLPPACPE PGFVPAPAAL QTGQCCPQYT CVCNATHCPA PADCPEGSGS TLTYKEGACC
     PTQNCSWTVC SVNGTLYQPG AIVSSNLCEK CWCELAGGPQ SDTFAINCET QICSTRCPVG
     FEYQTRSGQC CGECVQVACV MNTSGSSAHL FYPGESWSDP RSRCVTHECE KHQDGLVVVT
     TKKACPPLSC PADTARLSED GCCLSCPQRP PQNRSTCTVY HKLQVLRQPG CSSVEPVRLT
     YCQGNCGDTS SVYSLEARTV EHRCRCCREL RASLRNATLR CEDGSSQAFS YTEVEECGCV
     GLQCDSHGGL GRSGEVQLEQ GAERSQETGL RHWRRGAPGP RPSQ
//
DBGET integrated database retrieval system