ID A0A0D9R813_CHLSB Unreviewed; 2117 AA.
AC A0A0D9R813;
DT 27-MAY-2015, integrated into UniProtKB/TrEMBL.
DT 27-MAY-2015, sequence version 1.
DT 08-NOV-2023, entry version 32.
DE RecName: Full=Mucin 4, cell surface associated {ECO:0008006|Google:ProtNLM};
OS Chlorocebus sabaeus (Green monkey) (Cercopithecus sabaeus).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini;
OC Cercopithecidae; Cercopithecinae; Chlorocebus.
OX NCBI_TaxID=60711 {ECO:0000313|Ensembl:ENSCSAP00000004752.1, ECO:0000313|Proteomes:UP000029965};
RN [1] {ECO:0000313|Ensembl:ENSCSAP00000004752.1, ECO:0000313|Proteomes:UP000029965}
RP NUCLEOTIDE SEQUENCE.
RA Warren W., Wilson R.K.;
RL Submitted (MAR-2014) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSCSAP00000004752.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (JUL-2023) to UniProtKB.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AQIB01076854; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AQIB01076855; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AQIB01076856; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AQIB01076857; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AQIB01076858; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AQIB01076859; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AQIB01076860; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AQIB01076861; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR STRING; 60711.ENSCSAP00000004752; -.
DR Ensembl; ENSCSAT00000006553.1; ENSCSAP00000004752.1; ENSCSAG00000008513.1.
DR eggNOG; ENOG502QUJ0; Eukaryota.
DR GeneTree; ENSGT00730000110943; -.
DR OMA; DRFSQDT; -.
DR Proteomes; UP000029965; Chromosome 15.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR InterPro; IPR005533; AMOP_dom.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR001846; VWF_type-D.
DR PANTHER; PTHR13802; MUCIN 4-RELATED; 1.
DR PANTHER; PTHR13802:SF52; MUCIN-4; 1.
DR Pfam; PF00094; VWD; 1.
DR SMART; SM00723; AMOP; 1.
DR SMART; SM00181; EGF; 3.
DR SMART; SM00216; VWD; 1.
DR PROSITE; PS50856; AMOP; 1.
DR PROSITE; PS50026; EGF_3; 2.
DR PROSITE; PS51233; VWFD; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|PROSITE-ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|PROSITE-ProRule:PRU00076};
KW Membrane {ECO:0000256|SAM:Phobius};
KW Reference proteome {ECO:0000313|Proteomes:UP000029965};
KW Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT TRANSMEM 2073..2099
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 1258..1373
FT /note="AMOP"
FT /evidence="ECO:0000259|PROSITE:PS50856"
FT DOMAIN 1385..1585
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 1823..1862
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 2026..2065
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT REGION 1..62
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 121..228
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 274..311
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 324..358
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 433..659
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 726..829
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 849..891
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 903..932
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 971..1039
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1091..1127
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 281..311
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 507..659
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 971..1032
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 1852..1861
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ SEQUENCE 2117 AA; 223980 MW; 5DE56AB470126D16 CRC64;
PGTTGETTTG SKNPTAVTSA GSVAMTLEGQ STATYSRTSN QDISASSQNH QTKSMETTRE
SETGTLTQMA TLTFSSSPSV HNVTDTVSQE TSTPDEMTTS FPSSVTNTLM MTSKTIAMTT
STDSTLGNIE ETSTPGTESS TPVISAVSIT AELEGQSHTT SSRTSIQDTS ASSQNHQTQS
TDTTRESQTS TLTQRTTSIP SFSPSVHNVK GTISQKTFPP GETTTSSLFS VTNTPMMISK
KITTTTSTDS TLGNTGETSV PVTRSLMPVT SAAPITAEPE GQLPATSSRT STQDTTGFSQ
NNHQTQSIET TRVSQINTLT PVTTSTVLSS PSGFNPSGTV SQETFPSGET TTSSPSSVSN
TFLVTSEVFR MPTSRDSTLG NTEEISLPIR GTISAITSRV STIWLSDTLS TALSPSSLPP
KISTTFHTQQ SEGAETIGRP HERSSVSPGV SHETFPPHET ITWPSSFSSE DRTTRSPKEL
PSTPTGAATR LVTGSPIVPG TAGTIPRVPS EVSTIGDSRQ PTTHPSHSTT LLETTGAGAQ
TQWTQETGTT GEVPINSPSY SVTQMINTTP SSSPMLNGHT SQQITMAPST NHSTIHSTNT
SPQESLAVSQ RGHTQAPQTT QESQTTRSIS PVTDTKTVTT PGSSFTASGH SSSEIAPQDA
PTISAATTFA PAPTGDGHTT QAPTTVLQAA PSSPDSTLAP SGGASLSITG ALPGVVSTPG
VLEGRWTSAS ASTSPDTAAA MTQTHQAEGT EASGETQTSE PASSGSGTTS EGTATLSSSR
ASGTTPSGSE GISITGVTTW FSPNPSRDSH TTQSTTELLS ASASHDATPV STGMVSAIVP
STFPVTLSEA STAGRPTGHS SPTSPSASPQ ETAAISQMAQ TQRTGTSRGS DTISLVSQVT
DTFSTVTPTP PSITSSGLTF PQTETHALSP SGSGTTFNMA LISNTFPGPA VTSNSTTLLT
GHATPLAVSS ASSASTVSSG SPLKTETSAT STPPTTSETA TSTTAPSPPP TTSQAATSAT
PSTATHTRST AAPVPVPSEK GASLLPYGAG AGDLEFVRRT VDFTSPPFRP ATGFPLGSSL
RDSLYVSPGG GLSSLGRFSP RLRDHPEEAG EEAASAGPLV RPTAPLMSGV CRGTRGARRG
RTPAGPSAAL RGLPRSKTLR LGNGAVPALP AFWPPGPCPL DEVGGAVLAP AALRGRLWDA
PLQLGSATLI PFTQIGDGYF ENSPLMSQPV WEKYRPDRFL NSSSGLRGLQ FYRLHREERP
NYRLECLQWL KSQPQWPSWG WNQVSCPCSW QQGRWDLRFQ PVSIGLLGLG SRQLCSFTSW
RGGVCCSYGP WGEFREGWHV QHPWQFAPEL EPQSWCCLWN DKPDLCALYQ QRRPRVGCAT
YRPPRPAWMF GDPHITTLDG VNYTFNGLGD FLLVRTQDGN SSFLLQGRTA QTGSAQATNF
IAFAAQYRSG SLGPVTVQWL LEPHDGIHVL LDNQTVTFEP GHGDGGGQET VNATGVVLSR
SGSVVSASFD GWAAVSVVAL SGILHASASL PPEYQDRTEG LLGVWNNNPE DDFRMPNGST
VPPGSSEETL FHYGMTWQIN GTGLLGKRND QLPSNFTPVF YSQLQKNSSW AEDLISSCNG
DSSCIYDTLA LRNASIGLHT RAVSKTYKQV NATLNQYPPS INGGRVIEAY KGQTTLIQYT
SNAEDATFTL RDSCTDFKLF ENGTLLWTPK SLEPFTLEIL ARSAKIGLAS ALQPRTVVCH
CNAESQCLYN QTSRVGNSSL EMAGCKCDAG TFGRYCERSK DACEEPCFPS VRCIPGKGCG
ACPPHLTGDG RHCAALGSSL LCRNQSCPVN YCYNQGRCYI SQTLGCQPTC TCPPAFTDSL
CFLAGNNFSP TVHLEPPLRV IQLLLSEEEN ASMAEVNASV AYILGTLDMR AFLRNSQVNR
IDSPAPASGS PIQHWRVISE FQYRPWGPVI DFLNNQLLAA VVEAFLYQVP RRSEVSRNHV
VFQPISREDV WDVTALNVST LKAYFECNGY KGYDLVYSPQ SGFTCVSPCS RGYCDHGGQC
QHLPSGPRCS CVSFSIYTAW GEHCEHLSMK LDAFFGIFFG SLGGLLLLGV GVFVVLRFWN
CSRTSFSYLL RSAEASP
//