ID A0A452FHW9_CAPHI Unreviewed; 906 AA.
AC A0A452FHW9;
DT 08-MAY-2019, integrated into UniProtKB/TrEMBL.
DT 08-MAY-2019, sequence version 1.
DT 27-MAR-2024, entry version 23.
DE RecName: Full=Mucin-2 {ECO:0008006|Google:ProtNLM};
GN Name=MUC2 {ECO:0000313|Ensembl:ENSCHIP00000023750.1};
OS Capra hircus (Goat).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Artiodactyla; Ruminantia; Pecora; Bovidae;
OC Caprinae; Capra.
OX NCBI_TaxID=9925 {ECO:0000313|Ensembl:ENSCHIP00000023750.1, ECO:0000313|Proteomes:UP000291000};
RN [1] {ECO:0000313|Ensembl:ENSCHIP00000023750.1, ECO:0000313|Proteomes:UP000291000}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Bickhart D.M., Koren S., Rosen B., Hastie A., Liachko I., Sullivan S.T.,
RA Burton J., Sayre B.L., Huson H.J., Lee J., Lam E., Kelley C.M.,
RA Hutchison J.L., Zhou Y., Sun J., Crisa A., Schwartz J.C., Hammond J.A.,
RA Schroeder S.G., Liu G.E., Dunham M., Shendure J., Sonstegard T.S.,
RA Phillippy A.M., Van Tassell C.P., Smith T.P.;
RT "Polished mammalian reference genomes with single-molecule sequencing and
RT chromosome conformation capture applied to the Capra hircus genome.";
RL Submitted (APR-2016) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSCHIP00000023750.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Secreted {ECO:0000256|ARBA:ARBA00004613}.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00039}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LWLT01000026; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR AlphaFoldDB; A0A452FHW9; -.
DR STRING; 9925.ENSCHIP00000023750; -.
DR Ensembl; ENSCHIT00000031610.1; ENSCHIP00000023750.1; ENSCHIG00000021181.1.
DR GeneTree; ENSGT00940000156289; -.
DR OMA; AGICIHW; -.
DR Proteomes; UP000291000; Chromosome 29.
DR Bgee; ENSCHIG00000021181; Expressed in descending colon and 8 other cell types or tissues.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-SubCell.
DR CDD; cd19941; TIL; 1.
DR Gene3D; 2.10.90.10; Cystine-knot cytokines; 1.
DR InterPro; IPR006207; Cys_knot_C.
DR InterPro; IPR029034; Cystine-knot_cytokine.
DR InterPro; IPR006208; Glyco_hormone_CN.
DR InterPro; IPR036084; Ser_inhib-like_sf.
DR InterPro; IPR014853; VWF/SSPO/ZAN-like_Cys-rich_dom.
DR InterPro; IPR001007; VWF_dom.
DR InterPro; IPR001846; VWF_type-D.
DR PANTHER; PTHR11339; EXTRACELLULAR MATRIX GLYCOPROTEIN RELATED; 1.
DR PANTHER; PTHR11339:SF399; MUCIN-2; 1.
DR Pfam; PF08742; C8; 1.
DR Pfam; PF00007; Cys_knot; 1.
DR Pfam; PF00094; VWD; 1.
DR SMART; SM00832; C8; 1.
DR SMART; SM00041; CT; 1.
DR SMART; SM00214; VWC; 2.
DR SMART; SM00216; VWD; 1.
DR SUPFAM; SSF57603; FnI-like domain; 1.
DR SUPFAM; SSF57567; Serine protease inhibitors; 1.
DR PROSITE; PS01185; CTCK_1; 1.
DR PROSITE; PS01225; CTCK_2; 1.
DR PROSITE; PS01208; VWFC_1; 1.
DR PROSITE; PS50184; VWFC_2; 2.
DR PROSITE; PS51233; VWFD; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00039}; Reference proteome {ECO:0000313|Proteomes:UP000291000};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Secreted {ECO:0000256|ARBA:ARBA00022525}.
FT DOMAIN 217..400
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 543..612
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 650..717
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 801..886
FT /note="CTCK"
FT /evidence="ECO:0000259|PROSITE:PS01225"
FT REGION 124..147
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 824..878
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00039"
FT DISULFID 828..880
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00039"
SQ SEQUENCE 906 AA; 98354 MW; F361BA267962470A CRC64;
MQALPSDSAA QAGGAWVPGA WGSLGGLHWE LAWEAGCWAW VDVSMASPTS CSHKSGGSGR
CGLLSPGLWG RQPECSGAAP ALVSPLAGEV VYSGTHGDTC YYVNCSLECT LEFFNWSCPS
TPTPTPSPST PTSQESGTST TPVPGVPGCP DLDPPRQVCN ESWWMCNCTK ATCKYNNTVE
LVKVPCEPPP MPTCTNGLAP VRVQDPDKCC WHWECDCYCT GWGDPHYVTF DGLYYSYQGN
CTYVLVEEVS PRVDNFGIYV DNYHCDVNDE VSCPRTLIVR HETQEVLIKM VQMAPIVVQV
QVNRQAVALP YTKFGLRVYE SGINYMVDIP ELGALVSYNG LSFSIRLPYR LFGNNTKGQC
GTCTNSTLDD CVLPSGESID NCEVAADSWV VDDPSKPRCP HTSFTTRRPA TSPSSCASPL
CELIKDSLFA HCHALAPPQH YYEACLFDSC YVPGSNLECA SLQTYAALCA QEGICVDWRN
HTGGACPVTC PAHREYRACG PVEEPTCNPS EPNSTRLVEG CFCPEGTTSY APGFDVCVDL
CGCVGPDNVP REYGEHFEFD CKDCICLEGG SGIICKPKTC RPEPRLECEE DGTYPFTEVD
PANTCCNLTS CKCNASLCRE KPPLCSLGFQ VKSEMVPGRC CPLYSCVPKG VCVLEHAEYQ
PGSPVYSSKC QNCVCTDRRD NATQLNVITC TYVPCNTTCS LGFELVDAPG ECCKKCEQTH
CIISRPGQHN LVLKPGDMKS DPLNNCTFFS CMKIHNQLIS SISNITCPEF DPSTCVQGSI
TLMPNGCCRK CIPRNETSVP CAAVPVTREI SHNGCTASVS MNDCSGSCGT FAMYSAEAQA
LDHRCSCCRE QRTSQREVTL RCPDGGTLRY TYTHVDSCLC QDTVCELPAQ RRARRSSALG
VAPGRG
//