GenomeNet

Database: UniProt
Entry: A0A452FHW9_CAPHI
LinkDB: A0A452FHW9_CAPHI
Original site: A0A452FHW9_CAPHI 
ID   A0A452FHW9_CAPHI        Unreviewed;       906 AA.
AC   A0A452FHW9;
DT   08-MAY-2019, integrated into UniProtKB/TrEMBL.
DT   08-MAY-2019, sequence version 1.
DT   27-MAR-2024, entry version 23.
DE   RecName: Full=Mucin-2 {ECO:0008006|Google:ProtNLM};
GN   Name=MUC2 {ECO:0000313|Ensembl:ENSCHIP00000023750.1};
OS   Capra hircus (Goat).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Laurasiatheria; Artiodactyla; Ruminantia; Pecora; Bovidae;
OC   Caprinae; Capra.
OX   NCBI_TaxID=9925 {ECO:0000313|Ensembl:ENSCHIP00000023750.1, ECO:0000313|Proteomes:UP000291000};
RN   [1] {ECO:0000313|Ensembl:ENSCHIP00000023750.1, ECO:0000313|Proteomes:UP000291000}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA   Bickhart D.M., Koren S., Rosen B., Hastie A., Liachko I., Sullivan S.T.,
RA   Burton J., Sayre B.L., Huson H.J., Lee J., Lam E., Kelley C.M.,
RA   Hutchison J.L., Zhou Y., Sun J., Crisa A., Schwartz J.C., Hammond J.A.,
RA   Schroeder S.G., Liu G.E., Dunham M., Shendure J., Sonstegard T.S.,
RA   Phillippy A.M., Van Tassell C.P., Smith T.P.;
RT   "Polished mammalian reference genomes with single-molecule sequencing and
RT   chromosome conformation capture applied to the Capra hircus genome.";
RL   Submitted (APR-2016) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|Ensembl:ENSCHIP00000023750.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (SEP-2023) to UniProtKB.
CC   -!- SUBCELLULAR LOCATION: Secreted {ECO:0000256|ARBA:ARBA00004613}.
CC   -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC       feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00039}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; LWLT01000026; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   AlphaFoldDB; A0A452FHW9; -.
DR   STRING; 9925.ENSCHIP00000023750; -.
DR   Ensembl; ENSCHIT00000031610.1; ENSCHIP00000023750.1; ENSCHIG00000021181.1.
DR   GeneTree; ENSGT00940000156289; -.
DR   OMA; AGICIHW; -.
DR   Proteomes; UP000291000; Chromosome 29.
DR   Bgee; ENSCHIG00000021181; Expressed in descending colon and 8 other cell types or tissues.
DR   GO; GO:0005576; C:extracellular region; IEA:UniProtKB-SubCell.
DR   CDD; cd19941; TIL; 1.
DR   Gene3D; 2.10.90.10; Cystine-knot cytokines; 1.
DR   InterPro; IPR006207; Cys_knot_C.
DR   InterPro; IPR029034; Cystine-knot_cytokine.
DR   InterPro; IPR006208; Glyco_hormone_CN.
DR   InterPro; IPR036084; Ser_inhib-like_sf.
DR   InterPro; IPR014853; VWF/SSPO/ZAN-like_Cys-rich_dom.
DR   InterPro; IPR001007; VWF_dom.
DR   InterPro; IPR001846; VWF_type-D.
DR   PANTHER; PTHR11339; EXTRACELLULAR MATRIX GLYCOPROTEIN RELATED; 1.
DR   PANTHER; PTHR11339:SF399; MUCIN-2; 1.
DR   Pfam; PF08742; C8; 1.
DR   Pfam; PF00007; Cys_knot; 1.
DR   Pfam; PF00094; VWD; 1.
DR   SMART; SM00832; C8; 1.
DR   SMART; SM00041; CT; 1.
DR   SMART; SM00214; VWC; 2.
DR   SMART; SM00216; VWD; 1.
DR   SUPFAM; SSF57603; FnI-like domain; 1.
DR   SUPFAM; SSF57567; Serine protease inhibitors; 1.
DR   PROSITE; PS01185; CTCK_1; 1.
DR   PROSITE; PS01225; CTCK_2; 1.
DR   PROSITE; PS01208; VWFC_1; 1.
DR   PROSITE; PS50184; VWFC_2; 2.
DR   PROSITE; PS51233; VWFD; 1.
PE   4: Predicted;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW   ProRule:PRU00039}; Reference proteome {ECO:0000313|Proteomes:UP000291000};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Secreted {ECO:0000256|ARBA:ARBA00022525}.
FT   DOMAIN          217..400
FT                   /note="VWFD"
FT                   /evidence="ECO:0000259|PROSITE:PS51233"
FT   DOMAIN          543..612
FT                   /note="VWFC"
FT                   /evidence="ECO:0000259|PROSITE:PS50184"
FT   DOMAIN          650..717
FT                   /note="VWFC"
FT                   /evidence="ECO:0000259|PROSITE:PS50184"
FT   DOMAIN          801..886
FT                   /note="CTCK"
FT                   /evidence="ECO:0000259|PROSITE:PS01225"
FT   REGION          124..147
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   DISULFID        824..878
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00039"
FT   DISULFID        828..880
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00039"
SQ   SEQUENCE   906 AA;  98354 MW;  F361BA267962470A CRC64;
     MQALPSDSAA QAGGAWVPGA WGSLGGLHWE LAWEAGCWAW VDVSMASPTS CSHKSGGSGR
     CGLLSPGLWG RQPECSGAAP ALVSPLAGEV VYSGTHGDTC YYVNCSLECT LEFFNWSCPS
     TPTPTPSPST PTSQESGTST TPVPGVPGCP DLDPPRQVCN ESWWMCNCTK ATCKYNNTVE
     LVKVPCEPPP MPTCTNGLAP VRVQDPDKCC WHWECDCYCT GWGDPHYVTF DGLYYSYQGN
     CTYVLVEEVS PRVDNFGIYV DNYHCDVNDE VSCPRTLIVR HETQEVLIKM VQMAPIVVQV
     QVNRQAVALP YTKFGLRVYE SGINYMVDIP ELGALVSYNG LSFSIRLPYR LFGNNTKGQC
     GTCTNSTLDD CVLPSGESID NCEVAADSWV VDDPSKPRCP HTSFTTRRPA TSPSSCASPL
     CELIKDSLFA HCHALAPPQH YYEACLFDSC YVPGSNLECA SLQTYAALCA QEGICVDWRN
     HTGGACPVTC PAHREYRACG PVEEPTCNPS EPNSTRLVEG CFCPEGTTSY APGFDVCVDL
     CGCVGPDNVP REYGEHFEFD CKDCICLEGG SGIICKPKTC RPEPRLECEE DGTYPFTEVD
     PANTCCNLTS CKCNASLCRE KPPLCSLGFQ VKSEMVPGRC CPLYSCVPKG VCVLEHAEYQ
     PGSPVYSSKC QNCVCTDRRD NATQLNVITC TYVPCNTTCS LGFELVDAPG ECCKKCEQTH
     CIISRPGQHN LVLKPGDMKS DPLNNCTFFS CMKIHNQLIS SISNITCPEF DPSTCVQGSI
     TLMPNGCCRK CIPRNETSVP CAAVPVTREI SHNGCTASVS MNDCSGSCGT FAMYSAEAQA
     LDHRCSCCRE QRTSQREVTL RCPDGGTLRY TYTHVDSCLC QDTVCELPAQ RRARRSSALG
     VAPGRG
//
DBGET integrated database retrieval system