ID G5C6A1_HETGA Unreviewed; 885 AA.
AC G5C6A1;
DT 14-DEC-2011, integrated into UniProtKB/TrEMBL.
DT 14-DEC-2011, sequence version 1.
DT 27-MAR-2024, entry version 44.
DE SubName: Full=Mucin-2 {ECO:0000313|EMBL:EHB17062.1};
DE Flags: Fragment;
GN ORFNames=GW7_17690 {ECO:0000313|EMBL:EHB17062.1};
OS Heterocephalus glaber (Naked mole rat).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Hystricomorpha; Bathyergidae;
OC Heterocephalus.
OX NCBI_TaxID=10181 {ECO:0000313|EMBL:EHB17062.1, ECO:0000313|Proteomes:UP000006813};
RN [1] {ECO:0000313|EMBL:EHB17062.1, ECO:0000313|Proteomes:UP000006813}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21993625; DOI=10.1038/nature10533;
RA Kim E.B., Fang X., Fushan A.A., Huang Z., Lobanov A.V., Han L.,
RA Marino S.M., Sun X., Turanov A.A., Yang P., Yim S.H., Zhao X.,
RA Kasaikina M.V., Stoletzki N., Peng C., Polak P., Xiong Z., Kiezun A.,
RA Zhu Y., Chen Y., Kryukov G.V., Zhang Q., Peshkin L., Yang L., Bronson R.T.,
RA Buffenstein R., Wang B., Han C., Li Q., Chen L., Zhao W., Sunyaev S.R.,
RA Park T.J., Zhang G., Wang J., Gladyshev V.N.;
RT "Genome sequencing reveals insights into physiology and longevity of the
RT naked mole rat.";
RL Nature 479:223-227(2011).
CC -!- SUBCELLULAR LOCATION: Secreted {ECO:0000256|ARBA:ARBA00004613}.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00039}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JH173516; EHB17062.1; -; Genomic_DNA.
DR AlphaFoldDB; G5C6A1; -.
DR STRING; 10181.G5C6A1; -.
DR eggNOG; KOG1216; Eukaryota.
DR InParanoid; G5C6A1; -.
DR Proteomes; UP000006813; Unassembled WGS sequence.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-SubCell.
DR Gene3D; 2.10.90.10; Cystine-knot cytokines; 1.
DR InterPro; IPR006207; Cys_knot_C.
DR InterPro; IPR029034; Cystine-knot_cytokine.
DR InterPro; IPR006208; Glyco_hormone_CN.
DR InterPro; IPR036084; Ser_inhib-like_sf.
DR InterPro; IPR014853; VWF/SSPO/ZAN-like_Cys-rich_dom.
DR InterPro; IPR001007; VWF_dom.
DR InterPro; IPR001846; VWF_type-D.
DR PANTHER; PTHR11339; EXTRACELLULAR MATRIX GLYCOPROTEIN RELATED; 1.
DR PANTHER; PTHR11339:SF399; MUCIN-2; 1.
DR Pfam; PF08742; C8; 1.
DR Pfam; PF00007; Cys_knot; 1.
DR Pfam; PF00094; VWD; 1.
DR SMART; SM00832; C8; 1.
DR SMART; SM00041; CT; 1.
DR SMART; SM00214; VWC; 2.
DR SMART; SM00216; VWD; 1.
DR SUPFAM; SSF57603; FnI-like domain; 1.
DR SUPFAM; SSF57567; Serine protease inhibitors; 1.
DR PROSITE; PS01185; CTCK_1; 1.
DR PROSITE; PS01225; CTCK_2; 1.
DR PROSITE; PS01208; VWFC_1; 2.
DR PROSITE; PS50184; VWFC_2; 2.
DR PROSITE; PS51233; VWFD; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00039}; Reference proteome {ECO:0000313|Proteomes:UP000006813};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Secreted {ECO:0000256|ARBA:ARBA00022525}.
FT DOMAIN 188..371
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 520..589
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 627..694
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 778..863
FT /note="CTCK"
FT /evidence="ECO:0000259|PROSITE:PS01225"
FT REGION 1..29
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 13..29
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 801..855
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00039"
FT DISULFID 805..857
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00039"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:EHB17062.1"
FT NON_TER 885
FT /evidence="ECO:0000313|EMBL:EHB17062.1"
SQ SEQUENCE 885 AA; 96244 MW; 48E03349CAB3DF2C CRC64;
ASATPTPPPV PTTTAVTSTQ TSFSSIEPTT LETTPIKSSS LGWETGASTA VYCCYLNGTY
YDPGDLVYNG THGNTCYYVN CSLDCTLQFF NWSCPSTPSP TPSTLSTPAS SKPVTTNPSG
CLDFIPPREE NESWWLCNCT MAICKHDNVV EIVEVECKPP PMPTCSNGLK PVQVMDPDGC
CWHWECDCYC TGWGDPHFVT FDGLYYSYQG NCTYVLVEEI TSTVDSFGVY IDNYHCDVND
KVSCPRTIIV RHETQEVLIK TLQMTPIKVQ VQVNKQVVAL PYRKYGLKVY ESGINFVVDI
PELGALISYN GLSFSIRLPF QRFGNNTKGQ CGTCTNSTAD DCVLPGGEVT SNCELAADQW
VVNDPSKPLC AHTDFTTQRP AVTPTLLENC TSSRICNLIK DSLFAQCHPW VPPQHYYEAC
VFDSCFVPDS GMECASVQAY AALCSQQGVC VDWRGHTDKA CAVTCPAHRR YQACGPTEER
APPPLSSPQN SSILVEGCFC PEGTTNYAPG YDICVKMCGC VGPDNVPREF GERFVFDCKD
CVCLEGGSGI ACQPKKCSQE AQPTCEAEGT YLVTEVNPAD TCCNVSSCKC NTSLCKKEPP
SCSLGFELKS NMVPGQCCPV YSCEPKQVCV HKNAEYQPGS PVYSSKCQNC MCTQEVNSST
QLNIIACTHV PCNVSCDPGF EPVEAPGECC RKCQQTHCVI SLPSGQPVIL KPGAIERNSD
NNCTFFSCVK IHNQLISSVS NITCPDFDPR SCVPGSITLM PNGCCRKCIL LNETRVPCST
IPVVKEISYA GCAKNVSTNY CSGSCGTFAM YSASAQALDH LCSCCKEETT VQRQVALDCP
NGGSLNHTYT HIESCLCQES VCGLPQAQQA QQARVRRASP QLLDR
//