ID A0A226MFN5_CALSU Unreviewed; 749 AA.
AC A0A226MFN5;
DT 25-OCT-2017, integrated into UniProtKB/TrEMBL.
DT 25-OCT-2017, sequence version 1.
DT 27-MAR-2024, entry version 21.
DE RecName: Full=Mucin-2 {ECO:0008006|Google:ProtNLM};
GN ORFNames=ASZ78_003200 {ECO:0000313|EMBL:OXB53829.1};
OS Callipepla squamata (Scaled quail).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Galloanserae; Galliformes; Odontophoridae;
OC Callipepla.
OX NCBI_TaxID=9009 {ECO:0000313|EMBL:OXB53829.1, ECO:0000313|Proteomes:UP000198323};
RN [1] {ECO:0000313|EMBL:OXB53829.1, ECO:0000313|Proteomes:UP000198323}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Texas {ECO:0000313|EMBL:OXB53829.1,
RC ECO:0000313|Proteomes:UP000198323};
RC TISSUE=Leg muscle {ECO:0000313|EMBL:OXB53829.1};
RA Oldeschulte D.L., Halley Y.A., Bhattarai E.K., Brashear W.A., Hill J.,
RA Metz R.P., Johnson C.D., Rollins D., Peterson M.J., Bickhart D.M.,
RA Decker J.E., Seabury C.M.;
RT "Disparate Historic Effective Population Sizes Predicted by Modern Levels
RT of Genome Diversity for the Scaled Quail (Callipepla squamata) and the
RT Northern Bobwhite (Colinus virginianus): Inferences from First and Second
RT Generation Draft Genome Assemblies for Sympatric New World Quail.";
RL Submitted (JUL-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00039}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OXB53829.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; MCFN01001056; OXB53829.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A226MFN5; -.
DR STRING; 9009.A0A226MFN5; -.
DR Proteomes; UP000198323; Unassembled WGS sequence.
DR CDD; cd19941; TIL; 1.
DR Gene3D; 2.10.25.10; Laminin; 1.
DR InterPro; IPR006207; Cys_knot_C.
DR InterPro; IPR036084; Ser_inhib-like_sf.
DR InterPro; IPR014853; VWF/SSPO/ZAN-like_Cys-rich_dom.
DR InterPro; IPR001007; VWF_dom.
DR InterPro; IPR001846; VWF_type-D.
DR PANTHER; PTHR11339; EXTRACELLULAR MATRIX GLYCOPROTEIN RELATED; 1.
DR PANTHER; PTHR11339:SF399; MUCIN-2; 1.
DR Pfam; PF08742; C8; 1.
DR Pfam; PF00094; VWD; 1.
DR SMART; SM00832; C8; 1.
DR SMART; SM00041; CT; 1.
DR SMART; SM00214; VWC; 2.
DR SMART; SM00216; VWD; 1.
DR SUPFAM; SSF57603; FnI-like domain; 1.
DR SUPFAM; SSF57567; Serine protease inhibitors; 1.
DR PROSITE; PS01185; CTCK_1; 1.
DR PROSITE; PS01225; CTCK_2; 1.
DR PROSITE; PS50184; VWFC_2; 2.
DR PROSITE; PS51233; VWFD; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00039}; Reference proteome {ECO:0000313|Proteomes:UP000198323};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT DOMAIN 30..213
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 373..444
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 482..549
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 632..719
FT /note="CTCK"
FT /evidence="ECO:0000259|PROSITE:PS01225"
FT REGION 721..749
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 646..695
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00039"
FT DISULFID 657..711
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00039"
FT DISULFID 661..713
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00039"
SQ SEQUENCE 749 AA; 83144 MW; 9F310B815752CC88 CRC64;
MGLLLCKSSM MINAVGTGSV ILTWLYVFSG YCTGWGDPHY LTFDGLYYSY QGNCTYILVE
EIEKRVDNFG VYIDNYHCDT RDVVSCPRAL IVRHETQEVR IVTAKPNTLE VEVTVNKQAV
ALPYKKFGLS VYQSGINRVV EIPELKVNVT FNGLSFSIRM PYSLFGNNTQ GQCGTCNNNT
ADDCRLPNGN IAESCETMAD HWQVVDPSKP QCSPGLIPTK APSTTTGQPC KESSLCELLW
GSVFEKCHDV VKPDRYYAAC VFDSCTLPDL DLECSSLQIY ASVCADQNVC IDWRSHTNGV
CSYECPKHKE YRACGPIQEA TCKSSPQNGT SVKQVEGCFC PNGTMLFDSG VDVCVNTCGE
FYYFNWIYFD IFTGCVGLDM IPREFGEKFT ADCQDCICLE GGNGIVCEPH KCTEQNKRSC
TGKGFYEVSE VNSEDPCCPI FTCKCNTSLC TSKPPKCTLG FEVYTYIPSD ECCPQYQCVP
KNVCVHQNAE FLPNSSVFVD KCHNCFCTNE VNISTQLNVI SCEHIPCNTY CKPGYERQDV
EGECCGKCVQ TKCIVHTSHS SSLILNPGEF VNDPYNNCTI YSCTSLKNQL ISSTSEITCP
AFNEKSCKPG TVTFLPNGCC KTCALLDSPT PCSVRERKDF IVYKNCRSPE RVVLTECEGT
CGTFSLYSVE ASSMEHSCSC CKEVRTSMKE VELKCPSGDS IKHKYVYVES CGCQDTQCVT
SESSESQSTE ENDESTQNHR KRAISFTSK
//