ID F7EH21_ORNAN Unreviewed; 656 AA.
AC F7EH21;
DT 27-JUL-2011, integrated into UniProtKB/TrEMBL.
DT 02-DEC-2020, sequence version 2.
DT 27-MAR-2024, entry version 86.
DE RecName: Full=Versican core protein {ECO:0000256|ARBA:ARBA00044099};
DE AltName: Full=Chondroitin sulfate proteoglycan core protein 2 {ECO:0000256|ARBA:ARBA00044230};
DE AltName: Full=Large fibroblast proteoglycan {ECO:0000256|ARBA:ARBA00044263};
DE AltName: Full=PG-M {ECO:0000256|ARBA:ARBA00044266};
OS Ornithorhynchus anatinus (Duckbill platypus).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Monotremata; Ornithorhynchidae; Ornithorhynchus.
OX NCBI_TaxID=9258 {ECO:0000313|Ensembl:ENSOANP00000023253.2, ECO:0000313|Proteomes:UP000002279};
RN [1] {ECO:0000313|Ensembl:ENSOANP00000023253.2, ECO:0000313|Proteomes:UP000002279}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Glennie {ECO:0000313|Ensembl:ENSOANP00000023253.2,
RC ECO:0000313|Proteomes:UP000002279};
RX PubMed=18464734; DOI=10.1038/nature06936;
RA Warren W.C., Hillier L.W., Marshall Graves J.A., Birney E., Ponting C.P.,
RA Grutzner F., Belov K., Miller W., Clarke L., Chinwalla A.T., Yang S.P.,
RA Heger A., Locke D.P., Miethke P., Waters P.D., Veyrunes F., Fulton L.,
RA Fulton B., Graves T., Wallis J., Puente X.S., Lopez-Otin C., Ordonez G.R.,
RA Eichler E.E., Chen L., Cheng Z., Deakin J.E., Alsop A., Thompson K.,
RA Kirby P., Papenfuss A.T., Wakefield M.J., Olender T., Lancet D.,
RA Huttley G.A., Smit A.F., Pask A., Temple-Smith P., Batzer M.A.,
RA Walker J.A., Konkel M.K., Harris R.S., Whittington C.M., Wong E.S.,
RA Gemmell N.J., Buschiazzo E., Vargas Jentzsch I.M., Merkel A., Schmitz J.,
RA Zemann A., Churakov G., Kriegs J.O., Brosius J., Murchison E.P.,
RA Sachidanandam R., Smith C., Hannon G.J., Tsend-Ayush E., McMillan D.,
RA Attenborough R., Rens W., Ferguson-Smith M., Lefevre C.M., Sharp J.A.,
RA Nicholas K.R., Ray D.A., Kube M., Reinhardt R., Pringle T.H., Taylor J.,
RA Jones R.C., Nixon B., Dacheux J.L., Niwa H., Sekita Y., Huang X., Stark A.,
RA Kheradpour P., Kellis M., Flicek P., Chen Y., Webber C., Hardison R.,
RA Nelson J., Hallsworth-Pepin K., Delehaunty K., Markovic C., Minx P.,
RA Feng Y., Kremitzki C., Mitreva M., Glasscock J., Wylie T., Wohldmann P.,
RA Thiru P., Nhan M.N., Pohl C.S., Smith S.M., Hou S., Nefedov M.,
RA de Jong P.J., Renfree M.B., Mardis E.R., Wilson R.K.;
RT "Genome analysis of the platypus reveals unique signatures of evolution.";
RL Nature 453:175-183(2008).
RN [2] {ECO:0000313|Ensembl:ENSOANP00000023253.2}
RP IDENTIFICATION.
RC STRAIN=Glennie {ECO:0000313|Ensembl:ENSOANP00000023253.2};
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- FUNCTION: May play a role in intercellular signaling and in connecting
CC cells with the extracellular matrix. May take part in the regulation of
CC cell motility, growth and differentiation. Binds hyaluronic acid.
CC {ECO:0000256|ARBA:ARBA00043896}.
CC -!- SUBUNIT: Interacts with FBLN1. {ECO:0000256|ARBA:ARBA00044030}.
CC -!- SUBCELLULAR LOCATION: Cell projection, cilium, photoreceptor outer
CC segment {ECO:0000256|ARBA:ARBA00004504}. Secreted, extracellular space,
CC extracellular matrix, interphotoreceptor matrix
CC {ECO:0000256|ARBA:ARBA00004593}.
CC -!- SIMILARITY: Belongs to the aggrecan/versican proteoglycan family.
CC {ECO:0000256|ARBA:ARBA00006838}.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; F7EH21; -.
DR STRING; 9258.ENSOANP00000023253; -.
DR Ensembl; ENSOANT00000023257.3; ENSOANP00000023253.2; ENSOANG00000014761.3.
DR eggNOG; ENOG502QRBE; Eukaryota.
DR GeneTree; ENSGT00940000156102; -.
DR HOGENOM; CLU_000303_1_1_1; -.
DR InParanoid; F7EH21; -.
DR OMA; CFGDMMG; -.
DR TreeFam; TF332134; -.
DR Proteomes; UP000002279; Chromosome 1.
DR Bgee; ENSOANG00000014761; Expressed in fibroblast and 6 other cell types or tissues.
DR GO; GO:0005615; C:extracellular space; IBA:GO_Central.
DR GO; GO:0072534; C:perineuronal net; IBA:GO_Central.
DR GO; GO:0045202; C:synapse; IBA:GO_Central.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR GO; GO:0030246; F:carbohydrate binding; IEA:UniProtKB-KW.
DR GO; GO:0005540; F:hyaluronic acid binding; IEA:InterPro.
DR GO; GO:0007155; P:cell adhesion; IEA:InterPro.
DR GO; GO:0007417; P:central nervous system development; IBA:GO_Central.
DR GO; GO:0010001; P:glial cell differentiation; IBA:GO_Central.
DR GO; GO:0002052; P:positive regulation of neuroblast proliferation; IBA:GO_Central.
DR GO; GO:0001501; P:skeletal system development; IBA:GO_Central.
DR CDD; cd00033; CCP; 1.
DR CDD; cd00054; EGF_CA; 2.
DR CDD; cd05901; Ig_Versican; 1.
DR CDD; cd03517; Link_domain_CSPGs_modules_1_3; 1.
DR CDD; cd03520; Link_domain_CSPGs_modules_2_4; 1.
DR Gene3D; 2.10.70.10; Complement Module, domain 1; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 1.
DR Gene3D; 2.10.25.10; Laminin; 2.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 3.
DR InterPro; IPR001304; C-type_lectin-like.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR018378; C-type_lectin_CS.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
DR InterPro; IPR018097; EGF_Ca-bd_CS.
DR InterPro; IPR007110; Ig-like_dom.
DR InterPro; IPR036179; Ig-like_dom_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR003599; Ig_sub.
DR InterPro; IPR013106; Ig_V-set.
DR InterPro; IPR000538; Link_dom.
DR InterPro; IPR035976; Sushi/SCR/CCP_sf.
DR InterPro; IPR000436; Sushi_SCR_CCP_dom.
DR PANTHER; PTHR22804; AGGRECAN/VERSICAN PROTEOGLYCAN; 1.
DR PANTHER; PTHR22804:SF6; VERSICAN CORE PROTEIN; 1.
DR Pfam; PF00008; EGF; 2.
DR Pfam; PF00059; Lectin_C; 1.
DR Pfam; PF00084; Sushi; 1.
DR Pfam; PF07686; V-set; 1.
DR Pfam; PF00193; Xlink; 2.
DR PRINTS; PR01265; LINKMODULE.
DR SMART; SM00032; CCP; 1.
DR SMART; SM00034; CLECT; 1.
DR SMART; SM00181; EGF; 2.
DR SMART; SM00179; EGF_CA; 2.
DR SMART; SM00409; IG; 1.
DR SMART; SM00406; IGv; 1.
DR SMART; SM00445; LINK; 2.
DR SUPFAM; SSF56436; C-type lectin-like; 3.
DR SUPFAM; SSF57535; Complement control module/SCR domain; 1.
DR SUPFAM; SSF57196; EGF/Laminin; 1.
DR SUPFAM; SSF48726; Immunoglobulin; 1.
DR PROSITE; PS00010; ASX_HYDROXYL; 1.
DR PROSITE; PS00615; C_TYPE_LECTIN_1; 1.
DR PROSITE; PS50041; C_TYPE_LECTIN_2; 1.
DR PROSITE; PS00022; EGF_1; 2.
DR PROSITE; PS01186; EGF_2; 1.
DR PROSITE; PS50026; EGF_3; 2.
DR PROSITE; PS01187; EGF_CA; 1.
DR PROSITE; PS50835; IG_LIKE; 1.
DR PROSITE; PS01241; LINK_1; 1.
DR PROSITE; PS50963; LINK_2; 2.
DR PROSITE; PS50923; SUSHI; 1.
PE 3: Inferred from homology;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Immunoglobulin domain {ECO:0000256|ARBA:ARBA00023319};
KW Lectin {ECO:0000256|ARBA:ARBA00022734};
KW Proteoglycan {ECO:0000256|ARBA:ARBA00022974};
KW Reference proteome {ECO:0000313|Proteomes:UP000002279};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Secreted {ECO:0000256|ARBA:ARBA00022525}; Signal {ECO:0000256|SAM:SignalP};
KW Sushi {ECO:0000256|ARBA:ARBA00022659, ECO:0000256|PROSITE-
KW ProRule:PRU00302}.
FT SIGNAL 1..20
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 21..656
FT /note="Versican core protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5028288310"
FT DOMAIN 33..147
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 151..246
FT /note="Link"
FT /evidence="ECO:0000259|PROSITE:PS50963"
FT DOMAIN 252..348
FT /note="Link"
FT /evidence="ECO:0000259|PROSITE:PS50963"
FT DOMAIN 349..385
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 387..423
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 436..550
FT /note="C-type lectin"
FT /evidence="ECO:0000259|PROSITE:PS50041"
FT DOMAIN 554..614
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT DISULFID 197..218
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00323"
FT DISULFID 295..316
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00323"
FT DISULFID 375..384
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 413..422
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 556..599
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00302"
FT DISULFID 585..612
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00302"
SQ SEQUENCE 656 AA; 74245 MW; DCC21FDB1A7696FB CRC64;
MLLNIKSILW ICSTLIVTNA LHKVQVEKSP PVKGSLSGKV SLPCHFSTMP TLPPSYNNTS
EFLRIKWSKV EEDKIGKDLK ETTVLVAQNG NIKIGQGYKG RVSVPTHPED VGDASLTVVK
LRASDAGFYR CDVMYGIEDT QDTVSLAVEG VVFHYRASSS RYTLNFKEAQ EVCLANGAVI
ASPEQLKAAY EDGFEQCDAG WLSDQTVRYP IRTPRAGCYG DMMGKEGVRT YGFRVPDERY
DVYCYVDHLE GEVFHITSPN KLTFEEAEEE CVNQDGRLAT VGELHAAWRN GFDKCNYGWL
ADGSVRHPVT VARAQCGGGL LGVRTLYRFE NQTGFPAPDS TFDAYCHKRL NLCKTSPCLN
GGTCYERDSS YVCTCLPGFS GDQCEYDFDE CQSNPCRNGA TCVDGINTFT CLCLPSYVGA
LCEQDTETCD YGWHKFQGQC YKYFAHRRTW DAAERECRLQ GAHLTSILSH EEQLFVNRVG
HDYQWIGLND KMFEHDFRWT DGSTLQYENW RPNQPDSFFS AGEDCVVIIW HENGQWNDVP
CNYHLTYTCK KGTVACGQPP VVENAKTFGK MKPRYEINSL IRYHCKDGFI QRHLPTIRCL
GNGRWALPKI TCLNPSANQR TYSKKHFKNS SSAKDNSINS PKHYHPWISR WQDSRR
//