ID A0A1S2ZA90_ERIEU Unreviewed; 1487 AA.
AC A0A1S2ZA90;
DT 12-APR-2017, integrated into UniProtKB/TrEMBL.
DT 12-APR-2017, sequence version 1.
DT 27-MAR-2024, entry version 29.
DE SubName: Full=Collagen alpha-1(II) chain {ECO:0000313|RefSeq:XP_007516290.1};
GN Name=COL2A1 {ECO:0000313|RefSeq:XP_007516290.1};
OS Erinaceus europaeus (Western European hedgehog).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Eulipotyphla; Erinaceidae; Erinaceinae;
OC Erinaceus.
OX NCBI_TaxID=9365 {ECO:0000313|Proteomes:UP000079721, ECO:0000313|RefSeq:XP_007516290.1};
RN [1] {ECO:0000313|RefSeq:XP_007516290.1}
RP IDENTIFICATION.
RG RefSeq;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_007516290.1; XM_007516228.1.
DR STRING; 9365.ENSEEUP00000009876; -.
DR GeneID; 103107428; -.
DR CTD; 1280; -.
DR eggNOG; KOG3544; Eukaryota.
DR InParanoid; A0A1S2ZA90; -.
DR OrthoDB; 2970887at2759; -.
DR Proteomes; UP000079721; Unplaced.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR Gene3D; 2.60.120.1000; -; 1.
DR Gene3D; 2.10.70.10; Complement Module, domain 1; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR000885; Fib_collagen_C.
DR InterPro; IPR001007; VWF_dom.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF58; COLLAGEN ALPHA-1(II) CHAIN; 1.
DR Pfam; PF01410; COLFI; 1.
DR Pfam; PF01391; Collagen; 7.
DR Pfam; PF00093; VWC; 1.
DR SMART; SM00038; COLFI; 1.
DR SMART; SM00214; VWC; 1.
DR SUPFAM; SSF57603; FnI-like domain; 1.
DR PROSITE; PS51461; NC1_FIB; 1.
DR PROSITE; PS01208; VWFC_1; 1.
DR PROSITE; PS50184; VWFC_2; 1.
PE 4: Predicted;
KW Collagen {ECO:0000313|RefSeq:XP_007516290.1};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Reference proteome {ECO:0000313|Proteomes:UP000079721};
KW Secreted {ECO:0000256|ARBA:ARBA00022530}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..25
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 26..1487
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5010342456"
FT DOMAIN 32..90
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 1253..1487
FT /note="Fibrillar collagen NC1"
FT /evidence="ECO:0000259|PROSITE:PS51461"
FT REGION 96..1234
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 133..149
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 157..174
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 351..365
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 432..446
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 910..924
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1200..1217
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1487 AA; 142083 MW; 9218FD61EE15E522 CRC64;
MTRLGAPQKL VLLTLLVAAV LRCQGQDVQE AGSCVQDGQR YNDKDVWKPE PCQICVCDTG
TVLCDDIICE EAKDCLNPEI PFGECCPICP TELATASGKP GPKGQKGEPG DIKDIVGPKG
PPGPQGPAGE QGPRGDRGDK GEKGAPGPRG RDGEPGTPGN PGPPGPPGPP GPPGLGGNFA
AQMAGGFEDK AGGAQMGVMQ GPMGPMGPRG PPGPAGSPGP QGFQGNPGEP GEPGVSGPMG
PRGPPGPPGK AGDDGEAGKP GKSGERGPPG PQGARGFPGT PGLPGVKGHR GYPGLDGAKG
EAGAPGVKGE SGSPGENGSP GPMGPRGLPG ERGRTGPAGA AGARGNDGQP GPAGPPGPVG
PAGGPGFPGA PGAKGEAGPT GARGPEGAQG SRGEPGTPGS PGPAGASGNP GTDGIPGAKG
SAGAPGIAGA PGFPGPRGPP GPQGATGPLG PKGQTGEPGI AGFKGEQGPK GETGPAGPQG
APGPAGEEGK RGARGEPGGA GPIGPPGERG APGNRGFPGQ DGLAGPKGAP GERGPSGLAG
PKGANGDPGR PGEPGLPGAR GLTGRPGDAG PQGKLGPSGA PGEDGRPGPP GPQGARGQPG
VMGFPGPKGA NGEPGKAGEK GLAGAPGLRG LPGKDGETGA AGPPGPAGPA GERGEQGAPG
PSGFQGLPGP PGPPGEGGKP GDQGIPGEAG APGLVGPRGE RGFPGERGSP GAQGLQGARG
LPGTPGNDGP KGAAGAAGPP GAQGPPGLQG MPGERGAAGI AGPKGDRGDV GEKGPEGAPG
KDGGRGLTGP IGPPGPAGAN GEKGEVGPPG PSGTAGARGA PGERGETGPP GPAGFAGPPG
ADGQPGAKGE QGEAGQKGDA GAPGPQGPSG APGPQGPTGV TGPKGARGAQ GPPGATGFPG
AAGRVGPPGS NGNPGPPGPP GPSGKDGPKG VRGDIGPPGR AGDPGLQGPA GTPGEKGEPG
DDGPSGPDGP PGPQGLAGQR GIVGLPGQRG ERGFPGLPGP SGEPGKQGAP GSSGDRGPPG
PVGPPGLTGP AGEPGREGSP GADGPPGRDG AAGVKGDRGE TGPLGAPGAP GPPGSPGPAG
PTGKQGDRGE AGAQGPMGPS GPAGARGIPG PQGPRGDKGE TGEAGERGLK GHRGFTGLQG
LPGPPGPSGD QGASGPAGPS GPRGPPGPVG PSGKDGANGI PGPIGPPGPR GRSGETGPAG
PPGNPGPPGP PGPPGPGIDM SAFAGLGQRE KGPDPLQYMR ADQAAGDLRQ HDVEVDATLK
SLNNQIESIR SPEGSRKNPA RTCRDLQLCH PEWKSGDYWI DPNQGCTLDA MKVFCNMETG
ETCVYPNPAN VPKKNWWSSK SKDRKHVWFG ETISGGFHFS YGDDNLPPNT ANVQMTFLRL
LSTEGSQNIT YHCKNSIAYM DEAAGNLKKA LLIQGSNDVE IRAEGNSRFT YNVLRDGCTR
HTGKWGKTVI EYRSQKTSRL PIIDIAPMDI GGPEQEFGVD IGPVCFL
//