ID S4RWI7_PETMA Unreviewed; 888 AA.
AC S4RWI7;
DT 18-SEP-2013, integrated into UniProtKB/TrEMBL.
DT 18-SEP-2013, sequence version 1.
DT 27-MAR-2024, entry version 59.
DE SubName: Full=Collagen, type II, alpha 1a {ECO:0000313|Ensembl:ENSPMAP00000009577.1};
OS Petromyzon marinus (Sea lamprey).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Cyclostomata;
OC Hyperoartia; Petromyzontiformes; Petromyzontidae; Petromyzon.
OX NCBI_TaxID=7757 {ECO:0000313|Ensembl:ENSPMAP00000009577.1, ECO:0000313|Proteomes:UP000245300};
RN [1] {ECO:0000313|Ensembl:ENSPMAP00000009577.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; S4RWI7; -.
DR STRING; 7757.ENSPMAP00000009577; -.
DR Ensembl; ENSPMAT00000009617.1; ENSPMAP00000009577.1; ENSPMAG00000008685.1.
DR GeneTree; ENSGT00940000155224; -.
DR HOGENOM; CLU_001074_19_1_1; -.
DR OMA; HIRMGET; -.
DR Proteomes; UP000245300; Unplaced.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR Gene3D; 2.60.120.1000; -; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR000885; Fib_collagen_C.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1108; ENDOSTATIN DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF01410; COLFI; 1.
DR Pfam; PF01391; Collagen; 9.
DR SMART; SM00038; COLFI; 1.
DR PROSITE; PS51461; NC1_FIB; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Reference proteome {ECO:0000313|Proteomes:UP000245300};
KW Secreted {ECO:0000256|ARBA:ARBA00022530}.
FT DOMAIN 653..888
FT /note="Fibrillar collagen NC1"
FT /evidence="ECO:0000259|PROSITE:PS51461"
FT REGION 1..622
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 60..74
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 603..617
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 888 AA; 86057 MW; 51AAF513893AE6DF CRC64;
MGFPGPKGAG GEPGKPGEKG IPGAPGLRGL PGKDGETGAQ GPPGPAGPAG ERGEGGPPGP
PGFQGLPGPS GPPGEAGKPG AQGVPGEAGK VGVSGARGER GFPGERGAPG PQGLQGARGL
PGTPGTDGPK GATGPGGPPG AQGPPGLQGM PGERGATGIS GAKGDRGDSG EKGPEGAPGK
DGGRGVTGPI GPPGPAGAHG DKGDAGPAGP TGPSGARGAP GERGETGPPG PAGFAGPPGS
DGQPGAKGEQ GETGQKGDAG APGPQGPSGA PGPQGPNGVS GPKGARGAQG PPPGARALCG
PAVDVAPVPF QGNPGSPGPA GPNGKDGPKG IRGDSGPVGR AGEPGLQGPI GAPGEKGDAG
EDGPPGPDGP PGAQGLSGQR GIVGLPGQRG ERGFPGLPGP AGEPGKQGST GASGERGPPG
PVGPPGLSGP PGEQGREGNA GSDGAPGRDG AAGAKGERGE AGAAGQPGAP GSPGAPGPVG
PTGKNGARGD QGPQGPMGPP GPSGARGLPG PQGPRGDKGE TGEAGERGIK GHRGFTGLQG
LPGPPGSSGD QGLAGSVGPS GPRGLSGPTG PPGKDGNNGQ SGPIGPPGPR GRTGETGPAG
PSGPAGLPGP PGPPGPGIDL SAIAGIGQTD KAPDPLRYYR SDQALPELRQ HDAEVDASIK
SLSGQIEGLR SPEGTRKNPA RTCRDLKLCH PEWESGNYWI DPNQGCTLDA LKVFCNMETG
ETCVYANPQT ISRKNWWQSK SADRKHVWFG EAMSGGLHFN YGDDNLPANT AAIQMTFLRL
LSMEAYDLPA AANITYHCKN SIAYMDEETG NLKKALLLQG SNDMEIRAEG NSRFTYNVLE
DSCTKHTGEW GRTVVEYRTQ KTSRLPFVDV APMDVGGPDQ EFGLDIGP
//