GenomeNet

Database: UniProt
Entry: S4RWI7_PETMA
LinkDB: S4RWI7_PETMA
Original site: S4RWI7_PETMA 
ID   S4RWI7_PETMA            Unreviewed;       888 AA.
AC   S4RWI7;
DT   18-SEP-2013, integrated into UniProtKB/TrEMBL.
DT   18-SEP-2013, sequence version 1.
DT   27-MAR-2024, entry version 59.
DE   SubName: Full=Collagen, type II, alpha 1a {ECO:0000313|Ensembl:ENSPMAP00000009577.1};
OS   Petromyzon marinus (Sea lamprey).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Cyclostomata;
OC   Hyperoartia; Petromyzontiformes; Petromyzontidae; Petromyzon.
OX   NCBI_TaxID=7757 {ECO:0000313|Ensembl:ENSPMAP00000009577.1, ECO:0000313|Proteomes:UP000245300};
RN   [1] {ECO:0000313|Ensembl:ENSPMAP00000009577.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   AlphaFoldDB; S4RWI7; -.
DR   STRING; 7757.ENSPMAP00000009577; -.
DR   Ensembl; ENSPMAT00000009617.1; ENSPMAP00000009577.1; ENSPMAG00000008685.1.
DR   GeneTree; ENSGT00940000155224; -.
DR   HOGENOM; CLU_001074_19_1_1; -.
DR   OMA; HIRMGET; -.
DR   Proteomes; UP000245300; Unplaced.
DR   GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR   Gene3D; 2.60.120.1000; -; 1.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR000885; Fib_collagen_C.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1108; ENDOSTATIN DOMAIN-CONTAINING PROTEIN; 1.
DR   Pfam; PF01410; COLFI; 1.
DR   Pfam; PF01391; Collagen; 9.
DR   SMART; SM00038; COLFI; 1.
DR   PROSITE; PS51461; NC1_FIB; 1.
PE   4: Predicted;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Reference proteome {ECO:0000313|Proteomes:UP000245300};
KW   Secreted {ECO:0000256|ARBA:ARBA00022530}.
FT   DOMAIN          653..888
FT                   /note="Fibrillar collagen NC1"
FT                   /evidence="ECO:0000259|PROSITE:PS51461"
FT   REGION          1..622
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        60..74
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        603..617
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   888 AA;  86057 MW;  51AAF513893AE6DF CRC64;
     MGFPGPKGAG GEPGKPGEKG IPGAPGLRGL PGKDGETGAQ GPPGPAGPAG ERGEGGPPGP
     PGFQGLPGPS GPPGEAGKPG AQGVPGEAGK VGVSGARGER GFPGERGAPG PQGLQGARGL
     PGTPGTDGPK GATGPGGPPG AQGPPGLQGM PGERGATGIS GAKGDRGDSG EKGPEGAPGK
     DGGRGVTGPI GPPGPAGAHG DKGDAGPAGP TGPSGARGAP GERGETGPPG PAGFAGPPGS
     DGQPGAKGEQ GETGQKGDAG APGPQGPSGA PGPQGPNGVS GPKGARGAQG PPPGARALCG
     PAVDVAPVPF QGNPGSPGPA GPNGKDGPKG IRGDSGPVGR AGEPGLQGPI GAPGEKGDAG
     EDGPPGPDGP PGAQGLSGQR GIVGLPGQRG ERGFPGLPGP AGEPGKQGST GASGERGPPG
     PVGPPGLSGP PGEQGREGNA GSDGAPGRDG AAGAKGERGE AGAAGQPGAP GSPGAPGPVG
     PTGKNGARGD QGPQGPMGPP GPSGARGLPG PQGPRGDKGE TGEAGERGIK GHRGFTGLQG
     LPGPPGSSGD QGLAGSVGPS GPRGLSGPTG PPGKDGNNGQ SGPIGPPGPR GRTGETGPAG
     PSGPAGLPGP PGPPGPGIDL SAIAGIGQTD KAPDPLRYYR SDQALPELRQ HDAEVDASIK
     SLSGQIEGLR SPEGTRKNPA RTCRDLKLCH PEWESGNYWI DPNQGCTLDA LKVFCNMETG
     ETCVYANPQT ISRKNWWQSK SADRKHVWFG EAMSGGLHFN YGDDNLPANT AAIQMTFLRL
     LSMEAYDLPA AANITYHCKN SIAYMDEETG NLKKALLLQG SNDMEIRAEG NSRFTYNVLE
     DSCTKHTGEW GRTVVEYRTQ KTSRLPFVDV APMDVGGPDQ EFGLDIGP
//
DBGET integrated database retrieval system