ID S4RLD1_PETMA Unreviewed; 1684 AA.
AC S4RLD1;
DT 18-SEP-2013, integrated into UniProtKB/TrEMBL.
DT 18-SEP-2013, sequence version 1.
DT 24-JAN-2024, entry version 59.
DE RecName: Full=Fibrillar collagen NC1 domain-containing protein {ECO:0000259|PROSITE:PS51461};
OS Petromyzon marinus (Sea lamprey).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Cyclostomata;
OC Hyperoartia; Petromyzontiformes; Petromyzontidae; Petromyzon.
OX NCBI_TaxID=7757 {ECO:0000313|Ensembl:ENSPMAP00000006017.1};
RN [1] {ECO:0000313|Ensembl:ENSPMAP00000006017.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (JUL-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 7757.ENSPMAP00000006017; -.
DR Ensembl; ENSPMAT00000006045.1; ENSPMAP00000006017.1; ENSPMAG00000005423.1.
DR GeneTree; ENSGT00940000163583; -.
DR HOGENOM; CLU_001074_2_1_1; -.
DR OMA; HQHIAVG; -.
DR Proteomes; UP000245300; Unplaced.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR Gene3D; 2.60.120.1000; -; 2.
DR Gene3D; 2.60.120.200; -; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR000885; Fib_collagen_C.
DR InterPro; IPR048287; TSPN-like_N.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1064; COLLAGEN, TYPE XXVIII, ALPHA 1B; 1.
DR Pfam; PF01410; COLFI; 2.
DR Pfam; PF01391; Collagen; 8.
DR SMART; SM00038; COLFI; 1.
DR SMART; SM00210; TSPN; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR PROSITE; PS51461; NC1_FIB; 1.
PE 4: Predicted;
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Secreted {ECO:0000256|ARBA:ARBA00022530}.
FT DOMAIN 1485..1684
FT /note="Fibrillar collagen NC1"
FT /evidence="ECO:0000259|PROSITE:PS51461"
FT REGION 213..266
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 324..413
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 456..1245
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1268..1290
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1323..1383
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1427..1449
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 341..413
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 467..497
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 785..800
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 869..887
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1031..1060
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1077..1097
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1684 AA; 172569 MW; E4DECC262D94C895 CRC64;
VDVLQRLGLP NARSSSPGSH HAELSASAVS RGVIPSRSGL IFTRGAHVEA PADSVLPRTV
GASLALVASV RSHRVNNAFL FAVKSRRKKL QLGLQFIPGK VIIYLGNKKA VDFEYDVHDG
RWHNFAVDIG DREISFFTAC GRERSVKRLI FRKPEKFEAD AVFFLAKMNR KAVPFEGSIC
QLDILPVAEA AANYCSYIRK QCWHGDVYRQ RAPQPPFPDR SSAPRPLAVP PVVDRDSSSS
AGARSSQEDL DENHTEQVSP VTPLTPHIFA TQRSAAERGG SLWTAHVDEL PLDSMGTATT
ERIQTTLHPP DGDAALGVAF GEPRHTTASQ GHGTSRGPSD NTTEENDKRR RKVAAKNKET
LATIRQSRKK TEGLASRRKP DRRPRPDLLD QQLESRHRGL GRNGDDGSEG YEEAGRDNLT
LLYEWEDLIS GGTVDVEGYV DVHDPEGVVE LEVLPELRGP RGDPGPLGMV LPGPPGSPGP
PGRRGPRGMP GPHGNPGLPG FPGSRGEKGD PGISPPMQLP GEKGDSGSIG PPGLHGPLGS
KGSKGYQGQT GHPGEPGSPG LEGNPGVKGY PGRQGLPGPI GKPGPKGIRG FIGPAGIVGA
QGAEGPRGVP GAPGKRGTMG RPGFPGDYGE RGPPGPDGKP GEVGIEGPKG VLGLTGDPGP
QGNIGPPGFV GPKGFMGSLG EPGVKGDKGD EGAAGIAGEI GFPGDKGNLG FPGLPGPIGN
PGIAGKTGES GPPGPPGSSG PEGFPGAIGS PGINGLDGPK GKPGSRGPPG PTGQKGLDGE
EGPLGPPGIP GPPGNPGSQG FPGKPGPDGL KGETGDPGMP GKIGDTGLIG LTGPIGEMGK
AGDKGERGGV GLPGPPGEKG AMGYPGPPGE QGPVGPEGPP GRPGLPGRRG PLGPKGKRGP
RGEDGLPGEP GAEGNKGPVG DAGEVGIEGF IGKSGDPGDR GPLGFPGLQG TPGGVGIKGP
IGPPGPRGAK GEMGTVGDIG TQGLPGPTGE SGLQGEKGEK GDIGPLGGEG EPGLEGPRGL
PGPVGDDGPQ GKDGIKGEKG DLGLNGEDGE RGDIGDKGKE GAIGDPGIVG MRGMEGKLGK
LGERGKPGKK GDKGAMGHIG EPGKAGQSGS KGNQGPVGAR GSRGPVGAPG AMGAEGEDGL
PGYPGHQGPI GPSGPPGIKG EKGQAGEPGT QFGPPGPKGL MVQTIEVPDP GSLGEESSEK
HFSLSFTQGP VGPHGPRGLP GWPGPDGERG LEGAPGKDGK PGPPGDNYIN SLGCFAAFCG
FQNRQGDAGS VGQPGRTGLA GAQGKLGKPG QPGAAGLVGA KDPRWLPFQS GKMTEPIPGV
PQGDRGMDGQ PGIPGQRGIL GSDGPQGNKG ETGEKGQLRR GTVRDRGGSG VSEFPPPRGH
KSNAMLVGVE HPGGYIGRGG VWGGLGRAAC CLLALCATAP KWVPGGHVHK SPTKPPTHPA
THPPTLRQHP ARYRQHLFRT HIRLMSNIAS KCPTHSATVY PLNDAEFFLS LVERSELLAQ
VSGPMLTKPC NRLHYYRGFL TCNFHSAGNY WIDPNLGCSS DSIDVFCNFT VGGQTCLKPL
AMSKLDFGVG KIQMNFLHLL SSEAVQPVTV HCRGGPAWED PSSPHPHRHA ARFRAWSGRL
YEPGGLLEPR VLHDGCRMDD GKWHKTEFLF TTQDVNHLPI VDVHFTHQKP DSQYHLEVGP
VCFL
//