GenomeNet

Database: UniProt
Entry: S4RLD1_PETMA
LinkDB: S4RLD1_PETMA
Original site: S4RLD1_PETMA 
ID   S4RLD1_PETMA            Unreviewed;      1684 AA.
AC   S4RLD1;
DT   18-SEP-2013, integrated into UniProtKB/TrEMBL.
DT   18-SEP-2013, sequence version 1.
DT   24-JAN-2024, entry version 59.
DE   RecName: Full=Fibrillar collagen NC1 domain-containing protein {ECO:0000259|PROSITE:PS51461};
OS   Petromyzon marinus (Sea lamprey).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Cyclostomata;
OC   Hyperoartia; Petromyzontiformes; Petromyzontidae; Petromyzon.
OX   NCBI_TaxID=7757 {ECO:0000313|Ensembl:ENSPMAP00000006017.1};
RN   [1] {ECO:0000313|Ensembl:ENSPMAP00000006017.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (JUL-2023) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   STRING; 7757.ENSPMAP00000006017; -.
DR   Ensembl; ENSPMAT00000006045.1; ENSPMAP00000006017.1; ENSPMAG00000005423.1.
DR   GeneTree; ENSGT00940000163583; -.
DR   HOGENOM; CLU_001074_2_1_1; -.
DR   OMA; HQHIAVG; -.
DR   Proteomes; UP000245300; Unplaced.
DR   GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR   Gene3D; 2.60.120.1000; -; 2.
DR   Gene3D; 2.60.120.200; -; 1.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR000885; Fib_collagen_C.
DR   InterPro; IPR048287; TSPN-like_N.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1064; COLLAGEN, TYPE XXVIII, ALPHA 1B; 1.
DR   Pfam; PF01410; COLFI; 2.
DR   Pfam; PF01391; Collagen; 8.
DR   SMART; SM00038; COLFI; 1.
DR   SMART; SM00210; TSPN; 1.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR   PROSITE; PS51461; NC1_FIB; 1.
PE   4: Predicted;
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Secreted {ECO:0000256|ARBA:ARBA00022530}.
FT   DOMAIN          1485..1684
FT                   /note="Fibrillar collagen NC1"
FT                   /evidence="ECO:0000259|PROSITE:PS51461"
FT   REGION          213..266
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          324..413
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          456..1245
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1268..1290
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1323..1383
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1427..1449
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        341..413
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        467..497
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        785..800
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        869..887
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1031..1060
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1077..1097
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1684 AA;  172569 MW;  E4DECC262D94C895 CRC64;
     VDVLQRLGLP NARSSSPGSH HAELSASAVS RGVIPSRSGL IFTRGAHVEA PADSVLPRTV
     GASLALVASV RSHRVNNAFL FAVKSRRKKL QLGLQFIPGK VIIYLGNKKA VDFEYDVHDG
     RWHNFAVDIG DREISFFTAC GRERSVKRLI FRKPEKFEAD AVFFLAKMNR KAVPFEGSIC
     QLDILPVAEA AANYCSYIRK QCWHGDVYRQ RAPQPPFPDR SSAPRPLAVP PVVDRDSSSS
     AGARSSQEDL DENHTEQVSP VTPLTPHIFA TQRSAAERGG SLWTAHVDEL PLDSMGTATT
     ERIQTTLHPP DGDAALGVAF GEPRHTTASQ GHGTSRGPSD NTTEENDKRR RKVAAKNKET
     LATIRQSRKK TEGLASRRKP DRRPRPDLLD QQLESRHRGL GRNGDDGSEG YEEAGRDNLT
     LLYEWEDLIS GGTVDVEGYV DVHDPEGVVE LEVLPELRGP RGDPGPLGMV LPGPPGSPGP
     PGRRGPRGMP GPHGNPGLPG FPGSRGEKGD PGISPPMQLP GEKGDSGSIG PPGLHGPLGS
     KGSKGYQGQT GHPGEPGSPG LEGNPGVKGY PGRQGLPGPI GKPGPKGIRG FIGPAGIVGA
     QGAEGPRGVP GAPGKRGTMG RPGFPGDYGE RGPPGPDGKP GEVGIEGPKG VLGLTGDPGP
     QGNIGPPGFV GPKGFMGSLG EPGVKGDKGD EGAAGIAGEI GFPGDKGNLG FPGLPGPIGN
     PGIAGKTGES GPPGPPGSSG PEGFPGAIGS PGINGLDGPK GKPGSRGPPG PTGQKGLDGE
     EGPLGPPGIP GPPGNPGSQG FPGKPGPDGL KGETGDPGMP GKIGDTGLIG LTGPIGEMGK
     AGDKGERGGV GLPGPPGEKG AMGYPGPPGE QGPVGPEGPP GRPGLPGRRG PLGPKGKRGP
     RGEDGLPGEP GAEGNKGPVG DAGEVGIEGF IGKSGDPGDR GPLGFPGLQG TPGGVGIKGP
     IGPPGPRGAK GEMGTVGDIG TQGLPGPTGE SGLQGEKGEK GDIGPLGGEG EPGLEGPRGL
     PGPVGDDGPQ GKDGIKGEKG DLGLNGEDGE RGDIGDKGKE GAIGDPGIVG MRGMEGKLGK
     LGERGKPGKK GDKGAMGHIG EPGKAGQSGS KGNQGPVGAR GSRGPVGAPG AMGAEGEDGL
     PGYPGHQGPI GPSGPPGIKG EKGQAGEPGT QFGPPGPKGL MVQTIEVPDP GSLGEESSEK
     HFSLSFTQGP VGPHGPRGLP GWPGPDGERG LEGAPGKDGK PGPPGDNYIN SLGCFAAFCG
     FQNRQGDAGS VGQPGRTGLA GAQGKLGKPG QPGAAGLVGA KDPRWLPFQS GKMTEPIPGV
     PQGDRGMDGQ PGIPGQRGIL GSDGPQGNKG ETGEKGQLRR GTVRDRGGSG VSEFPPPRGH
     KSNAMLVGVE HPGGYIGRGG VWGGLGRAAC CLLALCATAP KWVPGGHVHK SPTKPPTHPA
     THPPTLRQHP ARYRQHLFRT HIRLMSNIAS KCPTHSATVY PLNDAEFFLS LVERSELLAQ
     VSGPMLTKPC NRLHYYRGFL TCNFHSAGNY WIDPNLGCSS DSIDVFCNFT VGGQTCLKPL
     AMSKLDFGVG KIQMNFLHLL SSEAVQPVTV HCRGGPAWED PSSPHPHRHA ARFRAWSGRL
     YEPGGLLEPR VLHDGCRMDD GKWHKTEFLF TTQDVNHLPI VDVHFTHQKP DSQYHLEVGP
     VCFL
//
DBGET integrated database retrieval system