GenomeNet

Database: RefSeq
Entry: XP_031440237
LinkDB: XP_031440237
Original site: XP_031440237 
LOCUS       XP_031440237            1256 aa            linear   VRT 27-JUL-2021
DEFINITION  collagen alpha-1(XVIII) chain isoform X1 [Clupea harengus].
ACCESSION   XP_031440237
VERSION     XP_031440237.1
DBLINK      BioProject: PRJNA587748
DBSOURCE    REFSEQ: accession XM_031584377.2
KEYWORDS    RefSeq.
SOURCE      Clupea harengus (Atlantic herring)
  ORGANISM  Clupea harengus
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Actinopterygii; Neopterygii; Teleostei; Clupei; Clupeiformes;
            Clupeoidei; Clupeidae; Clupea.
COMMENT     MODEL REFSEQ:  This record is predicted by automated computational
            analysis. This record is derived from a genomic sequence
            (NC_045168.1) annotated using gene prediction method: Gnomon.
            Also see:
                Documentation of NCBI's Annotation Process
            
            ##Genome-Annotation-Data-START##
            Annotation Provider         :: NCBI
            Annotation Status           :: Full annotation
            Annotation Name             :: Clupea harengus Annotation Release
                                           102
            Annotation Version          :: 102
            Annotation Pipeline         :: NCBI eukaryotic genome annotation
                                           pipeline
            Annotation Software Version :: 9.0
            Annotation Method           :: Best-placed RefSeq; Gnomon
            Features Annotated          :: Gene; mRNA; CDS; ncRNA
            ##Genome-Annotation-Data-END##
            COMPLETENESS: full length.
FEATURES             Location/Qualifiers
     source          1..1256
                     /organism="Clupea harengus"
                     /db_xref="taxon:7950"
                     /chromosome="17"
     Protein         1..1256
                     /product="collagen alpha-1(XVIII) chain isoform X1"
                     /calculated_mol_wt=128969
     Region          31..220
                     /region_name="LamG"
                     /note="Laminin G domain; Laminin G-like domains are
                     usually Ca++ mediated receptors that can have binding
                     sites for steroids, beta1 integrins, heparin, sulfatides,
                     fibulin-1, and alpha-dystroglycans. Proteins that contain
                     LamG domains serve a variety of...; cl22861"
                     /db_xref="CDD:473984"
     Region          <370..>529
                     /region_name="gly_rich_SclB"
                     /note="LPXTG-anchored collagen-like adhesin Scl2/SclB;
                     NF038329"
                     /db_xref="CDD:468478"
     Region          <547..>794
                     /region_name="gly_rich_SclB"
                     /note="LPXTG-anchored collagen-like adhesin Scl2/SclB;
                     NF038329"
                     /db_xref="CDD:468478"
     Region          <706..>912
                     /region_name="gly_rich_SclB"
                     /note="LPXTG-anchored collagen-like adhesin Scl2/SclB;
                     NF038329"
                     /db_xref="CDD:468478"
     Region          981..1028
                     /region_name="Collagen_trimer"
                     /note="Collagen trimerization domain; pfam20010"
                     /db_xref="CDD:466257"
     Region          1062..1228
                     /region_name="Endostatin-like"
                     /note="Endostatin-like domain; the angiogenesis inhibitor
                     endostatin is a C-terminal fragment of collagen XV/XVIII,
                     a proteoglycan/collagen found in vessel walls and basement
                     membranes; this domain has a compact globular fold similar
                     to that of C-type lectins; cl23985"
                     /db_xref="CDD:474121"
     Site            order(1091,1100,1106,1115..1116,1119,1180..1181,1191)
                     /site_type="other"
                     /note="putative ligand binding site [chemical binding]"
                     /db_xref="CDD:238151"
     CDS             1..1256
                     /gene="col15a1a"
                     /coded_by="XM_031584377.2:342..4112"
                     /db_xref="GeneID:105890075"
ORIGIN      
        1 mmsrgslwsf alflwychpt tafleergtk gqldltelig vplppdvafi tgfegfpays
       61 fgpganvgrl arsfvpdpff rdfaiivtak pttrqggvlf aitdamqkvv hlgvmlaave
      121 dgtqrvvlyy tepgaattqe aasfkmgelt grwarftlav qghevrlymd ceeyhrvafq
      181 rsagqltfqp ssgifvgnag ntglerfvgs iqqlvltpdp rapddqceed dpyasgygsg
      241 ddlddtermd evkklveere ytmpedfltm pvqappteea ssddedleeg tsgqaidlpd
      301 eratehtarv dtvhretsln pglkgekgda gsagppgppg ppgrsspasg tssgggepgp
      361 rgpqgpsgtp gapgkegepg vqgrdgspgq agpqgfpglp gdagptgekg dpgvglpgpp
      421 gppgppgkst sprymdgldg sgfedfdsdt evvrgpagpp gppgnpgppg psqallpgqp
      481 glkgapgtdg kdgemgtpgl pgadgrpgnp gaqgekgdlg fpgspglkgd lgqegppglp
      541 gpvgpegptg pagqpgppgp pgppaqgfkf nledvegsge lsamgamlpk gppglpgqpg
      601 viglkgerga dgqpglsvkg dagerghege pglpglpgak gqlgdeghpg lkgepgrdal
      661 glvgppgppg lpgpiinlqd lmlndteglf nfsgilgpqg pmgpnglkge sgipgvqgps
      721 gvkgekgepg itmaadgslm tgpegpkgvk gdsgvpgplg ekgpigpvgp kgefglpgrp
      781 grpgmngfkg dqgqavfvhg ppgppgppgr pgmfncpkgt vfpvpprphc kkgvngeskg
      841 egatcltnss kgekgdrglq gipgipapai ailpkgfspn rgdqgyhgek gekgeaglpg
      901 lhgrsglvgp kgesvmgprg hpgipgppgv pgygrdgptg ppgppgprgp pgygsalatp
      961 gppgppgprg spgvsggstg vktypslqsm tqrsyldldg tmyfvtdagr lylkvpggwk
     1021 eiqlgkllev qspiipqdea rprpqspgss sssmpqiheg qalklvalnt pltgnigslt
     1081 aadqacraqa qamgirdqyr aflsnhlqdl vdvihpqyrr tlpivnlrge vlfdnyehif
     1141 tkssalphgi plfsfdgrdv msdpfwpqka iwhgsspqgr rlqekncesw ragdmaivgq
     1201 asflytglln qqsrscsnqf vvlcveaspe pssyqevrrg tryayyyrnp rsshrt
//
DBGET integrated database retrieval system