LOCUS XP_031440237 1256 aa linear VRT 27-JUL-2021
DEFINITION collagen alpha-1(XVIII) chain isoform X1 [Clupea harengus].
ACCESSION XP_031440237
VERSION XP_031440237.1
DBLINK BioProject: PRJNA587748
DBSOURCE REFSEQ: accession XM_031584377.2
KEYWORDS RefSeq.
SOURCE Clupea harengus (Atlantic herring)
ORGANISM Clupea harengus
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Actinopterygii; Neopterygii; Teleostei; Clupei; Clupeiformes;
Clupeoidei; Clupeidae; Clupea.
COMMENT MODEL REFSEQ: This record is predicted by automated computational
analysis. This record is derived from a genomic sequence
(NC_045168.1) annotated using gene prediction method: Gnomon.
Also see:
Documentation of NCBI's Annotation Process
##Genome-Annotation-Data-START##
Annotation Provider :: NCBI
Annotation Status :: Full annotation
Annotation Name :: Clupea harengus Annotation Release
102
Annotation Version :: 102
Annotation Pipeline :: NCBI eukaryotic genome annotation
pipeline
Annotation Software Version :: 9.0
Annotation Method :: Best-placed RefSeq; Gnomon
Features Annotated :: Gene; mRNA; CDS; ncRNA
##Genome-Annotation-Data-END##
COMPLETENESS: full length.
FEATURES Location/Qualifiers
source 1..1256
/organism="Clupea harengus"
/db_xref="taxon:7950"
/chromosome="17"
Protein 1..1256
/product="collagen alpha-1(XVIII) chain isoform X1"
/calculated_mol_wt=128969
Region 31..220
/region_name="LamG"
/note="Laminin G domain; Laminin G-like domains are
usually Ca++ mediated receptors that can have binding
sites for steroids, beta1 integrins, heparin, sulfatides,
fibulin-1, and alpha-dystroglycans. Proteins that contain
LamG domains serve a variety of...; cl22861"
/db_xref="CDD:473984"
Region <370..>529
/region_name="gly_rich_SclB"
/note="LPXTG-anchored collagen-like adhesin Scl2/SclB;
NF038329"
/db_xref="CDD:468478"
Region <547..>794
/region_name="gly_rich_SclB"
/note="LPXTG-anchored collagen-like adhesin Scl2/SclB;
NF038329"
/db_xref="CDD:468478"
Region <706..>912
/region_name="gly_rich_SclB"
/note="LPXTG-anchored collagen-like adhesin Scl2/SclB;
NF038329"
/db_xref="CDD:468478"
Region 981..1028
/region_name="Collagen_trimer"
/note="Collagen trimerization domain; pfam20010"
/db_xref="CDD:466257"
Region 1062..1228
/region_name="Endostatin-like"
/note="Endostatin-like domain; the angiogenesis inhibitor
endostatin is a C-terminal fragment of collagen XV/XVIII,
a proteoglycan/collagen found in vessel walls and basement
membranes; this domain has a compact globular fold similar
to that of C-type lectins; cl23985"
/db_xref="CDD:474121"
Site order(1091,1100,1106,1115..1116,1119,1180..1181,1191)
/site_type="other"
/note="putative ligand binding site [chemical binding]"
/db_xref="CDD:238151"
CDS 1..1256
/gene="col15a1a"
/coded_by="XM_031584377.2:342..4112"
/db_xref="GeneID:105890075"
ORIGIN
1 mmsrgslwsf alflwychpt tafleergtk gqldltelig vplppdvafi tgfegfpays
61 fgpganvgrl arsfvpdpff rdfaiivtak pttrqggvlf aitdamqkvv hlgvmlaave
121 dgtqrvvlyy tepgaattqe aasfkmgelt grwarftlav qghevrlymd ceeyhrvafq
181 rsagqltfqp ssgifvgnag ntglerfvgs iqqlvltpdp rapddqceed dpyasgygsg
241 ddlddtermd evkklveere ytmpedfltm pvqappteea ssddedleeg tsgqaidlpd
301 eratehtarv dtvhretsln pglkgekgda gsagppgppg ppgrsspasg tssgggepgp
361 rgpqgpsgtp gapgkegepg vqgrdgspgq agpqgfpglp gdagptgekg dpgvglpgpp
421 gppgppgkst sprymdgldg sgfedfdsdt evvrgpagpp gppgnpgppg psqallpgqp
481 glkgapgtdg kdgemgtpgl pgadgrpgnp gaqgekgdlg fpgspglkgd lgqegppglp
541 gpvgpegptg pagqpgppgp pgppaqgfkf nledvegsge lsamgamlpk gppglpgqpg
601 viglkgerga dgqpglsvkg dagerghege pglpglpgak gqlgdeghpg lkgepgrdal
661 glvgppgppg lpgpiinlqd lmlndteglf nfsgilgpqg pmgpnglkge sgipgvqgps
721 gvkgekgepg itmaadgslm tgpegpkgvk gdsgvpgplg ekgpigpvgp kgefglpgrp
781 grpgmngfkg dqgqavfvhg ppgppgppgr pgmfncpkgt vfpvpprphc kkgvngeskg
841 egatcltnss kgekgdrglq gipgipapai ailpkgfspn rgdqgyhgek gekgeaglpg
901 lhgrsglvgp kgesvmgprg hpgipgppgv pgygrdgptg ppgppgprgp pgygsalatp
961 gppgppgprg spgvsggstg vktypslqsm tqrsyldldg tmyfvtdagr lylkvpggwk
1021 eiqlgkllev qspiipqdea rprpqspgss sssmpqiheg qalklvalnt pltgnigslt
1081 aadqacraqa qamgirdqyr aflsnhlqdl vdvihpqyrr tlpivnlrge vlfdnyehif
1141 tkssalphgi plfsfdgrdv msdpfwpqka iwhgsspqgr rlqekncesw ragdmaivgq
1201 asflytglln qqsrscsnqf vvlcveaspe pssyqevrrg tryayyyrnp rsshrt
//