LOCUS XP_019636314 1385 aa linear INV 28-DEC-2016
DEFINITION PREDICTED: collagen alpha-1(I) chain-like isoform X7 [Branchiostoma
belcheri].
ACCESSION XP_019636314
VERSION XP_019636314.1
DBLINK BioProject: PRJNA358734
DBSOURCE REFSEQ: accession XM_019780755.1
KEYWORDS RefSeq.
SOURCE Branchiostoma belcheri (Belcher's lancelet)
ORGANISM Branchiostoma belcheri
Eukaryota; Metazoa; Chordata; Cephalochordata; Leptocardii;
Amphioxiformes; Branchiostomatidae; Branchiostoma.
COMMENT MODEL REFSEQ: This record is predicted by automated computational
analysis. This record is derived from a genomic sequence
(NW_017804087.1) annotated using gene prediction method: Gnomon,
supported by EST evidence.
Also see:
Documentation of NCBI's Annotation Process
##Genome-Annotation-Data-START##
Annotation Provider :: NCBI
Annotation Status :: Full annotation
Annotation Version :: Branchiostoma belcheri Annotation
Release 100
Annotation Pipeline :: NCBI eukaryotic genome annotation
pipeline
Annotation Software Version :: 7.2
Annotation Method :: Best-placed RefSeq; Gnomon
Features Annotated :: Gene; mRNA; CDS; ncRNA
##Genome-Annotation-Data-END##
COMPLETENESS: full length.
FEATURES Location/Qualifiers
source 1..1385
/organism="Branchiostoma belcheri"
/isolate="BF01"
/isolation_source="seawater"
/db_xref="taxon:7741"
/chromosome="Unknown"
/sex="male"
/cell_type="sperm cells"
/tissue_type="gonad"
/dev_stage="adult"
/geo_loc_name="China: Xiamen Bay"
/collection_date="Aug-2008"
/breed="outbred"
Protein 1..1385
/product="collagen alpha-1(I) chain-like isoform X7"
/calculated_mol_wt=137771
Region 41..235
/region_name="LamG"
/note="Laminin G domain; Laminin G-like domains are
usually Ca++ mediated receptors that can have binding
sites for steroids, beta1 integrins, heparin, sulfatides,
fibulin-1, and alpha-dystroglycans. Proteins that contain
LamG domains serve a variety of...; cl22861"
/db_xref="CDD:473984"
Region <324..>705
/region_name="gly_rich_SclB"
/note="LPXTG-anchored collagen-like adhesin Scl2/SclB;
NF038329"
/db_xref="CDD:468478"
Region <644..>883
/region_name="gly_rich_SclB"
/note="LPXTG-anchored collagen-like adhesin Scl2/SclB;
NF038329"
/db_xref="CDD:468478"
Region 1088..1127
/region_name="Collagen_trimer"
/note="Collagen trimerization domain; pfam20010"
/db_xref="CDD:466257"
Region 1198..1372
/region_name="Endostatin-like"
/note="Endostatin-like domain; the angiogenesis inhibitor
endostatin is a C-terminal fragment of collagen XV/XVIII,
a proteoglycan/collagen found in vessel walls and basement
membranes; this domain has a compact globular fold similar
to that of C-type lectins; cl23985"
/db_xref="CDD:474121"
Site order(1226,1235,1241,1250..1251,1254,1323..1324,1334)
/site_type="other"
/note="putative ligand binding site [chemical binding]"
/db_xref="CDD:238151"
CDS 1..1385
/gene="LOC109478941"
/coded_by="XM_019780755.1:450..4607"
/db_xref="GeneID:109478941"
ORIGIN
1 mvclfageqg rlvwlvalva viatppghgq eiigesggqq didllqmigv plptpirfvs
61 gydgfpafef gseanigrla rtffpnmfyk dfsilvttrp nfqeggilfa vtnsfqtviq
121 lglkiadagk grsgeelqnl tfiytdtrns evtqevakft ipttagewlr fslsirgnav
181 tlyynceere tqffdrtvsq lefapaaavf vgqagaaeeg kylgsiqell irkdpnaaeq
241 qcsgdageeg avsgsgdgge egtiitipgt vipgrpntph atqetpvdgp dvnepypdts
301 nmerhdgdtt gwpelpeggg ntpglpglpg vpgpkgatgp qgppglrgqk gepgdttlve
361 gpigipgerg lqglpgtpge agpvgppger gpkgergeag lkgdpgvglt gppgppgppg
421 vvtvgeddqv isgtpgskga pgapglpglp gpagitgakg epgesiagva gppgppgppg
481 lpgppgpsng fvpetaiigl egsgyefagq gvsgppgppg ppgipglpgp pglpglpgkp
541 gtfgtgnitn giqgppgrdg ldgltgppgl pglpgqdgli gpkgeagapg iqgsagskge
601 pgqpglpgpp gppgpsgggg gifgfggssg gpgpagepgi pgnpgqkgeq gtagpegpqg
661 spglpgpvgp rgpkgdagea givgpqgpkg dmgprgppge agrdgvglpg ppgppgppgl
721 pgtisvlpgd demtvspgfp pvggegemtd gftggvigpa gpegprgppg iagppgpigr
781 pgepglpger gekgdsgeag rkgdrgepgp avtvdgdvlq ikgakgepgl egvagfpgkk
841 gepgdagvrg pegpvgpkgm igevgfpgrm gipgvrgqkg ergdtgtglp gppgppgppg
901 lpsgasfpag sfpvlqkgek gdigpsgppg ppgppgslas gpglsfggag vigpagpkge
961 kgmmgirgpl grqgrkgeig lpgrkgdtgl pgppgapggg ffggsgqvvq gptgppgpqg
1021 prgppgfpgr gpsgppgprg ppgpagigap glpgppgrpg qpgasigsgi mgppgppgpp
1081 graagiitfn sesqllrspp sssgtlafia dteqlylrvr dgwqiiltqp lrptvqvgki
1141 msiperppiq peasaenpgh anngfnngff gsevdaptvq lvgspddggl gkatgkmlhl
1201 ialnepmtgn mygirgadfk cfqqarqagl rgtyraflss kvqdlssvvs rgdrdgipiv
1261 nlkdeilfps wnsifegerq tnkykggafd intaiytfng tqpllnptwp hkriwhgtnm
1321 dgqrlgdhfc sawrendvsy vgmasslqtg lllgqeqysc sssyivlcie nthkrhhrly
1381 nfrrk
//