LOCUS XP_034276142 1847 aa linear VRT 03-NOV-2023
DEFINITION collagen alpha-1(XVIII) chain isoform X2 [Pantherophis guttatus].
ACCESSION XP_034276142
VERSION XP_034276142.1
DBLINK BioProject: PRJNA1031231
DBSOURCE REFSEQ: accession XM_034420251.2
KEYWORDS RefSeq.
SOURCE Pantherophis guttatus
ORGANISM Pantherophis guttatus
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Lepidosauria; Squamata; Bifurcata; Unidentata; Episquamata;
Toxicofera; Serpentes; Colubroidea; Colubridae; Colubrinae;
Pantherophis.
COMMENT MODEL REFSEQ: This record is predicted by automated computational
analysis. This record is derived from a genomic sequence
(NW_026844021) annotated using gene prediction method: Gnomon.
Also see:
Documentation of NCBI's Annotation Process
##Genome-Annotation-Data-START##
Annotation Provider :: NCBI RefSeq
Annotation Status :: Full annotation
Annotation Name :: GCF_029531705.1-RS_2023_10
Annotation Pipeline :: NCBI eukaryotic genome annotation
pipeline
Annotation Software Version :: 10.2
Annotation Method :: Gnomon; cmsearch; tRNAscan-SE
Features Annotated :: Gene; mRNA; CDS; ncRNA
Annotation Date :: 10/30/2023
##Genome-Annotation-Data-END##
COMPLETENESS: full length.
FEATURES Location/Qualifiers
source 1..1847
/organism="Pantherophis guttatus"
/isolate="1"
/db_xref="taxon:94885"
/chromosome="Unknown"
/sex="male"
/tissue_type="blood"
Protein 1..1847
/product="collagen alpha-1(XVIII) chain isoform X2"
/calculated_mol_wt=192817
Region <40..>161
/region_name="DUF959"
/note="Domain of Unknown Function (DUF959); pfam06121"
/db_xref="CDD:399255"
Region 404..527
/region_name="CRD_FZ"
/note="CRD_domain cysteine-rich domain, also known as Fz
(frizzled) domain; cl02447"
/db_xref="CDD:470581"
Site order(417,419..420,423..425)
/site_type="other"
/note="putative Wnt binding site [polypeptide binding]"
/db_xref="CDD:143549"
Region 531..715
/region_name="LamG"
/note="Laminin G domain; Laminin G-like domains are
usually Ca++ mediated receptors that can have binding
sites for steroids, beta1 integrins, heparin, sulfatides,
fibulin-1, and alpha-dystroglycans. Proteins that contain
LamG domains serve a variety of...; cl22861"
/db_xref="CDD:473984"
Region <799..>1079
/region_name="gly_rich_SclB"
/note="LPXTG-anchored collagen-like adhesin Scl2/SclB;
NF038329"
/db_xref="CDD:468478"
Region <1038..>1257
/region_name="gly_rich_SclB"
/note="LPXTG-anchored collagen-like adhesin Scl2/SclB;
NF038329"
/db_xref="CDD:468478"
Region <1151..>1441
/region_name="gly_rich_SclB"
/note="LPXTG-anchored collagen-like adhesin Scl2/SclB;
NF038329"
/db_xref="CDD:468478"
Region 1514..1561
/region_name="Collagen_trimer"
/note="Collagen trimerization domain; pfam20010"
/db_xref="CDD:466257"
Region 1673..1841
/region_name="Endostatin"
/note="Collagenase NC10 and Endostatin; pfam06482"
/db_xref="CDD:461931"
Site order(1701,1710,1716,1725..1726,1729,1791..1792,1802)
/site_type="other"
/note="putative ligand binding site [chemical binding]"
/db_xref="CDD:238151"
CDS 1..1847
/gene="COL18A1"
/coded_by="XM_034420251.2:79..5622"
/db_xref="GeneID:117667073"
ORIGIN
1 marvivsawl llmllcclan aqgwrnwfws gseettlspt kaaeaedets qdhstpaata
61 rpnapfestt dpeprgkagt iftlkqpdft ptvpvpaats psqeesrern itgvgveiln
121 vaegiqnlvq lldekttdrt ertevpatte tsaspapvte pgsiqnvtsn ltgdiqtslk
181 tkkpegatkl arlwnkdlal lwnktrvfpk kpgkprqgsa sfsfspdghf sgtmlvfqes
241 peagqevaft ppairatwga fskkqgilst akalklqesq ansssssgsn snsslhagmp
301 skvvgtldsw vlpyvtnpsq pvskdsgahl sfkshpffkh fgigvagakt nhsrdsvsns
361 satnvmdfpp annsdslefl ltyavqhsns ssglpsflpg ltpsagrclp lptklsycnh
421 lgtkhfrvpn ylhhgseeev waalhewegl lksrchryle wflclllvpg cnasfpvtpp
481 pcrgfcealk dlcwthwkag rlpisceslp eedgpyscvf vnvsaenfsr evglleligd
541 pppdqitkiy gpdkspayvf spdanagqva ryhlpspfyr dfsllfyiqs tsdnagvlfa
601 ltdasqsiiy vgvklsevkd gkqqiifyyt epgsqnsnva atftvpslvn lwtrfalsvr
661 dynvvlymdc eefkmvhler ssgkieleeg aglfvgqagg adpdkyqgii velkikdnpw
721 aagyqcveed ddcdtcggsg sgldikqpps ekesviplls klpvpppvts paiakkpvql
781 eeteyterpt yvpasgtkge kgdpgekgdr gpkgdpgtgv lstngdkgek gsagfgypgs
841 kgqkgepgtt glpgpigppg ppgtimrhsd gstvegpgam gppglpgkdg qpgkdgepgd
901 pgedgkpgdv gpqgfpgtpg epglkgekge psvgargppg ppgppgkpgl sskvdkltfi
961 dmegsgfgse leslrgprgp pgppgppgvp glpgqpgrfg tngtdfpglp glpgvpgrng
1021 nsgipgppgp lgppgkdgip gqpgekgapg epgemgfpga pgpegnqgvp gfpgtpgepg
1081 laglpgpmgp rglpgppgpg iaaefvdmeg sgfplvsggp gtrgpegppg lpglpglpgp
1141 pgpkgdegii glpglpgekg ypglpgldgr pglegfpgpq gqkgeegspg skgekgqdgi
1201 glpgppgppg qavylssedk tvpvlpgpeg pkgpkgdsgt pglqgypglk gekgepgvvt
1261 rpdgtilaae akgekgepgp sgpmgpggpp grygrkgelg fpgrpgrpgm nglkgekgdp
1321 adlsgalglr gppgppgppg ppgspvpvye nnafsdlgpp gppglpgyhg qkgekgeqgl
1381 pgppgqfpyd lsrfsstfrg ergdkgdpgm kgekgesggg snaaglpgpq gypglpgpkg
1441 esirglpgpp gpqgppgagf eghpgpqgpp gppgppgpps fpgphrqhis ipgppgppgp
1501 pgppgtsdpp slgvrilaty qnlmsrahev pegqllfire reelyirvhn gfrkilmeek
1561 isipgsgldn evyersssih yshggtassg shrpfqphlp vharpeysay stakpwrgde
1621 siidphhlpe qpavhpprqg aqqesldhff pnnrqtetap lavhthhdfq palhlialns
1681 pqsgsmrgir gadfqcfqqa rqvglpgtfr aflssrlqdl ysivrradrs tvpivnlrde
1741 vlfnnwenlf sgseapfrtg vrilsfdgrd vlrdsawpqk yvwhgsdskg rrltesycet
1801 wrtddtvvtg qasslasskl leqksnscrn afivlciens fmtsskk
//