LOCUS XP_036701352 1000 aa linear MAM 28-OCT-2020
DEFINITION LOW QUALITY PROTEIN: macrophage colony-stimulating factor 1
receptor [Balaenoptera musculus].
ACCESSION XP_036701352
VERSION XP_036701352.1
DBLINK BioProject: PRJNA607322
DBSOURCE REFSEQ: accession XM_036845457.1
KEYWORDS RefSeq; corrected model.
SOURCE Balaenoptera musculus (blue whale)
ORGANISM Balaenoptera musculus
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Laurasiatheria; Artiodactyla; Whippomorpha;
Cetacea; Mysticeti; Balaenopteridae; Balaenoptera.
COMMENT MODEL REFSEQ: This record is predicted by automated computational
analysis. This record is derived from a genomic sequence
(NC_045787.1) annotated using gene prediction method: Gnomon.
Also see:
Documentation of NCBI's Annotation Process
##Genome-Annotation-Data-START##
Annotation Provider :: NCBI
Annotation Status :: Full annotation
Annotation Name :: Balaenoptera musculus Annotation
Release 100
Annotation Version :: 100
Annotation Pipeline :: NCBI eukaryotic genome annotation
pipeline
Annotation Software Version :: 8.5
Annotation Method :: Best-placed RefSeq; Gnomon
Features Annotated :: Gene; mRNA; CDS; ncRNA
##Genome-Annotation-Data-END##
##RefSeq-Attributes-START##
frameshifts :: corrected 5 indels
internal stop codons :: corrected 1 genomic stop codons
##RefSeq-Attributes-END##
COMPLETENESS: full length.
FEATURES Location/Qualifiers
source 1..1000
/organism="Balaenoptera musculus"
/isolate="JJ_BM4_2016_0621"
/db_xref="taxon:9771"
/chromosome="3"
/sex="male"
/cell_type="Fibroblast"
/tissue_type="Epidermis and Blubber"
/geo_loc_name="Pacific Ocean: Santa Barbara"
/lat_lon="34.41938 N 119.69905 W"
/collection_date="2016"
/collected_by="Jeff K. Jacobsen, Susanne Meyer, Li-Fang
Chu, Jessica Antosiewicz-Bourget"
Protein 1..1000
/product="LOW QUALITY PROTEIN: macrophage
colony-stimulating factor 1 receptor"
/calculated_mol_wt=110409
Region 62..135
/region_name="IG_like"
/note="Immunoglobulin like; smart00410"
/db_xref="CDD:214653"
Region 241..334
/region_name="Ig"
/note="Immunoglobulin domain; cl11960"
/db_xref="CDD:472250"
Region 258..262
/region_name="Ig strand B"
/note="Ig strand B [structural motif]"
/db_xref="CDD:409353"
Region 272..276
/region_name="Ig strand C"
/note="Ig strand C [structural motif]"
/db_xref="CDD:409353"
Region 298..302
/region_name="Ig strand E"
/note="Ig strand E [structural motif]"
/db_xref="CDD:409353"
Region 314..319
/region_name="Ig strand F"
/note="Ig strand F [structural motif]"
/db_xref="CDD:409353"
Region 327..330
/region_name="Ig strand G"
/note="Ig strand G [structural motif]"
/db_xref="CDD:409353"
Region 338..437
/region_name="Ig"
/note="Immunoglobulin domain; cl11960"
/db_xref="CDD:472250"
Region 358..362
/region_name="Ig strand B"
/note="Ig strand B [structural motif]"
/db_xref="CDD:409353"
Region 372..376
/region_name="Ig strand C"
/note="Ig strand C [structural motif]"
/db_xref="CDD:409353"
Region 401..405
/region_name="Ig strand E"
/note="Ig strand E [structural motif]"
/db_xref="CDD:409353"
Region 415..420
/region_name="Ig strand F"
/note="Ig strand F [structural motif]"
/db_xref="CDD:409353"
Region 428..431
/region_name="Ig strand G"
/note="Ig strand G [structural motif]"
/db_xref="CDD:409353"
Region 438..539
/region_name="IG_like"
/note="Immunoglobulin like; smart00410"
/db_xref="CDD:214653"
Region 453..456
/region_name="Ig strand B"
/note="Ig strand B [structural motif]"
/db_xref="CDD:409353"
Region 465..469
/region_name="Ig strand C"
/note="Ig strand C [structural motif]"
/db_xref="CDD:409353"
Region 500..504
/region_name="Ig strand E"
/note="Ig strand E [structural motif]"
/db_xref="CDD:409353"
Region 518..523
/region_name="Ig strand F"
/note="Ig strand F [structural motif]"
/db_xref="CDD:409353"
Region 579..951
/region_name="Protein Kinases, catalytic domain"
/note="The protein kinase superfamily is mainly composed
of the catalytic domains of serine/threonine-specific and
tyrosine-specific protein kinases. It also includes RIO
kinases, which are atypical serine protein kinases,
aminoglycoside phosphotransferases; cl21453"
/db_xref="CDD:473864"
Site order(624..627,630,632,650,652,683,699..702,815,819..820,
822,832..833)
/site_type="other"
/note="ATP binding site [chemical binding]"
/db_xref="CDD:133237"
CDS 1..1000
/gene="CSF1R"
/coded_by="XM_036845457.1:63..3065"
/note="The sequence of the model RefSeq protein was
modified relative to its source genomic sequence to
represent the inferred CDS: inserted 3 bases in 3 codons;
deleted 4 bases in 2 codons; substituted 1 base at 1
genomic stop codon"
/db_xref="GeneID:118891756"
ORIGIN
1 mssrpgpcga svqessslpt pkpcigpaaf lprpwargtl lillvatawh gqgvrlieps
61 gpelvvepgt tvtlrclsng svewdgpist ywtldadaps gilttkqatf lntgtyrcte
121 rgdplxgsat ihlyvkgedc epcrpwrvlv devtvdxgsg raapcllxdp algagvslvr
181 vrgptflrqt nysflpshgf iihkakfies qdyecsarvd grmvkvpqhp lkvqkvipgp
241 ptltlkptel vriqgeaaqi vcsasdvdvn fdvflqhgdt kvtnnlsqsd fhdnryqkvl
301 tlnldhvgfq dagnytcmat naxgihstsm vfwvvdsayl nltseqnllq evtvgekiel
361 kvkveaypsl qsfnwtyrgp flgqqpklnf vtnkgtyryt stltllrlkp feagrysfqa
421 rnargeealt feltllyppe vevtwtling sktllceasg ypqpnvtwvq crghtnrcdk
481 tevlvlddpn pevlsqkpfh kvtvqsllat gtlehnrtye craynsvgns sqafgpvsvg
541 aymqpldepl ftpvlvacms imalllllll lllykykqkp kyqvrwkiie syggnsytfi
601 dptqlpynek wefprnnlqf gktlgagafg kvveatafgl gkedavlkva vkmlkstaha
661 dekealmsel kimshlgqhe nivnllgact hggpvlvite yccygdllnf lrrkaeamlg
721 pslspgqdpk agtgyknihl ekkyirrdsg fssqgvdtyv emrpvstsss ndsfseqdld
781 kedgrplelr dllhfssqva qgmaflaskn cihrdvaarn vlltsghvak igdfglardi
841 mndsnyivkg narlpvkwma pesifdcvyt vqsdvwsygi llweifslgl npypgilvns
901 kfyklvkdgy qmaqpvfapk niysimqscw aleptrrptf qqicsllqeq vqvdrreqdy
961 snlpssssep eeesssehla cceqgdiaqp llqpnnyqfc
//