ID F7FE47_MACMU Unreviewed; 1604 AA.
AC F7FE47;
DT 27-JUL-2011, integrated into UniProtKB/TrEMBL.
DT 11-DEC-2019, sequence version 3.
DT 27-MAR-2024, entry version 70.
DE SubName: Full=Collagen type XVI alpha 1 chain {ECO:0000313|Ensembl:ENSMMUP00000030804.4};
GN Name=COL16A1 {ECO:0000313|Ensembl:ENSMMUP00000030804.4,
GN ECO:0000313|VGNC:VGNC:71293};
OS Macaca mulatta (Rhesus macaque).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini;
OC Cercopithecidae; Cercopithecinae; Macaca.
OX NCBI_TaxID=9544 {ECO:0000313|Ensembl:ENSMMUP00000030804.4, ECO:0000313|Proteomes:UP000006718};
RN [1] {ECO:0000313|Proteomes:UP000006718}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=17573 {ECO:0000313|Proteomes:UP000006718};
RX PubMed=17431167; DOI=10.1126/science.1139247;
RA Gibbs R.A., Rogers J., Katze M.G., Bumgarner R., Weinstock G.M.,
RA Mardis E.R., Remington K.A., Strausberg R.L., Venter J.C., Wilson R.K.,
RA Batzer M.A., Bustamante C.D., Eichler E.E., Hahn M.W., Hardison R.C.,
RA Makova K.D., Miller W., Milosavljevic A., Palermo R.E., Siepel A.,
RA Sikela J.M., Attaway T., Bell S., Bernard K.E., Buhay C.J.,
RA Chandrabose M.N., Dao M., Davis C., Delehaunty K.D., Ding Y., Dinh H.H.,
RA Dugan-Rocha S., Fulton L.A., Gabisi R.A., Garner T.T., Godfrey J.,
RA Hawes A.C., Hernandez J., Hines S., Holder M., Hume J., Jhangiani S.N.,
RA Joshi V., Khan Z.M., Kirkness E.F., Cree A., Fowler R.G., Lee S.,
RA Lewis L.R., Li Z., Liu Y.-S., Moore S.M., Muzny D., Nazareth L.V.,
RA Ngo D.N., Okwuonu G.O., Pai G., Parker D., Paul H.A., Pfannkoch C.,
RA Pohl C.S., Rogers Y.-H.C., Ruiz S.J., Sabo A., Santibanez J.,
RA Schneider B.W., Smith S.M., Sodergren E., Svatek A.F., Utterback T.R.,
RA Vattathil S., Warren W., White C.S., Chinwalla A.T., Feng Y., Halpern A.L.,
RA Hillier L.W., Huang X., Minx P., Nelson J.O., Pepin K.H., Qin X.,
RA Sutton G.G., Venter E., Walenz B.P., Wallis J.W., Worley K.C., Yang S.-P.,
RA Jones S.M., Marra M.A., Rocchi M., Schein J.E., Baertsch R., Clarke L.,
RA Csuros M., Glasscock J., Harris R.A., Havlak P., Jackson A.R., Jiang H.,
RA Liu Y., Messina D.N., Shen Y., Song H.X.-Z., Wylie T., Zhang L., Birney E.,
RA Han K., Konkel M.K., Lee J., Smit A.F.A., Ullmer B., Wang H., Xing J.,
RA Burhans R., Cheng Z., Karro J.E., Ma J., Raney B., She X., Cox M.J.,
RA Demuth J.P., Dumas L.J., Han S.-G., Hopkins J., Karimpour-Fard A.,
RA Kim Y.H., Pollack J.R., Vinar T., Addo-Quaye C., Degenhardt J., Denby A.,
RA Hubisz M.J., Indap A., Kosiol C., Lahn B.T., Lawson H.A., Marklein A.,
RA Nielsen R., Vallender E.J., Clark A.G., Ferguson B., Hernandez R.D.,
RA Hirani K., Kehrer-Sawatzki H., Kolb J., Patil S., Pu L.-L., Ren Y.,
RA Smith D.G., Wheeler D.A., Schenck I., Ball E.V., Chen R., Cooper D.N.,
RA Giardine B., Hsu F., Kent W.J., Lesk A., Nelson D.L., O'brien W.E.,
RA Pruefer K., Stenson P.D., Wallace J.C., Ke H., Liu X.-M., Wang P.,
RA Xiang A.P., Yang F., Barber G.P., Haussler D., Karolchik D., Kern A.D.,
RA Kuhn R.M., Smith K.E., Zwieg A.S.;
RT "Evolutionary and biomedical insights from the rhesus macaque genome.";
RL Science 316:222-234(2007).
RN [2] {ECO:0000313|Ensembl:ENSMMUP00000030804.4}
RP IDENTIFICATION.
RC STRAIN=17573 {ECO:0000313|Ensembl:ENSMMUP00000030804.4};
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 9544.ENSMMUP00000030804; -.
DR PaxDb; 9544-ENSMMUP00000030805; -.
DR Ensembl; ENSMMUT00000032922.4; ENSMMUP00000030804.4; ENSMMUG00000023398.4.
DR VEuPathDB; HostDB:ENSMMUG00000023398; -.
DR VGNC; VGNC:71293; COL16A1.
DR eggNOG; KOG3544; Eukaryota.
DR GeneTree; ENSGT00940000161782; -.
DR HOGENOM; CLU_001074_2_2_1; -.
DR InParanoid; F7FE47; -.
DR OMA; MGNSWQP; -.
DR Proteomes; UP000006718; Chromosome 1.
DR Bgee; ENSMMUG00000023398; Expressed in olfactory segment of nasal mucosa and 20 other cell types or tissues.
DR GO; GO:0005594; C:collagen type IX trimer; IBA:GO_Central.
DR GO; GO:0062023; C:collagen-containing extracellular matrix; IBA:GO_Central.
DR GO; GO:0005788; C:endoplasmic reticulum lumen; IEA:UniProt.
DR GO; GO:0005615; C:extracellular space; IBA:GO_Central.
DR GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IBA:GO_Central.
DR GO; GO:0005178; F:integrin binding; IEA:Ensembl.
DR GO; GO:0033627; P:cell adhesion mediated by integrin; IEA:Ensembl.
DR GO; GO:0071230; P:cellular response to amino acid stimulus; IEA:Ensembl.
DR GO; GO:0030198; P:extracellular matrix organization; IBA:GO_Central.
DR GO; GO:0033622; P:integrin activation; IEA:Ensembl.
DR GO; GO:0051894; P:positive regulation of focal adhesion assembly; IEA:Ensembl.
DR Gene3D; 2.60.120.200; -; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR048287; TSPN-like_N.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF845; COLLAGEN ALPHA-1(XXII) CHAIN; 1.
DR Pfam; PF01391; Collagen; 9.
DR SMART; SM00210; TSPN; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Reference proteome {ECO:0000313|Proteomes:UP000006718};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..21
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 22..1604
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5023855542"
FT DOMAIN 50..231
FT /note="Thrombospondin-like N-terminal"
FT /evidence="ECO:0000259|SMART:SM00210"
FT REGION 309..510
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 602..625
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 637..919
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1002..1429
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1468..1552
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 448..473
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 905..919
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1040..1059
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1123..1137
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1159..1177
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1283..1303
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1604 AA; 157863 MW; 0958995EAA2FF1FB CRC64;
MWVSWAPGLW LLGLWATFSH GANTGAHCPP SQQEGLKLEH SSSLPANVTG FNLIRRLSLM
KMSAIKKIRN PKGPLILRLG AASVTQPTRR VFPRGLPEEF ALVLTLLLKK HTHQKTWYLF
QVTDADGYPQ ISLEVNNQER SLELRAQGQD GDFVSCIFPV PQLFDLRWHK LMLSVAGRVA
SVHVDCSSAS SQPLGPRRPL RPVGHVFLGL DAEQGKPVSF DLQQVHIYCD PELVLEEGCC
EILPAGCPPE TSKARRDTQS NELIEINPQS EGKVYTRCFC LEEPQNSQVD AQLTGRINQK
AERGAKVHRE TAADECPPCV HGAQDSNVTL APSGPKGGKG ERGLPGPPGS KGEKGARGND
CVRISPDAPL QCAEGPKGEK GQSGALGPSG LPGSTGEKGQ KGEKGDGGLK GVPGKPGRDG
RPGEICVIGP KGQKGDPGFV GPEGLAGEPG PPGLPGPPGI GLPGTPGDPG GPPGPKGDKG
SSGIPGKEGP GGKPGKPGVK GEKGDPCEVC PTLPEGFQSF VGLPGKPGPK GEPGDPVSAR
GDPGIQGIKG EKGESCLSCS SVVGAQHLAS STGASGDVGS PGFGLPGLPG KAGLPGLKGE
KGNFGEAGPA GSPGPPGPVG PVGIKGAKGE PCEPCPALSN LQDGDVRVVA LPGPSGEKGE
PGPPGFGLPG KQGKAGERGL KGQKGDAGNP GDPGTPGTTG RPGLSGEPGV RGPAGPKGEK
GDGCTACPSL QGTVTDVAGR PGQPGPKGEP GPEGVGRPGK PGQPGLPGVQ GPPGLKGMQG
EPGPPGRGVQ GPQGEPGAPG LPGIQGLPGP RGPPGPTGEK GAQGSPGVKG ATGPVGPPGT
SVSGPPGRDG QQGQTGPRGT PGEKGPRGEK GEPGECSCPS RGDLVFSGMP GAPGLWMGSS
WQPGPQGPPG IPGPPGPPGV PGLQGVPGNN GLPGQPGLTA ELGSLPIEQH LLKSICGDCV
QGQRAHPAYL VEKGEKGDQG IPGVLGLDNC AQCFLSLERP RAEEARGDNN EGDPGCIGSP
GLPGPPGLPG QRGEEGPPGM RGSPGPPGPI GPPGFPGAVG SPGLPGLQGE RGLTGLTGDK
GEPGPPGQPG YPGAMGPPGL PGIKGERGYT GSAGEKGEPG PPGSEGLPGP PGPAGPRGER
GPQGNSGEKG DQGFQGQPGF PGPPGPPGFP GKVGAPGPPG PQAEKGSEGI RGPSGMPGSP
GPPGPPGIQG PAGLDGLDGK DGKPGLRGDP GPAGPPGLMG PPGFKGKTGH PGLPGPKGDC
GKPGPPGSTG RPGAEGEPGA MGPQGRPGPP GHVGPPGPPG QPGPAGISAV GLKGDRGATG
ERGLAGLPGQ PGPPGHPGPP GEPGTDGAAG KEGPAGKQGL YGPPGPKGDP GAAGQKGQAG
EKGRAGMPGG PGKSGSMGPV GPPGPAGERG HPGAPGPSGS PGLPGVPGSM GDMVNYDEIK
RFIRQEIIKM FDERMAYYTS RMQFPMEMAA APGRPGPPGK DGAPGRPGAP GSPGLPGQIG
REGRQGLPGM RGLPGTKGEK GDIGVGIAGE NGLPGPPGPQ GPPGYGKMGA TGPMGQQGIP
GIPGPPGPMG QPGKAGHCSP SDCFGAMPME QQYPPMKTMK GPFG
//