ID F7I9X6_CALJA Unreviewed; 439 AA.
AC F7I9X6; U3CDB4;
DT 27-JUL-2011, integrated into UniProtKB/TrEMBL.
DT 20-JUN-2018, sequence version 2.
DT 27-MAR-2024, entry version 45.
DE SubName: Full=Collagen alpha-1(XXVI) chain {ECO:0000313|EMBL:JAB23528.1};
DE SubName: Full=Collagen type XXVI alpha 1 chain {ECO:0000313|Ensembl:ENSCJAP00000029560.3};
GN Name=COL26A1 {ECO:0000313|EMBL:JAB23528.1,
GN ECO:0000313|Ensembl:ENSCJAP00000029560.3};
OS Callithrix jacchus (White-tufted-ear marmoset).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Platyrrhini; Cebidae;
OC Callitrichinae; Callithrix; Callithrix.
OX NCBI_TaxID=9483 {ECO:0000313|Ensembl:ENSCJAP00000029560.3, ECO:0000313|Proteomes:UP000008225};
RN [1] {ECO:0000313|Ensembl:ENSCJAP00000029560.3}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Warren W., Ye L., Minx P., Worley K., Gibbs R., Wilson R.K.;
RL Submitted (MAR-2009) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:JAB23528.1}
RP NUCLEOTIDE SEQUENCE.
RC TISSUE=Cerebral cortex {ECO:0000313|EMBL:JAB23528.1};
RX PubMed=25243066; DOI=10.1186/2047-217X-3-14;
RA Maudhoo M.D., Ren D., Gradnigo J.S., Gibbs R.M., Lubker A.C.,
RA Moriyama E.N., French J.A., Norgren R.B.Jr.;
RT "De novo assembly of the common marmoset transcriptome from NextGen mRNA
RT sequences.";
RL Gigascience 3:14-14(2014).
RN [3] {ECO:0000313|Ensembl:ENSCJAP00000029560.3}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GAMR01010404; JAB23528.1; -; mRNA.
DR STRING; 9483.ENSCJAP00000029560; -.
DR Ensembl; ENSCJAT00000031235.4; ENSCJAP00000029560.3; ENSCJAG00000016061.5.
DR GeneTree; ENSGT00940000161716; -.
DR HOGENOM; CLU_045268_2_0_1; -.
DR OMA; TCKVHNG; -.
DR OrthoDB; 3062338at2759; -.
DR Proteomes; UP000008225; Chromosome 2.
DR Bgee; ENSCJAG00000016061; Expressed in frontal cortex and 2 other cell types or tissues.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005783; C:endoplasmic reticulum; IEA:Ensembl.
DR GO; GO:0031012; C:extracellular matrix; IEA:Ensembl.
DR GO; GO:0005794; C:Golgi apparatus; IEA:Ensembl.
DR GO; GO:0010811; P:positive regulation of cell-substrate adhesion; IEA:Ensembl.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR011489; EMI_domain.
DR PANTHER; PTHR15427:SF23; EMI DOMAIN-CONTAINING PROTEIN 1; 1.
DR PANTHER; PTHR15427; EMILIN ELASTIN MICROFIBRIL INTERFACE-LOCATED PROTEIN ELASTIN MICROFIBRIL INTERFACER; 1.
DR Pfam; PF01391; Collagen; 2.
DR Pfam; PF07546; EMI; 1.
DR PROSITE; PS51041; EMI; 1.
PE 2: Evidence at transcript level;
KW Collagen {ECO:0000313|EMBL:JAB23528.1};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Reference proteome {ECO:0000313|Proteomes:UP000008225};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..20
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 21..439
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5041117711"
FT DOMAIN 52..128
FT /note="EMI"
FT /evidence="ECO:0000259|PROSITE:PS51041"
FT REGION 160..361
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 388..439
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 196..216
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 233..265
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 266..287
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 306..326
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 346..361
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 439 AA; 45409 MW; 23E694B5180367A3 CRC64;
MKLALLLPWA CCCLCGSALA TGFLYPFPAA ALQQHGYPEP GAGSPGSGYA SRRHWCHHTV
TRTVSCQVQN GSETVVQRVY QSCRWPGPCA NLVSYRTLIR PTYRVSYRTV TALEWRCCPG
FTGSNCDEEC MNCTRLSDMS ERLTTLEAKV LLLEAAERPL SPDNDLPVPQ STSPPWNEDF
LPDAIPLAHP GPRQRRPTGP AGPPGQTGPP GPAGPPGSKG DRGQTGEKGP AGPPGLLGPP
GPRGLPGEMG RPGPPGPPGP AGSLGPSPNS SPQGALYSLQ PPTDKDNGDS RLASAIVDTV
LAGIPGPRGP PGPPGPPGPR GPPGPPGSQG LAGERGTVGP SGEPGMKGEE GEKAPTAESE
GVKQLREALK ILAERVLILE HMIGVHDPLA SPEGGSGQDA ALRANLKMKR GGPQPEGVLA
ALLGPDPEQK SVDRASSRK
//