ID A0A2I3TX02_PANTR Unreviewed; 1596 AA.
AC A0A2I3TX02;
DT 28-FEB-2018, integrated into UniProtKB/TrEMBL.
DT 28-FEB-2018, sequence version 1.
DT 27-MAR-2024, entry version 20.
DE SubName: Full=Collagen type XXII alpha 1 chain {ECO:0000313|Ensembl:ENSPTRP00000093512.1};
GN Name=COL22A1 {ECO:0000313|Ensembl:ENSPTRP00000093512.1,
GN ECO:0000313|VGNC:VGNC:50312};
OS Pan troglodytes (Chimpanzee).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC Pan.
OX NCBI_TaxID=9598 {ECO:0000313|Ensembl:ENSPTRP00000093512.1, ECO:0000313|Proteomes:UP000002277};
RN [1] {ECO:0000313|Ensembl:ENSPTRP00000093512.1, ECO:0000313|Proteomes:UP000002277}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=16136131; DOI=10.1038/nature04072;
RG Chimpanzee sequencing and analysis consortium;
RT "Initial sequence of the chimpanzee genome and comparison with the human
RT genome.";
RL Nature 437:69-87(2005).
RN [2] {ECO:0000313|Ensembl:ENSPTRP00000093512.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AACZ04067425; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AACZ04067426; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR Ensembl; ENSPTRT00000077974.1; ENSPTRP00000093512.1; ENSPTRG00000020613.6.
DR VGNC; VGNC:50312; COL22A1.
DR GeneTree; ENSGT00940000159308; -.
DR Proteomes; UP000002277; Chromosome 8.
DR Bgee; ENSPTRG00000020613; Expressed in cerebellar cortex and 5 other cell types or tissues.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005788; C:endoplasmic reticulum lumen; IEA:UniProt.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 1.20.5.320; 6-Phosphogluconate Dehydrogenase, domain 3; 1.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR048287; TSPN-like_N.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR37456:SF3; COLLAGEN ALPHA-1(XXV) CHAIN; 1.
DR PANTHER; PTHR37456; SI:CH211-266K2.1; 1.
DR Pfam; PF01391; Collagen; 10.
DR Pfam; PF00092; VWA; 1.
DR PRINTS; PR00453; VWFADOMAIN.
DR SMART; SM00210; TSPN; 1.
DR SMART; SM00327; VWA; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR SUPFAM; SSF53300; vWA-like; 1.
DR PROSITE; PS51257; PROKAR_LIPOPROTEIN; 1.
DR PROSITE; PS50234; VWFA; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000002277};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..27
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 28..1596
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5014140901"
FT DOMAIN 38..213
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT REGION 493..1073
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1089..1428
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1461..1580
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 593..608
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 656..674
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 699..719
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 728..757
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 807..830
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1018..1039
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1190..1204
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1288..1309
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1405..1422
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1558..1575
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1596 AA; 158136 MW; A4E55B9D7F7AE47B CRC64;
MAGLRGNAMA GLLWMLLLWS GGGSCQAQRA GCKSVHYDLV FLLDTSSSVG KEDFEKVRQW
VANLVDTFEV GPDRTRVGVV RYSDRPTTAF ELGLFGSQEE VKAAARRLAY HGGNTNTGDA
LRYITARSFS PRAGGRPRDR AYKQVAILLT DGRSQDLVLD AAAAAHRAGI RIFAVGVGEA
LKEELEEIAS EPKSAHVFHV SDFNAIDKIR GKLRRRLCEN VLCPSVRVEG DRFKHTNGGT
KEITGFDLMD LFSVKEILGK RENGAQSSYV RMGSFPVVQS TEDVFPQGLP DEYAFVTTFR
FRKTSRKEDW YIWQVIDQYG IPQVSIRLDG ENKAVEYNAV GAMKDAVRVV FRGSRVNDLF
DRDWHKMALS IQAQNVSLHI DCALVQTLPI EERENIDIQG KTVIGKRLYD SVPIDFDLQR
IVIYCDSRHA ELETCCDIPS GPCQVTVVTQ PPPPPPPQRP PTPGSEQIGF LKTINCSCPA
GEKGEMGVAG PMGLPGPKGD IGATGPVGAP GPKGEKGDVG IGPFGQGEKG EKGSLGLPGP
PGRDGSKGMR GEPGELGEPG LPGEVGMRGP QGPPGLPGPP GHVSAPGLQG ERGEKGTRGE
KGERGLDGIP GKPGDTGQQG RPGPSGVAGP QGEKGDVGPA GPPGVPGSVG APGPRGHQGP
PGPPGAPGPI GPEGRDGPPG LQGLRGKKGD MGPPGIPGLL GLQGPPGPPG VPGPPGLGGS
PGLPGEIGFP GKPGPPGPTG PPGKDGPNGP PGPPGTKGEP GERGEDGLPG KPGLRGEIGE
QGLAGRPGEK GEAGLPGAPG FPGVRGEKGD QGEKGELGLP GLKGDRGEKG EAGPAGPPGL
PGTTSLFTPH PRMPGEQGPK GEKGDPGLPG EPGLQGRPGE LGPRGPAGPP GAKGQEGAHG
APGAAGNPGA PGHVGPPGPS GPPGSVGAPG LRGPPGKDGE RGEKGAAGEE GSPGPVGPRG
DPGAPGLPGP PGKGKDGEPG LRGSPGLPGT LGTKGDRGAP GIPGSPGSRG DPGIGVAGPP
GPSGPPGDKG PPGSRGLPGF PGPQGPAGRD GAPGNPGERG PPGKPGLSSL LSPGDINLLA
KDVCNDCPPG PPGLPGLPGF KGDKGVPGKP GREGTGGKKG EAGPPGLPGP PGIAGPQGSQ
GERGADGEVG QKGDQGHPGV PGFMGPPGNP GPPGADGIAG AAGPPGIQGS PGKEGPPGPQ
GPSGLPGIPG EEGKEGRDGK PGPPGEPGKA GEPGLPGPEG ARGPPGFKGH TGDSGAPGPR
GEPGAMGPPG QEGLPGKDGD TGPTGPQGPQ GPRGPPGKNG SPGSPGEPGP SGTPGQKGSK
GENGSPGLPG FLGPRGPPGE PGEKGVPGKE GVPGKPGEPG FKGERGDPGI KGDKGPPGGK
GQPGDPGIPG HKGHTGLMGP QGPPGENGPV GPPGPPGQPG FPGLRGESPS METLRRLIQE
ELGKQLETRL AYLLAQMPPA YMKSSQGRPG PPGPPGKDGL PGRAGPMGEP GRPGQRGLEG
PSGPIGPKGE RGAKGDPGAP GVGLRGEMGP PGIPGQPGEP GYAKDGLPGI PGPQGETGPA
GHPGPPGPPG PPGQCDPSQC AYFASLAARP GNVKGP
//