GenomeNet

Database: UniProt
Entry: A0A2I3TX02_PANTR
LinkDB: A0A2I3TX02_PANTR
Original site: A0A2I3TX02_PANTR 
ID   A0A2I3TX02_PANTR        Unreviewed;      1596 AA.
AC   A0A2I3TX02;
DT   28-FEB-2018, integrated into UniProtKB/TrEMBL.
DT   28-FEB-2018, sequence version 1.
DT   27-MAR-2024, entry version 20.
DE   SubName: Full=Collagen type XXII alpha 1 chain {ECO:0000313|Ensembl:ENSPTRP00000093512.1};
GN   Name=COL22A1 {ECO:0000313|Ensembl:ENSPTRP00000093512.1,
GN   ECO:0000313|VGNC:VGNC:50312};
OS   Pan troglodytes (Chimpanzee).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC   Pan.
OX   NCBI_TaxID=9598 {ECO:0000313|Ensembl:ENSPTRP00000093512.1, ECO:0000313|Proteomes:UP000002277};
RN   [1] {ECO:0000313|Ensembl:ENSPTRP00000093512.1, ECO:0000313|Proteomes:UP000002277}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=16136131; DOI=10.1038/nature04072;
RG   Chimpanzee sequencing and analysis consortium;
RT   "Initial sequence of the chimpanzee genome and comparison with the human
RT   genome.";
RL   Nature 437:69-87(2005).
RN   [2] {ECO:0000313|Ensembl:ENSPTRP00000093512.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AACZ04067425; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AACZ04067426; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   Ensembl; ENSPTRT00000077974.1; ENSPTRP00000093512.1; ENSPTRG00000020613.6.
DR   VGNC; VGNC:50312; COL22A1.
DR   GeneTree; ENSGT00940000159308; -.
DR   Proteomes; UP000002277; Chromosome 8.
DR   Bgee; ENSPTRG00000020613; Expressed in cerebellar cortex and 5 other cell types or tissues.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0005788; C:endoplasmic reticulum lumen; IEA:UniProt.
DR   Gene3D; 2.60.120.200; -; 1.
DR   Gene3D; 1.20.5.320; 6-Phosphogluconate Dehydrogenase, domain 3; 1.
DR   Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 1.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR048287; TSPN-like_N.
DR   InterPro; IPR002035; VWF_A.
DR   InterPro; IPR036465; vWFA_dom_sf.
DR   PANTHER; PTHR37456:SF3; COLLAGEN ALPHA-1(XXV) CHAIN; 1.
DR   PANTHER; PTHR37456; SI:CH211-266K2.1; 1.
DR   Pfam; PF01391; Collagen; 10.
DR   Pfam; PF00092; VWA; 1.
DR   PRINTS; PR00453; VWFADOMAIN.
DR   SMART; SM00210; TSPN; 1.
DR   SMART; SM00327; VWA; 1.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR   SUPFAM; SSF53300; vWA-like; 1.
DR   PROSITE; PS51257; PROKAR_LIPOPROTEIN; 1.
DR   PROSITE; PS50234; VWFA; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000002277};
KW   Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..27
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           28..1596
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5014140901"
FT   DOMAIN          38..213
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   REGION          493..1073
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1089..1428
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1461..1580
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        593..608
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        656..674
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        699..719
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        728..757
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        807..830
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1018..1039
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1190..1204
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1288..1309
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1405..1422
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1558..1575
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1596 AA;  158136 MW;  A4E55B9D7F7AE47B CRC64;
     MAGLRGNAMA GLLWMLLLWS GGGSCQAQRA GCKSVHYDLV FLLDTSSSVG KEDFEKVRQW
     VANLVDTFEV GPDRTRVGVV RYSDRPTTAF ELGLFGSQEE VKAAARRLAY HGGNTNTGDA
     LRYITARSFS PRAGGRPRDR AYKQVAILLT DGRSQDLVLD AAAAAHRAGI RIFAVGVGEA
     LKEELEEIAS EPKSAHVFHV SDFNAIDKIR GKLRRRLCEN VLCPSVRVEG DRFKHTNGGT
     KEITGFDLMD LFSVKEILGK RENGAQSSYV RMGSFPVVQS TEDVFPQGLP DEYAFVTTFR
     FRKTSRKEDW YIWQVIDQYG IPQVSIRLDG ENKAVEYNAV GAMKDAVRVV FRGSRVNDLF
     DRDWHKMALS IQAQNVSLHI DCALVQTLPI EERENIDIQG KTVIGKRLYD SVPIDFDLQR
     IVIYCDSRHA ELETCCDIPS GPCQVTVVTQ PPPPPPPQRP PTPGSEQIGF LKTINCSCPA
     GEKGEMGVAG PMGLPGPKGD IGATGPVGAP GPKGEKGDVG IGPFGQGEKG EKGSLGLPGP
     PGRDGSKGMR GEPGELGEPG LPGEVGMRGP QGPPGLPGPP GHVSAPGLQG ERGEKGTRGE
     KGERGLDGIP GKPGDTGQQG RPGPSGVAGP QGEKGDVGPA GPPGVPGSVG APGPRGHQGP
     PGPPGAPGPI GPEGRDGPPG LQGLRGKKGD MGPPGIPGLL GLQGPPGPPG VPGPPGLGGS
     PGLPGEIGFP GKPGPPGPTG PPGKDGPNGP PGPPGTKGEP GERGEDGLPG KPGLRGEIGE
     QGLAGRPGEK GEAGLPGAPG FPGVRGEKGD QGEKGELGLP GLKGDRGEKG EAGPAGPPGL
     PGTTSLFTPH PRMPGEQGPK GEKGDPGLPG EPGLQGRPGE LGPRGPAGPP GAKGQEGAHG
     APGAAGNPGA PGHVGPPGPS GPPGSVGAPG LRGPPGKDGE RGEKGAAGEE GSPGPVGPRG
     DPGAPGLPGP PGKGKDGEPG LRGSPGLPGT LGTKGDRGAP GIPGSPGSRG DPGIGVAGPP
     GPSGPPGDKG PPGSRGLPGF PGPQGPAGRD GAPGNPGERG PPGKPGLSSL LSPGDINLLA
     KDVCNDCPPG PPGLPGLPGF KGDKGVPGKP GREGTGGKKG EAGPPGLPGP PGIAGPQGSQ
     GERGADGEVG QKGDQGHPGV PGFMGPPGNP GPPGADGIAG AAGPPGIQGS PGKEGPPGPQ
     GPSGLPGIPG EEGKEGRDGK PGPPGEPGKA GEPGLPGPEG ARGPPGFKGH TGDSGAPGPR
     GEPGAMGPPG QEGLPGKDGD TGPTGPQGPQ GPRGPPGKNG SPGSPGEPGP SGTPGQKGSK
     GENGSPGLPG FLGPRGPPGE PGEKGVPGKE GVPGKPGEPG FKGERGDPGI KGDKGPPGGK
     GQPGDPGIPG HKGHTGLMGP QGPPGENGPV GPPGPPGQPG FPGLRGESPS METLRRLIQE
     ELGKQLETRL AYLLAQMPPA YMKSSQGRPG PPGPPGKDGL PGRAGPMGEP GRPGQRGLEG
     PSGPIGPKGE RGAKGDPGAP GVGLRGEMGP PGIPGQPGEP GYAKDGLPGI PGPQGETGPA
     GHPGPPGPPG PPGQCDPSQC AYFASLAARP GNVKGP
//
DBGET integrated database retrieval system