GenomeNet

Database: UniProt
Entry: A0A096NYZ3_PAPAN
LinkDB: A0A096NYZ3_PAPAN
Original site: A0A096NYZ3_PAPAN 
ID   A0A096NYZ3_PAPAN        Unreviewed;      1626 AA.
AC   A0A096NYZ3;
DT   26-NOV-2014, integrated into UniProtKB/TrEMBL.
DT   25-MAY-2022, sequence version 2.
DT   27-MAR-2024, entry version 55.
DE   SubName: Full=Collagen type XXII alpha 1 chain {ECO:0000313|Ensembl:ENSPANP00000018309.2};
GN   Name=COL22A1 {ECO:0000313|Ensembl:ENSPANP00000018309.2};
OS   Papio anubis (Olive baboon).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini;
OC   Cercopithecidae; Cercopithecinae; Papio.
OX   NCBI_TaxID=9555 {ECO:0000313|Ensembl:ENSPANP00000018309.2, ECO:0000313|Proteomes:UP000028761};
RN   [1] {ECO:0000313|Ensembl:ENSPANP00000018309.2, ECO:0000313|Proteomes:UP000028761}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA   Liu Y.L., Abraham K.A., Akbar H.A., Ali S.A., Anosike U.A., Aqrawi P.A.,
RA   Arias F.A., Attaway T.A., Awwad R.A., Babu C.B., Bandaranaike D.B.,
RA   Battles P.B., Bell A.B., Beltran B.B., Berhane-Mersha D.B., Bess C.B.,
RA   Bickham C.B., Bolden T.B., Carter K.C., Chau D.C., Chavez A.C.,
RA   Clerc-Blankenburg K.C., Coyle M.C., Dao M.D., Davila M.L.D.,
RA   Davy-Carroll L.D., Denson S.D., Dinh H.D., Fernandez S.F., Fernando P.F.,
RA   Forbes L.F., Francis C.F., Francisco L.F., Fu Q.F., Garcia-Iii R.G.,
RA   Garrett T.G., Gross S.G., Gubbala S.G., Hirani K.H., Hogues M.H.,
RA   Hollins B.H., Jackson L.J., Javaid M.J., Jhangiani S.J., Johnson A.J.,
RA   Johnson B.J., Jones J.J., Joshi V.J., Kalu J.K., Khan N.K., Korchina V.K.,
RA   Kovar C.K., Lago L.L., Lara F.L., Le T.-K.L., Lee S.L., Legall-Iii F.L.,
RA   Lemon S.L., Liu J.L., Liu Y.-S.L., Liyanage D.L., Lopez J.L.,
RA   Lorensuhewa L.L., Mata R.M., Mathew T.M., Mercado C.M., Mercado I.M.,
RA   Morales K.M., Morgan M.M., Munidasa M.M., Ngo D.N., Nguyen L.N.,
RA   Nguyen T.N., Nguyen N.N., Obregon M.O., Okwuonu G.O., Ongeri F.O.,
RA   Onwere C.O., Osifeso I.O., Parra A.P., Patil S.P., Perez A.P., Perez Y.P.,
RA   Pham C.P., Pu L.-L.P., Puazo M.P., Quiroz J.Q., Rouhana J.R., Ruiz M.R.,
RA   Ruiz S.-J.R., Saada N.S., Santibanez J.S., Scheel M.S., Schneider B.S.,
RA   Simmons D.S., Sisson I.S., Tang L.-Y.T., Thornton R.T., Tisius J.T.,
RA   Toledanes G.T., Trejos Z.T., Usmani K.U., Varghese R.V., Vattathil S.V.,
RA   Vee V.V., Walker D.W., Weissenberger G.W., White C.W., Williams A.W.,
RA   Woodworth J.W., Wright R.W., Zhu Y.Z., Han Y.H., Newsham I.N.,
RA   Nazareth L.N., Worley K.W., Muzny D.M., Rogers J.R., Gibbs R.G.;
RT   "Whole Genome Assembly of Papio anubis.";
RL   Submitted (MAR-2012) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|Ensembl:ENSPANP00000018309.2}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (SEP-2023) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   STRING; 9555.ENSPANP00000018309; -.
DR   Ensembl; ENSPANT00000007142.3; ENSPANP00000018309.2; ENSPANG00000007983.3.
DR   eggNOG; KOG1217; Eukaryota.
DR   eggNOG; KOG3544; Eukaryota.
DR   GeneTree; ENSGT00940000159308; -.
DR   HOGENOM; CLU_003584_0_0_1; -.
DR   OMA; PQVNCSC; -.
DR   OrthoDB; 2883115at2759; -.
DR   Proteomes; UP000028761; Chromosome 8.
DR   Bgee; ENSPANG00000007983; Expressed in arcuate nucleus of hypothalamus and 36 other cell types or tissues.
DR   ExpressionAtlas; A0A096NYZ3; baseline.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0005788; C:endoplasmic reticulum lumen; IEA:UniProt.
DR   Gene3D; 2.60.120.200; -; 1.
DR   Gene3D; 1.20.5.320; 6-Phosphogluconate Dehydrogenase, domain 3; 1.
DR   Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 1.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR048287; TSPN-like_N.
DR   InterPro; IPR002035; VWF_A.
DR   InterPro; IPR036465; vWFA_dom_sf.
DR   PANTHER; PTHR37456:SF3; COLLAGEN ALPHA-1(XXV) CHAIN; 1.
DR   PANTHER; PTHR37456; SI:CH211-266K2.1; 1.
DR   Pfam; PF01391; Collagen; 14.
DR   Pfam; PF00092; VWA; 1.
DR   PRINTS; PR00453; VWFADOMAIN.
DR   SMART; SM00210; TSPN; 1.
DR   SMART; SM00327; VWA; 1.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR   SUPFAM; SSF53300; vWA-like; 1.
DR   PROSITE; PS51257; PROKAR_LIPOPROTEIN; 1.
DR   PROSITE; PS50234; VWFA; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000028761};
KW   Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..27
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           28..1626
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5035233926"
FT   DOMAIN          38..213
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   REGION          492..1000
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1018..1458
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1490..1610
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        593..608
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        666..684
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        708..731
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        738..767
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        817..840
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1048..1069
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1220..1234
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1318..1339
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1435..1452
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1588..1605
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1626 AA;  161256 MW;  0A2E2CFFAE8762F2 CRC64;
     MVGLRGNAVA GLLWMLLLWS GGGGCQAQRA GCKSVHYDLV FLLDTSSSVG KEDFEKVRQW
     VANLVDTFEV GPDRTRVGVV RYSDRPTTAF ELGLFGSREE VKAAARRLSY HGGNTNTGDA
     LRYITALSFS PRAGGRPGDR AYKQVAILLT DGRSQDLVLD AAAAAHRAGI RIFAVGVGEA
     LKEELEEIAS EPKSAHVFHV SDFNAIDKIR GKLRRRLCEN VLCPSVRVEG DRFKHTNGGT
     KEITGFDLMD LFSVKEILGK RENGVQSSYV RMGSFPVVQS TEEVFPQGLP DEYAFVTTFR
     FRRTSRKEDW YIWQVIDQYG IPQVSIRLDG ENKAVEYNAV GAMKDAVRVV FRGSRVNDLF
     DRDWHKIALS IQAQNVSLYI DCALVQTLPI EERENIDIQG KTVIGKRLYD SVPIDFDLQR
     IVIYCDSRHA ELETCCDIPS GPCQVTVVTE PPPPPPPQRP PTPGSEQIGF LKTINCSCPA
     GEKGEMGFAG PIGLPGPKGD TGATGPVGAP GPKGEKGDVG IGPFGRGEKG EKGSLGLPGP
     PGRDGSKGMR GEPGELGEPG LPGEVGMRGP QGPPGLPGPP GHVGAPGLQG ERGEKGTRGE
     KGERGLDGFP GKPGDAGQQG RPGPPGVAGP QGEKGDVGPA GLPGVPGSVV QREGLKGEQG
     APGPRGHQGP PGPPGAPGPI GPEGRDGPPG LQGLRGKKGD VGPPGIPGSL GPQGPPGPPG
     VPGPPGPGGS PGLPGEIGFP GKPGPPGPAG HPGKDGPNGP PGPPGTKGEP GERGEDGLPG
     KPGFRGETGE QGLAGRPGEK GEAGLPGAPG FPGVRGEKGD QGEKGELGLP GLKGDRGEKG
     EAGPAGPPGL PGTTSLFTPH PRMPGEQGPK GEKGDPGMPG EPGPQGRPGE LGPRGPAGPP
     GAKGQEGAHG APGAAGNPGA PGHVGPPGPS GPPGSVGAPG LRGPPGKDGE RGEKGSAGEE
     GSPGPVGPRG DPGVPGLPGP PGKGKDGEPG LRGSPGLPGP LGIKAACRKF RGSDNCALGG
     QHVKGDRGAP GIPGSPGSRG DPGVGVAGPP GPSGPPGDKG PPGSRGLPGF PGPQGPAGRD
     GTPGNPGERG PPGKPGLSSL LSPGDINLLA KDVCSDCPPG PPGLPGLPGF KGDKGLPGKP
     GKEGTEGKKG EPGPPGLPGP PGIAGPQGSQ GERGTDGEVG QKGDQGHPGV PGFMGPPGNP
     GPPGADGIAG AAGPPGIQGS PGKEGPPGPQ GPSGLPGIPG EEGKEGRDGK PGPPGEPGKA
     GEPGLPGPEG ARGPPGFKGH TGDSGAPGPR GEPGAMGPPG QEGLPGKDGD TGPTGPQGPQ
     GPRGPPGKNG SPGSPGEPGP SGTPGQKGSK GENGSPGLPG FLGPRGPPGE PGEKGVPGKE
     GVPGKPGEPG FKGERGDPGI KGDKGPPGGK GQPGDPGIPG HKGHTGLMGP QGPPGENGPA
     GPPGPPGQPG FPGLRGESPS METLRRLIQE ELGKQLETKL AYLLAQMPPA HMKASQGRPG
     PPGPPGKDGL PGRAGPMGEP GRPGQGGLEG PSGPIGPKGE RGAKGDPGAP GVGLRGEMGP
     PGIPGQPGEP GYAKDGLPGI PGPQGETGPA GHPGPPGPPG PPGQCDPSQC AYFASLAARP
     GNVKGP
//
DBGET integrated database retrieval system