ID A0A096NYZ3_PAPAN Unreviewed; 1626 AA.
AC A0A096NYZ3;
DT 26-NOV-2014, integrated into UniProtKB/TrEMBL.
DT 25-MAY-2022, sequence version 2.
DT 27-MAR-2024, entry version 55.
DE SubName: Full=Collagen type XXII alpha 1 chain {ECO:0000313|Ensembl:ENSPANP00000018309.2};
GN Name=COL22A1 {ECO:0000313|Ensembl:ENSPANP00000018309.2};
OS Papio anubis (Olive baboon).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini;
OC Cercopithecidae; Cercopithecinae; Papio.
OX NCBI_TaxID=9555 {ECO:0000313|Ensembl:ENSPANP00000018309.2, ECO:0000313|Proteomes:UP000028761};
RN [1] {ECO:0000313|Ensembl:ENSPANP00000018309.2, ECO:0000313|Proteomes:UP000028761}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Liu Y.L., Abraham K.A., Akbar H.A., Ali S.A., Anosike U.A., Aqrawi P.A.,
RA Arias F.A., Attaway T.A., Awwad R.A., Babu C.B., Bandaranaike D.B.,
RA Battles P.B., Bell A.B., Beltran B.B., Berhane-Mersha D.B., Bess C.B.,
RA Bickham C.B., Bolden T.B., Carter K.C., Chau D.C., Chavez A.C.,
RA Clerc-Blankenburg K.C., Coyle M.C., Dao M.D., Davila M.L.D.,
RA Davy-Carroll L.D., Denson S.D., Dinh H.D., Fernandez S.F., Fernando P.F.,
RA Forbes L.F., Francis C.F., Francisco L.F., Fu Q.F., Garcia-Iii R.G.,
RA Garrett T.G., Gross S.G., Gubbala S.G., Hirani K.H., Hogues M.H.,
RA Hollins B.H., Jackson L.J., Javaid M.J., Jhangiani S.J., Johnson A.J.,
RA Johnson B.J., Jones J.J., Joshi V.J., Kalu J.K., Khan N.K., Korchina V.K.,
RA Kovar C.K., Lago L.L., Lara F.L., Le T.-K.L., Lee S.L., Legall-Iii F.L.,
RA Lemon S.L., Liu J.L., Liu Y.-S.L., Liyanage D.L., Lopez J.L.,
RA Lorensuhewa L.L., Mata R.M., Mathew T.M., Mercado C.M., Mercado I.M.,
RA Morales K.M., Morgan M.M., Munidasa M.M., Ngo D.N., Nguyen L.N.,
RA Nguyen T.N., Nguyen N.N., Obregon M.O., Okwuonu G.O., Ongeri F.O.,
RA Onwere C.O., Osifeso I.O., Parra A.P., Patil S.P., Perez A.P., Perez Y.P.,
RA Pham C.P., Pu L.-L.P., Puazo M.P., Quiroz J.Q., Rouhana J.R., Ruiz M.R.,
RA Ruiz S.-J.R., Saada N.S., Santibanez J.S., Scheel M.S., Schneider B.S.,
RA Simmons D.S., Sisson I.S., Tang L.-Y.T., Thornton R.T., Tisius J.T.,
RA Toledanes G.T., Trejos Z.T., Usmani K.U., Varghese R.V., Vattathil S.V.,
RA Vee V.V., Walker D.W., Weissenberger G.W., White C.W., Williams A.W.,
RA Woodworth J.W., Wright R.W., Zhu Y.Z., Han Y.H., Newsham I.N.,
RA Nazareth L.N., Worley K.W., Muzny D.M., Rogers J.R., Gibbs R.G.;
RT "Whole Genome Assembly of Papio anubis.";
RL Submitted (MAR-2012) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSPANP00000018309.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 9555.ENSPANP00000018309; -.
DR Ensembl; ENSPANT00000007142.3; ENSPANP00000018309.2; ENSPANG00000007983.3.
DR eggNOG; KOG1217; Eukaryota.
DR eggNOG; KOG3544; Eukaryota.
DR GeneTree; ENSGT00940000159308; -.
DR HOGENOM; CLU_003584_0_0_1; -.
DR OMA; PQVNCSC; -.
DR OrthoDB; 2883115at2759; -.
DR Proteomes; UP000028761; Chromosome 8.
DR Bgee; ENSPANG00000007983; Expressed in arcuate nucleus of hypothalamus and 36 other cell types or tissues.
DR ExpressionAtlas; A0A096NYZ3; baseline.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005788; C:endoplasmic reticulum lumen; IEA:UniProt.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 1.20.5.320; 6-Phosphogluconate Dehydrogenase, domain 3; 1.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR048287; TSPN-like_N.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR37456:SF3; COLLAGEN ALPHA-1(XXV) CHAIN; 1.
DR PANTHER; PTHR37456; SI:CH211-266K2.1; 1.
DR Pfam; PF01391; Collagen; 14.
DR Pfam; PF00092; VWA; 1.
DR PRINTS; PR00453; VWFADOMAIN.
DR SMART; SM00210; TSPN; 1.
DR SMART; SM00327; VWA; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR SUPFAM; SSF53300; vWA-like; 1.
DR PROSITE; PS51257; PROKAR_LIPOPROTEIN; 1.
DR PROSITE; PS50234; VWFA; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000028761};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..27
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 28..1626
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5035233926"
FT DOMAIN 38..213
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT REGION 492..1000
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1018..1458
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1490..1610
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 593..608
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 666..684
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 708..731
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 738..767
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 817..840
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1048..1069
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1220..1234
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1318..1339
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1435..1452
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1588..1605
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1626 AA; 161256 MW; 0A2E2CFFAE8762F2 CRC64;
MVGLRGNAVA GLLWMLLLWS GGGGCQAQRA GCKSVHYDLV FLLDTSSSVG KEDFEKVRQW
VANLVDTFEV GPDRTRVGVV RYSDRPTTAF ELGLFGSREE VKAAARRLSY HGGNTNTGDA
LRYITALSFS PRAGGRPGDR AYKQVAILLT DGRSQDLVLD AAAAAHRAGI RIFAVGVGEA
LKEELEEIAS EPKSAHVFHV SDFNAIDKIR GKLRRRLCEN VLCPSVRVEG DRFKHTNGGT
KEITGFDLMD LFSVKEILGK RENGVQSSYV RMGSFPVVQS TEEVFPQGLP DEYAFVTTFR
FRRTSRKEDW YIWQVIDQYG IPQVSIRLDG ENKAVEYNAV GAMKDAVRVV FRGSRVNDLF
DRDWHKIALS IQAQNVSLYI DCALVQTLPI EERENIDIQG KTVIGKRLYD SVPIDFDLQR
IVIYCDSRHA ELETCCDIPS GPCQVTVVTE PPPPPPPQRP PTPGSEQIGF LKTINCSCPA
GEKGEMGFAG PIGLPGPKGD TGATGPVGAP GPKGEKGDVG IGPFGRGEKG EKGSLGLPGP
PGRDGSKGMR GEPGELGEPG LPGEVGMRGP QGPPGLPGPP GHVGAPGLQG ERGEKGTRGE
KGERGLDGFP GKPGDAGQQG RPGPPGVAGP QGEKGDVGPA GLPGVPGSVV QREGLKGEQG
APGPRGHQGP PGPPGAPGPI GPEGRDGPPG LQGLRGKKGD VGPPGIPGSL GPQGPPGPPG
VPGPPGPGGS PGLPGEIGFP GKPGPPGPAG HPGKDGPNGP PGPPGTKGEP GERGEDGLPG
KPGFRGETGE QGLAGRPGEK GEAGLPGAPG FPGVRGEKGD QGEKGELGLP GLKGDRGEKG
EAGPAGPPGL PGTTSLFTPH PRMPGEQGPK GEKGDPGMPG EPGPQGRPGE LGPRGPAGPP
GAKGQEGAHG APGAAGNPGA PGHVGPPGPS GPPGSVGAPG LRGPPGKDGE RGEKGSAGEE
GSPGPVGPRG DPGVPGLPGP PGKGKDGEPG LRGSPGLPGP LGIKAACRKF RGSDNCALGG
QHVKGDRGAP GIPGSPGSRG DPGVGVAGPP GPSGPPGDKG PPGSRGLPGF PGPQGPAGRD
GTPGNPGERG PPGKPGLSSL LSPGDINLLA KDVCSDCPPG PPGLPGLPGF KGDKGLPGKP
GKEGTEGKKG EPGPPGLPGP PGIAGPQGSQ GERGTDGEVG QKGDQGHPGV PGFMGPPGNP
GPPGADGIAG AAGPPGIQGS PGKEGPPGPQ GPSGLPGIPG EEGKEGRDGK PGPPGEPGKA
GEPGLPGPEG ARGPPGFKGH TGDSGAPGPR GEPGAMGPPG QEGLPGKDGD TGPTGPQGPQ
GPRGPPGKNG SPGSPGEPGP SGTPGQKGSK GENGSPGLPG FLGPRGPPGE PGEKGVPGKE
GVPGKPGEPG FKGERGDPGI KGDKGPPGGK GQPGDPGIPG HKGHTGLMGP QGPPGENGPA
GPPGPPGQPG FPGLRGESPS METLRRLIQE ELGKQLETKL AYLLAQMPPA HMKASQGRPG
PPGPPGKDGL PGRAGPMGEP GRPGQGGLEG PSGPIGPKGE RGAKGDPGAP GVGLRGEMGP
PGIPGQPGEP GYAKDGLPGI PGPQGETGPA GHPGPPGPPG PPGQCDPSQC AYFASLAARP
GNVKGP
//