ID H2PZI3_PANTR Unreviewed; 1806 AA.
AC H2PZI3; A0A2J8MHR2;
DT 21-MAR-2012, integrated into UniProtKB/TrEMBL.
DT 21-MAR-2012, sequence version 1.
DT 27-MAR-2024, entry version 71.
DE SubName: Full=Collagen type XI alpha 1 chain {ECO:0000313|Ensembl:ENSPTRP00000001750.3};
GN Name=COL11A1 {ECO:0000313|Ensembl:ENSPTRP00000001750.3,
GN ECO:0000313|VGNC:VGNC:10666};
OS Pan troglodytes (Chimpanzee).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC Pan.
OX NCBI_TaxID=9598 {ECO:0000313|Ensembl:ENSPTRP00000001750.3, ECO:0000313|Proteomes:UP000002277};
RN [1] {ECO:0000313|Ensembl:ENSPTRP00000001750.3, ECO:0000313|Proteomes:UP000002277}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=16136131; DOI=10.1038/nature04072;
RG Chimpanzee sequencing and analysis consortium;
RT "Initial sequence of the chimpanzee genome and comparison with the human
RT genome.";
RL Nature 437:69-87(2005).
RN [2] {ECO:0000313|Ensembl:ENSPTRP00000001750.3}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AACZ04027422; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AACZ04027423; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AACZ04027424; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AACZ04027425; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR RefSeq; XP_001140143.1; XM_001140143.3.
DR PaxDb; 9598-ENSPTRP00000001750; -.
DR Ensembl; ENSPTRT00000001907.4; ENSPTRP00000001750.3; ENSPTRG00000001017.6.
DR GeneID; 457065; -.
DR CTD; 1301; -.
DR VGNC; VGNC:10666; COL11A1.
DR eggNOG; KOG3544; Eukaryota.
DR GeneTree; ENSGT00940000154535; -.
DR HOGENOM; CLU_001074_2_1_1; -.
DR OrthoDB; 2970887at2759; -.
DR TreeFam; TF323987; -.
DR Proteomes; UP000002277; Chromosome 1.
DR Bgee; ENSPTRG00000001017; Expressed in bone marrow and 11 other cell types or tissues.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005788; C:endoplasmic reticulum lumen; IEA:UniProt.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR CDD; cd00110; LamG; 1.
DR Gene3D; 2.60.120.1000; -; 1.
DR Gene3D; 2.60.120.200; -; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR000885; Fib_collagen_C.
DR InterPro; IPR001791; Laminin_G.
DR InterPro; IPR048287; TSPN-like_N.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF42; COLLAGEN ALPHA-1(XI) CHAIN; 1.
DR Pfam; PF01410; COLFI; 1.
DR Pfam; PF01391; Collagen; 6.
DR Pfam; PF02210; Laminin_G_2; 1.
DR SMART; SM00038; COLFI; 1.
DR SMART; SM00282; LamG; 1.
DR SMART; SM00210; TSPN; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR PROSITE; PS51461; NC1_FIB; 1.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Reference proteome {ECO:0000313|Proteomes:UP000002277};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Secreted {ECO:0000256|ARBA:ARBA00022530};
KW Signal {ECO:0000256|ARBA:ARBA00022729}.
FT DOMAIN 1577..1805
FT /note="Fibrillar collagen NC1"
FT /evidence="ECO:0000259|PROSITE:PS51461"
FT REGION 438..508
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 527..1545
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 458..484
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 680..714
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 772..786
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 907..943
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1126..1140
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1217..1231
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1291..1324
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1342..1363
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1493..1509
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1529..1543
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1806 AA; 181078 MW; A2A2A6B1A518D374 CRC64;
MEPWSSRWKT KRWLWDFTVT TLALTFLFQA REVRGAAPVD VLKALDFHNS PEGISKTTGF
CTNRKNSKGS DTAYRVSKQA QLSAPTKQLF PGGTFPEDFS ILFTVKPKKG IQSFLLSIYN
EHGIQQIGVE VGRSPVFLFE DHTGKPAPED YPLFRTVNIA DGKWHRVAIS VEKKTVTMIV
DCKKKTTKPL DRSERAIVDT NGITVFGTRI LDEEVFEGDI QQFLITGDPK AAYDYCEHYS
PDCDSSAPKA AQAQEPQIDE YAPEDIIEYD YEYGEAEYKE AESITEGPTV TEETIAQTEA
NIVDDFQEYN YGTMESYQTE APRRVSGTNE PNPVEEIFTE EYLTGEDYDS QRKNSEDTLY
ENKEIDGRDS DLLVDGDLGE YDFYEYKEYE DKPTSPPNEE FGPGVPAETD ITETSINGHG
AYGEKGQKGE PAVVEPGMLV EGPPGPAGPA GIMGPPGLQG PTGPPGDPGD RGPPGRPGLP
GADGLPGPPG TMLMLPFRYG GDGSKGPTIS AQEAQAQAIL QQARIALRGP PGPMGLTGRP
GPVGGPGSSG AKGESGDPGP QGPRGVQGPP GPTGKPGKRG RPGADGGRGM PGEPGAKGDR
GFDGLPGLPG DKGHRGERGP QGPPGPPGDD GMRGEDGEIG PRGLPGEAGP RGLLGPRGTP
GAPGQPGMAG VDGPPGPKGN MGPQGEPGPP GQQGNPGPQG LPGPQGPIGP PGEKGPQGKP
GLAGLPGADG PPGHPGKEGQ SGEKGALGPP GPQGPIGYPG PRGVKGADGV RGLKGSKGEK
GEDGFPGFKG DMGLKGDRGE VGQIGPRGED GPEGPKGRAG PTGDPGPSGQ AGEKGKLGVP
GLPGYPGRQG PKGSTGFPGF PGANGEKGAR GVAGKPGPRG QRGPTGPRGS RGARGPTGKP
GPKGTSGGDG PPGPPGERGP QGPQGPVGFP GPKGPPGPPG KDGLPGHPGQ RGETGFQGKT
GPPGPGGVVG PQGPTGETGP IGERGHPGPP GPPGEQGLPG AAGKEGAKGD PGPQGISGKD
GPAGLRGFPG ERGLPGAQGA PGLKGGEGPQ GPPGPVGSPG ERGSAGTAGP IGLPGRPGPQ
GPPGPAGEKG APGEKGPQGP AGRDGVQGPV GLPGPAGPAG SPGEDGDKGE IGEPGQKGSK
GDKGENGPPG PPGLQGPVGA PGIAGGDGEP GPRGQQGMFG QKGDEGARGF PGPPGPIGLQ
GLPGPPGEKG ENGDVGPMGP PGPPGPRGPQ GPNGADGPQG PPGSVGSVGG VGEKGEPGEA
GNPGPPGEAG VGGPKGERGE KGEAGPPGAA GPPGAKGPPG DDGPKGNPGP VGFPGDPGPP
GEPGPAGQDG VGGDKGEDGD PGQPGPPGPS GEAGPPGPPG KRGPPGAAGA EGRQGEKGAK
GEAGAEGPPG KTGPVGPQGP AGKPGPEGLR GIPGPVGEQG LPGAAGQDGP PGPMGPPGLP
GLKGDPGSKG EKGHPGLIGL IGPPGEQGEK GDRGLPGTQG SPGAKGDGGI PGPAGPLGPP
GPPGLPGPQG PKGNKGSTGP AGQKGDSGLP GPPGPPGPPG EVIQPLPILS SKKTRRHTEG
MQADADDNIL DYSDGMEEIF GSLNSLKQDI EHMKFPMGTQ TNPARTCKDL QLSHPDFPDG
EYWIDPNQGC SGDSFKVYCN FTSGGETCIY PDKKSEGVRI SSWPKEKPGS WFSEFKRGKL
LSYLDVEGNS INMVQMTFLK LLTASARQNF TYHCHQSAAW YDVSSGSYDK ALRFLGSNDE
EMSYDNNPFI KALYDGCASR KGYEKTVIEI NTPKIDQVPI VDVMINDFGD QNQKFGFEVG
PVCFLG
//