GenomeNet

Database: UniProt
Entry: H2PZI3_PANTR
LinkDB: H2PZI3_PANTR
Original site: H2PZI3_PANTR 
ID   H2PZI3_PANTR            Unreviewed;      1806 AA.
AC   H2PZI3; A0A2J8MHR2;
DT   21-MAR-2012, integrated into UniProtKB/TrEMBL.
DT   21-MAR-2012, sequence version 1.
DT   27-MAR-2024, entry version 71.
DE   SubName: Full=Collagen type XI alpha 1 chain {ECO:0000313|Ensembl:ENSPTRP00000001750.3};
GN   Name=COL11A1 {ECO:0000313|Ensembl:ENSPTRP00000001750.3,
GN   ECO:0000313|VGNC:VGNC:10666};
OS   Pan troglodytes (Chimpanzee).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC   Pan.
OX   NCBI_TaxID=9598 {ECO:0000313|Ensembl:ENSPTRP00000001750.3, ECO:0000313|Proteomes:UP000002277};
RN   [1] {ECO:0000313|Ensembl:ENSPTRP00000001750.3, ECO:0000313|Proteomes:UP000002277}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=16136131; DOI=10.1038/nature04072;
RG   Chimpanzee sequencing and analysis consortium;
RT   "Initial sequence of the chimpanzee genome and comparison with the human
RT   genome.";
RL   Nature 437:69-87(2005).
RN   [2] {ECO:0000313|Ensembl:ENSPTRP00000001750.3}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AACZ04027422; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AACZ04027423; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AACZ04027424; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AACZ04027425; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   RefSeq; XP_001140143.1; XM_001140143.3.
DR   PaxDb; 9598-ENSPTRP00000001750; -.
DR   Ensembl; ENSPTRT00000001907.4; ENSPTRP00000001750.3; ENSPTRG00000001017.6.
DR   GeneID; 457065; -.
DR   CTD; 1301; -.
DR   VGNC; VGNC:10666; COL11A1.
DR   eggNOG; KOG3544; Eukaryota.
DR   GeneTree; ENSGT00940000154535; -.
DR   HOGENOM; CLU_001074_2_1_1; -.
DR   OrthoDB; 2970887at2759; -.
DR   TreeFam; TF323987; -.
DR   Proteomes; UP000002277; Chromosome 1.
DR   Bgee; ENSPTRG00000001017; Expressed in bone marrow and 11 other cell types or tissues.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0005788; C:endoplasmic reticulum lumen; IEA:UniProt.
DR   GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR   CDD; cd00110; LamG; 1.
DR   Gene3D; 2.60.120.1000; -; 1.
DR   Gene3D; 2.60.120.200; -; 1.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR000885; Fib_collagen_C.
DR   InterPro; IPR001791; Laminin_G.
DR   InterPro; IPR048287; TSPN-like_N.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF42; COLLAGEN ALPHA-1(XI) CHAIN; 1.
DR   Pfam; PF01410; COLFI; 1.
DR   Pfam; PF01391; Collagen; 6.
DR   Pfam; PF02210; Laminin_G_2; 1.
DR   SMART; SM00038; COLFI; 1.
DR   SMART; SM00282; LamG; 1.
DR   SMART; SM00210; TSPN; 1.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR   PROSITE; PS51461; NC1_FIB; 1.
PE   4: Predicted;
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Reference proteome {ECO:0000313|Proteomes:UP000002277};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Secreted {ECO:0000256|ARBA:ARBA00022530};
KW   Signal {ECO:0000256|ARBA:ARBA00022729}.
FT   DOMAIN          1577..1805
FT                   /note="Fibrillar collagen NC1"
FT                   /evidence="ECO:0000259|PROSITE:PS51461"
FT   REGION          438..508
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          527..1545
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        458..484
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        680..714
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        772..786
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        907..943
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1126..1140
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1217..1231
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1291..1324
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1342..1363
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1493..1509
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1529..1543
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1806 AA;  181078 MW;  A2A2A6B1A518D374 CRC64;
     MEPWSSRWKT KRWLWDFTVT TLALTFLFQA REVRGAAPVD VLKALDFHNS PEGISKTTGF
     CTNRKNSKGS DTAYRVSKQA QLSAPTKQLF PGGTFPEDFS ILFTVKPKKG IQSFLLSIYN
     EHGIQQIGVE VGRSPVFLFE DHTGKPAPED YPLFRTVNIA DGKWHRVAIS VEKKTVTMIV
     DCKKKTTKPL DRSERAIVDT NGITVFGTRI LDEEVFEGDI QQFLITGDPK AAYDYCEHYS
     PDCDSSAPKA AQAQEPQIDE YAPEDIIEYD YEYGEAEYKE AESITEGPTV TEETIAQTEA
     NIVDDFQEYN YGTMESYQTE APRRVSGTNE PNPVEEIFTE EYLTGEDYDS QRKNSEDTLY
     ENKEIDGRDS DLLVDGDLGE YDFYEYKEYE DKPTSPPNEE FGPGVPAETD ITETSINGHG
     AYGEKGQKGE PAVVEPGMLV EGPPGPAGPA GIMGPPGLQG PTGPPGDPGD RGPPGRPGLP
     GADGLPGPPG TMLMLPFRYG GDGSKGPTIS AQEAQAQAIL QQARIALRGP PGPMGLTGRP
     GPVGGPGSSG AKGESGDPGP QGPRGVQGPP GPTGKPGKRG RPGADGGRGM PGEPGAKGDR
     GFDGLPGLPG DKGHRGERGP QGPPGPPGDD GMRGEDGEIG PRGLPGEAGP RGLLGPRGTP
     GAPGQPGMAG VDGPPGPKGN MGPQGEPGPP GQQGNPGPQG LPGPQGPIGP PGEKGPQGKP
     GLAGLPGADG PPGHPGKEGQ SGEKGALGPP GPQGPIGYPG PRGVKGADGV RGLKGSKGEK
     GEDGFPGFKG DMGLKGDRGE VGQIGPRGED GPEGPKGRAG PTGDPGPSGQ AGEKGKLGVP
     GLPGYPGRQG PKGSTGFPGF PGANGEKGAR GVAGKPGPRG QRGPTGPRGS RGARGPTGKP
     GPKGTSGGDG PPGPPGERGP QGPQGPVGFP GPKGPPGPPG KDGLPGHPGQ RGETGFQGKT
     GPPGPGGVVG PQGPTGETGP IGERGHPGPP GPPGEQGLPG AAGKEGAKGD PGPQGISGKD
     GPAGLRGFPG ERGLPGAQGA PGLKGGEGPQ GPPGPVGSPG ERGSAGTAGP IGLPGRPGPQ
     GPPGPAGEKG APGEKGPQGP AGRDGVQGPV GLPGPAGPAG SPGEDGDKGE IGEPGQKGSK
     GDKGENGPPG PPGLQGPVGA PGIAGGDGEP GPRGQQGMFG QKGDEGARGF PGPPGPIGLQ
     GLPGPPGEKG ENGDVGPMGP PGPPGPRGPQ GPNGADGPQG PPGSVGSVGG VGEKGEPGEA
     GNPGPPGEAG VGGPKGERGE KGEAGPPGAA GPPGAKGPPG DDGPKGNPGP VGFPGDPGPP
     GEPGPAGQDG VGGDKGEDGD PGQPGPPGPS GEAGPPGPPG KRGPPGAAGA EGRQGEKGAK
     GEAGAEGPPG KTGPVGPQGP AGKPGPEGLR GIPGPVGEQG LPGAAGQDGP PGPMGPPGLP
     GLKGDPGSKG EKGHPGLIGL IGPPGEQGEK GDRGLPGTQG SPGAKGDGGI PGPAGPLGPP
     GPPGLPGPQG PKGNKGSTGP AGQKGDSGLP GPPGPPGPPG EVIQPLPILS SKKTRRHTEG
     MQADADDNIL DYSDGMEEIF GSLNSLKQDI EHMKFPMGTQ TNPARTCKDL QLSHPDFPDG
     EYWIDPNQGC SGDSFKVYCN FTSGGETCIY PDKKSEGVRI SSWPKEKPGS WFSEFKRGKL
     LSYLDVEGNS INMVQMTFLK LLTASARQNF TYHCHQSAAW YDVSSGSYDK ALRFLGSNDE
     EMSYDNNPFI KALYDGCASR KGYEKTVIEI NTPKIDQVPI VDVMINDFGD QNQKFGFEVG
     PVCFLG
//
DBGET integrated database retrieval system