ID A0A091W5P3_OPIHO Unreviewed; 1789 AA.
AC A0A091W5P3;
DT 26-NOV-2014, integrated into UniProtKB/TrEMBL.
DT 26-NOV-2014, sequence version 1.
DT 27-MAR-2024, entry version 30.
DE SubName: Full=Collagen alpha-1(XI) chain {ECO:0000313|EMBL:KFR10902.1};
DE Flags: Fragment;
GN ORFNames=N306_09561 {ECO:0000313|EMBL:KFR10902.1};
OS Opisthocomus hoazin (Hoatzin) (Phasianus hoazin).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Opisthocomiformes; Opisthocomidae;
OC Opisthocomus.
OX NCBI_TaxID=30419 {ECO:0000313|EMBL:KFR10902.1, ECO:0000313|Proteomes:UP000053605};
RN [1] {ECO:0000313|EMBL:KFR10902.1, ECO:0000313|Proteomes:UP000053605}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=BGI_N306 {ECO:0000313|EMBL:KFR10902.1};
RA Zhang G., Li C.;
RT "Genome evolution of avian class.";
RL Submitted (APR-2014) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KK734785; KFR10902.1; -; Genomic_DNA.
DR STRING; 30419.A0A091W5P3; -.
DR PhylomeDB; A0A091W5P3; -.
DR Proteomes; UP000053605; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR CDD; cd00110; LamG; 1.
DR Gene3D; 2.60.120.1000; -; 1.
DR Gene3D; 2.60.120.200; -; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR000885; Fib_collagen_C.
DR InterPro; IPR001791; Laminin_G.
DR InterPro; IPR048287; TSPN-like_N.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1100; FIBRILLAR COLLAGEN NC1 DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF01410; COLFI; 1.
DR Pfam; PF01391; Collagen; 10.
DR Pfam; PF02210; Laminin_G_2; 1.
DR SMART; SM00038; COLFI; 1.
DR SMART; SM00282; LamG; 1.
DR SMART; SM00210; TSPN; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR PROSITE; PS51461; NC1_FIB; 1.
PE 4: Predicted;
KW Collagen {ECO:0000313|EMBL:KFR10902.1};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Reference proteome {ECO:0000313|Proteomes:UP000053605};
KW Secreted {ECO:0000256|ARBA:ARBA00022530};
KW Signal {ECO:0000256|ARBA:ARBA00022729}.
FT DOMAIN 1560..1788
FT /note="Fibrillar collagen NC1"
FT /evidence="ECO:0000259|PROSITE:PS51461"
FT REGION 248..293
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 407..480
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 499..574
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 602..1545
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 250..269
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 430..447
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 677..699
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 754..768
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 898..925
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1109..1123
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1200..1214
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1274..1307
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1325..1346
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1409..1423
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1478..1492
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1512..1528
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:KFR10902.1"
FT NON_TER 1789
FT /evidence="ECO:0000313|EMBL:KFR10902.1"
SQ SEQUENCE 1789 AA; 178822 MW; 375DB8BDBD920D86 CRC64;
FSLSAEPVDV LKALEFHNSP EGVSRTAGFC TNRRDSKGSD VAYRVSKTAQ LSAPTKQLYP
GGDFPEDFSI LITVKPKKGI QSFLLSVYNE QGIQQVGVEV GRSPVFLFED QNGKPAPEEY
PLFRTVNIAD GKWHRVAISV EKKSVTMIVD CNKKTTKPLE RSDKTVVDTN GITVFGTRIL
DEEVFEGEIQ QLLIIADPRA AYDYCEHYSP DCDSPVPSAA QAQDPQVDET RYKEDIFYRE
GGETLEQIAQ RGSGGPSPGN IQGQVGQDSE QPGLVEDGPA HCREPQNGRD WKGPLWVIES
NSPAKAESPR AVFAEEYITG EDYDKKNEET EYGSRGVDLS ESDLLVDGDL GEYDFYEYKE
YEEKPTDSTN EEFGPGVPAE TDITETSVTG HGAYGEKGQK GEPAVIEPGM LIEGPPGPRG
PAGLTGPPGL QGPVGPPGDP GERGPPGRPG LPGADGLAGP PGTMLMLPFR FGGDGEKGPT
ISAQEAQAQA ILQQARIAMR GPPGPMGLTG RPGPVGGPGA AGAKGESGEP GPQGPRGVQG
APGPSGKAGK RGRPGADGAR GIPGEPGAKV NNNETGGFIS VLVWARKILC RCIILITSLI
LQGDRGPQGP PGLPGEDGSR GEDGEVGPRG LPGEAGPRGL LGPRGTPGPP GQPGIAGVDG
PSGPKGNMVR PPGQQGIPGP QGLPGPQGPI GPPGEKGPQG KPGLPGLPGS DGPPGHPGKE
GQSGDKGALG PPGPQGPIGY PGPRGVKGAD GVRGLKGSKG EKGEDGFPGF KGDMGLKGDR
GEVGQPGPRG EDGPEGPKGR AGPSGDPGAA GPAGEKGKLG VPGLPGYPGR QGPKGSTGFP
GFPGANGEKG ARGLHGKPGP RGQRGPTGPR GSRGPRGPTG KPGPKGTAGN DGPAGPPGER
GPQGPQGPVG FPGPKGPPGP PGKDGLPGHP GQRGETGFQG KTGPPGPGGV VGPQGPTGET
GPIGERGHPG PPGPPGEQGL PGAAGKEGAK DEGGSPSPLH CPDGPAGLRG FPGERGLPGA
QGPAGLKGGE GPQGPPGPVG SPGERGTAGT AGPIGLPGRP GPQGPPGPAG EKGAPGEKGP
QGPAGRDGVQ GPVGLPGPAG PSGSPGEDGD KGEIGEPGQK GSKGDKGENG PPGPPGLQGP
VGAPGIAVRD GEPGPRGQQG MFGQKGDEGP RGFPGPPGPI GLQGLPGPPG EKGENGDVGP
MGPPGPPGPR GPQGPNGADG PQGPPGSIGS VGGVGEKGEP GEAGNPGPPG ESGTSGPKGE
RGEKGEAGPP GAAGPPGAKG PPGDDGPKGN PGPVGFPGDP GPPGEPGPAG QDGVGGEKGE
DGDPGQPGPP GPSGEAGPPG PPGKRGPPGA TGAEGRQGEK GAKGEPGAEG APGKTGPVGP
QGPAGKPGPE GLRGIPGPVG EQGLPGAPGQ DGPPGPLGPP GLPGLKGDPG SKGEKGHPGL
IGLIGPPGEQ GEKGDRGLPG PQGSPGAKGD AGISGPAGPL GPPGPPGLPG PQGPKGSKGS
SGPAGQKGDS GLPGPPGPPG PPGEVIQPLP IQSPKKTRRS PDYMLSDAGD NILDYSDGME
EIFGSLNSLK QDIEHMKYPM GTQNNPARTC KDLQLCHPDF PDGEYWIDPN QGCSGDSFKV
YCNFTAGGET CIYPDKKSEG VRISSWPKEN PGSWFSEFKR GKLLSYLDVE GNSINMVQMT
FLKLLSASAR QNFTYNCHQS VAWHDASSDS YDKALRFLGS NDEEMSYDNN PYIKALHDGC
AARKGYAKTV IEINTPKIDQ VPIVDVMIND FGDQNQKFGF EVSPVCFLG
//