ID A0A087XJE9_POEFO Unreviewed; 1797 AA.
AC A0A087XJE9;
DT 29-OCT-2014, integrated into UniProtKB/TrEMBL.
DT 26-NOV-2014, sequence version 2.
DT 27-MAR-2024, entry version 53.
DE SubName: Full=Collagen, type XI, alpha 2 {ECO:0000313|Ensembl:ENSPFOP00000005902.2};
OS Poecilia formosa (Amazon molly) (Limia formosa).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Ovalentaria; Atherinomorphae; Cyprinodontiformes; Poeciliidae; Poeciliinae;
OC Poecilia.
OX NCBI_TaxID=48698 {ECO:0000313|Ensembl:ENSPFOP00000005902.2, ECO:0000313|Proteomes:UP000028760};
RN [1] {ECO:0000313|Proteomes:UP000028760}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=female {ECO:0000313|Proteomes:UP000028760};
RA Schartl M., Warren W.;
RL Submitted (OCT-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSPFOP00000005902.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AYCK01010108; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR STRING; 48698.ENSPFOP00000005902; -.
DR Ensembl; ENSPFOT00000005911.2; ENSPFOP00000005902.2; ENSPFOG00000003376.2.
DR eggNOG; KOG3544; Eukaryota.
DR GeneTree; ENSGT00940000159762; -.
DR OMA; TCLYPSV; -.
DR Proteomes; UP000028760; Unassembled WGS sequence.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR Gene3D; 2.60.120.1000; -; 1.
DR Gene3D; 2.60.120.200; -; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR000885; Fib_collagen_C.
DR InterPro; IPR001791; Laminin_G.
DR InterPro; IPR048287; TSPN-like_N.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF509; COLLAGEN ALPHA-2(XI) CHAIN; 1.
DR Pfam; PF01410; COLFI; 1.
DR Pfam; PF01391; Collagen; 5.
DR Pfam; PF02210; Laminin_G_2; 1.
DR SMART; SM00038; COLFI; 1.
DR SMART; SM00282; LamG; 1.
DR SMART; SM00210; TSPN; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR PROSITE; PS51461; NC1_FIB; 1.
PE 4: Predicted;
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Reference proteome {ECO:0000313|Proteomes:UP000028760};
KW Secreted {ECO:0000256|ARBA:ARBA00022530};
KW Signal {ECO:0000256|ARBA:ARBA00022729}.
FT DOMAIN 1568..1796
FT /note="Fibrillar collagen NC1"
FT /evidence="ECO:0000259|PROSITE:PS51461"
FT REGION 249..489
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 505..1536
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 249..263
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 305..320
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 420..440
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 631..646
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 751..765
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 907..922
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1049..1065
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1105..1120
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1196..1210
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1508..1524
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1797 AA; 178936 MW; ED8674C410C21470 CRC64;
SSSSDPVDVL RALQVPSLPE GVKKVPGFCT SRRSSSPDHA YRITKKAQIS APTKQLFSGR
FPENFSIMTL IKPQAGLQAF LLSIYSEQGI QQLGIELGRS PVFLYEDQHG KPAPEDYPLF
RGINLADGKW HRIAFSVSKK NVTLLLDCKK KMTRPLARGN NAEVDTNGIT VFGARLLDEE
VFQGDIQQLL IASNPQAAYD FCEHYSPDCD SPLPKTQAQD PNTYYFDEEQ DDHEYPYYYE
VNANGKATTE NAATVTSKPK TTKAPKPTKP APTKMAKPAA TKGSVGKSKD VVNKVPLVKP
TIGKPQKVSN PTKATSPSPA IRPTSPKPTK PKTTDNDVKS QHRPFPPFDK AALSGSEVKK
VTNGFQQKDY SDPVPETREA DGYMGPALSA VTDEGGTGIS GQKGEKGEPA VLEPGMLIEG
PPGPEGPAGL PGPSGPSGPP GSVGDPGERG PPGRAGLPGA DGVPGPPGTS VMLPFRFGQS
GGDKGPVVSA QEAQAAAILS QARMALKGPP GPMGFTGRPG PLGNPGSPGL KGESGDPGSQ
GPRGPQGLMG PPGKSGRRGR AGADGARGMP GEPGTKGDRG FDGLPGLPGD KGHRGDPGPM
GLQGSPGEDG ERGDDGDVGP RGLPGEPGPR GLLGPKGPPG IPGPPGVRGN DGPHGPKGNL
GPQGEPGPPG QQGTPGTQGM PGPQGAIGPP GEKGPTGKPG LPGMPGADGP PGHPGKEGPS
GTKGNQGPNG PQGAIGYPGP RGIKGAQGIR GLKGHKGEKG EDGFPGIKGD FGVKGERGEI
GVPGPRGEDG PEGPKGRVGP PGELGPLGLA GEKGKLGVPG LPGYPGRQGI KGSLGFPGFP
GSNGEKGTRG VSGKQGPRGQ RGPTGPRGQR GPRGATGKPG AKGTSGSDGP PGPLGERGLP
GPQGANGFPG PKGPPGPPGK DGLPGHPGQR GEVGFQGKVG PPGPPGVVGP QGPSGETGPM
GERGHPGPPG PPGEQGLPGQ SGKEGTKGDP GPPGGPGKDG PPGLRGFPGE RGLPGTPGGG
GLKGGEGPAG PPGPAGSPGE RGPAGTAGPV GPPGRPGPQG PPGPAGEKGV PGEKGPIGPA
GRDGVQGPVG LPGPAGSPGV PGEDGDKGEV GEHGQKGGKG AKGEHGPPGP PGPMGPVGQP
GPAGADGELG PRGQQGPFGA KGDDGTRGFP GAPGPIGLQG LPGPPGEKGE TGDVGPMGPP
GPPGPRGPAG PNGADGPQGP PGGLGNPGPL GEKGEPGEAG PPGVGGEPGK KGPRGERGEK
GEAGQPGTAG PAGGRGRPGD DGPKGNPGPV GFPGDPGPPG EVGPRGQDGA KGERGEDGEQ
GESGSPGPPG ENGPPGPPGK RGPAGTKGAE GRQGEKGTKG DPGAVGPPGK TGPVGPQGQP
GKPGTEGLRG LPGSVGEQGA PGAAGQKGPP GPMGPPGLPG LRGEPGAKGE KGHQGLIGLI
GPPGEQGEKG DRGLPGPQGS SGPKGEPGMA GGTGPLGPAG PPGLPGPQGV KGAKGATGGS
GPKGEKGVQG PPGPPGPPGD VIQPMPIQRS PKSKRSIDAS QLLPEFDPDM PASDTAGAEF
LMGSEGMEEI FGSLNSLRQE IETMRFPLGT QDSPARTCQD LHLSQPELKD GEYWIDPNQG
CSRDSFKVLC NFTSGETCLQ PRTGIDSVKM STWTTETPGS WYSQFSSGSK FSYVDSNGAP
VGVVQLGFLR LLSVQARQNL TYHCHRSVAW ADRSAKNNHK RALHFRGAND EELSYETNPY
IKALIDGCSY RKGFDRTVLE INTPQLEHLP LLDIKVTDFG ESNQQFGFEV GPVCFQG
//