ID L5KB07_PTEAL Unreviewed; 744 AA.
AC L5KB07;
DT 06-MAR-2013, integrated into UniProtKB/TrEMBL.
DT 06-MAR-2013, sequence version 1.
DT 27-MAR-2024, entry version 39.
DE SubName: Full=Collagen alpha-1(VIII) chain {ECO:0000313|EMBL:ELK08537.1};
GN ORFNames=PAL_GLEAN10021348 {ECO:0000313|EMBL:ELK08537.1};
OS Pteropus alecto (Black flying fox).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Chiroptera; Megachiroptera; Pteropodidae;
OC Pteropodinae; Pteropus.
OX NCBI_TaxID=9402 {ECO:0000313|EMBL:ELK08537.1, ECO:0000313|Proteomes:UP000010552};
RN [1] {ECO:0000313|Proteomes:UP000010552}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=23258410; DOI=10.1126/science.1230835;
RA Zhang G., Cowled C., Shi Z., Huang Z., Bishop-Lilly K.A., Fang X.,
RA Wynne J.W., Xiong Z., Baker M.L., Zhao W., Tachedjian M., Zhu Y., Zhou P.,
RA Jiang X., Ng J., Yang L., Wu L., Xiao J., Feng Y., Chen Y., Sun X.,
RA Zhang Y., Marsh G.A., Crameri G., Broder C.C., Frey K.G., Wang L.F.,
RA Wang J.;
RT "Comparative analysis of bat genomes provides insight into the evolution of
RT flight and immunity.";
RL Science 339:456-460(2013).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KB030886; ELK08537.1; -; Genomic_DNA.
DR RefSeq; XP_006916606.1; XM_006916544.2.
DR AlphaFoldDB; L5KB07; -.
DR STRING; 9402.L5KB07; -.
DR GeneID; 102890586; -.
DR KEGG; pale:102890586; -.
DR CTD; 1295; -.
DR eggNOG; ENOG502QRFR; Eukaryota.
DR InParanoid; L5KB07; -.
DR OrthoDB; 5267071at2759; -.
DR Proteomes; UP000010552; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR Gene3D; 2.60.120.40; -; 1.
DR InterPro; IPR001073; C1q_dom.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR008983; Tumour_necrosis_fac-like_dom.
DR PANTHER; PTHR15427:SF33; EMI DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR15427; EMILIN ELASTIN MICROFIBRIL INTERFACE-LOCATED PROTEIN ELASTIN MICROFIBRIL INTERFACER; 1.
DR Pfam; PF00386; C1q; 1.
DR Pfam; PF01391; Collagen; 4.
DR PRINTS; PR00007; COMPLEMNTC1Q.
DR SMART; SM00110; C1Q; 1.
DR SUPFAM; SSF49842; TNF-like; 1.
DR PROSITE; PS50871; C1Q; 1.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119, ECO:0000313|EMBL:ELK08537.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000010552};
KW Secreted {ECO:0000256|ARBA:ARBA00022525};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..27
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 28..744
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5003968952"
FT DOMAIN 611..744
FT /note="C1q"
FT /evidence="ECO:0000259|PROSITE:PS50871"
FT REGION 97..587
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 125..139
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 192..206
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 255..269
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 284..307
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 385..407
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 501..531
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 554..583
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 744 AA; 73359 MW; 6D138A38B93387A1 CRC64;
MAVPPAPLQL LGVLLTISLG SVRLVQAGAY YGIKPLPQIP AQIPPQIPQY QPLGQQVPHM
PLSKDGLTMG KELPHMQYGK EYPHLPQYRK EIQPVPRMGK EAAPKKGKVE TPLASLRGEQ
GPRGEPGPRG PPGPPGLPGH GIPGIKGKPG PQGYPGIGKP GMPGMPGKPG AMGMPGAKGE
IGPKGEIGPM GIPGPQGPPG PHGLPGIGKP GGPGLPGQPG AKGERGPKGP AGPPGLQGPK
GEKGFGMPGL PGLKGPPGMH GPPGPVGLPG VGKPGVTGFP GPQGPLGKPG PPGEPGPQGP
IGIPGVQGPP GIPGVGKPGQ DGIPGQPGFP GGKGEQGLPG LPGPPGLPGI GKPGFPGPKG
DRGIGGVPGA LGPRGEKGPV GAPGMGGPPG EPGLPGIPGP MGPPGAIGFP GPKGEGGVVG
PQGPPGPKGE PGLQGFPGKP GFPGEVGPPG IRGLPGPIGP KGEEGHKGLP GLPGAPGMLG
PKGEPGIPGD QGLQGPPGIP GIVGPSGPIG PPGIPGPKGE PGIPGPPGFP GVGKPGVAGL
HGPPGKPGAL GPQGQPGLPG PPGPPGPPGP PAVMPPTPPP HGEYLPDMGL GIDGVKPPHA
YGAKKGKNGG PAQEMPAFTA ELTAPFPPVG APVKFDKLLY NGRQNYNPQT GIFTCEVPGV
YYFVYHVHCK GGNVWVALFK NNEPMMYTYD EYKKGFLDQA SGSAVLLLRP GDRVFLQMPS
EQAAGLYAGQ YVHSSFSGYL LYPM
//