ID L5JRU9_PTEAL Unreviewed; 1459 AA.
AC L5JRU9;
DT 06-MAR-2013, integrated into UniProtKB/TrEMBL.
DT 06-MAR-2013, sequence version 1.
DT 27-MAR-2024, entry version 43.
DE SubName: Full=Collagen alpha-1(I) chain {ECO:0000313|EMBL:ELK01657.1};
GN ORFNames=PAL_GLEAN10019718 {ECO:0000313|EMBL:ELK01657.1};
OS Pteropus alecto (Black flying fox).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Chiroptera; Megachiroptera; Pteropodidae;
OC Pteropodinae; Pteropus.
OX NCBI_TaxID=9402 {ECO:0000313|EMBL:ELK01657.1, ECO:0000313|Proteomes:UP000010552};
RN [1] {ECO:0000313|Proteomes:UP000010552}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=23258410; DOI=10.1126/science.1230835;
RA Zhang G., Cowled C., Shi Z., Huang Z., Bishop-Lilly K.A., Fang X.,
RA Wynne J.W., Xiong Z., Baker M.L., Zhao W., Tachedjian M., Zhu Y., Zhou P.,
RA Jiang X., Ng J., Yang L., Wu L., Xiao J., Feng Y., Chen Y., Sun X.,
RA Zhang Y., Marsh G.A., Crameri G., Broder C.C., Frey K.G., Wang L.F.,
RA Wang J.;
RT "Comparative analysis of bat genomes provides insight into the evolution of
RT flight and immunity.";
RL Science 339:456-460(2013).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KB031150; ELK01657.1; -; Genomic_DNA.
DR RefSeq; XP_006924855.1; XM_006924793.2.
DR STRING; 9402.L5JRU9; -.
DR GeneID; 102886463; -.
DR KEGG; pale:102886463; -.
DR CTD; 1277; -.
DR eggNOG; KOG3544; Eukaryota.
DR InParanoid; L5JRU9; -.
DR OrthoDB; 2970887at2759; -.
DR Proteomes; UP000010552; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR Gene3D; 2.60.120.1000; -; 1.
DR Gene3D; 2.10.70.10; Complement Module, domain 1; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR000885; Fib_collagen_C.
DR InterPro; IPR001007; VWF_dom.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1082; COLLAGEN ALPHA-1(X) CHAIN; 1.
DR Pfam; PF01410; COLFI; 1.
DR Pfam; PF01391; Collagen; 11.
DR Pfam; PF00093; VWC; 1.
DR SMART; SM00038; COLFI; 1.
DR SMART; SM00214; VWC; 1.
DR SUPFAM; SSF57603; FnI-like domain; 1.
DR PROSITE; PS51461; NC1_FIB; 1.
DR PROSITE; PS01208; VWFC_1; 1.
DR PROSITE; PS50184; VWFC_2; 1.
PE 4: Predicted;
KW Collagen {ECO:0000313|EMBL:ELK01657.1};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Reference proteome {ECO:0000313|Proteomes:UP000010552};
KW Secreted {ECO:0000256|ARBA:ARBA00022530}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..22
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 23..1459
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5003968430"
FT DOMAIN 34..92
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 1224..1459
FT /note="Fibrillar collagen NC1"
FT /evidence="ECO:0000259|PROSITE:PS51461"
FT REGION 96..1212
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 118..152
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 178..221
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 414..428
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 883..897
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1171..1190
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1459 AA; 138385 MW; 798D1541C7A80311 CRC64;
MFSFVDLRLL LLLAATALLT HGQEEGQEED IPPVTCVQNG LRYHDREVWK PEVCRICVCD
NGNVLCDDVI CEDTKNCPGA VAPPGECCPV CPDGEVSPTD QETTGVEGPK GDTGPRGPRG
PSGPPGRDGI PGQPGLPGPP GPPGPPGPPG LGGNFAPQMS YGYDEKSAGV SVPGPMGPSG
PRGLPGPPGA PGPQGFQGPP GEPGEPGASG PMGPRGPPGP PGKNGDDGEA GKPGRPGERG
PPGPQGARGL PGTAGLPGMK GHRGFSGLDG AKGDSGPAGP KGEPGSPGEN GAPGQMGPRG
LPGERGRPGA PGPAGARGND GATGAAGPPG PTGPAGPPGF PGAVGAKGEA GPQGSRGSEG
PQGVRGEPGP PGPAGAAGPA GNPGADGQPG AKGANGAPGI AGAPGFPGAR GPSGPQGPGG
PPGPKGNSGE PGAPGNKGDA GAKGEPGPTG IQGPPGPAGE EGKRGARGEP GPSGLPGPPG
ERGGPGSRGF PGADGVAGPK GPAGERGSPG PAGPKGSPGE AGRPGEAGLP GAKGLTGSPG
SPGPDGKTGP AGPAGQDGRP GPPGPPGARG QAGVMGFPGP KGAAGEPGKA GERGVPGPPG
AVGAAGKDGE AGAQGPPGPA GPAGERGEQG PAGSPGFQGL PGPSGPPGEA GKPGEQGVPG
DLGAPGPSGA RGERGFPGER GVQGPPGPAG PRGANGAPGN DGAKGDAGAP GAPGSQGAPG
LQGMPGERGA AGLPGPKGDR GDAGPKGADG APGKDGVRGL TGPIGPPGPA GAPGDKGESG
PSGPAGPTGA RGAPGDRGEP GPPGPAGFAG PPGADGQPGA KGEPGDAGAK GDAGPAGPAG
PAGPPGPIGN VGAPGPKGAR GSAGPPGATG FPGAAGRVGP PGPSGNAGPP GPPGPVGKEG
GKGPRGETGP AGRPGEAGPP GPPGPAGEKG SPGADGPAGA PGTPGPQGIA GQRGVVGLPG
QRGERGFPGL PGPSGEPGKQ GPSGTSGERG PPGPMGPPGL AGPPGESGRE GSPGAEGSPG
RDGSPGPKGD RGETGPAGAP GAPGAPGAPG PVGPAGKSGD RGETGPAGPA GPVGPVGARG
PTGPQGPRGD KGETGEQGDR GIKGHRGFSG LQGPPGPPGS PGEQGPSGAS GPAGPRGPPG
SAGAAGKDGL NGLPGPIGPP GPRGRTGDAG PVGPPGPPGP PGPPGPPSGG FDFSFLPQPP
QEKAHDGGRY YRADDANVVR DRDLEVDTTL KSLSQQIENI RSPEGSRKNP ARTCRDLKMC
HSDWNSGEYW IDPNQGCNLD AIKVFCNMET GETCVYPTQP TVAQKNWYIS KNPKEKKHVW
YGESMTGGFQ FEYGGQGSDP ADVAIQLTFL RLMSTEASQN ITYHCKNSVA YMDQHTGNLK
KSLLLQGSNE IELRAEGNSR FTYTVTYDGC TSHTGAWGKT VIEYKTTKTS RLPIIDVAPL
DVGAPDQEFG IDIAPVCFL
//