ID A0A3Q1EE34_9TELE Unreviewed; 1484 AA.
AC A0A3Q1EE34;
DT 10-APR-2019, integrated into UniProtKB/TrEMBL.
DT 10-APR-2019, sequence version 1.
DT 27-MAR-2024, entry version 24.
DE SubName: Full=Collagen alpha-1(II) chain-like {ECO:0000313|Ensembl:ENSAPOP00000001858.1};
OS Acanthochromis polyacanthus (spiny chromis).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Ovalentaria; Pomacentridae; Acanthochromis.
OX NCBI_TaxID=80966 {ECO:0000313|Ensembl:ENSAPOP00000001858.1, ECO:0000313|Proteomes:UP000257200};
RN [1] {ECO:0000313|Ensembl:ENSAPOP00000001858.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 80966.ENSAPOP00000001858; -.
DR Ensembl; ENSAPOT00000014440.1; ENSAPOP00000001858.1; ENSAPOG00000003326.1.
DR GeneTree; ENSGT00940000155224; -.
DR InParanoid; A0A3Q1EE34; -.
DR Proteomes; UP000257200; Unplaced.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0007411; P:axon guidance; IEA:Ensembl.
DR GO; GO:0048706; P:embryonic skeletal system development; IEA:Ensembl.
DR GO; GO:0030903; P:notochord development; IEA:Ensembl.
DR Gene3D; 2.60.120.1000; -; 1.
DR Gene3D; 2.10.70.10; Complement Module, domain 1; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR000885; Fib_collagen_C.
DR InterPro; IPR001007; VWF_dom.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF58; COLLAGEN ALPHA-1(II) CHAIN; 1.
DR Pfam; PF01410; COLFI; 1.
DR Pfam; PF01391; Collagen; 9.
DR Pfam; PF00093; VWC; 1.
DR SMART; SM00038; COLFI; 1.
DR SMART; SM00214; VWC; 1.
DR SUPFAM; SSF57603; FnI-like domain; 1.
DR PROSITE; PS51461; NC1_FIB; 1.
DR PROSITE; PS01208; VWFC_1; 1.
DR PROSITE; PS50184; VWFC_2; 1.
PE 4: Predicted;
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Reference proteome {ECO:0000313|Proteomes:UP000257200};
KW Secreted {ECO:0000256|ARBA:ARBA00022530}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..28
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 29..1484
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5018700547"
FT DOMAIN 34..92
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 1251..1484
FT /note="Fibrillar collagen NC1"
FT /evidence="ECO:0000259|PROSITE:PS51461"
FT REGION 102..1232
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 159..173
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 347..361
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 428..442
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1198..1215
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1484 AA; 140684 MW; 73ABA590B039F846 CRC64;
MFSFVDSRTV LLLVASQVVL LSVVRCQGED DEAGGCIQDG QQYSDKDVWK PEPCRICVCD
SGAVLCDEII CEEIKECANP IIPSGECCPI CPADASSPIG TIGAKGQKGE PGDIADVVGP
KGPPGPMGPP GEQGARGETG AKGDKGNPGP RGRDGEPGTP GNPGPPGPPG PPGLGGNFAA
QMAGGFDEKA GGAQMGVMQG PMGPMGPRGP PGPTGSPGPQ GFQGGPGEAG EPGQAGPRGP
PGPSGKPGSD GEAGKPGKAG ERGPAGPQGA RGFPGTPGLP GIKGHRGHSG LDGAKGETGA
AGAKGEAGAA GENGAPGPMG PRGLPGERGR PGAAGAAGAR GNDGLPGPAG PPGPVGPAGA
PGFPGSPGAK GEAGPTGVRG AEGAQGPRGE AGTPGSPGPA GASGNPGTDG IPGAKGSAGA
PGIAGAPGFP GPRGPPGPQG ATGPLGPKGQ SGDPGLPGFK GEAGPKGELG PAGPQGAPGP
AGEEGKRGAR GEPGAAGPLG PPGERGAPGN RGFPGQDGLA GAKGVPGERG VAGAAGPKGG
SGDPGRTGEP GLPGARGLTG RPGDAGPQGK VGPSGAAGED GRPGPPGPQG ARGQPGVMGF
PGPKGANGEP GKPGEKGLVG RPGLRGLSGK DGETGPAGPP GPAGPVGERG EQGQPGPSGF
QGLPGPSGAP GEAGKPGDQG VPGEGGAPGA VGPRGERGFP GERGGAGAQG LQGPRGLPGT
PGSDGPKGAI GPAGAAGPQG PPGLQGMPGE RGAGGIPGPK GDRGDNGAKG LEGAPGKDGA
RGLTGPIGPP GPSGPNGAKG ETGPTGPIGT PGARGAPGDR GEGGPPGPAG FAGPPGADGQ
PGAKGEVGEG GQKGEAGAPG PQGPSGAPGP VGPTGVSGPK GARGAQGAPG ATGFPGAAGR
VGPPGPNGNP GAAGPAGPAG KDGPKGTRGD AGPPGRQGDG GLRGPAGPQG EKGEPGVDGP
PGADGPSGPQ GLAGSRGIVG LPGQRGERGF PGLPGPSGEP GKQGAPGSGG DRGPPGPVGP
PGLTGPAGEP GREGTPGSDG PPGRDGAVGN KLQGERGNTG PAGAPGAPGA PGAPGPVGPL
GKQGDRGEAG AQGPGGPPGP AGARGMAGPQ GPRGDKGEAG ETGERGQKGH RGFTGLQGLP
GPPGPAGDAG AAGPAGPSGA KGPPGPAGPA GKDGTNGQLG PIGPPGPRGR SGESGPAGPP
GNPGPPGPPG PPGPGIDMSA FAGLGQTEKS PDPLRYMRAD EASSSLRQHD VEVDSTLKSL
NNKIENLRSP DGSQKNPART CRDLKLCHPE WKSDYWVDPN IGSTADAIKV FCNMETGETC
VYPSIAQVPK KNWWTSKSKD RKHVWFGETM NGGFHFSYAE DGPAANAASV QLTFLRLLST
EASQNLTYHC KNSIGYMDGA TGNLKKALLL QGSNDVEIRA EGNSRFTYSV LEDGCKRHTG
RWGKTVFEYK TQKTSRLPIV DIAPMDIGGA DQEFGVHVGA VCFL
//