ID A0A3Q3X2N0_MOLML Unreviewed; 1415 AA.
AC A0A3Q3X2N0;
DT 10-APR-2019, integrated into UniProtKB/TrEMBL.
DT 10-APR-2019, sequence version 1.
DT 27-MAR-2024, entry version 23.
DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSMMOP00000016179.1};
OS Mola mola (Ocean sunfish) (Tetraodon mola).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Eupercaria; Tetraodontiformes; Molidae; Mola.
OX NCBI_TaxID=94237 {ECO:0000313|Ensembl:ENSMMOP00000016179.1, ECO:0000313|Proteomes:UP000261620};
RN [1] {ECO:0000313|Ensembl:ENSMMOP00000016179.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR Ensembl; ENSMMOT00000016449.1; ENSMMOP00000016179.1; ENSMMOG00000011805.1.
DR Proteomes; UP000261620; Unplaced.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR Gene3D; 2.60.120.1000; -; 1.
DR Gene3D; 2.10.70.10; Complement Module, domain 1; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR000885; Fib_collagen_C.
DR InterPro; IPR001007; VWF_dom.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF58; COLLAGEN ALPHA-1(II) CHAIN; 1.
DR Pfam; PF01410; COLFI; 1.
DR Pfam; PF01391; Collagen; 6.
DR Pfam; PF00093; VWC; 1.
DR SMART; SM00038; COLFI; 1.
DR SMART; SM00214; VWC; 1.
DR SUPFAM; SSF57603; FnI-like domain; 1.
DR PROSITE; PS51461; NC1_FIB; 1.
DR PROSITE; PS01208; VWFC_1; 1.
DR PROSITE; PS50184; VWFC_2; 1.
PE 4: Predicted;
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Reference proteome {ECO:0000313|Proteomes:UP000261620};
KW Secreted {ECO:0000256|ARBA:ARBA00022530}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..28
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 29..1415
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5018666657"
FT DOMAIN 35..93
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 1181..1415
FT /note="Fibrillar collagen NC1"
FT /evidence="ECO:0000259|PROSITE:PS51461"
FT REGION 185..1151
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 287..301
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1129..1145
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1415 AA; 136067 MW; E690AD330EC43917 CRC64;
MFSFVDSRTV LLLVASQVVL LSLVRCQAED DQEAGGCIQD GQQYNDKDVW KPEPCRICVC
DSGSVLCDEI ICEEIKECAN PIISSGECCP ICPADASAPI GCNLFIHIPI NKLLRITGRN
GEPGIPGFPG PPGPPGPSGL GRNFAAQLAG GFDEKSGGAQ MGVMQGPMVR LEECSLMINP
YFDVSDRSPC GEAGKPGKSG ERGPAGPQGA RGFPGTPGLP GIKGHRGHSG LDGAKGETGA
AGAKGESGAA GENGAPGPMG PRGLPGERGR PGAAGAAGAR GNDGLPGPAG PPGPVGPAGA
PGFPGSPGSK GEAGPTGVRG AEGPQGPRGE GGTPGSPGPA GASGNPGTDG IPGAKGSAGA
PGIAGAPGFP GPRGPPGPQG ATGSLGPKGQ SGDPGLPGFK GEAGPKGELG LTGPQGAPGP
AGEEGKRGAR GEPGAAGPLG PPGERGAPGN RGFPGQDGLA GAKGAPGDRG VPGASGPKGA
TGDSGRAGEP GLPGARGLTG RPGDAGAQGK VGPTGAAGED GRPGPPGPQG TRGQPGVMGF
PGPKGAHGEP GKPGEKGLVG RPGLRGLSGK DGETGPTGPS GPVGPAGERG EQGQSGPPGF
QGLPGPTGAP GEAGKPGDQG VSGEAGAPGA AGPRGERGFP GERGNAGAQG LQGPRGLPGT
PGSDGPKGAI GPAGAAGSQG PPGLQGMPGE RGTSGISGPK GDRGDSGQKG LEGAPGKDGA
RGLTGPIGPP GPAGPNGAKV STSSSHLGSG DRGEVGPPGP AGFAGPPGAD GQPGVKGELG
ETGQKGESGA HGPQGPSGAP GPVGPTGVTG LKGARGAQGA PGATGFPGAA GRVGPPGPNG
NHGSAGPPGP AGKDGPKGVR GDAGPPGRHG DAGLRGPPGQ QGAKGEAGED GPPGPDGPSG
PQGLAGSRGI VGLPGQRGER GFPGLPGPSG EPGKQGSTGS SGDRGPPGPV GPPGLSGPAG
EPGREGTPGS DGPPGRDGAS GVKGERGNTG PVGAPGAPGA PGAPGPVGPL GKQGDRGEAG
AQGPAGPAGP AGARGMAGAQ GPRGDKGEAG ESGERGQKGH RGFTGLQGLP GPPGPAGDAG
ASGPAGPSGA KGPAGPSGPA GKDGSNGQSG PIGPPGPRGR SGESGPAGPP GNTGPPGPPG
PPGPGIDMSA FAGLGQTEKS TDPLRYMRAD EASNSLRQHD VEVDSSLKSL NSQIENLRSP
DGSQKNPART CRDLKLCHPE WKSGDYWVDP NIGSTADAMK VFCNMETGET CVYPSIAKVP
KKNWWTSKSK DRKHVWFGET MNGGFHFSYA QDGPAANAAG IQLTFLRLLS NEASQNLTYH
CKNSVAYMDL GTGNLKKALL LQGSNDVEIR AEGNSRFTYS VMEDGCKKHT GRWGKTVFEY
KTQKTSRLPI VDIAPMDIGG ADQEFGVDVG PVCFL
//