GenomeNet

Database: UniProt
Entry: A0A3Q3X2N0_MOLML
LinkDB: A0A3Q3X2N0_MOLML
Original site: A0A3Q3X2N0_MOLML 
ID   A0A3Q3X2N0_MOLML        Unreviewed;      1415 AA.
AC   A0A3Q3X2N0;
DT   10-APR-2019, integrated into UniProtKB/TrEMBL.
DT   10-APR-2019, sequence version 1.
DT   27-MAR-2024, entry version 23.
DE   SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSMMOP00000016179.1};
OS   Mola mola (Ocean sunfish) (Tetraodon mola).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC   Eupercaria; Tetraodontiformes; Molidae; Mola.
OX   NCBI_TaxID=94237 {ECO:0000313|Ensembl:ENSMMOP00000016179.1, ECO:0000313|Proteomes:UP000261620};
RN   [1] {ECO:0000313|Ensembl:ENSMMOP00000016179.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   Ensembl; ENSMMOT00000016449.1; ENSMMOP00000016179.1; ENSMMOG00000011805.1.
DR   Proteomes; UP000261620; Unplaced.
DR   GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR   GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR   Gene3D; 2.60.120.1000; -; 1.
DR   Gene3D; 2.10.70.10; Complement Module, domain 1; 1.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR000885; Fib_collagen_C.
DR   InterPro; IPR001007; VWF_dom.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF58; COLLAGEN ALPHA-1(II) CHAIN; 1.
DR   Pfam; PF01410; COLFI; 1.
DR   Pfam; PF01391; Collagen; 6.
DR   Pfam; PF00093; VWC; 1.
DR   SMART; SM00038; COLFI; 1.
DR   SMART; SM00214; VWC; 1.
DR   SUPFAM; SSF57603; FnI-like domain; 1.
DR   PROSITE; PS51461; NC1_FIB; 1.
DR   PROSITE; PS01208; VWFC_1; 1.
DR   PROSITE; PS50184; VWFC_2; 1.
PE   4: Predicted;
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW   Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW   Reference proteome {ECO:0000313|Proteomes:UP000261620};
KW   Secreted {ECO:0000256|ARBA:ARBA00022530}; Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..28
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           29..1415
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5018666657"
FT   DOMAIN          35..93
FT                   /note="VWFC"
FT                   /evidence="ECO:0000259|PROSITE:PS50184"
FT   DOMAIN          1181..1415
FT                   /note="Fibrillar collagen NC1"
FT                   /evidence="ECO:0000259|PROSITE:PS51461"
FT   REGION          185..1151
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        287..301
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1129..1145
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1415 AA;  136067 MW;  E690AD330EC43917 CRC64;
     MFSFVDSRTV LLLVASQVVL LSLVRCQAED DQEAGGCIQD GQQYNDKDVW KPEPCRICVC
     DSGSVLCDEI ICEEIKECAN PIISSGECCP ICPADASAPI GCNLFIHIPI NKLLRITGRN
     GEPGIPGFPG PPGPPGPSGL GRNFAAQLAG GFDEKSGGAQ MGVMQGPMVR LEECSLMINP
     YFDVSDRSPC GEAGKPGKSG ERGPAGPQGA RGFPGTPGLP GIKGHRGHSG LDGAKGETGA
     AGAKGESGAA GENGAPGPMG PRGLPGERGR PGAAGAAGAR GNDGLPGPAG PPGPVGPAGA
     PGFPGSPGSK GEAGPTGVRG AEGPQGPRGE GGTPGSPGPA GASGNPGTDG IPGAKGSAGA
     PGIAGAPGFP GPRGPPGPQG ATGSLGPKGQ SGDPGLPGFK GEAGPKGELG LTGPQGAPGP
     AGEEGKRGAR GEPGAAGPLG PPGERGAPGN RGFPGQDGLA GAKGAPGDRG VPGASGPKGA
     TGDSGRAGEP GLPGARGLTG RPGDAGAQGK VGPTGAAGED GRPGPPGPQG TRGQPGVMGF
     PGPKGAHGEP GKPGEKGLVG RPGLRGLSGK DGETGPTGPS GPVGPAGERG EQGQSGPPGF
     QGLPGPTGAP GEAGKPGDQG VSGEAGAPGA AGPRGERGFP GERGNAGAQG LQGPRGLPGT
     PGSDGPKGAI GPAGAAGSQG PPGLQGMPGE RGTSGISGPK GDRGDSGQKG LEGAPGKDGA
     RGLTGPIGPP GPAGPNGAKV STSSSHLGSG DRGEVGPPGP AGFAGPPGAD GQPGVKGELG
     ETGQKGESGA HGPQGPSGAP GPVGPTGVTG LKGARGAQGA PGATGFPGAA GRVGPPGPNG
     NHGSAGPPGP AGKDGPKGVR GDAGPPGRHG DAGLRGPPGQ QGAKGEAGED GPPGPDGPSG
     PQGLAGSRGI VGLPGQRGER GFPGLPGPSG EPGKQGSTGS SGDRGPPGPV GPPGLSGPAG
     EPGREGTPGS DGPPGRDGAS GVKGERGNTG PVGAPGAPGA PGAPGPVGPL GKQGDRGEAG
     AQGPAGPAGP AGARGMAGAQ GPRGDKGEAG ESGERGQKGH RGFTGLQGLP GPPGPAGDAG
     ASGPAGPSGA KGPAGPSGPA GKDGSNGQSG PIGPPGPRGR SGESGPAGPP GNTGPPGPPG
     PPGPGIDMSA FAGLGQTEKS TDPLRYMRAD EASNSLRQHD VEVDSSLKSL NSQIENLRSP
     DGSQKNPART CRDLKLCHPE WKSGDYWVDP NIGSTADAMK VFCNMETGET CVYPSIAKVP
     KKNWWTSKSK DRKHVWFGET MNGGFHFSYA QDGPAANAAG IQLTFLRLLS NEASQNLTYH
     CKNSVAYMDL GTGNLKKALL LQGSNDVEIR AEGNSRFTYS VMEDGCKKHT GRWGKTVFEY
     KTQKTSRLPI VDIAPMDIGG ADQEFGVDVG PVCFL
//
DBGET integrated database retrieval system