GenomeNet

Database: UniProt
Entry: A0A3S2NSX4_ORYJA
LinkDB: A0A3S2NSX4_ORYJA
Original site: A0A3S2NSX4_ORYJA 
ID   A0A3S2NSX4_ORYJA        Unreviewed;       602 AA.
AC   A0A3S2NSX4;
DT   10-APR-2019, integrated into UniProtKB/TrEMBL.
DT   10-APR-2019, sequence version 1.
DT   28-JAN-2026, entry version 23.
DE   RecName: Full=Collagenase NC10/endostatin domain-containing protein {ECO:0008006|Google:ProtNLM};
GN   ORFNames=OJAV_G00195340 {ECO:0000313|EMBL:RVE58546.1};
OS   Oryzias javanicus (Javanese ricefish) (Aplocheilus javanicus).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC   Ovalentaria; Atherinomorphae; Beloniformes; Adrianichthyidae; Oryziinae;
OC   Oryzias.
OX   NCBI_TaxID=123683 {ECO:0000313|EMBL:RVE58546.1, ECO:0000313|Proteomes:UP000283210};
RN   [1] {ECO:0000313|EMBL:RVE58546.1, ECO:0000313|Proteomes:UP000283210}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=RS831 {ECO:0000313|EMBL:RVE58546.1};
RC   TISSUE=Whole body {ECO:0000313|EMBL:RVE58546.1};
RA   Lopez-Roques C., Donnadieu C., Bouchez O., Klopp C., Cabau C., Zahm M.;
RL   Submitted (NOV-2018) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|EMBL:RVE58546.1, ECO:0000313|Proteomes:UP000283210}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=RS831 {ECO:0000313|EMBL:RVE58546.1};
RC   TISSUE=Whole body {ECO:0000313|EMBL:RVE58546.1};
RA   Herpin A., Takehana Y., Naruse K., Ansai S., Kawaguchi M.;
RT   "A chromosome length genome reference of the Java medaka (oryzias
RT   javanicus).";
RL   Submitted (JAN-2019) to the EMBL/GenBank/DDBJ databases.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; CM012456; RVE58546.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A3S2NSX4; -.
DR   OrthoDB; 10060752at2759; -.
DR   Proteomes; UP000283210; Chromosome 20.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR   GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR   GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IEA:TreeGrafter.
DR   GO; GO:0030198; P:extracellular matrix organization; IEA:TreeGrafter.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050149; Collagen_superfamily.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1105; FIBRILLAR COLLAGEN NC1 DOMAIN-CONTAINING PROTEIN; 1.
DR   Pfam; PF01391; Collagen; 1.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 1.
PE   4: Predicted;
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Reference proteome {ECO:0000313|Proteomes:UP000283210}.
FT   DOMAIN          353..400
FT                   /note="Collagen type XV/XVIII trimerization"
FT                   /evidence="ECO:0000259|Pfam:PF20010"
FT   DOMAIN          432..598
FT                   /note="Collagenase NC10/endostatin"
FT                   /evidence="ECO:0000259|Pfam:PF06482"
FT   REGION          1..45
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          61..188
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          209..349
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1..19
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        28..39
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        124..135
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        169..178
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        243..257
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        291..317
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        330..343
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   602 AA;  61537 MW;  7C665D1BC81B2A85 CRC64;
     MSRFQGPKGD PGDPGQKGEP GADGLAVPGP PGPPGPPGPV FNLHDLPLNI TDAAFNLSGI
     VEALGPPGPQ GPKGDSGLPG FHGSPGVKGQ KGEPGGLVAA DGSTMAHLAG PVGPKGVKGD
     GGLPGPPGVP GPVGPAGPKG DIGIPGRSGR PGLMGPKGEK GDPRGLPGLP GPPGPPGRPG
     IFNCPKGTVF PIPPRPHCKM PLHPNGSIAA GNCRSGLKGE KGETGPPGSS AAPLSFLPRG
     RSSSREDPGI KGEKGQKGDP GSPGQPGIPG RSGLVGPKGD SVLGPAGLPG VPGPPGVPGF
     GRPGPPGPPG PPGPPLFPSR YGSALSNAGP PGPPGPPGPP GPPGASSDAA AIKTFSTRES
     MMQQTLRDAD GTLAFVTASG SLFLKVSQGW KEIQLGNLIY LSSNIIPQDE PQAAFQVRGE
     TLQRIRSPSE RLTLVALNQP HSGDMMGLDM ADRMCYEQAK AMGLAPHYRA FISSHKQDLV
     HVVYPGFREA LPVTNLRGDV MFWNWRSIFN GNGGSFSPRI PIYSFDGRDV LADPFWPKKS
     IWHGSTGRGL RMVDKHCETW RADDLSVTGQ SSSLTSGLLL GQQTRSCSNE YVVLCIETHK
     NL
//
DBGET integrated database retrieval system