ID A0A7J5Y0H9_DISMA Unreviewed; 670 AA.
AC A0A7J5Y0H9;
DT 07-APR-2021, integrated into UniProtKB/TrEMBL.
DT 07-APR-2021, sequence version 1.
DT 18-JUN-2025, entry version 16.
DE RecName: Full=Collagenase NC10/endostatin domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=F7725_001689 {ECO:0000313|EMBL:KAF3842840.1};
OS Dissostichus mawsoni (Antarctic cod).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Eupercaria; Perciformes; Notothenioidei; Nototheniidae; Dissostichus.
OX NCBI_TaxID=36200 {ECO:0000313|EMBL:KAF3842840.1, ECO:0000313|Proteomes:UP000518266};
RN [1] {ECO:0000313|EMBL:KAF3842840.1, ECO:0000313|Proteomes:UP000518266}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=DM0001 {ECO:0000313|EMBL:KAF3842840.1};
RC TISSUE=Muscle {ECO:0000313|EMBL:KAF3842840.1};
RA Park H.;
RT "Dissostichus mawsoni Genome sequencing and assembly.";
RL Submitted (MAR-2020) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KAF3842840.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JAAKFY010000018; KAF3842840.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A7J5Y0H9; -.
DR OrthoDB; 10060752at2759; -.
DR Proteomes; UP000518266; Unassembled WGS sequence.
DR GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IEA:TreeGrafter.
DR GO; GO:0030198; P:extracellular matrix organization; IEA:TreeGrafter.
DR CDD; cd00247; Endostatin-like; 1.
DR FunFam; 3.10.100.10:FF:000008; collagen alpha-1(XVIII) chain isoform X1; 1.
DR FunFam; 3.40.1620.70:FF:000003; Collagen type XVIII alpha 1; 1.
DR Gene3D; 3.40.1620.70; -; 1.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR050149; Collagen_superfamily.
DR InterPro; IPR010515; Collagenase_NC10/endostatin.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR PANTHER; PTHR24023:SF1112; COL_CUTICLE_N DOMAIN-CONTAINING PROTEIN-RELATED; 1.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR Pfam; PF20010; Collagen_trimer; 1.
DR Pfam; PF06482; Endostatin; 1.
DR SUPFAM; SSF56436; C-type lectin-like; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000518266}.
FT DOMAIN 376..422
FT /note="Collagen type XV/XVIII trimerization"
FT /evidence="ECO:0000259|Pfam:PF20010"
FT DOMAIN 496..664
FT /note="Collagenase NC10/endostatin"
FT /evidence="ECO:0000259|Pfam:PF06482"
FT REGION 18..46
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 183..210
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 273..345
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 450..483
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 288..305
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 469..479
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 670 AA; 73276 MW; 4B4502E19989C155 CRC64;
MPPYRGLEET QGYRVFQDRP DWLVCPDPSD QSDSRGLPGP LDPATASDLM TWRPLVEVSA
TDFLVPEDLK EDRVNLGCLV SPVRRAVKGP SEETDSPGWT ASPDLRDQRV TVVTKETGVS
QVGMDPDSLA PPAHLDPRDK SSTRLMAMLV MLGGRVLLGQ GVTEENRGEK GEPGLVIGPD
GSLLHLDGLS GQKGPYGPSG IKGEFGMPGR PGRPGVNGYK GEKGDTSGGS GYGYPCLANQ
DNPDHLDPPD QPSPLIASIA MTTVQGITQQ LKETKESKVT MDCQESQNEL KGERGDVGVK
GEKGEQIGGY YDQRFGGVQG QSGPPGSKGR LCNGTSWTPG ATRNPRDWLR WASSRQYSWT
SRPPGQPGSP ALSSGVTVLR SYETMVATAR RQSEGSLIYI IDKADLYLRV RDGLRQVMLG
DYNPFFRDLE NEVAEVQPPP VIVYPDTQDQ SQNNGAGHYS HGGPVIRPIE PPPQPPVKPD
TPHSMIPDSQ TRDRQLHIIA LNAPQTGNMR GIRGADFLCF QQARAVGLKG TFRAFLSSKL
QDLYTIVRRA DRDNFSIVNL KDQVLFDSWE SMFGDNTNKM RENVPIYSFD GRDTLRDSAW
PEKMVWHGSN NKGHRQTDHY CETWRTGDHA VSGLASSLQS GQLLQQSSSS CSGSYIVLCI
ENAFTTHSKK
//