ID A0A401RS92_CHIPU Unreviewed; 804 AA.
AC A0A401RS92;
DT 08-MAY-2019, integrated into UniProtKB/TrEMBL.
DT 08-MAY-2019, sequence version 1.
DT 27-MAR-2024, entry version 15.
DE RecName: Full=Collagenase NC10/endostatin domain-containing protein {ECO:0000259|Pfam:PF06482};
GN ORFNames=chiPu_0019506 {ECO:0000313|EMBL:GCC21002.1};
OS Chiloscyllium punctatum (Brownbanded bambooshark) (Hemiscyllium punctatum).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Chondrichthyes;
OC Elasmobranchii; Galeomorphii; Galeoidea; Orectolobiformes; Hemiscylliidae;
OC Chiloscyllium.
OX NCBI_TaxID=137246 {ECO:0000313|EMBL:GCC21002.1, ECO:0000313|Proteomes:UP000287033};
RN [1] {ECO:0000313|EMBL:GCC21002.1, ECO:0000313|Proteomes:UP000287033}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=30297745; DOI=.1038/s41559-018-0673-5;
RA Hara Y, Yamaguchi K, Onimaru K, Kadota M, Koyanagi M, Keeley SD, Tatsumi K,
RA Tanaka K, Motone F, Kageyama Y, Nozu R, Adachi N, Nishimura O, Nakagawa R,
RA Tanegashima C, Kiyatake I, Matsumoto R, Murakumo K, Nishida K, Terakita A,
RA Kuratani S, Sato K, Hyodo S Kuraku.S.;
RT "Shark genomes provide insights into elasmobranch evolution and the origin
RT of vertebrates.";
RL Nat. Ecol. Evol. 2:1761-1771(2018).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:GCC21002.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; BEZZ01002023; GCC21002.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A401RS92; -.
DR STRING; 137246.A0A401RS92; -.
DR OMA; YSHERPY; -.
DR Proteomes; UP000287033; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR CDD; cd00247; Endostatin-like; 1.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR010515; Collagenase_NC10/endostatin.
DR InterPro; IPR016187; CTDL_fold.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1104; COLLAGEN ALPHA-1(IV) CHAIN; 1.
DR Pfam; PF01391; Collagen; 4.
DR Pfam; PF06482; Endostatin; 1.
DR SUPFAM; SSF56436; C-type lectin-like; 1.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Reference proteome {ECO:0000313|Proteomes:UP000287033}.
FT DOMAIN 630..798
FT /note="Collagenase NC10/endostatin"
FT /evidence="ECO:0000259|Pfam:PF06482"
FT REGION 1..100
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 113..263
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 340..623
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 77..97
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 341..357
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 485..504
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 517..541
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 575..592
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 804 AA; 80649 MW; 99D03A0B88C1F7C5 CRC64;
MNGTAIPGLP GLPGPPGADG RPGVSGPTGP PGPPGQDGRP GLPGEKGLQG EPGDLGLPGA
QGPKGLVGQA GPPGQPGLAG LPGPVGPRGP PGPPGPSAAV LRAGFEDAEG SALPIMTGVP
GERGLEGAQG VPGVPGLPGV PGLKGEPGIP GELGLQGPQG YPGEIGLTGP PGPQGPKGAQ
GDPGMKGEPG RDGVGLPGPP GPPGITYSGG SAVDGVTFVP GAQGPRGIPG QAGLPGPSGP
KGEQGPPGVT GPSGPKGDRG EPGFIISPDG TTLNSLLASG IGSKGDVGIP GPIGPQGLPG
YPGQKGEIGL PGRPGRPGIN GLKGEKGEPG DFSGGFGYRN IPGPPGPPGP PGPPGMPASD
HVFRGAAEGF PGIHGPQGPK GDRGNPGYMG PPGQKGDIGE PGLPGPPGTI PHEVYDFTSS
LRGAKGETGD VGQKGEKGAP GDGYGSGVAG PQGPPGHPGP PGLQGPKGDS IVGPRGLPGN
AGPPGYGYPG APGPPGPPGP PGLGEYIPYH GTGINGNNVE AVGSPPVVQY STDSESAQTV
EGAAPDSGAP EPRTESPWGP EHRWPVHHRP ETPQLNYPRH QQPQSVPRWL TTSAPPRSAA
PPRHSHHGLS HRPHVPPSGH AESLHGVPML HLMALNTPLT GNMQGIRGVD FQCFQQARGV
GLMGTFRGFL SSKLQDLYSV VRRADRHSVP VVNSKGQVLF DSWSSLFSGS QARIKSNTPI
YSFDGRDVLQ DHAWPEKFVW HGSDSRGQRL SQSYCEAWRE AQDNKVGMAS SLLSGQLLEQ
RPSPCSHSFI ILCIENSYLP HTRK
//