GenomeNet

Database: UniProt
Entry: A0A5E4QPU7_9NEOP
LinkDB: A0A5E4QPU7_9NEOP
Original site: A0A5E4QPU7_9NEOP 
ID   A0A5E4QPU7_9NEOP        Unreviewed;       404 AA.
AC   A0A5E4QPU7;
DT   13-NOV-2019, integrated into UniProtKB/TrEMBL.
DT   13-NOV-2019, sequence version 1.
DT   28-JAN-2026, entry version 18.
DE   RecName: Full=Collagenase NC10/endostatin domain-containing protein {ECO:0008006|Google:ProtNLM};
GN   ORFNames=LSINAPIS_LOCUS10971 {ECO:0000313|EMBL:VVD00300.1};
OS   Leptidea sinapis.
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC   Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; Papilionoidea;
OC   Pieridae; Dismorphiinae; Leptidea.
OX   NCBI_TaxID=189913 {ECO:0000313|EMBL:VVD00300.1, ECO:0000313|Proteomes:UP000324832};
RN   [1] {ECO:0000313|EMBL:VVD00300.1, ECO:0000313|Proteomes:UP000324832}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA   Talla V., Backstrom N.;
RL   Submitted (JUL-2017) to the EMBL/GenBank/DDBJ databases.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; FZQP02004601; VVD00300.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A5E4QPU7; -.
DR   Proteomes; UP000324832; Unassembled WGS sequence.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 2.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24637; COLLAGEN; 1.
DR   PANTHER; PTHR24637:SF421; CUTICLE COLLAGEN DPY-2; 1.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06482; Endostatin; 2.
DR   SUPFAM; SSF56436; C-type lectin-like; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000324832}.
FT   DOMAIN          165..210
FT                   /note="Collagen type XV/XVIII trimerization"
FT                   /evidence="ECO:0000259|Pfam:PF20010"
FT   DOMAIN          247..327
FT                   /note="Collagenase NC10/endostatin"
FT                   /evidence="ECO:0000259|Pfam:PF06482"
FT   DOMAIN          329..391
FT                   /note="Collagenase NC10/endostatin"
FT                   /evidence="ECO:0000259|Pfam:PF06482"
FT   REGION          1..106
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1..12
FT                   /note="Basic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        15..25
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        28..37
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        38..63
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        95..106
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   404 AA;  42646 MW;  2173BE4485E3B3B8 CRC64;
     MTKPGPRGKR GHPGAPGPRG PPGGPPGVAG QPGAQGPKGE KGDPGLTKAE LDKIKGEKGD
     RGLDGTPGTP GKDGPRGPPG PPGSLTGNVQ YVSVPGPPGP PGPPGPAIAI ANEHPMDTLT
     DSPGINRHEP TAGKSRDALQ ILRSLNHYMS RQGQYDGRTI IGTILFKTTD SLLRLGTKSP
     RGTLAYVIQE QALLVRVNNG WQYVAMGTLL AIHSSPVGGP TRTPQQNILE TSSLVHHKNP
     AGGGPVLRLA ALNEPHTGDM HGVSSTNYEC HRQAQRAGLD GTFRAFISSR VQTIDSIVSW
     VDREIPVVNT RGDVLFNSWG EMFDGSGALP IKAVWHGAIP NGEPAMDAYC DAWHSSNPEK
     LGLASSLRSN KLLDQETYSC SSRLIVLCIE ATPVDTARRK KRSK
//
DBGET integrated database retrieval system