ID A0A154PPI0_DUFNO Unreviewed; 795 AA.
AC A0A154PPI0;
DT 08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT 08-JUN-2016, sequence version 1.
DT 18-JUN-2025, entry version 33.
DE SubName: Full=Collagen alpha-1(XV) chain {ECO:0000313|EMBL:KZC13791.1};
GN ORFNames=WN55_05694 {ECO:0000313|EMBL:KZC13791.1};
OS Dufourea novaeangliae (Sweat bee).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; Apoidea;
OC Anthophila; Halictidae; Rophitinae; Dufourea.
OX NCBI_TaxID=178035 {ECO:0000313|EMBL:KZC13791.1, ECO:0000313|Proteomes:UP000076502};
RN [1] {ECO:0000313|EMBL:KZC13791.1, ECO:0000313|Proteomes:UP000076502}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=0120121106 {ECO:0000313|EMBL:KZC13791.1};
RC TISSUE=Whole body {ECO:0000313|EMBL:KZC13791.1};
RA Pan H., Kapheim K.;
RT "The genome of Dufourea novaeangliae.";
RL Submitted (JUL-2015) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KQ435012; KZC13791.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A154PPI0; -.
DR STRING; 178035.A0A154PPI0; -.
DR OrthoDB; 5983381at2759; -.
DR Proteomes; UP000076502; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IEA:TreeGrafter.
DR GO; GO:0030198; P:extracellular matrix organization; IEA:TreeGrafter.
DR CDD; cd00247; Endostatin-like; 1.
DR Gene3D; 3.40.1620.70; -; 1.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR050149; Collagen_superfamily.
DR InterPro; IPR010515; Collagenase_NC10/endostatin.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF914; OTOLIN-1; 1.
DR Pfam; PF01391; Collagen; 3.
DR Pfam; PF20010; Collagen_trimer; 1.
DR Pfam; PF06482; Endostatin; 1.
DR SUPFAM; SSF56436; C-type lectin-like; 1.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119, ECO:0000313|EMBL:KZC13791.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000076502}.
FT DOMAIN 511..559
FT /note="Collagen type XV/XVIII trimerization"
FT /evidence="ECO:0000259|Pfam:PF20010"
FT DOMAIN 595..760
FT /note="Collagenase NC10/endostatin"
FT /evidence="ECO:0000259|Pfam:PF06482"
FT REGION 28..437
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 37..46
FT /note="Gly residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 166..175
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 199..228
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 304..315
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 392..402
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 410..422
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 795 AA; 82065 MW; B1F578FBAA903376 CRC64;
MPQGPPGKKG DPGTCSCNAT ALMASFTMPK MIQGPKGEPG VGGQEGKQGQ MGLTGAAGPP
GERGLEGPAG ARGDKGDMGI PGPEGPQGQK GEPGRDGIPG EKGSQGPPGP PGKGEFSGYD
TEGIAMRPGL PGEKGEGGTP GSPGPKGTPG IPGPKGNRGE PGLKGARGDH GKEGSRGIQG
FKGEPGAPGS PGLPGAPGLR EKGDKGEVGG RGYKGDKGTK GEKGDKGDSG PAGIPGVNGI
QGPQGNKGDP GKDGVPGTSG IVGVKGEKGE RGPPGATAIS SSGDYITIKG EKGMEGKRGR
RGRPGPPGPV GPPGKPGVMG EIGLPGWVNS MKGRPGTPGL PGPVGLGGPK GEKGEPGTPS
PYGVSVGIKG DKGDDGFPGI PGQPGRDGQR GPPGPPGPPG PPSQGNYIPV PGPPGPPGPP
GPSGMSLSGQ KEDTGIGRSH IFGERDYYGV RQAFNEKIHF LFILQGLRTS LDELKALREL
KQLKELKEHL GAVTAATRGP LETTTKIVPG AVTFQNTEAM TKMSAVSPVG TLAYIIDEQA
LLVRVNNGWQ YIALGSLLPI TTPAPPTTSP PPANPPFEAS NLINQIPVKA DGTGLRMAAL
NEPFTGDMHG VRGADYACYR QARRAGLRGT FRAFLSSRVQ NVDSIVRLGD RDLPIVNVKG
DVLFNSWKEM FNGNGAYFSQ IPRIYSFNGK NILTDFAWPE KVAWHGSHKL GDRAMDTYCD
SWHSSSSDRY GLGSPLTGGR LLEQVRYSCD NKFALLCIEV TSEITRRRRS VEVTDDEEEM
SENDYKEYLD ALMEN
//