ID E2B553_HARSA Unreviewed; 876 AA.
AC E2B553;
DT 30-NOV-2010, integrated into UniProtKB/TrEMBL.
DT 30-NOV-2010, sequence version 1.
DT 28-JAN-2026, entry version 59.
DE SubName: Full=Collagen alpha-1(XV) chain {ECO:0000313|EMBL:EFN89188.1};
GN ORFNames=EAI_10165 {ECO:0000313|EMBL:EFN89188.1};
OS Harpegnathos saltator (Jerdon's jumping ant).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; Formicoidea;
OC Formicidae; Ponerinae; Ponerini; Harpegnathos.
OX NCBI_TaxID=610380 {ECO:0000313|Proteomes:UP000008237};
RN [1] {ECO:0000313|EMBL:EFN89188.1, ECO:0000313|Proteomes:UP000008237}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=R22 G/1 {ECO:0000313|EMBL:EFN89188.1,
RC ECO:0000313|Proteomes:UP000008237};
RX PubMed=20798317; DOI=10.1126/science.1192428;
RA Bonasio R., Zhang G., Ye C., Mutti N.S., Fang X., Qin N., Donahue G.,
RA Yang P., Li Q., Li C., Zhang P., Huang Z., Berger S.L., Reinberg D.,
RA Wang J., Liebig J.;
RT "Genomic comparison of the ants Camponotus floridanus and Harpegnathos
RT saltator.";
RL Science 329:1068-1071(2010).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GL445712; EFN89188.1; -; Genomic_DNA.
DR AlphaFoldDB; E2B553; -.
DR InParanoid; E2B553; -.
DR OMA; WWKVSAS; -.
DR OrthoDB; 5983381at2759; -.
DR Proteomes; UP000008237; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IEA:TreeGrafter.
DR GO; GO:0030198; P:extracellular matrix organization; IEA:TreeGrafter.
DR CDD; cd00247; Endostatin-like; 1.
DR Gene3D; 3.40.1620.70; -; 1.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR050149; Collagen_superfamily.
DR InterPro; IPR010515; Collagenase_NC10/endostatin.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR PANTHER; PTHR24023:SF1112; COL_CUTICLE_N DOMAIN-CONTAINING PROTEIN-RELATED; 1.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR Pfam; PF01391; Collagen; 4.
DR Pfam; PF20010; Collagen_trimer; 1.
DR Pfam; PF06482; Endostatin; 1.
DR SUPFAM; SSF56436; C-type lectin-like; 1.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119, ECO:0000313|EMBL:EFN89188.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000008237}.
FT DOMAIN 591..639
FT /note="Collagen type XV/XVIII trimerization"
FT /evidence="ECO:0000259|Pfam:PF20010"
FT DOMAIN 675..840
FT /note="Collagenase NC10/endostatin"
FT /evidence="ECO:0000259|Pfam:PF06482"
FT REGION 24..537
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 24..36
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 71..83
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 134..146
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 213..231
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 258..270
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 284..300
FT /note="Gly residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 301..327
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 328..343
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 403..414
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 429..444
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 505..518
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 876 AA; 90292 MW; 421E4C6F5970982B CRC64;
MRAFPFVDRY SSYYFMLYAH KGPKGEKGDK GDKGESVRGP PGPPGPPGRD EDWLMKIPQG
PPGQKGDPGS CTCNATALMS SFTMPKMIQG PKGEPGVSGQ EGKQGLMGLT GAAGPPGERG
LQGPSGAKGD KGDIGIPGPE GPQGQKGEPG RDGIPGEKGA QGPPGPPGKG EFSGYDTEGI
TMRPGLPGQK GEPGTSGSPG PKGEAGITGS KGIKGEPGHK GAKGDHGKEG PRGTQGFKGE
PGAPGAPGLP GAPGENGRPA EKGDKGDIGP EGKSGPPGPP GPSGTSGPGG INVGDLGFGT
KGDKGELGTR GYRGDKGTKG EKGDKGDAGP AGIPGINGIQ GPQGDKGEPG TDGVSGSPGT
PGAKGERGER GPPGATTVAS SGDYVTIKGE KGAEGKRGRR GRPGPPGPVG PPGKPGIMGE
IGLPGWMGRP GNPGIPGSIG PMGPKGEKGE PGAPSPYGVS VGIKGDKGND GLPGIPGQTG
RDGQRGAPGP PGPPGPSSQG KYMPVPGPPG PPGPPGPPGV SLVGQKGEPG IGRNSYGEKN
LYYGLRQGSR SNTDELKALR ELKQLREQLD IVATVATKLP LESTTKIVPG AVTFQNTEIM
RKMSSVSPVG TLAYIIDEQA LLVRVNNGWQ YIALGSLLPI TTPAPPTTAP PPANPPFEAS
NLINQIPMKA DGTGLRMAAL NEPFTGDMHG IRGADYACYR QARRAGLRGT FRAFLSSRVQ
NVDSIVRLGD RDLPIVNIKG DVLFNSWKEM FNGNGAYFSQ NPRIYSFNGK NILTDFTWSE
KVAWHGSHKL GDRAMDTYCD AWHSSNSDRY GLGSPLTGGR LLEQVRYSCD NKFALLCIEV
TSESVRRRRS ADDRPEDDLE MTENDYMEYL EELMQY
//