GenomeNet

Database: UniProt
Entry: E2B553_HARSA
LinkDB: E2B553_HARSA
Original site: E2B553_HARSA 
ID   E2B553_HARSA            Unreviewed;       876 AA.
AC   E2B553;
DT   30-NOV-2010, integrated into UniProtKB/TrEMBL.
DT   30-NOV-2010, sequence version 1.
DT   28-JAN-2026, entry version 59.
DE   SubName: Full=Collagen alpha-1(XV) chain {ECO:0000313|EMBL:EFN89188.1};
GN   ORFNames=EAI_10165 {ECO:0000313|EMBL:EFN89188.1};
OS   Harpegnathos saltator (Jerdon's jumping ant).
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC   Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; Formicoidea;
OC   Formicidae; Ponerinae; Ponerini; Harpegnathos.
OX   NCBI_TaxID=610380 {ECO:0000313|Proteomes:UP000008237};
RN   [1] {ECO:0000313|EMBL:EFN89188.1, ECO:0000313|Proteomes:UP000008237}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=R22 G/1 {ECO:0000313|EMBL:EFN89188.1,
RC   ECO:0000313|Proteomes:UP000008237};
RX   PubMed=20798317; DOI=10.1126/science.1192428;
RA   Bonasio R., Zhang G., Ye C., Mutti N.S., Fang X., Qin N., Donahue G.,
RA   Yang P., Li Q., Li C., Zhang P., Huang Z., Berger S.L., Reinberg D.,
RA   Wang J., Liebig J.;
RT   "Genomic comparison of the ants Camponotus floridanus and Harpegnathos
RT   saltator.";
RL   Science 329:1068-1071(2010).
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; GL445712; EFN89188.1; -; Genomic_DNA.
DR   AlphaFoldDB; E2B553; -.
DR   InParanoid; E2B553; -.
DR   OMA; WWKVSAS; -.
DR   OrthoDB; 5983381at2759; -.
DR   Proteomes; UP000008237; Unassembled WGS sequence.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR   GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR   GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IEA:TreeGrafter.
DR   GO; GO:0030198; P:extracellular matrix organization; IEA:TreeGrafter.
DR   CDD; cd00247; Endostatin-like; 1.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050149; Collagen_superfamily.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24023:SF1112; COL_CUTICLE_N DOMAIN-CONTAINING PROTEIN-RELATED; 1.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   Pfam; PF01391; Collagen; 4.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 1.
PE   4: Predicted;
KW   Collagen {ECO:0000256|ARBA:ARBA00023119, ECO:0000313|EMBL:EFN89188.1};
KW   Reference proteome {ECO:0000313|Proteomes:UP000008237}.
FT   DOMAIN          591..639
FT                   /note="Collagen type XV/XVIII trimerization"
FT                   /evidence="ECO:0000259|Pfam:PF20010"
FT   DOMAIN          675..840
FT                   /note="Collagenase NC10/endostatin"
FT                   /evidence="ECO:0000259|Pfam:PF06482"
FT   REGION          24..537
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        24..36
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        71..83
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        134..146
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        213..231
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        258..270
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        284..300
FT                   /note="Gly residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        301..327
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        328..343
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        403..414
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        429..444
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        505..518
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   876 AA;  90292 MW;  421E4C6F5970982B CRC64;
     MRAFPFVDRY SSYYFMLYAH KGPKGEKGDK GDKGESVRGP PGPPGPPGRD EDWLMKIPQG
     PPGQKGDPGS CTCNATALMS SFTMPKMIQG PKGEPGVSGQ EGKQGLMGLT GAAGPPGERG
     LQGPSGAKGD KGDIGIPGPE GPQGQKGEPG RDGIPGEKGA QGPPGPPGKG EFSGYDTEGI
     TMRPGLPGQK GEPGTSGSPG PKGEAGITGS KGIKGEPGHK GAKGDHGKEG PRGTQGFKGE
     PGAPGAPGLP GAPGENGRPA EKGDKGDIGP EGKSGPPGPP GPSGTSGPGG INVGDLGFGT
     KGDKGELGTR GYRGDKGTKG EKGDKGDAGP AGIPGINGIQ GPQGDKGEPG TDGVSGSPGT
     PGAKGERGER GPPGATTVAS SGDYVTIKGE KGAEGKRGRR GRPGPPGPVG PPGKPGIMGE
     IGLPGWMGRP GNPGIPGSIG PMGPKGEKGE PGAPSPYGVS VGIKGDKGND GLPGIPGQTG
     RDGQRGAPGP PGPPGPSSQG KYMPVPGPPG PPGPPGPPGV SLVGQKGEPG IGRNSYGEKN
     LYYGLRQGSR SNTDELKALR ELKQLREQLD IVATVATKLP LESTTKIVPG AVTFQNTEIM
     RKMSSVSPVG TLAYIIDEQA LLVRVNNGWQ YIALGSLLPI TTPAPPTTAP PPANPPFEAS
     NLINQIPMKA DGTGLRMAAL NEPFTGDMHG IRGADYACYR QARRAGLRGT FRAFLSSRVQ
     NVDSIVRLGD RDLPIVNIKG DVLFNSWKEM FNGNGAYFSQ NPRIYSFNGK NILTDFTWSE
     KVAWHGSHKL GDRAMDTYCD AWHSSNSDRY GLGSPLTGGR LLEQVRYSCD NKFALLCIEV
     TSESVRRRRS ADDRPEDDLE MTENDYMEYL EELMQY
//
DBGET integrated database retrieval system