GenomeNet

Database: UniProt
Entry: A0A834U6D8_VESGE
LinkDB: A0A834U6D8_VESGE
Original site: A0A834U6D8_VESGE 
ID   A0A834U6D8_VESGE        Unreviewed;       962 AA.
AC   A0A834U6D8;
DT   29-SEP-2021, integrated into UniProtKB/TrEMBL.
DT   29-SEP-2021, sequence version 1.
DT   28-JAN-2026, entry version 20.
DE   RecName: Full=Collagen alpha-1(XV) chain {ECO:0008006|Google:ProtNLM};
GN   ORFNames=HZH68_001082 {ECO:0000313|EMBL:KAF7418429.1};
OS   Vespula germanica (German yellow jacket) (Paravespula germanica).
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC   Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; Vespoidea;
OC   Vespidae; Vespinae; Vespula.
OX   NCBI_TaxID=30212 {ECO:0000313|EMBL:KAF7418429.1, ECO:0000313|Proteomes:UP000617340};
RN   [1] {ECO:0000313|EMBL:KAF7418429.1}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=Linc-1 {ECO:0000313|EMBL:KAF7418429.1};
RX   PubMed=32859687; DOI=10.1534/g3.120.401579;
RA   Harrop T.W.R., Guhlin J., McLaughlin G.M., Permina E., Stockwell P.,
RA   Gilligan J., Le Lec M.F., Gruber M.A.M., Quinn O., Lovegrove M.,
RA   Duncan E.J., Remnant E.J., Van Eeckhoven J., Graham B., Knapp R.A.,
RA   Langford K.W., Kronenberg Z., Press M.O., Eacker S.M., Wilson-Rankin E.E.,
RA   Purcell J., Lester P.J., Dearden P.K.;
RT   "High-Quality Assemblies for Three Invasive Social Wasps from the
RT   <i>Vespula</i> Genus.";
RL   G3 (Bethesda) 10:3479-3488(2020).
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:KAF7418429.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; JACSDZ010000001; KAF7418429.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A834U6D8; -.
DR   Proteomes; UP000617340; Unassembled WGS sequence.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR   GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR   GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IEA:TreeGrafter.
DR   GO; GO:0030198; P:extracellular matrix organization; IEA:TreeGrafter.
DR   CDD; cd00247; Endostatin-like; 1.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050149; Collagen_superfamily.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24023:SF1112; COL_CUTICLE_N DOMAIN-CONTAINING PROTEIN-RELATED; 1.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   Pfam; PF01391; Collagen; 3.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 1.
PE   4: Predicted;
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Reference proteome {ECO:0000313|Proteomes:UP000617340}.
FT   DOMAIN          679..727
FT                   /note="Collagen type XV/XVIII trimerization"
FT                   /evidence="ECO:0000259|Pfam:PF20010"
FT   DOMAIN          763..928
FT                   /note="Collagenase NC10/endostatin"
FT                   /evidence="ECO:0000259|Pfam:PF06482"
FT   REGION          79..158
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          172..622
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        87..99
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        102..117
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        121..131
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        141..151
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        220..232
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        299..317
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        344..356
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        359..372
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        384..413
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        489..500
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        574..583
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        591..604
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   962 AA;  99433 MW;  C8BEA84777B6A33F CRC64;
     MDEDDRRKLL HQAKTVKCTV KHVSAGHLGR SIDPSNTSRM IDASFWHMPG LLQSLKLYSG
     HPQDLVKCTA DFDIDREYDE DKNEEGNPPP FITPPPPNPD YKGPKGEKGE KGDKGESVRG
     PPGPPGPPGP GYDPNDEDWP IKIPHGPIGP KGEPGECTCN ATALMTSFTM PKMIEGPKGK
     PGVPGKEGKQ GQMGLTGAAG PPGERGLQGP PGPKGDKGDI GIQGPEGPQG QKGEPGRDGI
     PGEKGAQGPP GPPGKGEFSG YDTEGITMRP GLPGQKGEAG TAGNPGPKGE AGIPGGKGVK
     GEPGHKGGKG EHGKEGPRGI QGFKGEPGAP GAPGLPGAPG ENGRPAEKGD KGDAGPEGKP
     GPPGPPGSPG PSGPGGINVG DTGLEEKGEK GESGPRGYKG DQGTKGEKGD KGESGPAGIP
     GVNGIQGPQG DKGEPGKDGV PGASGIPGLK GERGEKGPPG VTAIANSGDY ITIKGEKGAE
     GKRGRRGRPG PPGPVGPPGK PGVMGEIGLP GWMGRPGTPG IPGSPGSTGS KGEKGEPGAP
     SSYGISVGRK GEKGDDGMPG IPGQHGRDGQ RGPAGPPGPP GPPSQGNYVP VPGPPGPPGP
     PGPPGLSLIG QKGEPGIGRS HIFGERDYYG MRQGPRSSLD ELKALRELKQ LKELKEHLGS
     VTAATRGPLE STTKIVPGAV TFQNTEAMTK MSSVSPVGTL AYIIDEQALL VRVNNGWQYI
     ALGSLLPITT PAPPTTAPPP ANPPFEASNL INQIPVKADG TGLRMAALNE PFTGDMHGVR
     GADYACYRQA KRAGLRGTFR AFLSSRVQNV DSIVRLGDRD LPIVNIKGDV LFNSWKEMFN
     GNGAYFSQNP RIYSFNGKNI LTDFAWPEKV AWHGSHKLGD RAMDTYCDAW HSSSSDRYGL
     GSPLTGGRLL EQVRYSCDNK FALLCIEVTS EVTRRRRNSD AVEDVEMTES DYKRHLEALM
     KN
//
DBGET integrated database retrieval system