ID A0A834U6D8_VESGE Unreviewed; 962 AA.
AC A0A834U6D8;
DT 29-SEP-2021, integrated into UniProtKB/TrEMBL.
DT 29-SEP-2021, sequence version 1.
DT 28-JAN-2026, entry version 20.
DE RecName: Full=Collagen alpha-1(XV) chain {ECO:0008006|Google:ProtNLM};
GN ORFNames=HZH68_001082 {ECO:0000313|EMBL:KAF7418429.1};
OS Vespula germanica (German yellow jacket) (Paravespula germanica).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; Vespoidea;
OC Vespidae; Vespinae; Vespula.
OX NCBI_TaxID=30212 {ECO:0000313|EMBL:KAF7418429.1, ECO:0000313|Proteomes:UP000617340};
RN [1] {ECO:0000313|EMBL:KAF7418429.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=Linc-1 {ECO:0000313|EMBL:KAF7418429.1};
RX PubMed=32859687; DOI=10.1534/g3.120.401579;
RA Harrop T.W.R., Guhlin J., McLaughlin G.M., Permina E., Stockwell P.,
RA Gilligan J., Le Lec M.F., Gruber M.A.M., Quinn O., Lovegrove M.,
RA Duncan E.J., Remnant E.J., Van Eeckhoven J., Graham B., Knapp R.A.,
RA Langford K.W., Kronenberg Z., Press M.O., Eacker S.M., Wilson-Rankin E.E.,
RA Purcell J., Lester P.J., Dearden P.K.;
RT "High-Quality Assemblies for Three Invasive Social Wasps from the
RT <i>Vespula</i> Genus.";
RL G3 (Bethesda) 10:3479-3488(2020).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KAF7418429.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JACSDZ010000001; KAF7418429.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A834U6D8; -.
DR Proteomes; UP000617340; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IEA:TreeGrafter.
DR GO; GO:0030198; P:extracellular matrix organization; IEA:TreeGrafter.
DR CDD; cd00247; Endostatin-like; 1.
DR Gene3D; 3.40.1620.70; -; 1.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR050149; Collagen_superfamily.
DR InterPro; IPR010515; Collagenase_NC10/endostatin.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR PANTHER; PTHR24023:SF1112; COL_CUTICLE_N DOMAIN-CONTAINING PROTEIN-RELATED; 1.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR Pfam; PF01391; Collagen; 3.
DR Pfam; PF20010; Collagen_trimer; 1.
DR Pfam; PF06482; Endostatin; 1.
DR SUPFAM; SSF56436; C-type lectin-like; 1.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Reference proteome {ECO:0000313|Proteomes:UP000617340}.
FT DOMAIN 679..727
FT /note="Collagen type XV/XVIII trimerization"
FT /evidence="ECO:0000259|Pfam:PF20010"
FT DOMAIN 763..928
FT /note="Collagenase NC10/endostatin"
FT /evidence="ECO:0000259|Pfam:PF06482"
FT REGION 79..158
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 172..622
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 87..99
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 102..117
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 121..131
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 141..151
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 220..232
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 299..317
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 344..356
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 359..372
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 384..413
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 489..500
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 574..583
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 591..604
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 962 AA; 99433 MW; C8BEA84777B6A33F CRC64;
MDEDDRRKLL HQAKTVKCTV KHVSAGHLGR SIDPSNTSRM IDASFWHMPG LLQSLKLYSG
HPQDLVKCTA DFDIDREYDE DKNEEGNPPP FITPPPPNPD YKGPKGEKGE KGDKGESVRG
PPGPPGPPGP GYDPNDEDWP IKIPHGPIGP KGEPGECTCN ATALMTSFTM PKMIEGPKGK
PGVPGKEGKQ GQMGLTGAAG PPGERGLQGP PGPKGDKGDI GIQGPEGPQG QKGEPGRDGI
PGEKGAQGPP GPPGKGEFSG YDTEGITMRP GLPGQKGEAG TAGNPGPKGE AGIPGGKGVK
GEPGHKGGKG EHGKEGPRGI QGFKGEPGAP GAPGLPGAPG ENGRPAEKGD KGDAGPEGKP
GPPGPPGSPG PSGPGGINVG DTGLEEKGEK GESGPRGYKG DQGTKGEKGD KGESGPAGIP
GVNGIQGPQG DKGEPGKDGV PGASGIPGLK GERGEKGPPG VTAIANSGDY ITIKGEKGAE
GKRGRRGRPG PPGPVGPPGK PGVMGEIGLP GWMGRPGTPG IPGSPGSTGS KGEKGEPGAP
SSYGISVGRK GEKGDDGMPG IPGQHGRDGQ RGPAGPPGPP GPPSQGNYVP VPGPPGPPGP
PGPPGLSLIG QKGEPGIGRS HIFGERDYYG MRQGPRSSLD ELKALRELKQ LKELKEHLGS
VTAATRGPLE STTKIVPGAV TFQNTEAMTK MSSVSPVGTL AYIIDEQALL VRVNNGWQYI
ALGSLLPITT PAPPTTAPPP ANPPFEASNL INQIPVKADG TGLRMAALNE PFTGDMHGVR
GADYACYRQA KRAGLRGTFR AFLSSRVQNV DSIVRLGDRD LPIVNIKGDV LFNSWKEMFN
GNGAYFSQNP RIYSFNGKNI LTDFAWPEKV AWHGSHKLGD RAMDTYCDAW HSSSSDRYGL
GSPLTGGRLL EQVRYSCDNK FALLCIEVTS EVTRRRRNSD AVEDVEMTES DYKRHLEALM
KN
//