ID I3ML87_ICTTR Unreviewed; 1310 AA.
AC I3ML87;
DT 11-JUL-2012, integrated into UniProtKB/TrEMBL.
DT 22-NOV-2017, sequence version 2.
DT 27-MAR-2024, entry version 69.
DE SubName: Full=Collagen type XV alpha 1 chain {ECO:0000313|Ensembl:ENSSTOP00000012153.3};
GN Name=COL15A1 {ECO:0000313|Ensembl:ENSSTOP00000012153.3};
OS Ictidomys tridecemlineatus (Thirteen-lined ground squirrel) (Spermophilus
OS tridecemlineatus).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Sciuromorpha; Sciuridae;
OC Xerinae; Marmotini; Ictidomys.
OX NCBI_TaxID=43179 {ECO:0000313|Ensembl:ENSSTOP00000012153.3, ECO:0000313|Proteomes:UP000005215};
RN [1] {ECO:0000313|Proteomes:UP000005215}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RG The Broad Institute Genome Assembly & Analysis Group;
RG Computational R&D Group;
RG and Sequencing Platform;
RA Di Palma F., Alfoldi J., Johnson J., Berlin A., Gnerre S., Jaffe D.,
RA MacCallum I., Young S., Walker B.J., Lindblad-Toh K.;
RT "The Draft Genome of Spermophilus tridecemlineatus.";
RL Submitted (NOV-2011) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSSTOP00000012153.3}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AGTP01043791; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGTP01043792; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGTP01043793; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGTP01043794; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGTP01043795; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR STRING; 43179.ENSSTOP00000012153; -.
DR Ensembl; ENSSTOT00000013564.3; ENSSTOP00000012153.3; ENSSTOG00000013544.3.
DR eggNOG; KOG3544; Eukaryota.
DR eggNOG; KOG3546; Eukaryota.
DR GeneTree; ENSGT00940000158302; -.
DR HOGENOM; CLU_004003_0_0_1; -.
DR InParanoid; I3ML87; -.
DR TreeFam; TF315821; -.
DR Proteomes; UP000005215; Unassembled WGS sequence.
DR GO; GO:0005604; C:basement membrane; IEA:Ensembl.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR CDD; cd00247; Endostatin-like; 1.
DR CDD; cd00110; LamG; 1.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 3.40.1620.70; -; 1.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR010515; Collagenase_NC10/endostatin.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR001791; Laminin_G.
DR InterPro; IPR048287; TSPN-like_N.
DR InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF912; COLLAGEN ALPHA-1(XV) CHAIN; 1.
DR Pfam; PF01391; Collagen; 4.
DR Pfam; PF20010; Collagen_trimer; 1.
DR Pfam; PF06482; Endostatin; 1.
DR Pfam; PF13385; Laminin_G_3; 1.
DR SMART; SM00282; LamG; 1.
DR SMART; SM00210; TSPN; 1.
DR SUPFAM; SSF56436; C-type lectin-like; 2.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Reference proteome {ECO:0000313|Proteomes:UP000005215};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Secreted {ECO:0000256|ARBA:ARBA00022525};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..30
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 31..1310
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5012949132"
FT DOMAIN 39..227
FT /note="Thrombospondin-like N-terminal"
FT /evidence="ECO:0000259|SMART:SM00210"
FT DOMAIN 88..226
FT /note="Laminin G"
FT /evidence="ECO:0000259|SMART:SM00282"
FT REGION 227..250
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 263..346
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 359..444
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 459..723
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 805..838
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 913..938
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 997..1050
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 298..318
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 369..395
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 429..444
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 513..529
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 567..581
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 664..680
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 999..1032
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1310 AA; 133788 MW; BCDBE21946030DBB CRC64;
MAPRRNGQCW RVLLLLSISA LLSAVTQTRA STELASQSHL DLTELIGVPL PSSVSFVTGY
SGFPAYSFGP GANVGRPART LIPPTFFRDF AVSVTVKPSS PSGGVLFAIT DAFQKVIYLG
LRLSGVEDGR QRVILYYTEP GSHVSQEAAA FSVPVMTNRW SRFAVIVQGE EVTLLVDCEE
HSQVLFQRSS RPLAFEPSAG IFVGNAGATG LERFTGSIQQ LIIHSDPRTP EEVCEAQESS
ASGEGSGLQD PDTVAEILEA VTHTQAPPKE AKVDPINIPP TSPSPYEDTE LSGEPIPEET
PENTSSVVQH SNLEPGSGEI LNDTLEGLPP VDGDPIPEIG SGSGAFPVTE EEGLAATAAG
EVEVPVSTTP EAELSSSPTG RSALPVSTQD PGEGFTSGLD NEGLAMTATG ETEVPVSTAG
ETEAGSMPTG EPALSTFTQD SGEDATLAAA ASEVPLVTFE EEEASGVPTD GLATLAPTAA
PGQVVTPAPG DDHWAATSTE EPPATAGGEG PSSAPPDGPP LPVPTAAPER QVTPPVRVEA
EGSGLGWGLD GGSGSGDLVG SEELLRGPPG PPGPPGLPGI PGKPGTDVFN GPPGSPGKDG
AAGEPGPPGP EGQPGPDGAS GVPGMKGEKG ARGPNGSVGE KGDPGSRGLP GPPGKNGQVG
TPGVRGPPGP PGPPGPPGPG CTTELGFEGP KGEKGDQGPK GERGMDGAST VGPPGPRGPP
GRIEILSSSL INITHGSMNL SDIPELMGPP GPDGLPGLPG FPGPRGPKGD TGVPGFPGLK
GEQGEKGEPG AILTGDIPLE RLKGKKGEPG MHGTPGPMGP KGPPGHKGEF GLPGRPGRPG
LNGLKGAKGD RGIMMPGPPG LPGPPGPPGP PGAVVNIKGA VFPVPARPHC KTPVGTTLPG
DSELITFHGV KGEKGSWGLP GSKGEKGDQG AQGPPGPPVD PVYLRHFLNS LKGENGDRGF
KGEKGDSHDN FFVPGPPGLP GNPGLVGQKG ETVVGPQGPP GIPGLPGPPG FGRPGAPGPP
GPPGPPGPPA ILGAAVALPG PPGPPGQPGL PGSRNLVTAF SNMNDMLQKA HLVIEGTFIY
LRDSAEFFIR VRDGWKKLQL GELIPIPADS PPPPALSSNP YQLQPPLNPI LSANYENPAL
HLVALNMPFS GDIRADFQCF QQARAAGLLS TFRAFLSSHL QDLSTVVRKA ERYSLPIVNL
KGQVLFNNWD SIFSGQGGQF NAHVPIYSFD GRDVMTDPSW PQKVVWHGSN THGVRLVDKY
CEAWRTADMA VTGFASPLST GKILDQQAYS CANRLIVLCI ENSFMTDARK
//