ID A0A7K8LLY8_9AVES Unreviewed; 1258 AA.
AC A0A7K8LLY8;
DT 07-APR-2021, integrated into UniProtKB/TrEMBL.
DT 07-APR-2021, sequence version 1.
DT 28-JAN-2026, entry version 17.
DE SubName: Full=COIA1 protein {ECO:0000313|EMBL:NXE30159.1};
DE Flags: Fragment;
GN Name=Col18a1 {ECO:0000313|EMBL:NXE30159.1};
GN ORFNames=ARDKOR_R02663 {ECO:0000313|EMBL:NXE30159.1};
OS Ardeotis kori.
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Neoaves; Otidimorphae; Otidiformes;
OC Otididae; Ardeotis.
OX NCBI_TaxID=89386 {ECO:0000313|EMBL:NXE30159.1, ECO:0000313|Proteomes:UP000560386};
RN [1] {ECO:0000313|EMBL:NXE30159.1, ECO:0000313|Proteomes:UP000560386}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=B10K-CU-031-01 {ECO:0000313|EMBL:NXE30159.1};
RC TISSUE=Muscle {ECO:0000313|EMBL:NXE30159.1};
RA Zhang G.;
RT "Bird 10,000 Genomes (B10K) Project - Family phase.";
RL Submitted (SEP-2019) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Secreted, extracellular space, extracellular
CC matrix {ECO:0000256|ARBA:ARBA00004498}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:NXE30159.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; VWPR01003913; NXE30159.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A7K8LLY8; -.
DR Proteomes; UP000560386; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IEA:TreeGrafter.
DR GO; GO:0030198; P:extracellular matrix organization; IEA:TreeGrafter.
DR CDD; cd00110; LamG; 1.
DR FunFam; 2.60.120.200:FF:000039; Collagen XV alpha 1 chain; 1.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 3.40.1620.70; -; 1.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR050149; Collagen_superfamily.
DR InterPro; IPR010515; Collagenase_NC10/endostatin.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR001791; Laminin_G.
DR InterPro; IPR048287; TSPN-like_N.
DR InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1082; COLLAGEN TRIPLE HELIX REPEAT; 1.
DR Pfam; PF01391; Collagen; 3.
DR Pfam; PF20010; Collagen_trimer; 1.
DR Pfam; PF06482; Endostatin; 1.
DR SMART; SM00210; TSPN; 1.
DR SUPFAM; SSF56436; C-type lectin-like; 2.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Reference proteome {ECO:0000313|Proteomes:UP000560386};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Secreted {ECO:0000256|ARBA:ARBA00022525};
KW Signal {ECO:0000256|ARBA:ARBA00022729}.
FT DOMAIN 20..208
FT /note="Thrombospondin-like N-terminal"
FT /evidence="ECO:0000259|SMART:SM00210"
FT REGION 209..706
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 719..1007
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1068..1088
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 215..226
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 295..307
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 308..317
FT /note="Gly residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 322..331
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 384..400
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 402..415
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 429..443
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 471..483
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 498..511
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 512..523
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 524..535
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 559..575
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 583..595
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 606..617
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 618..636
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 671..685
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 691..700
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 825..838
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 893..907
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 908..921
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 944..953
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 964..977
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 987..999
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:NXE30159.1"
FT NON_TER 1258
FT /evidence="ECO:0000313|EMBL:NXE30159.1"
SQ SEQUENCE 1258 AA; 127709 MW; 188652AC680FB639 CRC64;
QQAGCDPRLS LPLAENLSAE VSLLELIGDP PPEEIIKIYG PDNNPGYVFG PNANTGQVAR
YHLPSPFYRD FSLLFHIQAT TPRAGVLFAV TDSSQSFIYV GVKLSELRAG KQQIIFYYTE
PGSPSSYAAA TFTVPTLLNQ WTRFAISVED DEVILYLDCE EHERVHFERS PDEMELEDGS
GLFVAQAGGA DPDKYQGVIA DLKLLGDPRA AERQCEEEED DTDEVSGDFG SGAEGGHRPS
GKVKGVPGLV DAAPVTSPPV AAGGGPRSSG GSPQQAERTR AEEMLQVSAG GTGPKGEKGE
KGDRGLKGDS GTGGIVGTGS VKGEKGEKGE LGLKGSAGFG YPGSKGQKGE PGDPGPPGTL
SQHTDGSVVE QVAGPPGPPG KDGAPGRDGE PGDPGEDGKP GDMGPQGFPG TPGEPGQKGE
KGEPGMGPRG PPGPPGPPGP PGPSFRHDKL TFIDMEGSGF GGDLENLRGP QGPPGPPGPP
GVPGLPGEPG RFGMNRTDLP GLPGLPGRDG NPGPPGPAGP QGPPGRDGAA GQPGPKGERG
DVGDLGLPGA PGPKGSKGEP GPAGAPGEMG LAGLPGPMGP RGQPGPPGPP GPPGPGYEAG
FGDMEGSGLP FAASSPGAPG PQGPQVAPLP PLPSPPSRAL RGIPCHRGDA GVPGVDGRPG
LEGFPGPQGP KGDRGSPGEK GERGQDGVGL PGPPGPPGPP GQVITLSNED GLTQALWWSS
LKGPAGPKGD RGSSGPQGPP GLKGEKGEPG VIISPDGTVV TAKAKGEKGE PGLRGPMGPS
GPQGRAGMKG EIGFPGRPGR PGMNGLKGEK GDPADVSGAL GLRGPPGPPG PPGPPGPPGS
IAYDNNNVSG IAPLLPTGFY QFPGQKGEKG DVGAPGPPGH FPYDPSHFGA NLRGDKGEAG
PKGEKGEPGS TPLYGPSISG LPGPPGPQGY PGLPGPKGDS IVGPPGPPGP QGPPGVGYEG
RQGPPGPPGP PGPPSFPGPH RQAISIPGPP GPPGPPGPPG TSGTSLGLRA LPTYQAMLSA
AHDLPEGGFV FLTDRQELYL RVRGGFRRVL HQPSACLSLS PQDNEVYDKP PSIHYAGPQP
PLQPHGPLHP LRNHGPLPTA RPWRGDEVVA NQHRLPEQPL LHHQHELLNS YYTQRRPDPA
PVAAHVHQDF QPALHLVALN APLSGGMRGI RGADFQCFQQ ARQVGLAGTF RAFLSSRLQD
LYSIVRRADR AAVPIVNLRD EVLFSNWEAL FTGSGAPLRA GGRILSFDGR DVLRDAGW
//