GenomeNet

Database: UniProt
Entry: A0A7K8LLY8_9AVES
LinkDB: A0A7K8LLY8_9AVES
Original site: A0A7K8LLY8_9AVES 
ID   A0A7K8LLY8_9AVES        Unreviewed;      1258 AA.
AC   A0A7K8LLY8;
DT   07-APR-2021, integrated into UniProtKB/TrEMBL.
DT   07-APR-2021, sequence version 1.
DT   28-JAN-2026, entry version 17.
DE   SubName: Full=COIA1 protein {ECO:0000313|EMBL:NXE30159.1};
DE   Flags: Fragment;
GN   Name=Col18a1 {ECO:0000313|EMBL:NXE30159.1};
GN   ORFNames=ARDKOR_R02663 {ECO:0000313|EMBL:NXE30159.1};
OS   Ardeotis kori.
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC   Coelurosauria; Aves; Neognathae; Neoaves; Otidimorphae; Otidiformes;
OC   Otididae; Ardeotis.
OX   NCBI_TaxID=89386 {ECO:0000313|EMBL:NXE30159.1, ECO:0000313|Proteomes:UP000560386};
RN   [1] {ECO:0000313|EMBL:NXE30159.1, ECO:0000313|Proteomes:UP000560386}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=B10K-CU-031-01 {ECO:0000313|EMBL:NXE30159.1};
RC   TISSUE=Muscle {ECO:0000313|EMBL:NXE30159.1};
RA   Zhang G.;
RT   "Bird 10,000 Genomes (B10K) Project - Family phase.";
RL   Submitted (SEP-2019) to the EMBL/GenBank/DDBJ databases.
CC   -!- SUBCELLULAR LOCATION: Secreted, extracellular space, extracellular
CC       matrix {ECO:0000256|ARBA:ARBA00004498}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:NXE30159.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; VWPR01003913; NXE30159.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A7K8LLY8; -.
DR   Proteomes; UP000560386; Unassembled WGS sequence.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR   GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR   GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IEA:TreeGrafter.
DR   GO; GO:0030198; P:extracellular matrix organization; IEA:TreeGrafter.
DR   CDD; cd00110; LamG; 1.
DR   FunFam; 2.60.120.200:FF:000039; Collagen XV alpha 1 chain; 1.
DR   Gene3D; 2.60.120.200; -; 1.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050149; Collagen_superfamily.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR001791; Laminin_G.
DR   InterPro; IPR048287; TSPN-like_N.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1082; COLLAGEN TRIPLE HELIX REPEAT; 1.
DR   Pfam; PF01391; Collagen; 3.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   SMART; SM00210; TSPN; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 2.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE   4: Predicted;
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Reference proteome {ECO:0000313|Proteomes:UP000560386};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Secreted {ECO:0000256|ARBA:ARBA00022525};
KW   Signal {ECO:0000256|ARBA:ARBA00022729}.
FT   DOMAIN          20..208
FT                   /note="Thrombospondin-like N-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00210"
FT   REGION          209..706
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          719..1007
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1068..1088
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        215..226
FT                   /note="Acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        295..307
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        308..317
FT                   /note="Gly residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        322..331
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        384..400
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        402..415
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        429..443
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        471..483
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        498..511
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        512..523
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        524..535
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        559..575
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        583..595
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        606..617
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        618..636
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        671..685
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        691..700
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        825..838
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        893..907
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        908..921
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        944..953
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        964..977
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        987..999
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   NON_TER         1
FT                   /evidence="ECO:0000313|EMBL:NXE30159.1"
FT   NON_TER         1258
FT                   /evidence="ECO:0000313|EMBL:NXE30159.1"
SQ   SEQUENCE   1258 AA;  127709 MW;  188652AC680FB639 CRC64;
     QQAGCDPRLS LPLAENLSAE VSLLELIGDP PPEEIIKIYG PDNNPGYVFG PNANTGQVAR
     YHLPSPFYRD FSLLFHIQAT TPRAGVLFAV TDSSQSFIYV GVKLSELRAG KQQIIFYYTE
     PGSPSSYAAA TFTVPTLLNQ WTRFAISVED DEVILYLDCE EHERVHFERS PDEMELEDGS
     GLFVAQAGGA DPDKYQGVIA DLKLLGDPRA AERQCEEEED DTDEVSGDFG SGAEGGHRPS
     GKVKGVPGLV DAAPVTSPPV AAGGGPRSSG GSPQQAERTR AEEMLQVSAG GTGPKGEKGE
     KGDRGLKGDS GTGGIVGTGS VKGEKGEKGE LGLKGSAGFG YPGSKGQKGE PGDPGPPGTL
     SQHTDGSVVE QVAGPPGPPG KDGAPGRDGE PGDPGEDGKP GDMGPQGFPG TPGEPGQKGE
     KGEPGMGPRG PPGPPGPPGP PGPSFRHDKL TFIDMEGSGF GGDLENLRGP QGPPGPPGPP
     GVPGLPGEPG RFGMNRTDLP GLPGLPGRDG NPGPPGPAGP QGPPGRDGAA GQPGPKGERG
     DVGDLGLPGA PGPKGSKGEP GPAGAPGEMG LAGLPGPMGP RGQPGPPGPP GPPGPGYEAG
     FGDMEGSGLP FAASSPGAPG PQGPQVAPLP PLPSPPSRAL RGIPCHRGDA GVPGVDGRPG
     LEGFPGPQGP KGDRGSPGEK GERGQDGVGL PGPPGPPGPP GQVITLSNED GLTQALWWSS
     LKGPAGPKGD RGSSGPQGPP GLKGEKGEPG VIISPDGTVV TAKAKGEKGE PGLRGPMGPS
     GPQGRAGMKG EIGFPGRPGR PGMNGLKGEK GDPADVSGAL GLRGPPGPPG PPGPPGPPGS
     IAYDNNNVSG IAPLLPTGFY QFPGQKGEKG DVGAPGPPGH FPYDPSHFGA NLRGDKGEAG
     PKGEKGEPGS TPLYGPSISG LPGPPGPQGY PGLPGPKGDS IVGPPGPPGP QGPPGVGYEG
     RQGPPGPPGP PGPPSFPGPH RQAISIPGPP GPPGPPGPPG TSGTSLGLRA LPTYQAMLSA
     AHDLPEGGFV FLTDRQELYL RVRGGFRRVL HQPSACLSLS PQDNEVYDKP PSIHYAGPQP
     PLQPHGPLHP LRNHGPLPTA RPWRGDEVVA NQHRLPEQPL LHHQHELLNS YYTQRRPDPA
     PVAAHVHQDF QPALHLVALN APLSGGMRGI RGADFQCFQQ ARQVGLAGTF RAFLSSRLQD
     LYSIVRRADR AAVPIVNLRD EVLFSNWEAL FTGSGAPLRA GGRILSFDGR DVLRDAGW
//
DBGET integrated database retrieval system