GenomeNet

Database: UniProt
Entry: A0A7K6K1G1_9PASE
LinkDB: A0A7K6K1G1_9PASE
Original site: A0A7K6K1G1_9PASE 
ID   A0A7K6K1G1_9PASE        Unreviewed;      1336 AA.
AC   A0A7K6K1G1;
DT   07-APR-2021, integrated into UniProtKB/TrEMBL.
DT   07-APR-2021, sequence version 1.
DT   28-JAN-2026, entry version 18.
DE   SubName: Full=COFA1 protein {ECO:0000313|EMBL:NWW06431.1};
DE   Flags: Fragment;
GN   Name=Col15a1_1 {ECO:0000313|EMBL:NWW06431.1};
GN   ORFNames=OREARF_R13526 {ECO:0000313|EMBL:NWW06431.1};
OS   Oreocharis arfaki (tit berrypecker).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC   Coelurosauria; Aves; Neognathae; Neoaves; Telluraves; Australaves;
OC   Passeriformes; Passeroidea; Paramythiidae; Oreocharis.
OX   NCBI_TaxID=979223 {ECO:0000313|EMBL:NWW06431.1, ECO:0000313|Proteomes:UP000542358};
RN   [1] {ECO:0000313|EMBL:NWW06431.1, ECO:0000313|Proteomes:UP000542358}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=B10K-DU-029-42 {ECO:0000313|EMBL:NWW06431.1};
RC   TISSUE=Muscle {ECO:0000313|EMBL:NWW06431.1};
RA   Zhang G.;
RT   "Bird 10,000 Genomes (B10K) Project - Family phase.";
RL   Submitted (SEP-2019) to the EMBL/GenBank/DDBJ databases.
CC   -!- SUBCELLULAR LOCATION: Secreted, extracellular space, extracellular
CC       matrix {ECO:0000256|ARBA:ARBA00004498}.
CC   -!- SIMILARITY: Belongs to the multiplexin collagen family.
CC       {ECO:0000256|ARBA:ARBA00061275}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:NWW06431.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; VZRR01005874; NWW06431.1; -; Genomic_DNA.
DR   Proteomes; UP000542358; Unassembled WGS sequence.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0007155; P:cell adhesion; IEA:UniProtKB-KW.
DR   CDD; cd00247; Endostatin-like; 1.
DR   FunFam; 3.10.100.10:FF:000008; collagen alpha-1(XVIII) chain isoform X1; 1.
DR   FunFam; 3.40.1620.70:FF:000003; Collagen type XVIII alpha 1; 1.
DR   FunFam; 2.60.120.200:FF:000039; Collagen XV alpha 1 chain; 1.
DR   Gene3D; 2.60.120.200; -; 1.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050938; Collagen_Structural_Proteins.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR048287; TSPN-like_N.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR37456:SF6; COLLAGEN ALPHA-1(XXIII) CHAIN-LIKE ISOFORM X2; 1.
DR   PANTHER; PTHR37456; SI:CH211-266K2.1; 1.
DR   Pfam; PF01391; Collagen; 3.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   SMART; SM00210; TSPN; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 2.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE   3: Inferred from homology;
KW   Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW   Proteoglycan {ECO:0000256|ARBA:ARBA00022974};
KW   Reference proteome {ECO:0000313|Proteomes:UP000542358};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Secreted {ECO:0000256|ARBA:ARBA00022525};
KW   Signal {ECO:0000256|ARBA:ARBA00022729}.
FT   DOMAIN          8..196
FT                   /note="Thrombospondin-like N-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00210"
FT   REGION          193..247
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          545..675
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          705..742
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          758..808
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          825..891
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          918..962
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1037..1076
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        554..569
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        600..615
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        656..675
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        706..715
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        729..738
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        765..781
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        789..798
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        846..864
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        880..890
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1042..1054
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1064..1073
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   NON_TER         1
FT                   /evidence="ECO:0000313|EMBL:NWW06431.1"
FT   NON_TER         1336
FT                   /evidence="ECO:0000313|EMBL:NWW06431.1"
SQ   SEQUENCE   1336 AA;  139273 MW;  01C3A0E05F3996B9 CRC64;
     TERGSKGHLD LTVLIGVPLP PSVYFVTGYG GFPAYSFGPD SNIGRLTSAV IPSPFYRDFA
     IVVTVKPNSD RGGALFAITD AFQKTIYLGL RISPVDDSTQ RIIMYYTEPG SPVSREAASF
     KVPVMTNRWN RFTVTVEGND VALFMDCEEY QRLQFQRSAR TLVFESGSGI FVGNAGATGL
     EKFTGSIQHL TIKSDPRATE DHCEDDDPYA SGDSSGNGSI QEHDVSEAQE ALGSSRLPIG
     PEDTLAEPVE APPTILSYLE ENDFSGNHRS EETTEAAKFK EQGTASGFGV FWGGWGSAVM
     ETGQGNSEST TVTQKMLREE EGSGAGVLPG VSREEVFNCY VSLTPFLSCS NSTASSPSSF
     LLSFLCVPLL HPFLSHALSS CWDVPSQLLQ ITPIPGSVHH KLLIACTRIV CGEVLSGNAH
     SNKPVAVTFC AHLSCCCVSV LCWSSTAWLL CFLSRCFVAA TLLEPEESGS GDTDRETEIL
     RVSISHNKEL YFTHAMLQHW VLLSCCSFPW EHSGGFLGQG RDCLPSCSPS RTPCTAAPAS
     CQPATSAMAK EGLPGPPGPP GLPGLPGKPA PDSGVGPPGS PGEDGTSGEP GPEVNLLSLQ
     GPQGPPGRDG VVGPPGWKGE KGDQGLPGSA GPKGDTGVTG SIGPKGEAGP VGSPGKPGPP
     GPPGPPGPPG PPGPPGLSYS LGFEVFCYSE ICYNYGYISS RVKAGPKGEK GDPGPRGEPG
     QDGNSIVGPP GPPGPPGPVI AIPELLLNDT DGIFNFTEIK GLLGPPGPNG KPGLPGFPGP
     RGPKGDAGSP GSQGPKGQQG EKGEPGAIIS ADGSLTELLG RKGEKGEAGV VGPAGPMGPI
     GPPGPKGELG FPGRPGRPGL NGLRGVKGDR GEAFNGLPGL PGPPGPPGPP GRILYIKGTV
     FPVPPRPHCK MPVNTPYPEN QDVLNDHGPK ANGDSWGLHS SVHLKGEKGD RGAPGPPGPP
     LPPSYFSHFI NSIKGEKGDN GVTGVKGEKG EPNGGFFLTG PPGPPGRPGL IGPKGDSVVG
     PRGPPGLPGL PGLPGYGKIG LPGPPGPPGP PGPPAIYGSA AAMPGPPGPP GEPGSPATRN
     LVTTFQNIEG MLEKVHYVAE GTLIYLRETS EVFIRVRNGW RKLQLGELIP IPADSLPPPA
     ISSHGFQSIP ALRPISNMNN GKPALHMVAL NFPLSGDMRA DFQCFRQAQL AGLTSTYRAF
     LSSHLQDLAT VVRKTDRNHL PVVNLQGETL FSNWESIFDG NGGQFNIHVP IYSFDGRNVM
     MDSSWPQKVI WHGSTANGIR LVSNYCEAWH TADMGAMGQA SPLNTGKLLD QKVYSCNNQF
     IVLCIENSFV SDPQGK
//
DBGET integrated database retrieval system