GenomeNet

Database: UniProt
Entry: A0A7K9D3M7_9AVES
LinkDB: A0A7K9D3M7_9AVES
Original site: A0A7K9D3M7_9AVES 
ID   A0A7K9D3M7_9AVES        Unreviewed;      1331 AA.
AC   A0A7K9D3M7;
DT   07-APR-2021, integrated into UniProtKB/TrEMBL.
DT   07-APR-2021, sequence version 1.
DT   28-JAN-2026, entry version 17.
DE   SubName: Full=COFA1 protein {ECO:0000313|EMBL:NXG59099.1};
DE   Flags: Fragment;
GN   Name=Col15a1_1 {ECO:0000313|EMBL:NXG59099.1};
GN   ORFNames=HEMCOM_R09142 {ECO:0000313|EMBL:NXG59099.1};
OS   Hemiprocne comata.
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC   Coelurosauria; Aves; Neognathae; Neoaves; Strisores; Apodiformes; Apodidae;
OC   Hemiprocninae; Hemiprocne.
OX   NCBI_TaxID=243314 {ECO:0000313|EMBL:NXG59099.1, ECO:0000313|Proteomes:UP000518305};
RN   [1] {ECO:0000313|EMBL:NXG59099.1, ECO:0000313|Proteomes:UP000518305}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=B10K-DU-001-23 {ECO:0000313|EMBL:NXG59099.1};
RC   TISSUE=Muscle {ECO:0000313|EMBL:NXG59099.1};
RA   Zhang G.;
RT   "Bird 10,000 Genomes (B10K) Project - Family phase.";
RL   Submitted (SEP-2019) to the EMBL/GenBank/DDBJ databases.
CC   -!- SUBCELLULAR LOCATION: Secreted, extracellular space, extracellular
CC       matrix {ECO:0000256|ARBA:ARBA00004498}.
CC   -!- SIMILARITY: Belongs to the multiplexin collagen family.
CC       {ECO:0000256|ARBA:ARBA00061275}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:NXG59099.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; VWZJ01005410; NXG59099.1; -; Genomic_DNA.
DR   OrthoDB; 10060752at2759; -.
DR   Proteomes; UP000518305; Unassembled WGS sequence.
DR   GO; GO:0005594; C:collagen type IX trimer; IEA:TreeGrafter.
DR   GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR   GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IEA:TreeGrafter.
DR   GO; GO:0007155; P:cell adhesion; IEA:UniProtKB-KW.
DR   GO; GO:0030198; P:extracellular matrix organization; IEA:TreeGrafter.
DR   CDD; cd00247; Endostatin-like; 1.
DR   FunFam; 3.10.100.10:FF:000008; collagen alpha-1(XVIII) chain isoform X1; 1.
DR   FunFam; 2.60.120.200:FF:000039; Collagen XV alpha 1 chain; 1.
DR   Gene3D; 2.60.120.200; -; 1.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050149; Collagen_superfamily.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR001791; Laminin_G.
DR   InterPro; IPR000477; RT_dom.
DR   InterPro; IPR048287; TSPN-like_N.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF372; COLLAGEN ALPHA-1(XVI) CHAIN; 1.
DR   Pfam; PF01391; Collagen; 2.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   Pfam; PF00078; RVT_1; 1.
DR   SMART; SM00282; LamG; 1.
DR   SMART; SM00210; TSPN; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 2.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE   3: Inferred from homology;
KW   Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW   Proteoglycan {ECO:0000256|ARBA:ARBA00022974};
KW   Reference proteome {ECO:0000313|Proteomes:UP000518305};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Secreted {ECO:0000256|ARBA:ARBA00022525};
KW   Signal {ECO:0000256|ARBA:ARBA00022729}.
FT   DOMAIN          8..196
FT                   /note="Thrombospondin-like N-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00210"
FT   DOMAIN          57..195
FT                   /note="Laminin G"
FT                   /evidence="ECO:0000259|SMART:SM00282"
FT   REGION          544..602
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          699..730
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          763..803
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          823..855
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          922..962
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1020..1074
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        559..575
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        576..590
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        711..721
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        787..796
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        841..855
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        922..931
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        940..949
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        952..961
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1039..1049
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1059..1068
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   NON_TER         1
FT                   /evidence="ECO:0000313|EMBL:NXG59099.1"
FT   NON_TER         1331
FT                   /evidence="ECO:0000313|EMBL:NXG59099.1"
SQ   SEQUENCE   1331 AA;  141364 MW;  CFF36D3F5C28AEC8 CRC64;
     TERGSEGHLD LTELIGVPLP PSVYFVTGYG GFPAYSFGPD ANIGRLTSAI IPLPFYRDFA
     IMVTVKPNSD HGGVLFAITD AFQKTIYLGM RLSPADDSTQ RIILYYTEPG SHISQEAASF
     QVPVMTNRWN KFTVTVQGND VALFMDCEEY QRSQFQRSAR ALVFDSGSGI FVGNAGAAGL
     EKFTGSIQHL IIKSNTKATE DHCEDDDPYA SGDLSGNASV QEHEGISETQ EVLAPSHLPI
     RPEDILAEPV EAPPTILSYL EEKVFSGNHS SEETSEAAKL KEQGTASFVP YPTWPLTLPG
     MEFPQLLWAA CSSVSLPSPS QNFLLISKHK SLLLQFKTIT PCPLTTSPCN KSLHSFPVGP
     LQVLKGHYEI SLEPSLLQAE QLQLSQPVFI GEVLQHSYHL HGPPLDCLQE LHRGRMTSLD
     LLATLFFMQP RIPLAFLATR AHCWLMDTFL SIRTPRSFSS ELLSSRSTPI LYWCLGFFLP
     RFPTLALIER HEVLLCPTLQ PVQVTLNSST GLWSVSHPSQ FGIISNLLRI HSVPSSRSLM
     NKLNKTRPSI DPWGTPLGLP GPPGPPGPPG LPGKPAPSSE GGSPGIPGED GPSGEPGPEV
     SRATDIIYPD LSKAFHTVSH NILVSKLEQY GSDGWTTHWI RNWLTGHTQR VVINGSVSKW
     KHVTSGVPQG SVVGPVLFTS LLATWTVESS ASSVSLLMTP NCGPKGEKGD PGPQGEPGQD
     GNSIVVLPEP PGPPGTIIAI PELLLNNTYG IFNFTGIKGL LGPPGPEGKP GLLGFPGPQG
     PKGDAGLPGS QGPKGQQGEK GEPGAIIAAD GSLTELLGRK GEKGEAGVMG PVGPMGPIGP
     TGPKGELGFP GRPGRPGLNG LRGMKGDRGK AFTGLPGLPG PPGPPGPPGR IVHIKGTVFP
     VSSRPHCKMP VSHIESVFSK MHGHDSAKAS RDSLGLHSSV DFKGEKGDRG APGPPGPPLP
     PSYFSHFISS IKGEKGDNGV TGVKGEKGEP NGGFFLAGPP GLPGRPGLVG PKGDSVVGPR
     GPPGLPGLPG LPGYGRIGPP GPPGPPGPPA IYGSAAAMPG PPGPPGEPGS PATSNLVTTF
     RNIEGMLEKV HFVAQGTLTY LTETSEIFIR VQNGWKKLQL GELIPIPADS LPPPAISSHG
     FQSLPALSPI SNMNNGKPAL HLVALNLPFS GDMQADFQCF QQAQLAGLTS TYRAFLSSHL
     QDLATVVRKT DRYHLPIVNL KGEILFNNWE SVFNGDGGQF NLDVPIYSFD GRNIMTDPSW
     PQKVIWHGST ANGIRLVSNY CKAWHTADMG AMGQASPLKT GKLLDQNVYS CSNQLIVLCI
     ENSFVSNPQG K
//
DBGET integrated database retrieval system