GenomeNet

Database: UniProt
Entry: A0A851QQV3_TYCCO
LinkDB: A0A851QQV3_TYCCO
Original site: A0A851QQV3_TYCCO 
ID   A0A851QQV3_TYCCO        Unreviewed;      1341 AA.
AC   A0A851QQV3;
DT   29-SEP-2021, integrated into UniProtKB/TrEMBL.
DT   29-SEP-2021, sequence version 1.
DT   28-JAN-2026, entry version 15.
DE   SubName: Full=COFA1 protein {ECO:0000313|EMBL:NXC80739.1};
DE   Flags: Fragment;
GN   Name=Col15a1_0 {ECO:0000313|EMBL:NXC80739.1};
GN   ORFNames=CERCOR_R02258 {ECO:0000313|EMBL:NXC80739.1};
OS   Tychaedon coryphoeus (Karoo scrub-robin) (Erythropygia coryphaeus).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC   Coelurosauria; Aves; Neognathae; Neoaves; Telluraves; Australaves;
OC   Passeriformes; Muscicapidae; Cercotrichas.
OX   NCBI_TaxID=614051 {ECO:0000313|EMBL:NXC80739.1, ECO:0000313|Proteomes:UP000631545};
RN   [1] {ECO:0000313|EMBL:NXC80739.1}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=OUT-0024 {ECO:0000313|EMBL:NXC80739.1};
RC   TISSUE=Muscle {ECO:0000313|EMBL:NXC80739.1};
RA   Zhang G.;
RT   "Bird 10,000 Genomes (B10K) Project - Family phase.";
RL   Submitted (SEP-2019) to the EMBL/GenBank/DDBJ databases.
CC   -!- SUBCELLULAR LOCATION: Secreted, extracellular space, extracellular
CC       matrix {ECO:0000256|ARBA:ARBA00004498}.
CC   -!- SIMILARITY: Belongs to the multiplexin collagen family.
CC       {ECO:0000256|ARBA:ARBA00061275}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:NXC80739.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; WBND01000065; NXC80739.1; -; Genomic_DNA.
DR   Proteomes; UP000631545; Unassembled WGS sequence.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR   GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR   GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IEA:TreeGrafter.
DR   GO; GO:0007155; P:cell adhesion; IEA:UniProtKB-KW.
DR   GO; GO:0030198; P:extracellular matrix organization; IEA:TreeGrafter.
DR   CDD; cd00247; Endostatin-like; 1.
DR   FunFam; 3.10.100.10:FF:000008; collagen alpha-1(XVIII) chain isoform X1; 1.
DR   FunFam; 3.40.1620.70:FF:000003; Collagen type XVIII alpha 1; 1.
DR   FunFam; 2.60.120.200:FF:000039; Collagen XV alpha 1 chain; 1.
DR   Gene3D; 2.60.120.200; -; 1.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050149; Collagen_superfamily.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR048287; TSPN-like_N.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24023:SF1112; COL_CUTICLE_N DOMAIN-CONTAINING PROTEIN-RELATED; 1.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   Pfam; PF01391; Collagen; 3.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   SMART; SM00210; TSPN; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 2.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE   3: Inferred from homology;
KW   Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW   Proteoglycan {ECO:0000256|ARBA:ARBA00022974};
KW   Reference proteome {ECO:0000313|Proteomes:UP000631545};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Secreted {ECO:0000256|ARBA:ARBA00022525};
KW   Signal {ECO:0000256|ARBA:ARBA00022729}.
FT   DOMAIN          8..196
FT                   /note="Thrombospondin-like N-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00210"
FT   REGION          193..239
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          265..366
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          560..682
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          703..744
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          767..813
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          945..970
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1028..1084
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        266..281
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        294..304
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        568..581
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        582..602
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        611..620
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        661..680
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        711..720
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        734..743
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        770..786
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        794..803
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        959..968
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1046..1059
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1069..1078
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   NON_TER         1
FT                   /evidence="ECO:0000313|EMBL:NXC80739.1"
FT   NON_TER         1341
FT                   /evidence="ECO:0000313|EMBL:NXC80739.1"
SQ   SEQUENCE   1341 AA;  140431 MW;  E5D2E4F4FB161795 CRC64;
     PERGSKGHLD LTELIGVPLP PSVYFVTGYG GFPAYSFGPD SNIGRLASAM VPSPFYRDFA
     VVVTAKPNRD RGGVLFAITD AFQKTIYLGL RIAPVDDSTQ RIIMYYTEPG SHISREAASF
     KVPVMTNRWN KFTVAVEGND VALFMDCEEY QRLHFQRSAR ALVFESGSGI FVGNAGATGL
     EKFTGSIQHL TIKSDPRATE DHCEDDDPYA SGDSSGNDSI QDHDVSEAQE ALAPSRLPIR
     PEDVLAEPVE APPTILSYLE ENDFSGNHRS EETSEAAKFK EQGTASDSAV METGQGNSES
     SPVTQKMLRE EDGSGAGVLP GASREKVCQE VEDGSTEPLG GSGIEDVENK DQGPPGSPGK
     PGQLGQPVSI FSLLKEILCF RKGDGDPSHM HGELELHSLA HQLHFHETGF SPAAQEDPAC
     PAQVSGTGAL WVTPTRRQRQ TAAVPTYKCS QSSNSNSLGS TAAPSQLPGL SVCKGAEPTF
     CLVFCCTAAA FKRPVRQPQI SRTRCSRPTA LPKGHFYKYS REHQVLHCLY FPVLCDSKFQ
     SLLFCSEVEE VSVHPRNASL TPRHLKWPPG PPGLPGLPGM PAPDSGVGPP GSPGEDGASG
     EPGPEGPQGP PGQDGVVGPP GWKGEKGDQG LRGSAGPKGD TGVTGSIGPK GEAGPIGSPG
     KPGPPGPPGP PGPPGPPGPP GQSYGLGFEV FCYSELCCNY GYTSSRGRTG PKGEKGDPGP
     RGEPGQDGNS VVGPPGPPGP PGPVIAIPEF LLNDTDGIFN FTESKGLLGP PGPDGKPGLP
     GFPGPRGPKG DTGLPGLQGP KGQQGEKGEP GAIISADGSL TELLGRKGEK GEAGVVGPAG
     PMGPIGPTGP KGELGFPGRP GRPGLNGLRG VKGDRGEAVY GPPGLPGPPG PPGPPGRILY
     IKGTVFPVPP RPHCKMPVNT PYPGNQEVLN DHGARANRDS WGLHSSAHLK GEKGDRGAPG
     PPGPPLPPSY FSHFINSIKG EKGDNGVTGV KGEKGEPNGG FFLTGPPGPP GRPGLIGPKG
     DSVVGPRGPP GLPGLPGLPG YGKIGPPGPP GPPGPPGPPA IYGSAAAMPG PPGPPGEPGS
     PATRNLVTTF QNIEGMLEKV HYVAEGTLIY LRETSEVFIR VRKGWRKLQL GELIPIPADS
     LPPPAISSHG FQSIPALRPI SNMNNGKPAL HLVALNFPLS GDMRADFQCF RQAQLAGLTS
     TYRAFLSSHL QDLATVVRKT DRQHLPIVNL QGETLFNNWE SIFDGNGGQF NIHVPIYSFD
     GRNVMTDSSW PQKIIWHGST ANGIRLVSNY CEAWHTADMG AMGQASPLNT GKLLDQKVYS
     CNNLFIVLCI ENSFVSDLQR K
//
DBGET integrated database retrieval system