ID A0A7L3AVY1_9AVES Unreviewed; 1379 AA.
AC A0A7L3AVY1;
DT 07-APR-2021, integrated into UniProtKB/TrEMBL.
DT 07-APR-2021, sequence version 1.
DT 08-OCT-2025, entry version 17.
DE SubName: Full=COFA1 protein {ECO:0000313|EMBL:NXT23103.1};
DE Flags: Fragment;
GN Name=Col15a1_0 {ECO:0000313|EMBL:NXT23103.1};
GN ORFNames=SYRPAR_R02927 {ECO:0000313|EMBL:NXT23103.1};
OS Syrrhaptes paradoxus (Pallas's sandgrouse).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Neoaves; Columbimorphae; Pterocliformes;
OC Pteroclidae; Syrrhaptes.
OX NCBI_TaxID=302527 {ECO:0000313|EMBL:NXT23103.1, ECO:0000313|Proteomes:UP000536260};
RN [1] {ECO:0000313|EMBL:NXT23103.1, ECO:0000313|Proteomes:UP000536260}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=B10K-DU-003-42 {ECO:0000313|EMBL:NXT23103.1};
RC TISSUE=Mixed tissue sample {ECO:0000313|EMBL:NXT23103.1};
RA Zhang G.;
RT "Bird 10,000 Genomes (B10K) Project - Family phase.";
RL Submitted (SEP-2019) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Secreted, extracellular space, extracellular
CC matrix {ECO:0000256|ARBA:ARBA00004498}.
CC -!- SIMILARITY: Belongs to the multiplexin collagen family.
CC {ECO:0000256|ARBA:ARBA00061275}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:NXT23103.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; VZTO01012519; NXT23103.1; -; Genomic_DNA.
DR Proteomes; UP000536260; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IEA:TreeGrafter.
DR GO; GO:0007155; P:cell adhesion; IEA:UniProtKB-KW.
DR GO; GO:0030198; P:extracellular matrix organization; IEA:TreeGrafter.
DR CDD; cd00247; Endostatin-like; 1.
DR FunFam; 3.10.100.10:FF:000008; collagen alpha-1(XVIII) chain isoform X1; 1.
DR FunFam; 3.40.1620.70:FF:000003; Collagen type XVIII alpha 1; 1.
DR FunFam; 2.60.120.200:FF:000039; Collagen XV alpha 1 chain; 1.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 3.40.1620.70; -; 1.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR050149; Collagen_superfamily.
DR InterPro; IPR010515; Collagenase_NC10/endostatin.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR048287; TSPN-like_N.
DR InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF912; COLLAGEN ALPHA-1(XV) CHAIN; 1.
DR Pfam; PF01391; Collagen; 2.
DR Pfam; PF20010; Collagen_trimer; 1.
DR Pfam; PF06482; Endostatin; 1.
DR SMART; SM00210; TSPN; 1.
DR SUPFAM; SSF56436; C-type lectin-like; 2.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE 3: Inferred from homology;
KW Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW Proteoglycan {ECO:0000256|ARBA:ARBA00022974};
KW Reference proteome {ECO:0000313|Proteomes:UP000536260};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Secreted {ECO:0000256|ARBA:ARBA00022525};
KW Signal {ECO:0000256|ARBA:ARBA00022729}.
FT DOMAIN 8..196
FT /note="Thrombospondin-like N-terminal"
FT /evidence="ECO:0000259|SMART:SM00210"
FT REGION 196..222
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 594..712
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 736..774
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 794..851
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 864..932
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 983..1008
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1020..1121
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 594..605
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 615..643
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 691..710
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 748..758
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 800..816
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 886..907
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 923..932
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 997..1006
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1084..1097
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1107..1116
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:NXT23103.1"
FT NON_TER 1379
FT /evidence="ECO:0000313|EMBL:NXT23103.1"
SQ SEQUENCE 1379 AA; 144563 MW; F2B78C4B2BE54FD6 CRC64;
TERGSKGHLD LTELIGVPLP PSVYFVTGYG GFPAYSFGPD ANIGRLTNAI IPLPFYRDFA
IVVTVKPNSD RGGTLFAITD AFQKTIYLGI RLSPVDDGTQ RIIMYYTEPG SHISREAVSF
KVPVMTNRWN RFTVAVQGDE VALFMDCEEY QRVQFQRSAQ ALVFESGSGI FVGNAGATGL
EKFTGSVQHL TIKSDLRDTE EHCEDDDPYA SGDTSGNGSI QEHEGISEAQ EVLAPSHLPI
RPEDMLAEPV EAPPTVLSYL EENNFSGNHG SEEISEAAEL KEQGTVSFFW RGAGEREGVG
QGNSESTTVT QKILQKEDGS GASVLPGVSR EEVFLPQVAK KANGILACIS SSVASRTRAG
IVPLDSTLVR PHLGSCVQFW APHYKKDIEG LEQVQRRAME LGKGLEHRSD EERLRELGVF
SLEKRRLRGD LITLYSCLKG GCSEVSAGLC SKGTSDGTRG NGLKLHQGRF RLDTRENFFS
ERVMRHWNRL PRAVVKSPCL EGRSVFLDSG TISQCKFCTC LIHSIQKCFV GNMGFSGLCD
RGDQTSVIGK EYSCQVNLSL GLLMHTWPWV KMGLSKQVDV HLILLLEGLP GPPGPPGLPG
LPGKPAPNSG VRPPGSLGED GASGEAGAEV YYHLQGPQGP PGLDGVAGPP GQKGEKGDQG
LPGSVGPKGD TGDTGSIGPK GEAGAVGSPG KPGPPGPPGP PGPPGPPGPP GLSYSLGFEV
SCCHGVHCSY GSTSFRGGTG PKGEKGDPGP QGEPGQDGNS IVGPPGPPGP PGTIIAIPEL
LLNGTNGMFN FTEIKGLLGP PGPDGKPGLP GFPGPRGPKG DTGLPGSQGP KGQQAREHAW
GKGAKGEPGA IVAADGSLTE LLGRKGEKGE AGVVGPVGPM GPMGPTGPKG ELGFPGRPGR
PGLNGLRGVK GDRGEAFNGL PGLPGPPGPP GPPGRIVYIK GTVFPVSPRP HCKMPVSTPY
PGNQEALNVH GAKVNRDSWG LHSSAHLKGE KGDRGAPGPP GPPLPPSYFS HFINSIKGEK
GDNGVTGVKG EKGEPNGGFF LTGPPGPPGR PGLVGPKGDS VVGPRGPPGL PGLPGLPGYG
KTGPPGPPGP PGPPGPPAIY GSAAAMPGPP GPPGEPGSPA TRNLVMTFQN IEGMLEKVHF
VAEGTLIYLS ETSEVFIRVR NGWRKLQLGE LIPIPADSLP PPAISSHGFQ SLPALNPVSN
INSGKPALHL VALNLPFSGD MRADFQCFQQ AQLAGLTSTY RAFLSSHLQD LATVVRKTDR
YHLPIVNLKG ETLFNNWESI FNGNGGQFNI HVPIYSFDGR NVMTDPSWPQ KVIWHGSTAN
GIRLVSNYCE AWHTAGMGAM GQASPLKTGK LLDQKAYSCS NQFIVLCIEN SFVSDPQGE
//