GenomeNet

Database: UniProt
Entry: A0A7L3AVY1_9AVES
LinkDB: A0A7L3AVY1_9AVES
Original site: A0A7L3AVY1_9AVES 
ID   A0A7L3AVY1_9AVES        Unreviewed;      1379 AA.
AC   A0A7L3AVY1;
DT   07-APR-2021, integrated into UniProtKB/TrEMBL.
DT   07-APR-2021, sequence version 1.
DT   08-OCT-2025, entry version 17.
DE   SubName: Full=COFA1 protein {ECO:0000313|EMBL:NXT23103.1};
DE   Flags: Fragment;
GN   Name=Col15a1_0 {ECO:0000313|EMBL:NXT23103.1};
GN   ORFNames=SYRPAR_R02927 {ECO:0000313|EMBL:NXT23103.1};
OS   Syrrhaptes paradoxus (Pallas's sandgrouse).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC   Coelurosauria; Aves; Neognathae; Neoaves; Columbimorphae; Pterocliformes;
OC   Pteroclidae; Syrrhaptes.
OX   NCBI_TaxID=302527 {ECO:0000313|EMBL:NXT23103.1, ECO:0000313|Proteomes:UP000536260};
RN   [1] {ECO:0000313|EMBL:NXT23103.1, ECO:0000313|Proteomes:UP000536260}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=B10K-DU-003-42 {ECO:0000313|EMBL:NXT23103.1};
RC   TISSUE=Mixed tissue sample {ECO:0000313|EMBL:NXT23103.1};
RA   Zhang G.;
RT   "Bird 10,000 Genomes (B10K) Project - Family phase.";
RL   Submitted (SEP-2019) to the EMBL/GenBank/DDBJ databases.
CC   -!- SUBCELLULAR LOCATION: Secreted, extracellular space, extracellular
CC       matrix {ECO:0000256|ARBA:ARBA00004498}.
CC   -!- SIMILARITY: Belongs to the multiplexin collagen family.
CC       {ECO:0000256|ARBA:ARBA00061275}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:NXT23103.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; VZTO01012519; NXT23103.1; -; Genomic_DNA.
DR   Proteomes; UP000536260; Unassembled WGS sequence.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR   GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR   GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IEA:TreeGrafter.
DR   GO; GO:0007155; P:cell adhesion; IEA:UniProtKB-KW.
DR   GO; GO:0030198; P:extracellular matrix organization; IEA:TreeGrafter.
DR   CDD; cd00247; Endostatin-like; 1.
DR   FunFam; 3.10.100.10:FF:000008; collagen alpha-1(XVIII) chain isoform X1; 1.
DR   FunFam; 3.40.1620.70:FF:000003; Collagen type XVIII alpha 1; 1.
DR   FunFam; 2.60.120.200:FF:000039; Collagen XV alpha 1 chain; 1.
DR   Gene3D; 2.60.120.200; -; 1.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050149; Collagen_superfamily.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR048287; TSPN-like_N.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF912; COLLAGEN ALPHA-1(XV) CHAIN; 1.
DR   Pfam; PF01391; Collagen; 2.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   SMART; SM00210; TSPN; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 2.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE   3: Inferred from homology;
KW   Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW   Proteoglycan {ECO:0000256|ARBA:ARBA00022974};
KW   Reference proteome {ECO:0000313|Proteomes:UP000536260};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Secreted {ECO:0000256|ARBA:ARBA00022525};
KW   Signal {ECO:0000256|ARBA:ARBA00022729}.
FT   DOMAIN          8..196
FT                   /note="Thrombospondin-like N-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00210"
FT   REGION          196..222
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          594..712
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          736..774
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          794..851
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          864..932
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          983..1008
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1020..1121
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        594..605
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        615..643
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        691..710
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        748..758
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        800..816
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        886..907
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        923..932
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        997..1006
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1084..1097
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1107..1116
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   NON_TER         1
FT                   /evidence="ECO:0000313|EMBL:NXT23103.1"
FT   NON_TER         1379
FT                   /evidence="ECO:0000313|EMBL:NXT23103.1"
SQ   SEQUENCE   1379 AA;  144563 MW;  F2B78C4B2BE54FD6 CRC64;
     TERGSKGHLD LTELIGVPLP PSVYFVTGYG GFPAYSFGPD ANIGRLTNAI IPLPFYRDFA
     IVVTVKPNSD RGGTLFAITD AFQKTIYLGI RLSPVDDGTQ RIIMYYTEPG SHISREAVSF
     KVPVMTNRWN RFTVAVQGDE VALFMDCEEY QRVQFQRSAQ ALVFESGSGI FVGNAGATGL
     EKFTGSVQHL TIKSDLRDTE EHCEDDDPYA SGDTSGNGSI QEHEGISEAQ EVLAPSHLPI
     RPEDMLAEPV EAPPTVLSYL EENNFSGNHG SEEISEAAEL KEQGTVSFFW RGAGEREGVG
     QGNSESTTVT QKILQKEDGS GASVLPGVSR EEVFLPQVAK KANGILACIS SSVASRTRAG
     IVPLDSTLVR PHLGSCVQFW APHYKKDIEG LEQVQRRAME LGKGLEHRSD EERLRELGVF
     SLEKRRLRGD LITLYSCLKG GCSEVSAGLC SKGTSDGTRG NGLKLHQGRF RLDTRENFFS
     ERVMRHWNRL PRAVVKSPCL EGRSVFLDSG TISQCKFCTC LIHSIQKCFV GNMGFSGLCD
     RGDQTSVIGK EYSCQVNLSL GLLMHTWPWV KMGLSKQVDV HLILLLEGLP GPPGPPGLPG
     LPGKPAPNSG VRPPGSLGED GASGEAGAEV YYHLQGPQGP PGLDGVAGPP GQKGEKGDQG
     LPGSVGPKGD TGDTGSIGPK GEAGAVGSPG KPGPPGPPGP PGPPGPPGPP GLSYSLGFEV
     SCCHGVHCSY GSTSFRGGTG PKGEKGDPGP QGEPGQDGNS IVGPPGPPGP PGTIIAIPEL
     LLNGTNGMFN FTEIKGLLGP PGPDGKPGLP GFPGPRGPKG DTGLPGSQGP KGQQAREHAW
     GKGAKGEPGA IVAADGSLTE LLGRKGEKGE AGVVGPVGPM GPMGPTGPKG ELGFPGRPGR
     PGLNGLRGVK GDRGEAFNGL PGLPGPPGPP GPPGRIVYIK GTVFPVSPRP HCKMPVSTPY
     PGNQEALNVH GAKVNRDSWG LHSSAHLKGE KGDRGAPGPP GPPLPPSYFS HFINSIKGEK
     GDNGVTGVKG EKGEPNGGFF LTGPPGPPGR PGLVGPKGDS VVGPRGPPGL PGLPGLPGYG
     KTGPPGPPGP PGPPGPPAIY GSAAAMPGPP GPPGEPGSPA TRNLVMTFQN IEGMLEKVHF
     VAEGTLIYLS ETSEVFIRVR NGWRKLQLGE LIPIPADSLP PPAISSHGFQ SLPALNPVSN
     INSGKPALHL VALNLPFSGD MRADFQCFQQ AQLAGLTSTY RAFLSSHLQD LATVVRKTDR
     YHLPIVNLKG ETLFNNWESI FNGNGGQFNI HVPIYSFDGR NVMTDPSWPQ KVIWHGSTAN
     GIRLVSNYCE AWHTAGMGAM GQASPLKTGK LLDQKAYSCS NQFIVLCIEN SFVSDPQGE
//
DBGET integrated database retrieval system