GenomeNet

Database: UniProt
Entry: A0A1V4JQU4_PATFA
LinkDB: A0A1V4JQU4_PATFA
Original site: A0A1V4JQU4_PATFA 
ID   A0A1V4JQU4_PATFA        Unreviewed;       828 AA.
AC   A0A1V4JQU4;
DT   07-JUN-2017, integrated into UniProtKB/TrEMBL.
DT   07-JUN-2017, sequence version 1.
DT   08-OCT-2025, entry version 28.
DE   SubName: Full=Collagen alpha-1(XVIII) chain {ECO:0000313|EMBL:OPJ74474.1};
GN   ORFNames=AV530_001329 {ECO:0000313|EMBL:OPJ74474.1};
OS   Patagioenas fasciata monilis.
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC   Coelurosauria; Aves; Neognathae; Neoaves; Columbimorphae; Columbiformes;
OC   Columbidae; Patagioenas.
OX   NCBI_TaxID=372326 {ECO:0000313|EMBL:OPJ74474.1, ECO:0000313|Proteomes:UP000190648};
RN   [1] {ECO:0000313|EMBL:OPJ74474.1, ECO:0000313|Proteomes:UP000190648}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=BTP2013 {ECO:0000313|EMBL:OPJ74474.1};
RC   TISSUE=Blood {ECO:0000313|EMBL:OPJ74474.1};
RA   Soares A.E., Novak B.J., Rice E.S., O'Connell B., Chang D., Weber S.,
RA   Shapiro B.;
RT   "Band-tailed pigeon sequencing and assembly.";
RL   Submitted (FEB-2016) to the EMBL/GenBank/DDBJ databases.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:OPJ74474.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; LSYS01006700; OPJ74474.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A1V4JQU4; -.
DR   STRING; 372326.A0A1V4JQU4; -.
DR   OrthoDB; 10060752at2759; -.
DR   Proteomes; UP000190648; Unassembled WGS sequence.
DR   GO; GO:0005594; C:collagen type IX trimer; IEA:TreeGrafter.
DR   GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR   GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IEA:TreeGrafter.
DR   GO; GO:0030198; P:extracellular matrix organization; IEA:TreeGrafter.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 1.20.5.320; 6-Phosphogluconate Dehydrogenase, domain 3; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050149; Collagen_superfamily.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1114; COLLAGEN ALPHA-1(XVIII) CHAIN; 1.
DR   Pfam; PF01391; Collagen; 2.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 1.
PE   4: Predicted;
KW   Collagen {ECO:0000256|ARBA:ARBA00023119, ECO:0000313|EMBL:OPJ74474.1};
KW   Reference proteome {ECO:0000313|Proteomes:UP000190648};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..28
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           29..828
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5013070542"
FT   DOMAIN          549..592
FT                   /note="Collagen type XV/XVIII trimerization"
FT                   /evidence="ECO:0000259|Pfam:PF20010"
FT   DOMAIN          657..822
FT                   /note="Collagenase NC10/endostatin"
FT                   /evidence="ECO:0000259|Pfam:PF06482"
FT   REGION          41..68
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          82..109
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          155..427
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          443..465
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          766..789
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        168..177
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        192..203
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        204..219
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        249..267
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        283..297
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        298..312
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        313..340
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        376..385
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        387..397
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        400..412
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   828 AA;  84899 MW;  7FE101792DD83BB8 CRC64;
     MGSGGPQRLL RALCILSVLA EHLPNATAQW FYPLGAEDTT PDPSISPAAP TGPALDVEED
     AGSVEPPRKV LLSKPPLAMA PRGRQSLARG AAKAPGHALP RTRGQHRPTA APEIFEGSAE
     EEEFLQIQTT AKGLPQRVLL APEMDPALQM HNRSSCVCSV RPGPPGPKGE KGDRGFPGER
     GQPGFSGEKG KTGSPGQPGH QGPRGPPGPP GPPGPPGPPG TWGGRSPPMA AALPGGSENE
     LGASSPAGNP GPPGPPGLPG QPGPPGYPGH EGPPGVPGRD GKPGPPGPPG PVGPPGFPGA
     EGAPGSPGSA GPDGPPGAPG LPGPQGPPGV PGHEGPPGPT GPASLPGKPG LRGEPGFPGL
     KGEKGEYGLP GMPGSPGRTG ETGAPGAPGP MGPPGPPGDY RCDSRHAGHR ETAGPPGPKG
     EKGDPGERGC CYGEHGCKPG HLPFPGTGVQ PGSWAPISGY QTGGKEEPEI YGAIIPHGLR
     GLPGNPGPPG PPGPPGAPGL LYFNRLYPSR AQQPCKQPAA TDTGWAADAD IPRTELPDSR
     ADLQRQTWVF RSKELMLKSG SAVPEGSLVY VREGSSAFLR TPTGWSRLLL EDSESLFAGD
     DPSASTPQYQ AAKRAQMKGD NTGTPVLAQT HSLVQKQEGQ GLPQIPPTTI APRIPSLRLA
     ALNVPLAGDM SGVRGADLQC YRQSQEAQLY GTFRAFLSAP TQDLVSIVKR TDRTLPVVNL
     KGQLLAKSWS SLFNGQAGAV PRGPIYSFNG RNVLTDPLWP QRLAWHGSTP RGGHARRRDC
     QGWRSSGPGE GLAAPLGEGR LLAGQRHNCS QALAVLCVEV AFPYRHMW
//
DBGET integrated database retrieval system