GenomeNet

Database: UniProt
Entry: A0A3L8RZ40_CHLGU
LinkDB: A0A3L8RZ40_CHLGU
Original site: A0A3L8RZ40_CHLGU 
ID   A0A3L8RZ40_CHLGU        Unreviewed;       818 AA.
AC   A0A3L8RZ40;
DT   13-FEB-2019, integrated into UniProtKB/TrEMBL.
DT   13-FEB-2019, sequence version 1.
DT   28-JAN-2026, entry version 26.
DE   RecName: Full=Collagen IV NC1 domain-containing protein {ECO:0008006|Google:ProtNLM};
GN   ORFNames=DV515_00014156 {ECO:0000313|EMBL:RLV91351.1};
OS   Chloebia gouldiae (Gouldian finch) (Erythrura gouldiae).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC   Coelurosauria; Aves; Neognathae; Neoaves; Telluraves; Australaves;
OC   Passeriformes; Passeroidea; Passeridae; Chloebia.
OX   NCBI_TaxID=44316 {ECO:0000313|EMBL:RLV91351.1, ECO:0000313|Proteomes:UP000276834};
RN   [1] {ECO:0000313|EMBL:RLV91351.1, ECO:0000313|Proteomes:UP000276834}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Red01 {ECO:0000313|EMBL:RLV91351.1};
RC   TISSUE=Muscle {ECO:0000313|EMBL:RLV91351.1};
RX   PubMed=30282656;
RA   Toomey M.B., Marques C.I., Andrade P., Araujo P.M., Sabatino S.,
RA   Gazda M.A., Afonso S., Lopes R.J., Corbo J.C., Carneiro M.;
RT   "A non-coding region near Follistatin controls head colour polymorphism in
RT   the Gouldian finch.";
RL   Proc. R. Soc. B 285:0-0(2018).
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:RLV91351.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; QUSF01000113; RLV91351.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A3L8RZ40; -.
DR   OrthoDB; 10060752at2759; -.
DR   Proteomes; UP000276834; Unassembled WGS sequence.
DR   GO; GO:0005594; C:collagen type IX trimer; IEA:TreeGrafter.
DR   GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR   GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IEA:TreeGrafter.
DR   GO; GO:0030198; P:extracellular matrix organization; IEA:TreeGrafter.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050149; Collagen_superfamily.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1114; COLLAGEN ALPHA-1(XVIII) CHAIN; 1.
DR   Pfam; PF01391; Collagen; 3.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 1.
PE   4: Predicted;
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Reference proteome {ECO:0000313|Proteomes:UP000276834}.
FT   DOMAIN          566..609
FT                   /note="Collagen type XV/XVIII trimerization"
FT                   /evidence="ECO:0000259|Pfam:PF20010"
FT   DOMAIN          651..812
FT                   /note="Collagenase NC10/endostatin"
FT                   /evidence="ECO:0000259|Pfam:PF06482"
FT   REGION          1..54
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          85..457
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          517..543
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1..15
FT                   /note="Gly residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        44..54
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        210..219
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        220..229
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        244..255
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        256..271
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        301..319
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        320..329
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        335..349
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        358..392
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        437..449
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        526..541
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   818 AA;  83756 MW;  A9D28835C875D11A CRC64;
     MRSVGAGAGR AGGVSGAADN SGRRAGAAGP DGAGEWDEPR TPILGSRPRG SRMGSSSLLQ
     LLRTLCIVSA LARCLHPATA QWFSLGSEDT TPDPGTSPMP PSLDGDEDTD NSVEPVGKVL
     LSKPPLATAP KRRDQLSRGA AKGSGRAPPR TRGQHRPTAA PETFEGSAVE EEFLQIQTTA
     KGLPRRVPSA PETDPALQVH NISGCVCPAR PGPPGPPGPK GEKGDRGFPG ERGQPGFLGE
     RGKSGSPGQP GHQGPRGPPG PPGPPGPPGP PGTWGARGPP VPAAVPERSE NELGVSSPAG
     NPGPPGPPGL PGMPGPPGYP GHDGPQGAPG REGKPGPPGP PGAVGPPGFP GAEGLPGSPG
     PAGPDGPPGA PGLPGPQGPP GMPGHEGPPG PPGSASLPGK PGLRGEPGFP GPKGEKGEYG
     LPGMPGSPGR TGEPGSPGMP GPMGPPGPPG DYRCDSRHAG HRGLAGLPGP KGCCYGEHGC
     KPGHLPFPST GSQPASWGSI NRYQTDGKEE PEIYGAIIPH GLQGPPGNPG PPGPPGPPGP
     PGLLYLNRMH PVRAQPPCKQ PHQTWVFKSK ELMVKASSAV PEGSLVYVRE GSNAFLRTPT
     GWSRLLLEDP KSFLAGDDPS ASTPQYQIQK EEEQGLPQIL PTTTAPRIPS LRLAALNVPL
     WGDMSGIRGA DLQCYRQSQE AGLYGTFRAF LSAPTQHLVS IVKRTDRTLP IGQLLAKSWS
     SLFGGQSGAV LKGPIYSFNG RNVLADPLWP HQLAWHGSTP RGSHARRWDC QGWRSSGTAE
     GMAAALGEGR LLAGHRHNCS APLAVLCVEV AFPYRHMW
//
DBGET integrated database retrieval system