ID A0A093GJZ5_DRYPU Unreviewed; 1655 AA.
AC A0A093GJZ5;
DT 26-NOV-2014, integrated into UniProtKB/TrEMBL.
DT 26-NOV-2014, sequence version 1.
DT 27-MAR-2024, entry version 30.
DE SubName: Full=Collagen alpha-1(XXIV) chain {ECO:0000313|EMBL:KFV67277.1};
DE Flags: Fragment;
GN ORFNames=N307_08630 {ECO:0000313|EMBL:KFV67277.1};
OS Dryobates pubescens (Downy woodpecker) (Picoides pubescens).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Piciformes; Picidae; Dryobates.
OX NCBI_TaxID=118200 {ECO:0000313|EMBL:KFV67277.1, ECO:0000313|Proteomes:UP000053875};
RN [1] {ECO:0000313|EMBL:KFV67277.1, ECO:0000313|Proteomes:UP000053875}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=BGI_N307 {ECO:0000313|EMBL:KFV67277.1};
RA Zhang G., Li C.;
RT "Genome evolution of avian class.";
RL Submitted (APR-2014) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KL215901; KFV67277.1; -; Genomic_DNA.
DR STRING; 118200.A0A093GJZ5; -.
DR Proteomes; UP000053875; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR Gene3D; 2.60.120.1000; -; 2.
DR Gene3D; 2.60.120.200; -; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR000885; Fib_collagen_C.
DR InterPro; IPR048287; TSPN-like_N.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF987; COLLAGEN ALPHA-1(XXIV) CHAIN; 1.
DR Pfam; PF01410; COLFI; 2.
DR Pfam; PF01391; Collagen; 10.
DR SMART; SM00038; COLFI; 1.
DR SMART; SM00210; TSPN; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR PROSITE; PS51461; NC1_FIB; 1.
PE 4: Predicted;
KW Collagen {ECO:0000313|EMBL:KFV67277.1};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Reference proteome {ECO:0000313|Proteomes:UP000053875};
KW Secreted {ECO:0000256|ARBA:ARBA00022530};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..19
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 20..1655
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5001884471"
FT DOMAIN 1456..1655
FT /note="Fibrillar collagen NC1"
FT /evidence="ECO:0000259|PROSITE:PS51461"
FT REGION 306..372
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 435..527
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 573..1030
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1051..1349
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1384..1420
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 307..353
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 354..368
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 445..487
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 847..865
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:KFV67277.1"
FT NON_TER 1655
FT /evidence="ECO:0000313|EMBL:KFV67277.1"
SQ SEQUENCE 1655 AA; 169164 MW; D4405DEEFCE53F0E CRC64;
RKLTLCLYFF FTGIDVLHQL GLVKDIQQSS ETAVPSTSSL LPRGVRLTVS GVVLTGDAHI
ESPTIQVIPS NLGRQFTILV GLYSYRVNNA FLFSIRKKSR LQFGVQLLPR KVVVYTGWKQ
SIYFDYSVHN EQWHSFAIDI RDKTVSIFAE CGKKYYSREV LSEVEIFDLD SLFTLGRMNS
HSVSFEGVIC QLDIIPSAEA AANYCKYVKQ QCRQAETYRS LLGAPSVSDG LDVFSKMMTV
EKKQSEGISA YRRKEASNIP VTTVAQIVFL PEHFESRNVS ASGFAEAKLL LLEAWPEIEF
QDKINLPPHQ QEMSEKRSIT KLPSGSYTST LDNTTKDAGR QSEPSLMSPV KQSKTETKRT
DKANQHIEES SKTQQIFNTT LYSLPADISL DNQLNPGQEG AFDADETYHM ETSYEISVDN
YSYDYEDLDK MFEMGSVRGP KGETGPPGPS GPPGLPGPPG KRGPRGIPGP HGNPGLPGLP
GPKGPKGDPG VSPGQAKRGE KGDPGPLGFP GPPGIKGQKG LKGFLGLQGP RGEQGIPGMA
GNIGMVGYPG RQGLAGPEGN PGPKGVRGFI GPPGVAGQPG PEGERGIPGT RGRKGPKGRQ
GFPGDFGNRG SPGPDGNPGI VGGVGPPGFP GLRGVIGPAG PRGPSGIPGP MGLPGSVGQP
GMKGDKGEHG LAGEPGEQGY PGDKGAVGLP GPPGIRGKPG PQGKIGDPGS LGPLGPPGPE
GFPGDIGVPG LNGPEGPKGL LGDRGPPGPQ GPKGVEGEVG PTGPIGGVGP RGKPGHKGYV
GDPGPEGLKG EQGDQGNTGK TGEAGSAGLP GETGPSGSLG EKGERGSPGP VGPPGEKGVM
GYAGPPGGPG PLGPEGLPGP SGTRGPPGPQ GLKGRMGPRG LDGPPGEPGT QGIKGERGDP
GKKGVPGLIG KAGNPGERGD QGALGLLGPP GTTGDRGPMG EPGPRGQPGD AGPIGETGFE
GPPGPEGEPG LQGEPGTKGD TGPAGKAGEP GIQGLRGEPG PPGEDGVQGK DGPKGDPGDP
GLFGEGGEKG QMGFPGTFLI HLFFSLINVQ GPEGTPGNPG QRGRLGKKGE KGQLGPPGET
GPAGDSGQPG EIGPKGARGT RGPLGRLGGM GPEGESGIPG YGGHQGPPGP SGPPGPKGEK
GYPGEDNTVL GPPGPIGEPG PSGERGDRGE PGDEGYKGHI GLPGLRGPIG QQGPPGEPGE
LGEQGQKGER GSEGPTGKKG AAGQAGKPGI PGKPGPAGEK GGVGYPGPEG ISGNPGKPGL
SGKPGPKGNK GNSGLQGLAG YPGARGPKGL PGMPGPRGEA GLKGETGILG HPGKRGKRGL
PGSQGDQGPA GDTGLKGQTG EGGDQGLMGI QGLPGLKGLA GDIGLVGILG PKGPTGQAGY
MGPVGEEGIT GPKGRPGLRG EKGTRGEMGP QGPRGQPGPR KQVDINAAVQ ALIESNAVLQ
REKYQNTEVT LLDHSTEIFK TLHYLSNLLH SIKNPLGTRD NPARICRDLL NCERKVSDGK
YWIDPNIGCP SDAIEVFCNF TAGGQTCLTP LSVTKLEFGV GKVQMNFLHL LSSEATHSIT
IHCLNTPMWR LNHAGGQKTS VSFKGWNGQI FKANTLLEPK VLMDECMIED GSWHKTQFFF
HTQDTNQLPV IQVNALPHLK PGQQHFIESG LVCFL
//