ID A0A099Z9P2_TINGU Unreviewed; 935 AA.
AC A0A099Z9P2;
DT 07-JAN-2015, integrated into UniProtKB/TrEMBL.
DT 07-JAN-2015, sequence version 1.
DT 27-MAR-2024, entry version 24.
DE SubName: Full=Collagen alpha-1(XXVIII) chain {ECO:0000313|EMBL:KGL79224.1};
DE Flags: Fragment;
GN ORFNames=N309_10307 {ECO:0000313|EMBL:KGL79224.1};
OS Tinamus guttatus (White-throated tinamou).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Palaeognathae; Tinamiformes; Tinamidae; Tinamus.
OX NCBI_TaxID=94827 {ECO:0000313|EMBL:KGL79224.1, ECO:0000313|Proteomes:UP000053641};
RN [1] {ECO:0000313|EMBL:KGL79224.1, ECO:0000313|Proteomes:UP000053641}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=BGI_N309 {ECO:0000313|EMBL:KGL79224.1};
RA Zhang G., Li C.;
RT "Genome evolution of avian class.";
RL Submitted (JUN-2014) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KL891832; KGL79224.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A099Z9P2; -.
DR STRING; 94827.A0A099Z9P2; -.
DR Proteomes; UP000053641; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR CDD; cd01472; vWA_collagen; 1.
DR CDD; cd01450; vWFA_subfamily_ECM; 1.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 2.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR24020; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24020:SF18; COLLAGEN ALPHA-1(VI) CHAIN; 1.
DR Pfam; PF01391; Collagen; 3.
DR Pfam; PF00092; VWA; 2.
DR PRINTS; PR00453; VWFADOMAIN.
DR SMART; SM00327; VWA; 2.
DR SUPFAM; SSF53300; vWA-like; 2.
DR PROSITE; PS50234; VWFA; 2.
PE 4: Predicted;
KW Collagen {ECO:0000313|EMBL:KGL79224.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000053641}.
FT DOMAIN 4..187
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 762..935
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT REGION 200..294
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 316..517
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 540..737
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 239..258
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:KGL79224.1"
FT NON_TER 935
FT /evidence="ECO:0000313|EMBL:KGL79224.1"
SQ SEQUENCE 935 AA; 95738 MW; 091C35AD3D7AC446 CRC64;
CILEIAFLLD SSESAKDYNH EQQKKFVLET VDRMKSLQLS SGRSLSWRMA LLQYSSTVST
EQTFRDWKGP EAFKSHIAPI TYIGHGTYTT YAITNLTQLY MTEGSPGSVK VTLLFTDGVD
HPRNPDIFAA IADAKHQGIV FFTMGITRVA EEAGNAAKLR LLASVPASRF VFNLQDKGTV
QKVLKEMCLQ AAICNCEKGE RGLPGPAGKK GRTGDDGLPG LKGAKGEAGL NGIPGRDGKE
GKSGYKGEKG ERGECGIPGI KGDRGPEGPA GPQGTTGLQG PQGQPGETGP EGLQGSKART
AYLFLFLFFG KGERGLPGPP GLPGETGIGV PGPKGDTGLT GRPGPAGPPG VGEPGLVGPQ
GPQGIQGERG LPGEGLPGAK GERGFDGPKG PRGLPGISIK GEKGEFGLPG LPGPIGLPGI
GIQGEKGLEG PKGSPGSRGL PGQGLPGPKG EQGLPGETGV PGERGIGEPG SKGEPGLSGL
AGLPGLPGED GAPGQKGEPG LPGLRGPEGA PGTGIQGEKM CLSTQRKNSI VFALAPSLQG
EPGVPGRFGM PGPPGRAVPG PKGDIGLPGL AGPVGEPGFG LPGAKGERGL PGPPGPFGPK
GDGYPGPTGL PGLPGTPGEQ GPDGVGLPGP KGDPGSRGPI GLPGAPGEGL PGPKGTMGRP
GPPGSLGPPG EGIQGIKGEQ GIQGMPGPRG PPGEGLLGQK GDRGATGERG KKGEKGNLGD
PGLSGEPGKT GPKGEQGLTR EDIIKLIREI CCGIKCKEIP MELVFVIDSS ESVGPENFEI
IKDFVTALVD RVTVGRNATR IGLVLYSLEV RLEFDLNKYT TQQDVKQAIR KMQYIGEGTY
TATAIRKATQ EGFFGARTGV RKVAIVLTDG QADKREAVKL DVAVREAHAA NIEMYAIGIV
NTSDPTQVEF VRELNLIASD PDREHMYLID DFNTL
//