GenomeNet

Database: UniProt
Entry: A0A099Z9P2_TINGU
LinkDB: A0A099Z9P2_TINGU
Original site: A0A099Z9P2_TINGU 
ID   A0A099Z9P2_TINGU        Unreviewed;       935 AA.
AC   A0A099Z9P2;
DT   07-JAN-2015, integrated into UniProtKB/TrEMBL.
DT   07-JAN-2015, sequence version 1.
DT   27-MAR-2024, entry version 24.
DE   SubName: Full=Collagen alpha-1(XXVIII) chain {ECO:0000313|EMBL:KGL79224.1};
DE   Flags: Fragment;
GN   ORFNames=N309_10307 {ECO:0000313|EMBL:KGL79224.1};
OS   Tinamus guttatus (White-throated tinamou).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC   Coelurosauria; Aves; Palaeognathae; Tinamiformes; Tinamidae; Tinamus.
OX   NCBI_TaxID=94827 {ECO:0000313|EMBL:KGL79224.1, ECO:0000313|Proteomes:UP000053641};
RN   [1] {ECO:0000313|EMBL:KGL79224.1, ECO:0000313|Proteomes:UP000053641}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=BGI_N309 {ECO:0000313|EMBL:KGL79224.1};
RA   Zhang G., Li C.;
RT   "Genome evolution of avian class.";
RL   Submitted (JUN-2014) to the EMBL/GenBank/DDBJ databases.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; KL891832; KGL79224.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A099Z9P2; -.
DR   STRING; 94827.A0A099Z9P2; -.
DR   Proteomes; UP000053641; Unassembled WGS sequence.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   CDD; cd01472; vWA_collagen; 1.
DR   CDD; cd01450; vWFA_subfamily_ECM; 1.
DR   Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 2.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR002035; VWF_A.
DR   InterPro; IPR036465; vWFA_dom_sf.
DR   PANTHER; PTHR24020; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24020:SF18; COLLAGEN ALPHA-1(VI) CHAIN; 1.
DR   Pfam; PF01391; Collagen; 3.
DR   Pfam; PF00092; VWA; 2.
DR   PRINTS; PR00453; VWFADOMAIN.
DR   SMART; SM00327; VWA; 2.
DR   SUPFAM; SSF53300; vWA-like; 2.
DR   PROSITE; PS50234; VWFA; 2.
PE   4: Predicted;
KW   Collagen {ECO:0000313|EMBL:KGL79224.1};
KW   Reference proteome {ECO:0000313|Proteomes:UP000053641}.
FT   DOMAIN          4..187
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          762..935
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   REGION          200..294
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          316..517
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          540..737
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        239..258
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   NON_TER         1
FT                   /evidence="ECO:0000313|EMBL:KGL79224.1"
FT   NON_TER         935
FT                   /evidence="ECO:0000313|EMBL:KGL79224.1"
SQ   SEQUENCE   935 AA;  95738 MW;  091C35AD3D7AC446 CRC64;
     CILEIAFLLD SSESAKDYNH EQQKKFVLET VDRMKSLQLS SGRSLSWRMA LLQYSSTVST
     EQTFRDWKGP EAFKSHIAPI TYIGHGTYTT YAITNLTQLY MTEGSPGSVK VTLLFTDGVD
     HPRNPDIFAA IADAKHQGIV FFTMGITRVA EEAGNAAKLR LLASVPASRF VFNLQDKGTV
     QKVLKEMCLQ AAICNCEKGE RGLPGPAGKK GRTGDDGLPG LKGAKGEAGL NGIPGRDGKE
     GKSGYKGEKG ERGECGIPGI KGDRGPEGPA GPQGTTGLQG PQGQPGETGP EGLQGSKART
     AYLFLFLFFG KGERGLPGPP GLPGETGIGV PGPKGDTGLT GRPGPAGPPG VGEPGLVGPQ
     GPQGIQGERG LPGEGLPGAK GERGFDGPKG PRGLPGISIK GEKGEFGLPG LPGPIGLPGI
     GIQGEKGLEG PKGSPGSRGL PGQGLPGPKG EQGLPGETGV PGERGIGEPG SKGEPGLSGL
     AGLPGLPGED GAPGQKGEPG LPGLRGPEGA PGTGIQGEKM CLSTQRKNSI VFALAPSLQG
     EPGVPGRFGM PGPPGRAVPG PKGDIGLPGL AGPVGEPGFG LPGAKGERGL PGPPGPFGPK
     GDGYPGPTGL PGLPGTPGEQ GPDGVGLPGP KGDPGSRGPI GLPGAPGEGL PGPKGTMGRP
     GPPGSLGPPG EGIQGIKGEQ GIQGMPGPRG PPGEGLLGQK GDRGATGERG KKGEKGNLGD
     PGLSGEPGKT GPKGEQGLTR EDIIKLIREI CCGIKCKEIP MELVFVIDSS ESVGPENFEI
     IKDFVTALVD RVTVGRNATR IGLVLYSLEV RLEFDLNKYT TQQDVKQAIR KMQYIGEGTY
     TATAIRKATQ EGFFGARTGV RKVAIVLTDG QADKREAVKL DVAVREAHAA NIEMYAIGIV
     NTSDPTQVEF VRELNLIASD PDREHMYLID DFNTL
//
DBGET integrated database retrieval system