ID A0A093NV64_PYGAD Unreviewed; 945 AA.
AC A0A093NV64;
DT 26-NOV-2014, integrated into UniProtKB/TrEMBL.
DT 26-NOV-2014, sequence version 1.
DT 27-MAR-2024, entry version 22.
DE SubName: Full=Collagen alpha-1(XXVIII) chain {ECO:0000313|EMBL:KFW68066.1};
DE Flags: Fragment;
GN ORFNames=AS28_00618 {ECO:0000313|EMBL:KFW68066.1};
OS Pygoscelis adeliae (Adelie penguin).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Sphenisciformes; Spheniscidae; Pygoscelis.
OX NCBI_TaxID=9238 {ECO:0000313|EMBL:KFW68066.1, ECO:0000313|Proteomes:UP000054081};
RN [1] {ECO:0000313|EMBL:KFW68066.1, ECO:0000313|Proteomes:UP000054081}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=BGI_AS28 {ECO:0000313|EMBL:KFW68066.1};
RA Zhang G., Li C.;
RT "Genome evolution of avian class.";
RL Submitted (APR-2014) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KL225146; KFW68066.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A093NV64; -.
DR STRING; 9238.A0A093NV64; -.
DR Proteomes; UP000054081; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR CDD; cd01472; vWA_collagen; 1.
DR CDD; cd01450; vWFA_subfamily_ECM; 1.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 2.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF878; COLLAGEN ALPHA-1(XXVIII) CHAIN; 1.
DR Pfam; PF01391; Collagen; 4.
DR Pfam; PF00092; VWA; 2.
DR PRINTS; PR00453; VWFADOMAIN.
DR SMART; SM00327; VWA; 2.
DR SUPFAM; SSF53300; vWA-like; 2.
DR PROSITE; PS50234; VWFA; 2.
PE 4: Predicted;
KW Collagen {ECO:0000313|EMBL:KFW68066.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000054081}.
FT DOMAIN 4..187
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 772..945
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT REGION 197..294
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 321..746
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 241..258
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 601..624
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 648..676
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 713..728
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:KFW68066.1"
FT NON_TER 945
FT /evidence="ECO:0000313|EMBL:KFW68066.1"
SQ SEQUENCE 945 AA; 96494 MW; A62B98B866D864D2 CRC64;
CIVEIAFLLD SSESAKYYNH EQQKKFVLET VDRMKSLQLS SGRTLSWRMA LLQYSSTVFI
EQTFHDWKGP EAFKSHLAPI TYIGHGTYTT YAITNLTQLY ITEGTPGSVK VAVLFTDGVD
HPRNPDIFAA TADAKHQGIV FFTMGMTRVA EEVSNAAKLR LLASVPASRF VFNLQEKETV
EKVLKEMCPQ AATCNCEKGE RGLPGPAGKK GRTGDDGAPG LKGAKGEPGL NGIPGRDGIE
GKSGYKGEKG ERGECGIPGI KGDRGPEGPV GPRGTRGLQG PQGQSGDQGP EGLQGSKART
VLFLLFQLFL FSFFEKGERG LPGPPGLPGE TGIGLPGPKG DTGLTGRPGP VGPPGVGEPG
LMGPQGPQGV QGERGSPGEG LPGAKGDRGF EGPKGPRGLP GISIKGEKGE FGPPGLPGPI
GLPGIGIQGE KGVEGPKGPP GSRGLPGQGL PGPKGEQGLP GETGVPGERG IGEPGSKGEP
GPSGLAGLPG LPGEDGAPGQ KGEPGLPGLR GPEGAPGIGI QGEKGDQGQR GVRGLTGPTG
VPGPAGAKGE PGAPGRLGMP GPPGRAVPGP KGDIGLPGLA GPIGEPGFGL PGAKGDRGLP
GPPGPFGPKG DGYPGPPGLP GLPGIPGEQG PDGVGLPGPK GDPGSRGPIG LPGPPGEGLP
GPKGTVGRPG PPGSLGPPGE GIQGIKGEQG IQGMPGPRGP PGEGLLGQKG DRGATGEKGK
KGEKGDVGDP GLSGEPGKTG PKGEQGLTRE DIIKLIKEIC GCGIKCKEIP MELVFVIDSS
ESVGPENFEI IKDFVTALVD RVTVGRNATR IGLVLYSLEV QLEFGLNKYT TQQDVKQAVR
KMQYMGEGTY TATAIRKATQ EGFFGARTGV RKVAIVLTDG QADKREAVKL DIVVREAHAA
NIEMYAIGIV NTSDPTQVEF VRELNLIASD PDGEHMYLID DFNTL
//