ID A0A093BQ80_9AVES Unreviewed; 1018 AA.
AC A0A093BQ80;
DT 26-NOV-2014, integrated into UniProtKB/TrEMBL.
DT 26-NOV-2014, sequence version 1.
DT 27-MAR-2024, entry version 23.
DE SubName: Full=Collagen alpha-1(VI) chain {ECO:0000313|EMBL:KFV05915.1};
DE Flags: Fragment;
GN ORFNames=N339_03409 {ECO:0000313|EMBL:KFV05915.1};
OS Pterocles gutturalis (yellow-throated sandgrouse).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Ciconiiformes; Pteroclidae; Pterocles.
OX NCBI_TaxID=240206 {ECO:0000313|EMBL:KFV05915.1, ECO:0000313|Proteomes:UP000053149};
RN [1] {ECO:0000313|EMBL:KFV05915.1, ECO:0000313|Proteomes:UP000053149}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=BGI_N339 {ECO:0000313|EMBL:KFV05915.1};
RA Zhang G., Li C.;
RT "Genome evolution of avian class.";
RL Submitted (APR-2014) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KL231305; KFV05915.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A093BQ80; -.
DR Proteomes; UP000053149; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR CDD; cd01480; vWA_collagen_alpha_1-VI-type; 3.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 3.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR24020; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24020:SF84; COLLAGEN ALPHA-1(XXI) CHAIN-LIKE ISOFORM X1; 1.
DR Pfam; PF01391; Collagen; 6.
DR Pfam; PF00092; VWA; 3.
DR PRINTS; PR00453; VWFADOMAIN.
DR SMART; SM00327; VWA; 3.
DR SUPFAM; SSF53300; vWA-like; 3.
DR PROSITE; PS50234; VWFA; 3.
PE 4: Predicted;
KW Collagen {ECO:0000313|EMBL:KFV05915.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000053149};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..18
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 19..1018
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5001881763"
FT DOMAIN 36..232
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 612..799
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 823..1011
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT REGION 247..587
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 300..328
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:KFV05915.1"
FT NON_TER 1018
FT /evidence="ECO:0000313|EMBL:KFV05915.1"
SQ SEQUENCE 1018 AA; 107946 MW; 8850482613297E92 CRC64;
FLFTLLLLDS FLGGGSQAQR PEITTRVANA EDCPVDLFFV LDTSESVALR VKPFGDLVAQ
VKDFTNRFID KLTNRYYRCD RNLVWNAGAL HYSDEVVLIK SLTSMPSGRN ELKNRVSAVN
YIGKGTYTDC AIKRGIEELL ISGSHHKENK YLIVVTDGHP LEGYKEPCGG LDDAANEAKH
LGIKVFSVAI SPNHLDQRLN IIATDHAYRR NFTATSLKPT RELDVEETIN TIIDMIKVNT
EQSCCSFECQ PPRGPPGPPG DPGNEGERGK PGLPGEKGEA GAPGRPGDMG PVGYQGMKGD
KGSRGEKGSR GAKGAKGEKG RRGIDGIDGM KGEAGYPGLP GCKGSPGFDV PQGPPGPKGD
PGAYGPKGGK GEPGDDGKPG RQGIPGSPGE KGAPGNQGEP GPMGETGDEG APGPDGPPGE
RGSNGERGPP GSPGDRGPRG EPGEPGPPGD QGREGPLGPP GDQGEVGPPG PKGYRGDDGP
RGNEGPKGLP GAPGLPGDPG LMGERGEDGP PGNGTIGFTG APGQPGDRGD PGINGTKGYV
GPKGDEGEAG DPGNDNLTPG PRGIKGAKGH RGPEGRPGPP GPVGPPGPDE CEILDIIMKM
CSCCECTCGP VDLLFVLDSS ESIGLQNFQI AKDFIIKVID RLSKDERVKF EPGESRVGVV
QYSHDNTQEL VAMGDANIEN IGALKQAVKN LKWIAGGTYT GEALQFTREN LLQRFTSDKR
VAIVITDGRS DTLRDPTPLN SLCDVTPVVS LGIGDIFRNP PNPDHLNDIA CLSRPTRPGL
SIQRDNYAEL LDDTFLQNIT SYVCQEKKCP DYTCPIAFAG PADITLLVDS STSVGSKNFE
TTKKFVKRLA ERFLEAGKPA DDSVRVSVVQ YSGRNQQKVE AQFQYNYTVI AKAIDNMEFI
NDATDVNAAL RYITGLYQRS SRAGAKKRVL LFSDGNSQGI TARAIERAVQ EAQQAGIEIY
VLAVGSQPNE PNIRVLVTGK SADYDVVYGE RHLFRVPDYT SLLRGVFYQT VSRKIAVD
//