ID A0A091UT92_NIPNI Unreviewed; 1022 AA.
AC A0A091UT92;
DT 26-NOV-2014, integrated into UniProtKB/TrEMBL.
DT 26-NOV-2014, sequence version 1.
DT 27-MAR-2024, entry version 25.
DE SubName: Full=Collagen alpha-2(VI) chain {ECO:0000313|EMBL:KFQ93891.1};
DE Flags: Fragment;
GN ORFNames=Y956_10201 {ECO:0000313|EMBL:KFQ93891.1};
OS Nipponia nippon (Crested ibis) (Ibis nippon).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Pelecaniformes; Threskiornithidae;
OC Nipponia.
OX NCBI_TaxID=128390 {ECO:0000313|EMBL:KFQ93891.1, ECO:0000313|Proteomes:UP000053283};
RN [1] {ECO:0000313|EMBL:KFQ93891.1, ECO:0000313|Proteomes:UP000053283}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=BGI_Y956 {ECO:0000313|EMBL:KFQ93891.1};
RA Zhang G., Li C.;
RT "Genome evolution of avian class.";
RL Submitted (APR-2014) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KL410068; KFQ93891.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A091UT92; -.
DR STRING; 128390.A0A091UT92; -.
DR eggNOG; KOG3544; Eukaryota.
DR Proteomes; UP000053283; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR CDD; cd00198; vWFA; 1.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 3.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR24020; COLLAGEN ALPHA; 1.
DR Pfam; PF01391; Collagen; 2.
DR Pfam; PF00092; VWA; 3.
DR PRINTS; PR00453; VWFADOMAIN.
DR SMART; SM00327; VWA; 3.
DR SUPFAM; SSF53300; vWA-like; 3.
DR PROSITE; PS50234; VWFA; 3.
PE 4: Predicted;
KW Collagen {ECO:0000313|EMBL:KFQ93891.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000053283};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..27
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 28..1022
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5001880376"
FT DOMAIN 50..238
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 619..809
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 837..1017
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT REGION 266..591
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 348..379
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:KFQ93891.1"
FT NON_TER 1022
FT /evidence="ECO:0000313|EMBL:KFQ93891.1"
SQ SEQUENCE 1022 AA; 108651 MW; 2B9395F3D6D38735 CRC64;
ISRGTAEMFR EALFSTLLCV ALVPLHAQFD GDPGDVSPTS CAEKRNCPIN VYFIIDTSES
VALQTVPIQS LVDQIKQFIP IFIDKLENEL YQNQVYITWQ FGGLHYSDVV EIYSPLTSSK
DIYLPRLSAI NYLGRGTFTD CAISNMTEQI QTQMATGVNF AVVITDGHVT GSPCGGMKMQ
AERARDMGIK LFAVAPSENV YEQGLREIAN LPHELYRNNY AITQKDTLDI DVNTTERIIQ
AMKHEAYGEC YKMSCLEIAG PPGPKGYRGQ KGAKGNMGEP GSPGLKGRQG DPGIEGPIGY
PGPKGVPGLK GEKGEIGSDG RRGAAGLAGR NGTDGQKGKL GRIGPPGCKG DRGDKGPDGY
PGDAGDQGER GDEGIKGDPG RPGRSGPLGP PGEKGSPGLP GNPGAQGPAG TKGRKGETGP
PGPKGEPGRR GDPGTKGSKG TPGTKGERGD PGPEGPRGLP GEVGSKGARG DQGLPGPRGP
PGAVGEPGNI GSRGDPGDLG PRGDIGPPGL KGDRGRPGFS YPGPRGPQGD KGEKGQPGPK
GGRGELGPKG VQGTKGEKGE PGDPGPGGEP GPRGPTGEAG PEGTPGPPGD PGLTDCDVMT
YVRETCGCCD CEKRCGALDI MFVIDSSESI GYTNFTLEKN FVVNVVSRLG SIAKDPKSQT
GARVGVVQYS HDGTFEAISL DDERIDSLSS FKEAVKRLEW IAGGTWTPSA LQFAYNKLIK
ESRREKAQVF AVVITDGRYD PRDDDKNLGA LCGRDVVVNT IGIGDMFDQP EQSETLVSIA
CNEPQRVQKM RLFSDLVAEE FIDKMEDVLC PDPQIICPEL PCQTELAVAQ CTQRPVDVVF
LLDGSERIGE QNFHRAHHFV EEVARQLTLA RSNSDNMNAR IALLQYGSER DQDVVFPLTY
NLTEISNALA QIKYLDSSSN IGSAIIHAIN NIVLSPGDGQ RLARRNAELS FVFITDGITG
SKNLEEAINS MKKQDVMPTV VALGSDIDMD LGLGDRAAIF REKDYDSLSQ PSFFDRFIRW
IC
//