GenomeNet

Database: UniProt
Entry: A0A091UT92_NIPNI
LinkDB: A0A091UT92_NIPNI
Original site: A0A091UT92_NIPNI 
ID   A0A091UT92_NIPNI        Unreviewed;      1022 AA.
AC   A0A091UT92;
DT   26-NOV-2014, integrated into UniProtKB/TrEMBL.
DT   26-NOV-2014, sequence version 1.
DT   27-MAR-2024, entry version 25.
DE   SubName: Full=Collagen alpha-2(VI) chain {ECO:0000313|EMBL:KFQ93891.1};
DE   Flags: Fragment;
GN   ORFNames=Y956_10201 {ECO:0000313|EMBL:KFQ93891.1};
OS   Nipponia nippon (Crested ibis) (Ibis nippon).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC   Coelurosauria; Aves; Neognathae; Pelecaniformes; Threskiornithidae;
OC   Nipponia.
OX   NCBI_TaxID=128390 {ECO:0000313|EMBL:KFQ93891.1, ECO:0000313|Proteomes:UP000053283};
RN   [1] {ECO:0000313|EMBL:KFQ93891.1, ECO:0000313|Proteomes:UP000053283}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=BGI_Y956 {ECO:0000313|EMBL:KFQ93891.1};
RA   Zhang G., Li C.;
RT   "Genome evolution of avian class.";
RL   Submitted (APR-2014) to the EMBL/GenBank/DDBJ databases.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; KL410068; KFQ93891.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A091UT92; -.
DR   STRING; 128390.A0A091UT92; -.
DR   eggNOG; KOG3544; Eukaryota.
DR   Proteomes; UP000053283; Unassembled WGS sequence.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   CDD; cd00198; vWFA; 1.
DR   Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 3.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR002035; VWF_A.
DR   InterPro; IPR036465; vWFA_dom_sf.
DR   PANTHER; PTHR24020; COLLAGEN ALPHA; 1.
DR   Pfam; PF01391; Collagen; 2.
DR   Pfam; PF00092; VWA; 3.
DR   PRINTS; PR00453; VWFADOMAIN.
DR   SMART; SM00327; VWA; 3.
DR   SUPFAM; SSF53300; vWA-like; 3.
DR   PROSITE; PS50234; VWFA; 3.
PE   4: Predicted;
KW   Collagen {ECO:0000313|EMBL:KFQ93891.1};
KW   Reference proteome {ECO:0000313|Proteomes:UP000053283};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..27
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           28..1022
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5001880376"
FT   DOMAIN          50..238
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          619..809
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          837..1017
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   REGION          266..591
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        348..379
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   NON_TER         1
FT                   /evidence="ECO:0000313|EMBL:KFQ93891.1"
FT   NON_TER         1022
FT                   /evidence="ECO:0000313|EMBL:KFQ93891.1"
SQ   SEQUENCE   1022 AA;  108651 MW;  2B9395F3D6D38735 CRC64;
     ISRGTAEMFR EALFSTLLCV ALVPLHAQFD GDPGDVSPTS CAEKRNCPIN VYFIIDTSES
     VALQTVPIQS LVDQIKQFIP IFIDKLENEL YQNQVYITWQ FGGLHYSDVV EIYSPLTSSK
     DIYLPRLSAI NYLGRGTFTD CAISNMTEQI QTQMATGVNF AVVITDGHVT GSPCGGMKMQ
     AERARDMGIK LFAVAPSENV YEQGLREIAN LPHELYRNNY AITQKDTLDI DVNTTERIIQ
     AMKHEAYGEC YKMSCLEIAG PPGPKGYRGQ KGAKGNMGEP GSPGLKGRQG DPGIEGPIGY
     PGPKGVPGLK GEKGEIGSDG RRGAAGLAGR NGTDGQKGKL GRIGPPGCKG DRGDKGPDGY
     PGDAGDQGER GDEGIKGDPG RPGRSGPLGP PGEKGSPGLP GNPGAQGPAG TKGRKGETGP
     PGPKGEPGRR GDPGTKGSKG TPGTKGERGD PGPEGPRGLP GEVGSKGARG DQGLPGPRGP
     PGAVGEPGNI GSRGDPGDLG PRGDIGPPGL KGDRGRPGFS YPGPRGPQGD KGEKGQPGPK
     GGRGELGPKG VQGTKGEKGE PGDPGPGGEP GPRGPTGEAG PEGTPGPPGD PGLTDCDVMT
     YVRETCGCCD CEKRCGALDI MFVIDSSESI GYTNFTLEKN FVVNVVSRLG SIAKDPKSQT
     GARVGVVQYS HDGTFEAISL DDERIDSLSS FKEAVKRLEW IAGGTWTPSA LQFAYNKLIK
     ESRREKAQVF AVVITDGRYD PRDDDKNLGA LCGRDVVVNT IGIGDMFDQP EQSETLVSIA
     CNEPQRVQKM RLFSDLVAEE FIDKMEDVLC PDPQIICPEL PCQTELAVAQ CTQRPVDVVF
     LLDGSERIGE QNFHRAHHFV EEVARQLTLA RSNSDNMNAR IALLQYGSER DQDVVFPLTY
     NLTEISNALA QIKYLDSSSN IGSAIIHAIN NIVLSPGDGQ RLARRNAELS FVFITDGITG
     SKNLEEAINS MKKQDVMPTV VALGSDIDMD LGLGDRAAIF REKDYDSLSQ PSFFDRFIRW
     IC
//
DBGET integrated database retrieval system