GenomeNet

Database: UniProt
Entry: A0A091LJR6_9GRUI
LinkDB: A0A091LJR6_9GRUI
Original site: A0A091LJR6_9GRUI 
ID   A0A091LJR6_9GRUI        Unreviewed;      1026 AA.
AC   A0A091LJR6;
DT   26-NOV-2014, integrated into UniProtKB/TrEMBL.
DT   26-NOV-2014, sequence version 1.
DT   27-MAR-2024, entry version 24.
DE   SubName: Full=Collagen alpha-1(VI) chain {ECO:0000313|EMBL:KFP43091.1};
GN   ORFNames=N324_05911 {ECO:0000313|EMBL:KFP43091.1};
OS   Chlamydotis macqueenii (Macqueen's bustard).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC   Coelurosauria; Aves; Neognathae; Gruiformes; Otididae; Chlamydotis.
OX   NCBI_TaxID=187382 {ECO:0000313|EMBL:KFP43091.1, ECO:0000313|Proteomes:UP000053330};
RN   [1] {ECO:0000313|EMBL:KFP43091.1, ECO:0000313|Proteomes:UP000053330}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=BGI_N324 {ECO:0000313|EMBL:KFP43091.1};
RA   Zhang G., Li C.;
RT   "Genome evolution of avian class.";
RL   Submitted (APR-2014) to the EMBL/GenBank/DDBJ databases.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; KK755360; KFP43091.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A091LJR6; -.
DR   Proteomes; UP000053330; Unassembled WGS sequence.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   CDD; cd01480; vWA_collagen_alpha_1-VI-type; 3.
DR   Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 3.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR002035; VWF_A.
DR   InterPro; IPR036465; vWFA_dom_sf.
DR   PANTHER; PTHR24020; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24020:SF84; COLLAGEN ALPHA-1(XXI) CHAIN-LIKE ISOFORM X1; 1.
DR   Pfam; PF01391; Collagen; 5.
DR   Pfam; PF00092; VWA; 3.
DR   PRINTS; PR00453; VWFADOMAIN.
DR   SMART; SM00327; VWA; 3.
DR   SUPFAM; SSF53300; vWA-like; 3.
DR   PROSITE; PS51257; PROKAR_LIPOPROTEIN; 1.
DR   PROSITE; PS50234; VWFA; 3.
PE   4: Predicted;
KW   Collagen {ECO:0000313|EMBL:KFP43091.1};
KW   Reference proteome {ECO:0000313|Proteomes:UP000053330};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..23
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           24..1026
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5001877733"
FT   DOMAIN          41..237
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          617..804
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          828..1019
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   REGION          249..592
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        305..333
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1026 AA;  108942 MW;  3155F8513E890330 CRC64;
     MKLQDFLLPL LLPVALLGSF SCAQRPEITT RVANAEDCPV DLFFVLDTSE SVALRVKPFG
     DLVAQVKDFT NRFIDKLTNR YYRCDRNLVW NAGALHYSDE VVLIKSLTSM PSGQNELKNR
     VSAVNYIGKG TYTDCAIKRG IEELLISGSH HKENKYLIVV TDGHPLEGYK EPCGGLDDAA
     NEAKHLGIKV FSVAISPNHL DQRLNIIATD HAYRRNFTAT SLKPTRELDV EETINTIIEM
     IKDNTEQSCC SFECQPPRGP PGPPGDPGNE GERGKPGLPG QKGDAGDPGR PGDMGPVGYQ
     GMKGDKGSRG EKGSRGAKGA KGEKGRRGID GIDGMKGEAG YPGLPGCKGS PGFDVPQGPP
     GPKGDPGAYG PKGGKGEPGD DGKPGRQGIP GSPGEKGTRG NQGEPGPAGE TGDEGAPGED
     GPPGERGSNG ERGPVGSPGD RGPRGDPGEP GPPGDQGREG PLGPPGDQGE AGPPGPKGYR
     GDDGPRGNEG PKGLPGAPGL PGDPGLMGER GEDGPPGNST IGFTGAPGQP GDRGDPGING
     TKGYLGPKGD EGEAGDPGND NPTPGPRGIK GAKGHRGPEG RPGPPGPVGP PGPDECEILD
     IIMKMCSCCE CTCGPVDLLF VLDSSESIGL QNFQIAKDFI IKVIDRLSKD ERVKFEPGES
     RVGVVQYSHD NTQELVAMGD ANIDNIGALK QAVKNLKWIA GGTYTGEALQ FTKENLLRRF
     TSDKRVAIVI TDGRSDTLRD PTPLNSLCDV TPVVSLGIGD IFRNPPNPDH LNDIACLSRP
     TRPGLSIQRD NYAELLDDTF LQNITSYVCQ EKKCPDYTCP INFAGLADIT LLVDSSTSVG
     SKNFETTKKF VKRLAERVLK KICAHLSEEG AVRISVVQYS GRNQQKVEAQ FQYNYTVIAK
     AIDNMEFIND ATDVNAALRY VTGLYQQSSR AGAKKRVLVF SDGNSQGITA RAIERAVQEA
     QQAGIEIYVL AVGSQANEPN IRVLVTGKSA DYDVVYGERH LFRVPDYTSL LRGVFYQTVS
     RKIAVD
//
DBGET integrated database retrieval system