ID A0A091LJR6_9GRUI Unreviewed; 1026 AA.
AC A0A091LJR6;
DT 26-NOV-2014, integrated into UniProtKB/TrEMBL.
DT 26-NOV-2014, sequence version 1.
DT 27-MAR-2024, entry version 24.
DE SubName: Full=Collagen alpha-1(VI) chain {ECO:0000313|EMBL:KFP43091.1};
GN ORFNames=N324_05911 {ECO:0000313|EMBL:KFP43091.1};
OS Chlamydotis macqueenii (Macqueen's bustard).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Gruiformes; Otididae; Chlamydotis.
OX NCBI_TaxID=187382 {ECO:0000313|EMBL:KFP43091.1, ECO:0000313|Proteomes:UP000053330};
RN [1] {ECO:0000313|EMBL:KFP43091.1, ECO:0000313|Proteomes:UP000053330}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=BGI_N324 {ECO:0000313|EMBL:KFP43091.1};
RA Zhang G., Li C.;
RT "Genome evolution of avian class.";
RL Submitted (APR-2014) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KK755360; KFP43091.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A091LJR6; -.
DR Proteomes; UP000053330; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR CDD; cd01480; vWA_collagen_alpha_1-VI-type; 3.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 3.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR24020; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24020:SF84; COLLAGEN ALPHA-1(XXI) CHAIN-LIKE ISOFORM X1; 1.
DR Pfam; PF01391; Collagen; 5.
DR Pfam; PF00092; VWA; 3.
DR PRINTS; PR00453; VWFADOMAIN.
DR SMART; SM00327; VWA; 3.
DR SUPFAM; SSF53300; vWA-like; 3.
DR PROSITE; PS51257; PROKAR_LIPOPROTEIN; 1.
DR PROSITE; PS50234; VWFA; 3.
PE 4: Predicted;
KW Collagen {ECO:0000313|EMBL:KFP43091.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000053330};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..23
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 24..1026
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5001877733"
FT DOMAIN 41..237
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 617..804
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 828..1019
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT REGION 249..592
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 305..333
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1026 AA; 108942 MW; 3155F8513E890330 CRC64;
MKLQDFLLPL LLPVALLGSF SCAQRPEITT RVANAEDCPV DLFFVLDTSE SVALRVKPFG
DLVAQVKDFT NRFIDKLTNR YYRCDRNLVW NAGALHYSDE VVLIKSLTSM PSGQNELKNR
VSAVNYIGKG TYTDCAIKRG IEELLISGSH HKENKYLIVV TDGHPLEGYK EPCGGLDDAA
NEAKHLGIKV FSVAISPNHL DQRLNIIATD HAYRRNFTAT SLKPTRELDV EETINTIIEM
IKDNTEQSCC SFECQPPRGP PGPPGDPGNE GERGKPGLPG QKGDAGDPGR PGDMGPVGYQ
GMKGDKGSRG EKGSRGAKGA KGEKGRRGID GIDGMKGEAG YPGLPGCKGS PGFDVPQGPP
GPKGDPGAYG PKGGKGEPGD DGKPGRQGIP GSPGEKGTRG NQGEPGPAGE TGDEGAPGED
GPPGERGSNG ERGPVGSPGD RGPRGDPGEP GPPGDQGREG PLGPPGDQGE AGPPGPKGYR
GDDGPRGNEG PKGLPGAPGL PGDPGLMGER GEDGPPGNST IGFTGAPGQP GDRGDPGING
TKGYLGPKGD EGEAGDPGND NPTPGPRGIK GAKGHRGPEG RPGPPGPVGP PGPDECEILD
IIMKMCSCCE CTCGPVDLLF VLDSSESIGL QNFQIAKDFI IKVIDRLSKD ERVKFEPGES
RVGVVQYSHD NTQELVAMGD ANIDNIGALK QAVKNLKWIA GGTYTGEALQ FTKENLLRRF
TSDKRVAIVI TDGRSDTLRD PTPLNSLCDV TPVVSLGIGD IFRNPPNPDH LNDIACLSRP
TRPGLSIQRD NYAELLDDTF LQNITSYVCQ EKKCPDYTCP INFAGLADIT LLVDSSTSVG
SKNFETTKKF VKRLAERVLK KICAHLSEEG AVRISVVQYS GRNQQKVEAQ FQYNYTVIAK
AIDNMEFIND ATDVNAALRY VTGLYQQSSR AGAKKRVLVF SDGNSQGITA RAIERAVQEA
QQAGIEIYVL AVGSQANEPN IRVLVTGKSA DYDVVYGERH LFRVPDYTSL LRGVFYQTVS
RKIAVD
//