ID A0A087R4Q7_APTFO Unreviewed; 1793 AA.
AC A0A087R4Q7;
DT 29-OCT-2014, integrated into UniProtKB/TrEMBL.
DT 29-OCT-2014, sequence version 1.
DT 24-JAN-2024, entry version 28.
DE SubName: Full=Collagen alpha-1(V) chain {ECO:0000313|EMBL:KFM08461.1};
DE Flags: Fragment;
GN ORFNames=AS27_13745 {ECO:0000313|EMBL:KFM08461.1};
OS Aptenodytes forsteri (Emperor penguin).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Sphenisciformes; Spheniscidae;
OC Aptenodytes.
OX NCBI_TaxID=9233 {ECO:0000313|EMBL:KFM08461.1, ECO:0000313|Proteomes:UP000053286};
RN [1] {ECO:0000313|EMBL:KFM08461.1, ECO:0000313|Proteomes:UP000053286}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=BGI_AS27 {ECO:0000313|EMBL:KFM08461.1};
RA Zhang G., Li C.;
RT "Genome evolution of avian class.";
RL Submitted (APR-2014) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KL226097; KFM08461.1; -; Genomic_DNA.
DR STRING; 9233.A0A087R4Q7; -.
DR Proteomes; UP000053286; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR CDD; cd00110; LamG; 1.
DR Gene3D; 2.60.120.1000; -; 1.
DR Gene3D; 2.60.120.200; -; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR000885; Fib_collagen_C.
DR InterPro; IPR001791; Laminin_G.
DR InterPro; IPR048287; TSPN-like_N.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF891; COLLAGEN ALPHA-1(XVII) CHAIN; 1.
DR Pfam; PF01410; COLFI; 1.
DR Pfam; PF01391; Collagen; 10.
DR Pfam; PF02210; Laminin_G_2; 1.
DR SMART; SM00038; COLFI; 1.
DR SMART; SM00210; TSPN; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR PROSITE; PS51461; NC1_FIB; 1.
PE 4: Predicted;
KW Collagen {ECO:0000313|EMBL:KFM08461.1};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Reference proteome {ECO:0000313|Proteomes:UP000053286};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Secreted {ECO:0000256|ARBA:ARBA00022530}.
FT DOMAIN 1564..1792
FT /note="Fibrillar collagen NC1"
FT /evidence="ECO:0000259|PROSITE:PS51461"
FT REGION 216..246
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 262..310
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 442..490
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 533..662
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 689..1553
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 220..234
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 443..463
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 638..653
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 756..770
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 891..927
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1110..1124
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1203..1223
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1275..1311
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1335..1350
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1408..1422
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1479..1494
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1510..1527
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:KFM08461.1"
FT NON_TER 1793
FT /evidence="ECO:0000313|EMBL:KFM08461.1"
SQ SEQUENCE 1793 AA; 180060 MW; 53FEF600BD1686FF CRC64;
FLFSPFLFTA EPADLLKVLD FHNLPDGITK TTGFCTSRRS SKEADVAYRV TKDAQLSAPT
KQLYPASPFP EDFSILTTVK AKKGGQSFLI SIYNEQGIQQ IGVEMGRSPV FLYEDHTGKP
GPEDYPLFRG INLADGKWHR VAISVQKKNV TLILDCKKKI TKFLDRSDHP IIDVNGIIVF
GTRILDEEVF EGDIQQLLIV ADPRAAHDYC EHYSPDCDTA IPDSPQSQDP NQDEYYTDGE
GEGDTYYYEY PYYEDVDEAV KPEAPTSKPG TPEVAAGERP ETKQDYPDPT PSPDGGDPGK
QTEGDAVVDD PLVDEYNYET INEEYFTPLP YEDVNYNEEV DPQGGLTENA VEAELPTSTV
ITYNETDAAQ GGDDLDKDFT EETIKEYDGN YYDNYYDRTV SPDIGPGMPA NQDTIYEGIG
GPRGEKGQKG EPAIIEPGML VEGPPGPEGP AGLPGPPGPT GPVGLMGDPG ERGPPGRPGL
PGADGLPGPP GTMLMLPFRF SGGGDAGSKG PLVSAQEAQA QAILQQARLA LRGPAGPMGL
TGRPGPMGPP GSGGLKGEAG DMGPQGPRGI QGPPGPAGKP GRRGRAGSDG ARGMPGQTGP
KGEPGPHGPP GAPGEDGERG DDGEVGPRGL PGEPGPRGLL GPKGPPGPPG PPGVAGMDGQ
TGPKGNVVSF LLQISQTVQI LYRGLPGPQG AIGPPGEKGP LGKPGLPGMP GADGPPGHPG
KEGPPGEKGS QGPPGPQGPI GYPGPRGVKG ADGVRGLKGT KGEKGEDGFP GFKGDMGIKG
DRGEIGPPGP RGEDGPEGPK GRSGPNGDPG PLGPAGEKGK LGVPGLPGYP GRQGPKGSIG
FPGFPGANGE KGTRGTPGKP GPRGQRGPTG PRGERGPRGS TGKPGPKGNS GGDGPPGPPG
ERGPPGPQGP TGFPGPKGPP GPPGKDGLPG HPGQRGETGF QGKTGPPGPP GVVGPQGPTG
ETGPMGERGH PGPPGPPGEQ GLPGLTGKEG TKGDPGPAGL PGKDGPPGLR GFPGERGLPG
PIGAPGLKGN EGPPGPPGPA GSPGERGPAG SAGPIGLPGR PGPQGPPGPA GEKGAPGEKG
PQGPAGRDGI QGPVGLPGPA GPVGPPGEDG DKGEIGEPGQ KGSKGDKGEQ GPPGPTGPQG
PIGQPGPAGA DGEPGPRGQQ GLFGQKGDEG PRGFPGPPGP VGLQGLPGPP GEKGETGDVG
QMGPPGPPGP RGPSGPPGAD GPQGPPGGIG NPGAVGEKGE PGESGEPGLP GEVGLPGPKG
ERGEKGEAGP SGAAGPPGPK GPPGDDGPKG SPGPVGFPGD PGPPGEPGPA GQDGPPGDKG
DDGEPGQTGS PGPTGEPGPS GPPGKRGPPG PAGPEGRQGE KGAKGEAGLE GPPGKTGPIG
PQGAPGKPGP DGLRGIPGPV GEQGLPGSPG PDGPPGPMGP PGLPGLKGDS GPKGEKGHPG
LIGLIGPPGE QGEKGDRGLP GPQGSAGPKG EQGITGPSGP IGPPGPPGLP GPPGPKGAKG
SSGPTGPKGE SGLPGPPGPP GPPGEVIQPL PIQSSKRTRR NIDASQLVDD GNADNYMDYA
DGMEEIFGSL NSLKLEIEQM KHPLGTQHNP ARTCKDLQLC HPDFPDGEYW VDPNQGCSRD
SFKVYCNFTA GGETCIFPDK KSEGARITSW PKENPGSWFS EFKRGKLLSY VDSDGNPIGV
VQMTFLRLLS ASAHQNITYN CYQSVAWHDA TTNSYDKAIR FLGSNDEEMS YDNNPYIRAA
LDGCAAKKGY QKTILEINTP KVEQVPIVDI MFNDFGEASQ KFGFEVGPAC FMG
//