GenomeNet

Database: UniProt
Entry: A0A087R4Q7_APTFO
LinkDB: A0A087R4Q7_APTFO
Original site: A0A087R4Q7_APTFO 
ID   A0A087R4Q7_APTFO        Unreviewed;      1793 AA.
AC   A0A087R4Q7;
DT   29-OCT-2014, integrated into UniProtKB/TrEMBL.
DT   29-OCT-2014, sequence version 1.
DT   24-JAN-2024, entry version 28.
DE   SubName: Full=Collagen alpha-1(V) chain {ECO:0000313|EMBL:KFM08461.1};
DE   Flags: Fragment;
GN   ORFNames=AS27_13745 {ECO:0000313|EMBL:KFM08461.1};
OS   Aptenodytes forsteri (Emperor penguin).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC   Coelurosauria; Aves; Neognathae; Sphenisciformes; Spheniscidae;
OC   Aptenodytes.
OX   NCBI_TaxID=9233 {ECO:0000313|EMBL:KFM08461.1, ECO:0000313|Proteomes:UP000053286};
RN   [1] {ECO:0000313|EMBL:KFM08461.1, ECO:0000313|Proteomes:UP000053286}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=BGI_AS27 {ECO:0000313|EMBL:KFM08461.1};
RA   Zhang G., Li C.;
RT   "Genome evolution of avian class.";
RL   Submitted (APR-2014) to the EMBL/GenBank/DDBJ databases.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; KL226097; KFM08461.1; -; Genomic_DNA.
DR   STRING; 9233.A0A087R4Q7; -.
DR   Proteomes; UP000053286; Unassembled WGS sequence.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR   CDD; cd00110; LamG; 1.
DR   Gene3D; 2.60.120.1000; -; 1.
DR   Gene3D; 2.60.120.200; -; 1.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR000885; Fib_collagen_C.
DR   InterPro; IPR001791; Laminin_G.
DR   InterPro; IPR048287; TSPN-like_N.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF891; COLLAGEN ALPHA-1(XVII) CHAIN; 1.
DR   Pfam; PF01410; COLFI; 1.
DR   Pfam; PF01391; Collagen; 10.
DR   Pfam; PF02210; Laminin_G_2; 1.
DR   SMART; SM00038; COLFI; 1.
DR   SMART; SM00210; TSPN; 1.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR   PROSITE; PS51461; NC1_FIB; 1.
PE   4: Predicted;
KW   Collagen {ECO:0000313|EMBL:KFM08461.1};
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Reference proteome {ECO:0000313|Proteomes:UP000053286};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Secreted {ECO:0000256|ARBA:ARBA00022530}.
FT   DOMAIN          1564..1792
FT                   /note="Fibrillar collagen NC1"
FT                   /evidence="ECO:0000259|PROSITE:PS51461"
FT   REGION          216..246
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          262..310
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          442..490
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          533..662
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          689..1553
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        220..234
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        443..463
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        638..653
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        756..770
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        891..927
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1110..1124
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1203..1223
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1275..1311
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1335..1350
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1408..1422
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1479..1494
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1510..1527
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   NON_TER         1
FT                   /evidence="ECO:0000313|EMBL:KFM08461.1"
FT   NON_TER         1793
FT                   /evidence="ECO:0000313|EMBL:KFM08461.1"
SQ   SEQUENCE   1793 AA;  180060 MW;  53FEF600BD1686FF CRC64;
     FLFSPFLFTA EPADLLKVLD FHNLPDGITK TTGFCTSRRS SKEADVAYRV TKDAQLSAPT
     KQLYPASPFP EDFSILTTVK AKKGGQSFLI SIYNEQGIQQ IGVEMGRSPV FLYEDHTGKP
     GPEDYPLFRG INLADGKWHR VAISVQKKNV TLILDCKKKI TKFLDRSDHP IIDVNGIIVF
     GTRILDEEVF EGDIQQLLIV ADPRAAHDYC EHYSPDCDTA IPDSPQSQDP NQDEYYTDGE
     GEGDTYYYEY PYYEDVDEAV KPEAPTSKPG TPEVAAGERP ETKQDYPDPT PSPDGGDPGK
     QTEGDAVVDD PLVDEYNYET INEEYFTPLP YEDVNYNEEV DPQGGLTENA VEAELPTSTV
     ITYNETDAAQ GGDDLDKDFT EETIKEYDGN YYDNYYDRTV SPDIGPGMPA NQDTIYEGIG
     GPRGEKGQKG EPAIIEPGML VEGPPGPEGP AGLPGPPGPT GPVGLMGDPG ERGPPGRPGL
     PGADGLPGPP GTMLMLPFRF SGGGDAGSKG PLVSAQEAQA QAILQQARLA LRGPAGPMGL
     TGRPGPMGPP GSGGLKGEAG DMGPQGPRGI QGPPGPAGKP GRRGRAGSDG ARGMPGQTGP
     KGEPGPHGPP GAPGEDGERG DDGEVGPRGL PGEPGPRGLL GPKGPPGPPG PPGVAGMDGQ
     TGPKGNVVSF LLQISQTVQI LYRGLPGPQG AIGPPGEKGP LGKPGLPGMP GADGPPGHPG
     KEGPPGEKGS QGPPGPQGPI GYPGPRGVKG ADGVRGLKGT KGEKGEDGFP GFKGDMGIKG
     DRGEIGPPGP RGEDGPEGPK GRSGPNGDPG PLGPAGEKGK LGVPGLPGYP GRQGPKGSIG
     FPGFPGANGE KGTRGTPGKP GPRGQRGPTG PRGERGPRGS TGKPGPKGNS GGDGPPGPPG
     ERGPPGPQGP TGFPGPKGPP GPPGKDGLPG HPGQRGETGF QGKTGPPGPP GVVGPQGPTG
     ETGPMGERGH PGPPGPPGEQ GLPGLTGKEG TKGDPGPAGL PGKDGPPGLR GFPGERGLPG
     PIGAPGLKGN EGPPGPPGPA GSPGERGPAG SAGPIGLPGR PGPQGPPGPA GEKGAPGEKG
     PQGPAGRDGI QGPVGLPGPA GPVGPPGEDG DKGEIGEPGQ KGSKGDKGEQ GPPGPTGPQG
     PIGQPGPAGA DGEPGPRGQQ GLFGQKGDEG PRGFPGPPGP VGLQGLPGPP GEKGETGDVG
     QMGPPGPPGP RGPSGPPGAD GPQGPPGGIG NPGAVGEKGE PGESGEPGLP GEVGLPGPKG
     ERGEKGEAGP SGAAGPPGPK GPPGDDGPKG SPGPVGFPGD PGPPGEPGPA GQDGPPGDKG
     DDGEPGQTGS PGPTGEPGPS GPPGKRGPPG PAGPEGRQGE KGAKGEAGLE GPPGKTGPIG
     PQGAPGKPGP DGLRGIPGPV GEQGLPGSPG PDGPPGPMGP PGLPGLKGDS GPKGEKGHPG
     LIGLIGPPGE QGEKGDRGLP GPQGSAGPKG EQGITGPSGP IGPPGPPGLP GPPGPKGAKG
     SSGPTGPKGE SGLPGPPGPP GPPGEVIQPL PIQSSKRTRR NIDASQLVDD GNADNYMDYA
     DGMEEIFGSL NSLKLEIEQM KHPLGTQHNP ARTCKDLQLC HPDFPDGEYW VDPNQGCSRD
     SFKVYCNFTA GGETCIFPDK KSEGARITSW PKENPGSWFS EFKRGKLLSY VDSDGNPIGV
     VQMTFLRLLS ASAHQNITYN CYQSVAWHDA TTNSYDKAIR FLGSNDEEMS YDNNPYIRAA
     LDGCAAKKGY QKTILEINTP KVEQVPIVDI MFNDFGEASQ KFGFEVGPAC FMG
//
DBGET integrated database retrieval system