GenomeNet

Database: UniProt
Entry: A0A091R2S5_9GRUI
LinkDB: A0A091R2S5_9GRUI
Original site: A0A091R2S5_9GRUI 
ID   A0A091R2S5_9GRUI        Unreviewed;      1337 AA.
AC   A0A091R2S5;
DT   26-NOV-2014, integrated into UniProtKB/TrEMBL.
DT   26-NOV-2014, sequence version 1.
DT   27-MAR-2024, entry version 29.
DE   RecName: Full=Fibrillar collagen NC1 domain-containing protein {ECO:0000259|PROSITE:PS51461};
DE   Flags: Fragment;
GN   ORFNames=N332_02412 {ECO:0000313|EMBL:KFQ34090.1};
OS   Mesitornis unicolor (brown roatelo).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC   Coelurosauria; Aves; Neognathae; Gruiformes; Mesitornithidae; Mesitornis.
OX   NCBI_TaxID=54374 {ECO:0000313|EMBL:KFQ34090.1, ECO:0000313|Proteomes:UP000053369};
RN   [1] {ECO:0000313|EMBL:KFQ34090.1, ECO:0000313|Proteomes:UP000053369}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=BGI_N332 {ECO:0000313|EMBL:KFQ34090.1};
RA   Zhang G., Li C.;
RT   "Genome evolution of avian class.";
RL   Submitted (APR-2014) to the EMBL/GenBank/DDBJ databases.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; KK808863; KFQ34090.1; -; Genomic_DNA.
DR   Proteomes; UP000053369; Unassembled WGS sequence.
DR   GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR   Gene3D; 2.60.120.1000; -; 1.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR000885; Fib_collagen_C.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1108; ENDOSTATIN DOMAIN-CONTAINING PROTEIN; 1.
DR   Pfam; PF01410; COLFI; 1.
DR   Pfam; PF01391; Collagen; 7.
DR   SMART; SM00038; COLFI; 1.
DR   PROSITE; PS51461; NC1_FIB; 1.
PE   4: Predicted;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Reference proteome {ECO:0000313|Proteomes:UP000053369};
KW   Secreted {ECO:0000256|ARBA:ARBA00022530}.
FT   DOMAIN          1102..1337
FT                   /note="Fibrillar collagen NC1"
FT                   /evidence="ECO:0000259|PROSITE:PS51461"
FT   REGION          1..1084
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        18..44
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        68..90
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        100..114
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        773..787
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1061..1077
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   NON_TER         1
FT                   /evidence="ECO:0000313|EMBL:KFQ34090.1"
FT   NON_TER         1337
FT                   /evidence="ECO:0000313|EMBL:KFQ34090.1"
SQ   SEQUENCE   1337 AA;  126129 MW;  610529FA7E8E9164 CRC64;
     SFIFFQQGPR GDKGPQGERG PPGPPGRDGE DGPPGPPGPP GPPGLGGNFA AQYDPSKAAD
     FGPGPMGLMG PRGPPGASGP PGPPGFQGVP GEPGEPGQTG PQGPRGPPGP PGKAGEDGHP
     GKPGRPGERG VAGPQGARGF PGTPGLPGFK GIRGHNGLDG QKGQPGTPGT KGEPGAPGEN
     GTPGQPGARG LPGERGRVGA PGPAGARGSD GSAGPTGPAG PIGAAGPPGF PGAPGAKGEI
     GPAGNVGPSG PAGPRGEIGL PGSSGPVGPP GNPGANGLPG AKGAAGLPGV AGAPGLPGPR
     GIPGPPGPAG PSGARGLVGE PGPAGAKGES GNKGEPGAAG PAGPPGPSGE EGKRGSTGEP
     GSAGPPGPAG LRGVPGSRGL PGADGRAGVM GPAGNRGASG PVGAKGPSGD GGRPGEPGLM
     GPRGLPGQPG SPGPAGKEGP VGFPGADGRV GPIGPAGNRG EPGNIGFPGP KGPTGEPGKP
     GEKGNVGVAG PRGAPGPEGN NGAQGPPGVT GNQGGKGETG PAGPPGFQGL PGPSGPAGEA
     GKPGERGLHG EFGVPGPAGP RGERGLPGES GAVGPAGPIG SRGPSGPPGP DGNKGEPGNV
     GAAGGPGPAG PGGIPGERGV AGVPGGKGEK GAPGLRGDTG ATGRDGARGL PGAIGAPGPA
     GGTGDRGEGG PAGPAGPAGA RGIPGERGEP GPVGPSGFAG PPGAAGQPGA KGERGPKGPK
     GESGPTGAIG PIGASGPPGP AGAAGPAGPR GDAGPPGMTG FPGAAGRVGP PGPAGISGPP
     GPPGPAGKDG PRGLRGDVGP VGRTGEQGIA GPPGFAGEKG PSGEAGAAGP PGTPGPQGIL
     GAPGILGLPG SRGERGLPGI SGATGEPGPL GVSGPPGARG PSGPVGSPGP NGAPGEAGRD
     GNPGNDGPPG RDGAPGFKGE RGAPGNPGPS GALGAPGPHG QVGPSGKPGN RGDPGPAGVV
     GPAGAFGPRG LAGPQGPRGE KGEPGDKGPR GLPGLKGHNG LQGLPGLAGQ HGDQGPPGNT
     GPAGPRGQPG PSGPPGKDGR NGLPGPIGPA GVRGSHGSQG PAGPPGPPGP PGPPGPNGGG
     YEVGFDAEYF RADQPSLRPK DYEVDATLKT LNNQIETLLT PEGSRKNPAR TCRDLRLSHP
     EWSSGFYWID PNQGCSADAI RAYCNFATGE TCIHANPDHI PAKNWYVNKN PKDKKHVWFG
     ETINGGSQFE YNSEGVTTKD MATQLAFMRL LANHASQNIT YHCKNSIAYM DEETGNLKKA
     VILQGSNDVE LRAEGNSRFT YSVLVDGCSR KNNEWGKTII EYRTNKPSRL PILDIAPLDI
     GSPDQEVSLD IGPVCYK
//
DBGET integrated database retrieval system