GenomeNet

Database: UniProt
Entry: A0A091I302_CALAN
LinkDB: A0A091I302_CALAN
Original site: A0A091I302_CALAN 
ID   A0A091I302_CALAN        Unreviewed;       410 AA.
AC   A0A091I302;
DT   26-NOV-2014, integrated into UniProtKB/TrEMBL.
DT   26-NOV-2014, sequence version 1.
DT   27-MAR-2024, entry version 28.
DE   SubName: Full=Collagen alpha-1(XXVI) chain {ECO:0000313|EMBL:KFP01828.1};
DE   Flags: Fragment;
GN   ORFNames=N300_03465 {ECO:0000313|EMBL:KFP01828.1};
OS   Calypte anna (Anna's hummingbird) (Archilochus anna).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC   Coelurosauria; Aves; Neognathae; Caprimulgimorphae; Apodiformes;
OC   Trochilidae; Calypte.
OX   NCBI_TaxID=9244 {ECO:0000313|EMBL:KFP01828.1, ECO:0000313|Proteomes:UP000054308};
RN   [1] {ECO:0000313|EMBL:KFP01828.1, ECO:0000313|Proteomes:UP000054308}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=BGI_N300 {ECO:0000313|EMBL:KFP01828.1};
RA   Zhang G., Li C.;
RT   "Genome evolution of avian class.";
RL   Submitted (APR-2014) to the EMBL/GenBank/DDBJ databases.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; KL218048; KFP01828.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A091I302; -.
DR   STRING; 9244.A0A091I302; -.
DR   Proteomes; UP000054308; Unassembled WGS sequence.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR011489; EMI_domain.
DR   PANTHER; PTHR15427:SF19; COLLAGEN ALPHA-1(XXVI) CHAIN; 1.
DR   PANTHER; PTHR15427; EMILIN ELASTIN MICROFIBRIL INTERFACE-LOCATED PROTEIN ELASTIN MICROFIBRIL INTERFACER; 1.
DR   Pfam; PF01391; Collagen; 2.
DR   Pfam; PF07546; EMI; 1.
DR   PROSITE; PS51041; EMI; 1.
PE   4: Predicted;
KW   Collagen {ECO:0000313|EMBL:KFP01828.1};
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Reference proteome {ECO:0000313|Proteomes:UP000054308};
KW   Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..19
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           20..410
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5001874912"
FT   DOMAIN          50..126
FT                   /note="EMI"
FT                   /evidence="ECO:0000259|PROSITE:PS51041"
FT   REGION          195..268
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          298..355
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        202..220
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        248..265
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        330..355
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   NON_TER         410
FT                   /evidence="ECO:0000313|EMBL:KFP01828.1"
SQ   SEQUENCE   410 AA;  43263 MW;  30230AA0383D4E4B CRC64;
     MRFALFLSWC CCLHGCALGT GFLYQFPAST LQHNYPEQSS GSPGRGFVTK RHWCHYTVTR
     TVSCQVQNGS ETVIQRVYQS CRWPGPCANL VSYRTLIRPT YKMSYRTVTT LEWRCCPGFT
     GSNCEEECMN CTRLADMTER LNTLEAKVLL LEAAERSPAL ENNLPITGTT ATWYDELLPD
     AFPLLNPGTV LRKAVGSPGQ VGPPGPVGPT GLPGPPGPKG EKGQPGERGP AGPPGLLGPQ
     GPRGLPGETG IPGPPGPPGP PATPSLPLTF QQGVLYSLQP TAEKESKFPS LLMVFSQHSA
     SGPKGPPGPI GAPGSQGLPG SPGQPGPKGS KGDRGERGEP GKKGEEGDKG ADGEGVQQLR
     EALKILAERV LILEHMIGIH DSLSSIEPGS GQDVIPGSPL RTSIKIKRGG
//
DBGET integrated database retrieval system