ID A0A091HPQ1_CALAN Unreviewed; 1484 AA.
AC A0A091HPQ1;
DT 26-NOV-2014, integrated into UniProtKB/TrEMBL.
DT 26-NOV-2014, sequence version 1.
DT 27-MAR-2024, entry version 27.
DE SubName: Full=Collagen alpha-1(XVII) chain {ECO:0000313|EMBL:KFO98248.1};
DE Flags: Fragment;
GN ORFNames=N300_12459 {ECO:0000313|EMBL:KFO98248.1};
OS Calypte anna (Anna's hummingbird) (Archilochus anna).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Caprimulgimorphae; Apodiformes;
OC Trochilidae; Calypte.
OX NCBI_TaxID=9244 {ECO:0000313|EMBL:KFO98248.1, ECO:0000313|Proteomes:UP000054308};
RN [1] {ECO:0000313|EMBL:KFO98248.1, ECO:0000313|Proteomes:UP000054308}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=BGI_N300 {ECO:0000313|EMBL:KFO98248.1};
RA Zhang G., Li C.;
RT "Genome evolution of avian class.";
RL Submitted (APR-2014) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KL217736; KFO98248.1; -; Genomic_DNA.
DR STRING; 9244.A0A091HPQ1; -.
DR Proteomes; UP000054308; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR Gene3D; 1.20.5.320; 6-Phosphogluconate Dehydrogenase, domain 3; 2.
DR InterPro; IPR008160; Collagen.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1105; MULTIPLEXIN, ISOFORM R; 1.
DR Pfam; PF01391; Collagen; 4.
PE 4: Predicted;
KW Collagen {ECO:0000313|EMBL:KFO98248.1}; Membrane {ECO:0000256|SAM:Phobius};
KW Reference proteome {ECO:0000313|Proteomes:UP000054308};
KW Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT TRANSMEM 459..483
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT REGION 1..155
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 556..999
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1015..1036
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1159..1184
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1197..1223
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1247..1272
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1289..1324
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1341..1363
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1429..1484
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..16
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 17..137
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 812..835
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 853..889
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 947..961
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 979..993
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1302..1324
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 1484
FT /evidence="ECO:0000313|EMBL:KFO98248.1"
SQ SEQUENCE 1484 AA; 150422 MW; 6145711D417AB14A CRC64;
MDSVTKKTRQ DGSEVTERLI TETVTTRLTS LPPKGGSSSG LKTSSHTGGT GVEKRSYTHS
SSYVTSSGSG RLNSSSSSSS YRQAQSPSST LTKSPGSTFE RKTYVNRHAT YEGSSSANSS
PEFPRKESAS ASTRGRSQTR ESEIRVRLQS ASPSGRWTEL DDVKRLLKGS RSASCSPTRS
PSSTLPIPKK AVVETKMITE SSQSVSGTYD TTILNTALPS YMWSSTLPAG SSLGGHHNSS
SLINSMSHST GSVFGVPNNL APTNHALNTG LSTSSTVFGV QNNLSPSCAT LTQNSTAAST
AYGMKNTSQS NSMASTGVSA SAGGTVVTSQ GDDILRKDCK FLLLDKDNAP AKKETELLIM
TKDSGKVFNA STTGLNGGSF VEDTLKKEKQ GFSSAYATDT GLKSDGNGGL KSMQARDKAA
YAGKLSFEGS PKHKIRNGDA GGAVGSAPSW CPCGSCCSWW KWLLGLLLAW LLLLGLLFGL
IALAEEVRKL KSRVENLEKI NGGLLTINGG DSKIAKDVSR VDYLQGISPS STFPYENEES
VWLMVKNRLN KERERGYFRG EKGEPGEKGD IGMQGPRGDR GLPGSPGPIG HQGPEGPKGQ
KGSMGDPGME GPMGQRGREG PPGPRGEPGP AGAGGKGDRG GVGEPGPPGP PGPPGSPGLK
GLMGSPGPQG LPGPPGLQGF RGEAGLPGAK GEKGATGPPG PKGDHGEKGS RGITGEQGSR
GMPGPPGEPG AKGPAGQAGR DGQPGEKGEP GLMGMPGARG PPGPSGDAGQ PGLTGPQGPP
GLPGDPGRPG AKGEPGSPGK VVSAEGLSTV ALPGPPGPPG PVGPTGPPGV PGPVGPAGLP
GQQVLADLQG RAGPPGPPGP PGESVQGPPG PRGPPGEGLP GPPGPPGRPG SSMSTSETLF
TGPPGPPGPP GPKGAQGERG PRGFTGEPGE PGLPAFSSHG DRITMQGPPG PPGPPGPKGD
AGVPGIPGAS RDGSRQIQGP PGPPGPPGPP GPGGSSSQEI QQHINDYLKS DNVRHYLTGA
QGPPGPPGPP GIITTTDGIN LDYAELATRV MSYMTSSSDH YESFASSVST TSVLYRELLD
MLQREEIRQY LIGPQGPPGP PGPGVDGMSL SLDYDELTRR FISYLTSSGM SIGLPGPPGP
PGPPGISYSD LTAYLRNSEF GGLVGPPGPA GPPGPPGSPG TSLEDVSAYL QSIGYSSFSG
VQGPPGPPGP PGPPGFSGTG LLSYADITNS DEFRSELIQY LKSDEIRSYI SGPPGPPGPR
GPPGPKGDSS LVAGAVSSSY RGLRTSEELH GGSLGAEGSH RGSLGTGSSY GSSMSSVASY
SASVGGDGTY DSSMGSDGTF DGLLTEGESH RRSSSSRSYS NSFTGSLDYN ELALRVSESL
QSQGILQDLM SYTARGPAGP PGPPGPPGIS RVFAAYGNVT EDLMDFFRTY GTIQGPPGQK
GEKGYPGPKG DPGPMGPPGR QGHRGPKGEK GEKGMNINHP PGEQ
//