ID V8NRE5_OPHHA Unreviewed; 668 AA.
AC V8NRE5;
DT 19-FEB-2014, integrated into UniProtKB/TrEMBL.
DT 19-FEB-2014, sequence version 1.
DT 27-MAR-2024, entry version 38.
DE SubName: Full=Collagen alpha-1(VIII) chain {ECO:0000313|EMBL:ETE64263.1};
DE Flags: Fragment;
GN Name=Col8a1 {ECO:0000313|EMBL:ETE64263.1};
GN ORFNames=L345_09965 {ECO:0000313|EMBL:ETE64263.1};
OS Ophiophagus hannah (King cobra) (Naja hannah).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Lepidosauria; Squamata; Bifurcata; Unidentata; Episquamata; Toxicofera;
OC Serpentes; Colubroidea; Elapidae; Elapinae; Ophiophagus.
OX NCBI_TaxID=8665 {ECO:0000313|EMBL:ETE64263.1, ECO:0000313|Proteomes:UP000018936};
RN [1] {ECO:0000313|EMBL:ETE64263.1, ECO:0000313|Proteomes:UP000018936}
RP NUCLEOTIDE SEQUENCE.
RC TISSUE=Blood {ECO:0000313|EMBL:ETE64263.1};
RX PubMed=24297900; DOI=10.1073/pnas.1314702110;
RA Vonk F.J., Casewell N.R., Henkel C.V., Heimberg A.M., Jansen H.J.,
RA McCleary R.J., Kerkkamp H.M., Vos R.A., Guerreiro I., Calvete J.J.,
RA Wuster W., Woods A.E., Logan J.M., Harrison R.A., Castoe T.A.,
RA de Koning A.P., Pollock D.D., Yandell M., Calderon D., Renjifo C.,
RA Currier R.B., Salgado D., Pla D., Sanz L., Hyder A.S., Ribeiro J.M.,
RA Arntzen J.W., van den Thillart G.E., Boetzer M., Pirovano W., Dirks R.P.,
RA Spaink H.P., Duboule D., McGlinn E., Kini R.M., Richardson M.K.;
RT "The king cobra genome reveals dynamic gene evolution and adaptation in the
RT snake venom system.";
RL Proc. Natl. Acad. Sci. U.S.A. 110:20651-20656(2013).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:ETE64263.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AZIM01002324; ETE64263.1; -; Genomic_DNA.
DR AlphaFoldDB; V8NRE5; -.
DR Proteomes; UP000018936; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR GO; GO:0090729; F:toxin activity; IEA:UniProtKB-KW.
DR Gene3D; 2.60.120.40; -; 1.
DR InterPro; IPR001073; C1q_dom.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR008983; Tumour_necrosis_fac-like_dom.
DR PANTHER; PTHR15427:SF50; C1Q DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR15427; EMILIN ELASTIN MICROFIBRIL INTERFACE-LOCATED PROTEIN ELASTIN MICROFIBRIL INTERFACER; 1.
DR Pfam; PF00386; C1q; 1.
DR Pfam; PF01391; Collagen; 2.
DR PRINTS; PR00007; COMPLEMNTC1Q.
DR SMART; SM00110; C1Q; 1.
DR SUPFAM; SSF49842; TNF-like; 1.
DR PROSITE; PS50871; C1Q; 1.
PE 4: Predicted;
KW Collagen {ECO:0000313|EMBL:ETE64263.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000018936};
KW Secreted {ECO:0000256|ARBA:ARBA00022525};
KW Signal {ECO:0000256|ARBA:ARBA00022729};
KW Toxin {ECO:0000256|ARBA:ARBA00022656}.
FT DOMAIN 535..668
FT /note="C1q"
FT /evidence="ECO:0000259|PROSITE:PS50871"
FT REGION 33..501
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 41..55
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 170..186
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 299..313
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 474..492
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:ETE64263.1"
SQ SEQUENCE 668 AA; 65427 MW; 9FB3F21C742FB94A CRC64;
MLVTSRAVKM KVNIPFFFCL SLVKKAELPL ASLRGEQGPP GEPGPRGPPG SPGLPGHGVP
GAKGKPGPQG YPGIGKPGMP GMPGKPGAMG SPGSRGEMGP KGESGSMGIP GPQGPPGPQG
LPGIGKPGDR GLPGQPGAKA EPGMKGQPGI QGPQGPKGDK GVGIPGLPGL KGPPGLPGPP
GPVGLPGIGK PGMIGFPGPQ GAVGKPGIPG EPGPRGLAGA HGIQGPPGLP GIGKPGQDGI
PGQPGMPGGK GEQGLPGLPG PPGAPGIGKP GFPGLKGDRG MGGALGPLGP KGEKGHVGPP
GMPGPPGEPG QPGQPGLMGP SGVMGFPGPK GEVGAVGPPG PIGPKGELGL QGFPGKPGFP
GEVGPSGLRG LPGPIGPKGE TGHKGLPGLP GAHGLMGPKG EPGKPGTQGH QGPTGIPGIA
GPSGPIGPPG LPGTKGERGF PGPPGFPGMG KPGVSGMPGP PGKPGSLGAP GQPGLQGPPG
PPGPPGPTVI MQPTPPAMGQ YLPEVGPGLD GIKSPYGYKG KKSKNGAAAA AAAAAAYEMP
AFTAELTTPF PRVRVPVVFD KLLYNGRQNY NPQTGVFTCE IPGIYYFAYH IHCKGGNVWV
ALFKNNEPLM YTYDEYKKGF LDQASGSAVV QLLHGDRVYI QMPSEQAAGL YAGQYVHSSF
SGYLLYPM
//