GenomeNet

Database: UniProt
Entry: V8P8B6_OPHHA
LinkDB: V8P8B6_OPHHA
Original site: V8P8B6_OPHHA 
ID   V8P8B6_OPHHA            Unreviewed;      1580 AA.
AC   V8P8B6;
DT   19-FEB-2014, integrated into UniProtKB/TrEMBL.
DT   19-FEB-2014, sequence version 1.
DT   27-MAR-2024, entry version 39.
DE   SubName: Full=Collagen alpha-1(XI) chain {ECO:0000313|EMBL:ETE70570.1};
DE   Flags: Fragment;
GN   Name=COL11A1 {ECO:0000313|EMBL:ETE70570.1};
GN   ORFNames=L345_03619 {ECO:0000313|EMBL:ETE70570.1};
OS   Ophiophagus hannah (King cobra) (Naja hannah).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Lepidosauria; Squamata; Bifurcata; Unidentata; Episquamata; Toxicofera;
OC   Serpentes; Colubroidea; Elapidae; Elapinae; Ophiophagus.
OX   NCBI_TaxID=8665 {ECO:0000313|EMBL:ETE70570.1, ECO:0000313|Proteomes:UP000018936};
RN   [1] {ECO:0000313|EMBL:ETE70570.1, ECO:0000313|Proteomes:UP000018936}
RP   NUCLEOTIDE SEQUENCE.
RC   TISSUE=Blood {ECO:0000313|EMBL:ETE70570.1};
RX   PubMed=24297900; DOI=10.1073/pnas.1314702110;
RA   Vonk F.J., Casewell N.R., Henkel C.V., Heimberg A.M., Jansen H.J.,
RA   McCleary R.J., Kerkkamp H.M., Vos R.A., Guerreiro I., Calvete J.J.,
RA   Wuster W., Woods A.E., Logan J.M., Harrison R.A., Castoe T.A.,
RA   de Koning A.P., Pollock D.D., Yandell M., Calderon D., Renjifo C.,
RA   Currier R.B., Salgado D., Pla D., Sanz L., Hyder A.S., Ribeiro J.M.,
RA   Arntzen J.W., van den Thillart G.E., Boetzer M., Pirovano W., Dirks R.P.,
RA   Spaink H.P., Duboule D., McGlinn E., Kini R.M., Richardson M.K.;
RT   "The king cobra genome reveals dynamic gene evolution and adaptation in the
RT   snake venom system.";
RL   Proc. Natl. Acad. Sci. U.S.A. 110:20651-20656(2013).
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:ETE70570.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AZIM01000526; ETE70570.1; -; Genomic_DNA.
DR   Proteomes; UP000018936; Unassembled WGS sequence.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR   GO; GO:0090729; F:toxin activity; IEA:UniProtKB-KW.
DR   Gene3D; 2.60.120.1000; -; 1.
DR   Gene3D; 2.60.120.200; -; 1.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR000885; Fib_collagen_C.
DR   InterPro; IPR001791; Laminin_G.
DR   InterPro; IPR048287; TSPN-like_N.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF42; COLLAGEN ALPHA-1(XI) CHAIN; 1.
DR   Pfam; PF01410; COLFI; 1.
DR   Pfam; PF01391; Collagen; 5.
DR   Pfam; PF02210; Laminin_G_2; 1.
DR   SMART; SM00038; COLFI; 1.
DR   SMART; SM00282; LamG; 1.
DR   SMART; SM00210; TSPN; 1.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR   PROSITE; PS51461; NC1_FIB; 1.
PE   4: Predicted;
KW   Collagen {ECO:0000313|EMBL:ETE70570.1};
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Reference proteome {ECO:0000313|Proteomes:UP000018936};
KW   Secreted {ECO:0000256|ARBA:ARBA00022530};
KW   Signal {ECO:0000256|ARBA:ARBA00022729};
KW   Toxin {ECO:0000256|ARBA:ARBA00022656}.
FT   DOMAIN          1351..1579
FT                   /note="Fibrillar collagen NC1"
FT                   /evidence="ECO:0000259|PROSITE:PS51461"
FT   REGION          215..255
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          357..417
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          434..1149
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1174..1336
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        236..250
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        784..802
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        967..981
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1022..1036
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1096..1129
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1222..1236
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1273..1291
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1307..1321
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   NON_TER         1
FT                   /evidence="ECO:0000313|EMBL:ETE70570.1"
FT   NON_TER         1580
FT                   /evidence="ECO:0000313|EMBL:ETE70570.1"
SQ   SEQUENCE   1580 AA;  159889 MW;  9C28A771853D51B1 CRC64;
     GKFPEDFSVL FTIKPKKGVQ SFLLSIYNEH GIQQVGIEVG RSPIFLFEDQ NGKPAPEDYP
     IFGTVNMADG KWHRVAISVE RKSVTMIVDC KKKTTKPLER SSPAVIDTNG ITVFGTRILD
     EEVFEGDIQQ MLIVGDPRAA YDYCEHYSPD CDSPVPNAPQ AQEPQVEEYA TEDLIEYDYD
     YGDPDYKDYK DYKDLETVTE GPPLYDETVA YTEKKNKAPK KKKKTGIASS KNNPQKLKLK
     KSETLPSKKK KSSRAAAKGK IGANVFDEYH EYNVAEVDYG TEAYQTAIPT QTTGRNEQLP
     VEEVFTEEYV TGEDYNSKTK NNEETDYGNR GVDHSEAEVV VDGDLGEYDF YEYKEYEEKP
     TSSTNEEFGP GVPAETDITE TTGLSGPSGM QGPIGPPGDP GDRFRFSGGG EKGPAISPQE
     AQAQAILQQA RIAMRGPPGP MGLTGRPGPV GSPGSSGIKG DGGDPGPQGP RGVQGPPGLP
     GKSGKRGDRG FDGLPGLPGE KGHRGDRGPQ GPPGAPGEDG MRGPRGLLGP RGTPGLPGQP
     GIPGVDGPPG PKGNLGPQGE PGPHGQQGNP GPQGLPGPQG PIGVPGEKGP QGKPGLAGLP
     GTDGPPGHPG KEGQSGEKGD TGPAGHQGPV GYPGPRGVKG EDGFPGFKGD MGLKGDRGEV
     GSLGPRGEDG PEGPKGRAGP TGDSGGPGQA GEKGKLGVPG LPGYPGRQGP KGSTGFPGFP
     GSNGEKGGRG LHGKPGPRGQ RGPTGPRGSR GARGPTGKPG PKGTAGGDGP PGPSGERGPQ
     GPQGPVGFPG PKGPPGPPGK DGLPGHPGQR GETGFQGKTG PPGPGGVVGP QGPTGETGPI
     GERGHPGPPG PPGEQGLPGA GGKEGAKGDP GPQGISGKDG PAGLRGFPGE RGLPGTQGSS
     GERGPAGTAG PIGLPGRPGP QGPPGPAGEK GGPGEKGPQG PAGRDGIQGP VGLPGPAGPA
     GSPGEDGDKG EIGEPGQKGS KGDKGENGPS GPPGLQGPIG GPGIAGLPGP PGEKGENGDV
     GPMGPPGPPG PRGPQGPSGS DGPQGPSGSA GSVGGVGQKG EPGEAGNPGP PGEGGPTGPK
     GERGEKGESG PPGAAGPPGL KGPPGDDGPK GNPGPVGFPG DPGPPGEPGP AGTDGTGGEK
     GDDGDPGQPV LLVKQVPLAL LEREGLLEQL VQKEDKGEAG SEGAPGKTGP VGPQGPAGKS
     GPEGLRGIPG PVGEQGLPGV PGQDGPPGPM GPPGLPGLKG DPGSKGEKGH PGLIGLIGPP
     GEQGEKGDRG LPGTQGSTGS KGDSGPQGPK GSKGSSGPGG QKGDGGLPGP PGPPGPPGEV
     IQPLPMQASK KSKRSVDAKL SVAEDYTDGM EEIFGSLSSL KQDIEHMKYP IGTQNNPART
     CKDLQLCHPD FPDGEYWIDP NQGCTGDSFK VYCNFTADGE TCIYPDKKSE GFRISSWPKE
     NPGTWFSEFK RGKLLSYVDA EGNSISMVQM TFLKLLSASA RQNFTYNCHQ SVAWHDVSSD
     SYDKALRFLG SNDEEMSYDN NPYIKVLHDG CASRKGYAKT VMEISTPKID QLPIFDVMIN
     DFGDQNQKFG FEVGPVCFLG
//
DBGET integrated database retrieval system