ID V8P8B6_OPHHA Unreviewed; 1580 AA.
AC V8P8B6;
DT 19-FEB-2014, integrated into UniProtKB/TrEMBL.
DT 19-FEB-2014, sequence version 1.
DT 27-MAR-2024, entry version 39.
DE SubName: Full=Collagen alpha-1(XI) chain {ECO:0000313|EMBL:ETE70570.1};
DE Flags: Fragment;
GN Name=COL11A1 {ECO:0000313|EMBL:ETE70570.1};
GN ORFNames=L345_03619 {ECO:0000313|EMBL:ETE70570.1};
OS Ophiophagus hannah (King cobra) (Naja hannah).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Lepidosauria; Squamata; Bifurcata; Unidentata; Episquamata; Toxicofera;
OC Serpentes; Colubroidea; Elapidae; Elapinae; Ophiophagus.
OX NCBI_TaxID=8665 {ECO:0000313|EMBL:ETE70570.1, ECO:0000313|Proteomes:UP000018936};
RN [1] {ECO:0000313|EMBL:ETE70570.1, ECO:0000313|Proteomes:UP000018936}
RP NUCLEOTIDE SEQUENCE.
RC TISSUE=Blood {ECO:0000313|EMBL:ETE70570.1};
RX PubMed=24297900; DOI=10.1073/pnas.1314702110;
RA Vonk F.J., Casewell N.R., Henkel C.V., Heimberg A.M., Jansen H.J.,
RA McCleary R.J., Kerkkamp H.M., Vos R.A., Guerreiro I., Calvete J.J.,
RA Wuster W., Woods A.E., Logan J.M., Harrison R.A., Castoe T.A.,
RA de Koning A.P., Pollock D.D., Yandell M., Calderon D., Renjifo C.,
RA Currier R.B., Salgado D., Pla D., Sanz L., Hyder A.S., Ribeiro J.M.,
RA Arntzen J.W., van den Thillart G.E., Boetzer M., Pirovano W., Dirks R.P.,
RA Spaink H.P., Duboule D., McGlinn E., Kini R.M., Richardson M.K.;
RT "The king cobra genome reveals dynamic gene evolution and adaptation in the
RT snake venom system.";
RL Proc. Natl. Acad. Sci. U.S.A. 110:20651-20656(2013).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:ETE70570.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AZIM01000526; ETE70570.1; -; Genomic_DNA.
DR Proteomes; UP000018936; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR GO; GO:0090729; F:toxin activity; IEA:UniProtKB-KW.
DR Gene3D; 2.60.120.1000; -; 1.
DR Gene3D; 2.60.120.200; -; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR000885; Fib_collagen_C.
DR InterPro; IPR001791; Laminin_G.
DR InterPro; IPR048287; TSPN-like_N.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF42; COLLAGEN ALPHA-1(XI) CHAIN; 1.
DR Pfam; PF01410; COLFI; 1.
DR Pfam; PF01391; Collagen; 5.
DR Pfam; PF02210; Laminin_G_2; 1.
DR SMART; SM00038; COLFI; 1.
DR SMART; SM00282; LamG; 1.
DR SMART; SM00210; TSPN; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR PROSITE; PS51461; NC1_FIB; 1.
PE 4: Predicted;
KW Collagen {ECO:0000313|EMBL:ETE70570.1};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Reference proteome {ECO:0000313|Proteomes:UP000018936};
KW Secreted {ECO:0000256|ARBA:ARBA00022530};
KW Signal {ECO:0000256|ARBA:ARBA00022729};
KW Toxin {ECO:0000256|ARBA:ARBA00022656}.
FT DOMAIN 1351..1579
FT /note="Fibrillar collagen NC1"
FT /evidence="ECO:0000259|PROSITE:PS51461"
FT REGION 215..255
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 357..417
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 434..1149
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1174..1336
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 236..250
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 784..802
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 967..981
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1022..1036
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1096..1129
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1222..1236
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1273..1291
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1307..1321
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:ETE70570.1"
FT NON_TER 1580
FT /evidence="ECO:0000313|EMBL:ETE70570.1"
SQ SEQUENCE 1580 AA; 159889 MW; 9C28A771853D51B1 CRC64;
GKFPEDFSVL FTIKPKKGVQ SFLLSIYNEH GIQQVGIEVG RSPIFLFEDQ NGKPAPEDYP
IFGTVNMADG KWHRVAISVE RKSVTMIVDC KKKTTKPLER SSPAVIDTNG ITVFGTRILD
EEVFEGDIQQ MLIVGDPRAA YDYCEHYSPD CDSPVPNAPQ AQEPQVEEYA TEDLIEYDYD
YGDPDYKDYK DYKDLETVTE GPPLYDETVA YTEKKNKAPK KKKKTGIASS KNNPQKLKLK
KSETLPSKKK KSSRAAAKGK IGANVFDEYH EYNVAEVDYG TEAYQTAIPT QTTGRNEQLP
VEEVFTEEYV TGEDYNSKTK NNEETDYGNR GVDHSEAEVV VDGDLGEYDF YEYKEYEEKP
TSSTNEEFGP GVPAETDITE TTGLSGPSGM QGPIGPPGDP GDRFRFSGGG EKGPAISPQE
AQAQAILQQA RIAMRGPPGP MGLTGRPGPV GSPGSSGIKG DGGDPGPQGP RGVQGPPGLP
GKSGKRGDRG FDGLPGLPGE KGHRGDRGPQ GPPGAPGEDG MRGPRGLLGP RGTPGLPGQP
GIPGVDGPPG PKGNLGPQGE PGPHGQQGNP GPQGLPGPQG PIGVPGEKGP QGKPGLAGLP
GTDGPPGHPG KEGQSGEKGD TGPAGHQGPV GYPGPRGVKG EDGFPGFKGD MGLKGDRGEV
GSLGPRGEDG PEGPKGRAGP TGDSGGPGQA GEKGKLGVPG LPGYPGRQGP KGSTGFPGFP
GSNGEKGGRG LHGKPGPRGQ RGPTGPRGSR GARGPTGKPG PKGTAGGDGP PGPSGERGPQ
GPQGPVGFPG PKGPPGPPGK DGLPGHPGQR GETGFQGKTG PPGPGGVVGP QGPTGETGPI
GERGHPGPPG PPGEQGLPGA GGKEGAKGDP GPQGISGKDG PAGLRGFPGE RGLPGTQGSS
GERGPAGTAG PIGLPGRPGP QGPPGPAGEK GGPGEKGPQG PAGRDGIQGP VGLPGPAGPA
GSPGEDGDKG EIGEPGQKGS KGDKGENGPS GPPGLQGPIG GPGIAGLPGP PGEKGENGDV
GPMGPPGPPG PRGPQGPSGS DGPQGPSGSA GSVGGVGQKG EPGEAGNPGP PGEGGPTGPK
GERGEKGESG PPGAAGPPGL KGPPGDDGPK GNPGPVGFPG DPGPPGEPGP AGTDGTGGEK
GDDGDPGQPV LLVKQVPLAL LEREGLLEQL VQKEDKGEAG SEGAPGKTGP VGPQGPAGKS
GPEGLRGIPG PVGEQGLPGV PGQDGPPGPM GPPGLPGLKG DPGSKGEKGH PGLIGLIGPP
GEQGEKGDRG LPGTQGSTGS KGDSGPQGPK GSKGSSGPGG QKGDGGLPGP PGPPGPPGEV
IQPLPMQASK KSKRSVDAKL SVAEDYTDGM EEIFGSLSSL KQDIEHMKYP IGTQNNPART
CKDLQLCHPD FPDGEYWIDP NQGCTGDSFK VYCNFTADGE TCIYPDKKSE GFRISSWPKE
NPGTWFSEFK RGKLLSYVDA EGNSISMVQM TFLKLLSASA RQNFTYNCHQ SVAWHDVSSD
SYDKALRFLG SNDEEMSYDN NPYIKVLHDG CASRKGYAKT VMEISTPKID QLPIFDVMIN
DFGDQNQKFG FEVGPVCFLG
//