ID H2LBK6_ORYLA Unreviewed; 1630 AA.
AC H2LBK6;
DT 21-MAR-2012, integrated into UniProtKB/TrEMBL.
DT 05-DEC-2018, sequence version 2.
DT 27-MAR-2024, entry version 67.
DE RecName: Full=Zmp:0000000846 {ECO:0008006|Google:ProtNLM};
GN Name=tnca {ECO:0000313|Ensembl:ENSORLP00000003280.2};
OS Oryzias latipes (Japanese rice fish) (Japanese killifish).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Ovalentaria; Atherinomorphae; Beloniformes; Adrianichthyidae; Oryziinae;
OC Oryzias.
OX NCBI_TaxID=8090 {ECO:0000313|Ensembl:ENSORLP00000003280.2, ECO:0000313|Proteomes:UP000001038};
RN [1] {ECO:0000313|Ensembl:ENSORLP00000003280.2, ECO:0000313|Proteomes:UP000001038}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Hd-rR {ECO:0000313|Ensembl:ENSORLP00000003280.2,
RC ECO:0000313|Proteomes:UP000001038};
RX PubMed=17554307; DOI=10.1038/nature05846;
RA Kasahara M., Naruse K., Sasaki S., Nakatani Y., Qu W., Ahsan B., Yamada T.,
RA Nagayasu Y., Doi K., Kasai Y., Jindo T., Kobayashi D., Shimada A.,
RA Toyoda A., Kuroki Y., Fujiyama A., Sasaki T., Shimizu A., Asakawa S.,
RA Shimizu N., Hashimoto S., Yang J., Lee Y., Matsushima K., Sugano S.,
RA Sakaizumi M., Narita T., Ohishi K., Haga S., Ohta F., Nomoto H., Nogata K.,
RA Morishita T., Endo T., Shin-I T., Takeda H., Morishita S., Kohara Y.;
RT "The medaka draft genome and insights into vertebrate genome evolution.";
RL Nature 447:714-719(2007).
RN [2] {ECO:0000313|Ensembl:ENSORLP00000003280.2}
RP IDENTIFICATION.
RC STRAIN=Hd-rR {ECO:0000313|Ensembl:ENSORLP00000003280.2};
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Secreted, extracellular space, extracellular
CC matrix {ECO:0000256|ARBA:ARBA00004498}.
CC -!- SIMILARITY: Belongs to the tenascin family.
CC {ECO:0000256|ARBA:ARBA00008673}.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR Ensembl; ENSORLT00000003281.2; ENSORLP00000003280.2; ENSORLG00000002627.2.
DR GeneTree; ENSGT00940000155188; -.
DR HOGENOM; CLU_026380_0_0_1; -.
DR Proteomes; UP000001038; Chromosome 12.
DR Bgee; ENSORLG00000002627; Expressed in muscle tissue and 13 other cell types or tissues.
DR GO; GO:0048856; P:anatomical structure development; IEA:UniProt.
DR CDD; cd00063; FN3; 9.
DR CDD; cd00087; FReD; 1.
DR Gene3D; 2.20.25.10; -; 1.
DR Gene3D; 3.90.215.10; Gamma Fibrinogen, chain A, domain 1; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 9.
DR Gene3D; 2.10.25.10; Laminin; 13.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR013111; EGF_extracell.
DR InterPro; IPR041161; EGF_Tenascin.
DR InterPro; IPR036056; Fibrinogen-like_C.
DR InterPro; IPR014716; Fibrinogen_a/b/g_C_1.
DR InterPro; IPR002181; Fibrinogen_a/b/g_C_dom.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR NCBIfam; NF040941; GGGWT_bact; 1.
DR PANTHER; PTHR46708; TENASCIN; 1.
DR PANTHER; PTHR46708:SF1; TENASCIN; 1.
DR Pfam; PF07974; EGF_2; 4.
DR Pfam; PF18720; EGF_Tenascin; 7.
DR Pfam; PF00147; Fibrinogen_C; 1.
DR Pfam; PF00041; fn3; 9.
DR SMART; SM00181; EGF; 13.
DR SMART; SM00186; FBG; 1.
DR SMART; SM00060; FN3; 9.
DR SUPFAM; SSF56496; Fibrinogen C-terminal domain-like; 1.
DR SUPFAM; SSF49265; Fibronectin type III; 6.
DR PROSITE; PS00022; EGF_1; 6.
DR PROSITE; PS01186; EGF_2; 6.
DR PROSITE; PS50026; EGF_3; 2.
DR PROSITE; PS51406; FIBRINOGEN_C_2; 1.
DR PROSITE; PS50853; FN3; 7.
PE 3: Inferred from homology;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Reference proteome {ECO:0000313|Proteomes:UP000001038};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Secreted {ECO:0000256|ARBA:ARBA00022530}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..21
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 22..1630
FT /note="Zmp:0000000846"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5017240613"
FT DOMAIN 315..346
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 558..594
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 598..688
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 777..866
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 867..960
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 961..1048
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1138..1230
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1231..1317
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1318..1406
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1404..1619
FT /note="Fibrinogen C-terminal"
FT /evidence="ECO:0000259|PROSITE:PS51406"
FT DISULFID 319..329
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 336..345
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 584..593
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ SEQUENCE 1630 AA; 176174 MW; BAF225E7FDEEA631 CRC64;
MGAQSLLGCF LLFALFTLPD AGLIKKILRH RRQTLSPHKE HNATLPGAGH PLVFSHVYNI
NVPASALCSV DLDAPEGQGL QPNDASDGHQ VTEHTVDGES QIVFTHRISI PQQACGCGDR
PPGLKELMSR LEMLEGEVSA LREQCGGEAA CCGAPVTGEL RTKPFCSGRG NYSSETCGCV
CEPGWRGPNC SEPECPGGCQ DRGRCVDGRC ECWRGFAGDD CSLQVCPVSC GTHGRCVGTV
CVCEDGFHGD DCSQSRCLND CLGRGHCDDG DCVCDEPWTG YDCSELICPK DCYDRGRCVN
GTCFCDEGFS GEDCGQHSCP NNCRGNGVCV DGKCICTAGY SGEDCSQPTC LSDCSGRGTC
IKGMCMCDPG YQGDDCSQVA CLKNCRGRGQ CINGRCSCDA GFQGDDCAEL SCPNSCRQRG
QCVNGQCVCD QGFAGEDCSI HTCPSDCYGR GTCVHGRCVC HAGFSGNDCS ELSCPNDCKG
RGLCVDGQCI CDEGFSGEDC SRRACPNDCL GRGDCLEGRC VCREGFSGDD CSAVSCPENC
SGRGSCVDGR CSCESGYEGD SCAERSCSNS CHQRGSCVNA QCVCDEGYIG EDCSEVSPPK
DLTVGEVTED TVDLSWDNEM LVTEYLVTYA PTIPGGLLME FTVAGDQSAA TVTELEPGME
YLISVYAVLS NKRSIPVSAR VVTDLPQPQG LRFKSVRETS VEVVWDQLDI SFDGWEIYFR
NTKEENGKVK SILPPSQNQL VQSGLGPGQE YEVSVSVIKN NTRGPQTSKR VTTKIDGPQQ
VEVKDVTESS ALVSWYLPVA PVDRVGVFYA PGSDPSAQTA VDVLPSDKQL SMDGLRPDTQ
YTVLLVSRSG NATSDPVTTT FTTALDAPTG LQAVSQTDNS VTLEWTNSQA DVGSYRVKYS
PISGAAHGEE VVPRGSGLTT QATITGLNPG TEYGMGVTAV RNERESLPAT TNAATDLDPP
REFERVESTE TSLTVRWQKP DAKVSRYRLT HTSRDGQFGE EEVPASESTH VLRSLSPGMT
YTLTLTAERG HRRSRPVSLS ASTEAEPEVD HLFVSDITAD GFRLSWAADE DIFDRFVIKL
RDSKRLAHPQ EYSARGDERT KVITGLMSGT EYEIELYGVT LDQRSQPVTV VAQTGLSSPR
GLRFSDVTDS TAVVHWSTLG SLVDNYRITY TPFGGGQFGS PLIVTSDGST SQSLLVNMIP
GKTYVVTVSA VKGLEESEPS RDTVTTALDS PQSLAAVNVT DVSALLLWQP SVATVDGYII
TLSAESVPPV VEHVSGNTAE FQMRSLLPGT TYSVGVYGVK GAQKSASAVT EFTTGVDPPR
DLTATNVQIE SATLTWKPPQ AAITGYMLSF SSADAVIREV MLSPTASSYS MSQLTGSTEY
NVRLQAIAGP QRSRSVTAVF STFGQLYRRP RDCAQILLNG ETTSGLYTIY IGGEESLPIQ
VYCDMSTDGG GWMVFLRRQN GRLDFFRNWK NYTAGFGNMN EEFWLGLSSL HKITALGHYE
LRVDLKDKGE SAYAQYDKFT LSEPRSRYKI NIGAYSGTAG DSMTYHQGRP FSTYDNDNDI
AVTNCALSYK GAFWYKNCHR VNLMGKYGDS SHSKGINWFH WKGHEHSIEF AEMKIRPANF
RNLESRKKRS
//