ID H2S966_TAKRU Unreviewed; 1774 AA.
AC H2S966;
DT 21-MAR-2012, integrated into UniProtKB/TrEMBL.
DT 17-JUN-2020, sequence version 3.
DT 27-MAR-2024, entry version 72.
DE SubName: Full=Tenascin C {ECO:0000313|Ensembl:ENSTRUP00000008942.3};
OS Takifugu rubripes (Japanese pufferfish) (Fugu rubripes).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Eupercaria; Tetraodontiformes; Tetradontoidea; Tetraodontidae; Takifugu.
OX NCBI_TaxID=31033 {ECO:0000313|Ensembl:ENSTRUP00000008942.3, ECO:0000313|Proteomes:UP000005226};
RN [1] {ECO:0000313|Ensembl:ENSTRUP00000008942.3, ECO:0000313|Proteomes:UP000005226}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21551351;
RA Kai W., Kikuchi K., Tohari S., Chew A.K., Tay A., Fujiwara A., Hosoya S.,
RA Suetake H., Naruse K., Brenner S., Suzuki Y., Venkatesh B.;
RT "Integration of the genetic map and genome assembly of fugu facilitates
RT insights into distinct features of genome evolution in teleosts and
RT mammals.";
RL Genome Biol. Evol. 3:424-442(2011).
RN [2] {ECO:0000313|Ensembl:ENSTRUP00000008942.3}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Secreted, extracellular space, extracellular
CC matrix {ECO:0000256|ARBA:ARBA00004498}.
CC -!- SIMILARITY: Belongs to the tenascin family.
CC {ECO:0000256|ARBA:ARBA00008673}.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 31033.ENSTRUP00000085510; -.
DR Ensembl; ENSTRUT00000008995.3; ENSTRUP00000008942.3; ENSTRUG00000003805.3.
DR GeneTree; ENSGT00940000155188; -.
DR Proteomes; UP000005226; Chromosome 21.
DR GO; GO:0048856; P:anatomical structure development; IEA:UniProt.
DR CDD; cd00054; EGF_CA; 1.
DR CDD; cd00063; FN3; 10.
DR CDD; cd00087; FReD; 1.
DR Gene3D; 2.20.25.10; -; 1.
DR Gene3D; 3.90.215.10; Gamma Fibrinogen, chain A, domain 1; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 10.
DR Gene3D; 2.10.25.10; Laminin; 12.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR041161; EGF_Tenascin.
DR InterPro; IPR036056; Fibrinogen-like_C.
DR InterPro; IPR014716; Fibrinogen_a/b/g_C_1.
DR InterPro; IPR002181; Fibrinogen_a/b/g_C_dom.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR NCBIfam; NF040941; GGGWT_bact; 1.
DR PANTHER; PTHR46708; TENASCIN; 1.
DR PANTHER; PTHR46708:SF1; TENASCIN; 1.
DR Pfam; PF18720; EGF_Tenascin; 8.
DR Pfam; PF00147; Fibrinogen_C; 1.
DR Pfam; PF00041; fn3; 10.
DR SMART; SM00181; EGF; 12.
DR SMART; SM00186; FBG; 1.
DR SMART; SM00060; FN3; 10.
DR SUPFAM; SSF56496; Fibrinogen C-terminal domain-like; 1.
DR SUPFAM; SSF49265; Fibronectin type III; 7.
DR PROSITE; PS00022; EGF_1; 5.
DR PROSITE; PS01186; EGF_2; 5.
DR PROSITE; PS50026; EGF_3; 1.
DR PROSITE; PS51406; FIBRINOGEN_C_2; 1.
DR PROSITE; PS50853; FN3; 8.
PE 3: Inferred from homology;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Reference proteome {ECO:0000313|Proteomes:UP000005226};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Secreted {ECO:0000256|ARBA:ARBA00022530}.
FT DOMAIN 580..611
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 618..708
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 798..887
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 888..980
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 981..1068
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1070..1161
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1288..1377
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1378..1461
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1462..1550
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1548..1763
FT /note="Fibrinogen C-terminal"
FT /evidence="ECO:0000259|PROSITE:PS51406"
FT DISULFID 584..594
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 601..610
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ SEQUENCE 1774 AA; 195079 MW; 458B22528C09D1D5 CRC64;
MRGQTIIYLS VCTVFNVGHP PQVDICIDGS ALGSSFYSNC SANMGMKIIL LACISMSLLF
ELSTPGLVRK IIRHRREALM PKKSQENITL PLPDQPVVFN HVYNINVPST SLCSVDLDLP
GGPEVKHETP LKEIQNMEHI EHTEDGDNQI VFTHRITIPK QACSCRNQLL DLKTILNRLE
MLELELSSLR EQCSSGAGCC GAQVTGEIST KPYCNGHGNW STDTCSCICE PGWKGHNCSD
PECPGDCQDQ GRCLNGRCEC FEGFGGEDCS NELCLLDCGD YGHCVNGVCL CEEGFSGEDC
SQTSCLNNCF GRGSCHEDEC VCDEPWTGYD CSEIICPNDC YDHGRCINGT CECDEGYTGE
DCGDLSCPSH CNNHGMCLNG QCVCQTGYSG EDCSKRSCPK NCNEKGHCFN GKCICDPGHE
GEDCSILSCP DNCNSRGECI NGECVCDAGY QGEDCSVLAC PNNCLDRGNC VNGQCMCDKG
YSGEDCNIKT CPKNCMGRGD CVDGKCMCFT GFKGKDCGEM TCPRDCMNQG HCENGKCACH
NGYTGEDCSQ KTCPKNCHNR GYCIDGDCVC YEGFTGTDCS IIACPSDCLN QGHCKNGVCV
CEEGFTGEDC SAGRQISPPK DLTVVEVSPE AVDLSWENEM RVSEYLIKYA PTVPGGLELD
MQVPGDQKKA TLLELEPGVE YLISVYALLN NKKSVPVNAR VALDLPKPDG LKFKSVRDTS
VQVEWDPIDF PFDGWNLIFR NMVNKEEDGE ILNFLSHPET MFEQSGLGPG QEYEVKLEVV
KNNKRGPPAS KNVITSEFTL SLYIRDVTDT TALVTWMPPV AEVEEVSISY GPSSNPADRN
MVELSSTETQ YHLGGLHPDT QYEVSLTAHK GEWSSNPVHE SFLTELDAPK HLKTAEITDE
SITLEWENSR AQVDNYRIKY GPLSGGEHRE LLFTPGAKDY THAKITGLRA GTEYGMGVTA
VKDERESLPT TTNAVTALDS PKDLIVTKVT ETTMLLEWRH PQAKLDSYRL VYVSADGHRS
EEVLPGDLKS YSLMELTPGM LYTISINTER GSRTSAPITI SAFTEEEKPV VTHFTISDVS
WDSFHLSWST KDGAFQAFLI KVTDAETSSD VQNHTLPAAA QSLAISDLSA TTWYRVNLYG
LYRGALLAPV YADTITGINI TSYSHFGCLP SSFAERSKFF FESLTVSNQY TLIKFSKKKK
SSEAEPEIQA LLVSEVTPES FWLTWMAEED ALDTFVIMVS PADDPGHPKE LVLGSEKRSV
AIANLTEDTE YRIEMFGLSF GRSTKSVGNT GIRFSDVTDT STTVHWGAPR VRVDSYQITY
VPAHGGNAKT LTVDGSKSQT MLPNLTPGVT YEVTIVAVKG PRESLPASDS ITTALDKPRG
LVSINITDTG ALLRWQPAIA TIDGYVITYS ADGVMERVSG NVMEFEMSSL VPATRYTVKV
FAARDLAKST ATTTEFTTDV DTPSHLAASN VQTESAMLTW KAPRAGITGY ILSFESVDGA
VREVVLSPTA VSYNMAQLSA STDYSVKLQA IAGPKRSRVV TAVFKTTGVQ YRHPRDCSQV
ILNGDGSSGL YTIFLSGDEN QPLQVYCDMN TDGGGWMVFL RRQSGKLDFF RNWKNYTAGF
GDINDEFWLG LSNLNKITAA AQYELRVDLR DKGETAFAQY DRFSVSESRS RYKVHIGGYS
GTAGDSMTYH HGRPFSTYDN DNDIAVTNCA LSYKGAFWYK NCHRVNLMGR YGDNSHSKGV
NWFHWKGHEH SIEFAEMKLR PSNFRNLEGR RKRS
//