ID F6ZWJ9_HORSE Unreviewed; 1296 AA.
AC F6ZWJ9;
DT 27-JUL-2011, integrated into UniProtKB/TrEMBL.
DT 13-SEP-2023, sequence version 4.
DT 27-MAR-2024, entry version 82.
DE SubName: Full=Tenascin N {ECO:0000313|Ensembl:ENSECAP00000009647.4};
GN Name=TNN {ECO:0000313|Ensembl:ENSECAP00000009647.4,
GN ECO:0000313|VGNC:VGNC:24387};
OS Equus caballus (Horse).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Perissodactyla; Equidae; Equus.
OX NCBI_TaxID=9796 {ECO:0000313|Ensembl:ENSECAP00000009647.4, ECO:0000313|Proteomes:UP000002281};
RN [1] {ECO:0000313|Ensembl:ENSECAP00000009647.4, ECO:0000313|Proteomes:UP000002281}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000009647.4,
RC ECO:0000313|Proteomes:UP000002281};
RX PubMed=19892987; DOI=10.1126/science.1178158;
RG Broad Institute Genome Sequencing Platform;
RG Broad Institute Whole Genome Assembly Team;
RA Wade C.M., Giulotto E., Sigurdsson S., Zoli M., Gnerre S., Imsland F.,
RA Lear T.L., Adelson D.L., Bailey E., Bellone R.R., Bloecker H., Distl O.,
RA Edgar R.C., Garber M., Leeb T., Mauceli E., MacLeod J.N., Penedo M.C.T.,
RA Raison J.M., Sharpe T., Vogel J., Andersson L., Antczak D.F., Biagi T.,
RA Binns M.M., Chowdhary B.P., Coleman S.J., Della Valle G., Fryc S.,
RA Guerin G., Hasegawa T., Hill E.W., Jurka J., Kiialainen A., Lindgren G.,
RA Liu J., Magnani E., Mickelson J.R., Murray J., Nergadze S.G., Onofrio R.,
RA Pedroni S., Piras M.F., Raudsepp T., Rocchi M., Roeed K.H., Ryder O.A.,
RA Searle S., Skow L., Swinburne J.E., Syvaenen A.C., Tozaki T., Valberg S.J.,
RA Vaudin M., White J.R., Zody M.C., Lander E.S., Lindblad-Toh K.;
RT "Genome sequence, comparative analysis, and population genetics of the
RT domestic horse.";
RL Science 326:865-867(2009).
RN [2] {ECO:0000313|Ensembl:ENSECAP00000009647.4}
RP IDENTIFICATION.
RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000009647.4};
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Secreted, extracellular space, extracellular
CC matrix {ECO:0000256|ARBA:ARBA00004498}.
CC -!- SIMILARITY: Belongs to the tenascin family.
CC {ECO:0000256|ARBA:ARBA00008673}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR Ensembl; ENSECAT00000012232.4; ENSECAP00000009647.4; ENSECAG00000011669.4.
DR VGNC; VGNC:24387; TNN.
DR GeneTree; ENSGT00940000160553; -.
DR HOGENOM; CLU_001162_0_1_1; -.
DR TreeFam; TF329915; -.
DR Proteomes; UP000002281; Chromosome 5.
DR Bgee; ENSECAG00000011669; Expressed in articular cartilage of joint and 14 other cell types or tissues.
DR GO; GO:0048856; P:anatomical structure development; IEA:UniProt.
DR GO; GO:0030154; P:cell differentiation; IEA:UniProt.
DR CDD; cd00055; EGF_Lam; 1.
DR CDD; cd00063; FN3; 9.
DR CDD; cd00087; FReD; 1.
DR Gene3D; 3.90.215.10; Gamma Fibrinogen, chain A, domain 1; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 9.
DR Gene3D; 2.10.25.10; Laminin; 3.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR013111; EGF_extracell.
DR InterPro; IPR041161; EGF_Tenascin.
DR InterPro; IPR036056; Fibrinogen-like_C.
DR InterPro; IPR014716; Fibrinogen_a/b/g_C_1.
DR InterPro; IPR002181; Fibrinogen_a/b/g_C_dom.
DR InterPro; IPR020837; Fibrinogen_CS.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR002049; LE_dom.
DR NCBIfam; NF040941; GGGWT_bact; 1.
DR PANTHER; PTHR46708; TENASCIN; 1.
DR PANTHER; PTHR46708:SF5; TENASCIN N; 1.
DR Pfam; PF07974; EGF_2; 1.
DR Pfam; PF18720; EGF_Tenascin; 2.
DR Pfam; PF00147; Fibrinogen_C; 1.
DR Pfam; PF00041; fn3; 9.
DR SMART; SM00181; EGF; 3.
DR SMART; SM00186; FBG; 1.
DR SMART; SM00060; FN3; 9.
DR SUPFAM; SSF56496; Fibrinogen C-terminal domain-like; 1.
DR SUPFAM; SSF49265; Fibronectin type III; 6.
DR PROSITE; PS00022; EGF_1; 1.
DR PROSITE; PS01186; EGF_2; 2.
DR PROSITE; PS00514; FIBRINOGEN_C_1; 1.
DR PROSITE; PS51406; FIBRINOGEN_C_2; 1.
DR PROSITE; PS50853; FN3; 9.
PE 3: Inferred from homology;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Reference proteome {ECO:0000313|Proteomes:UP000002281};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Secreted {ECO:0000256|ARBA:ARBA00022530}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..26
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 27..1296
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5040508048"
FT DOMAIN 263..352
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 353..443
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 444..533
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 534..615
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 620..709
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 710..791
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 796..885
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 886..969
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 972..1060
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1058..1275
FT /note="Fibrinogen C-terminal"
FT /evidence="ECO:0000259|PROSITE:PS51406"
FT REGION 867..887
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1296 AA; 142911 MW; 78A01C868E1175D0 CRC64;
MGLHGIFSFP LGLLFGSVLL VASAPAALEP HGCSDKEQQV TFSHTYKIDV PKSALVQVEA
NPQPLSDDGA SLLVPGEAEE HNIIFRHNIR LQTPQKDCEL AGSVQDLLAR VKKLEEEMAE
VKEQCSARCC CQGAAGVSGH CSSHGTFSPE TCSCLCEQGW EGAACDRPAC PGACSGHGRC
VDGRCVCDEP YVGADCAYPA CPENCSGHGV CVRGVCQCHE DFTSEDCSER RCPGDCSGHG
FCDTGECYCE EGFTGLDCSQ VVAPQGLQLL KSTEDSLLVN WEPSSEVDHY LLSYYQLGKE
LSGKQIQVPK EQHTYEITGL QPGTKYIVSL RNVKKEISSS PQHLLATTDL AVLGTAWVTD
ETENSLDVEW ENPPTEVDYY KLRYGPLTGQ EVAEVTVPKS SDPKSRYDIT GLQPGTEYKI
TVVPMRGDLE GKSILLNGRT EIDSPTNVVT HQVTEDTAMV SWDPVQAVID KYVVRHISAD
GESKDMAVPR EQSSTILTGL KPGEAYKVYV WAEKGSQESK KADTEALTEI DSPTNLVTDL
VTENMATVSW DPVQADIDRY MVRYTSADGD TKDVPVRKEQ NSTILTGLRP GVEYKVHVWA
EKGDRESKKA DTKAPTDIDS PKNLVTDLVT ENTATVSWDP VQAVIDRYMV RYTSADGDTK
DVPVGKEQNS TILTGLRPGV EYKVYVWAEK GDQESKKADT KAPTDIDSPQ NLVTNHVTEN
TAAVSWDPVQ AVIDRYMVRY TSADGDTREV PVGKEQSSTV LTGLRPGVEY TVHVWAEKGD
RESKKADTKA PTEIDSPQNL VTNQVTENTA TVSWDPVQAD IDRYMVRYTS ADGDTKDVPV
GKEQNSTILT GLRPGVEYTV HVWAKKGDQE SKKADTKAPT DIDSPKNLVT DRVTENTATI
SWDPVQADID RYVVRYTSAD GDAREVPVGK EQSSTILTGL RPGVEYKVHV WAEKGDWESK
KADTKALPDI DPPKNLRSSA VTQSGGVLTW TPPTAQIDGY ILTYQFPNGA VKEVQLGQGD
QRFELQGLEQ GVAYPVSLVA FKGDRRSRNV STILSTVGAR FPHPSDCSQV QQNSNVASGV
YTIYLHGDAS RPLQVYCDMD TDGGGWIVFQ RRNTGQLDFF KRWRAYVEGF GDPMKEFWLG
LDKLHNLTTG TPTRYEVRVD LQTANESAYA IYDFFQVASS KERFKLTVGK YRGTAGDALS
YHNGWKFTTF DRDNDIALSN CALTHHGGWW YKNCHLANPN GRYGETKHSE GVNWEPWKGH
EFSIPYVELK IRPHGYSGEH VLNRKKRTLG GKTRTV
//