GenomeNet

Database: UniProt
Entry: A0A1U7S8Q5_ALLSI
LinkDB: A0A1U7S8Q5_ALLSI
Original site: A0A1U7S8Q5_ALLSI 
ID   A0A1U7S8Q5_ALLSI        Unreviewed;      2437 AA.
AC   A0A1U7S8Q5;
DT   10-MAY-2017, integrated into UniProtKB/TrEMBL.
DT   10-MAY-2017, sequence version 1.
DT   27-MAR-2024, entry version 35.
DE   SubName: Full=Tenascin isoform X1 {ECO:0000313|RefSeq:XP_006031179.1};
GN   Name=TNC {ECO:0000313|RefSeq:XP_006031179.1};
OS   Alligator sinensis (Chinese alligator).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Archelosauria; Archosauria; Crocodylia; Alligatoridae; Alligatorinae;
OC   Alligator.
OX   NCBI_TaxID=38654 {ECO:0000313|Proteomes:UP000189705, ECO:0000313|RefSeq:XP_006031179.1};
RN   [1] {ECO:0000313|RefSeq:XP_006031179.1}
RP   IDENTIFICATION.
RG   RefSeq;
RL   Submitted (NOV-2023) to UniProtKB.
CC   -!- SUBCELLULAR LOCATION: Secreted, extracellular space, extracellular
CC       matrix {ECO:0000256|ARBA:ARBA00004498}.
CC   -!- SIMILARITY: Belongs to the tenascin family.
CC       {ECO:0000256|ARBA:ARBA00008673}.
CC   -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC       feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   RefSeq; XP_006031179.1; XM_006031117.3.
DR   STRING; 38654.A0A1U7S8Q5; -.
DR   GeneID; 102379949; -.
DR   KEGG; asn:102379949; -.
DR   CTD; 3371; -.
DR   eggNOG; KOG1225; Eukaryota.
DR   eggNOG; KOG2579; Eukaryota.
DR   eggNOG; KOG3544; Eukaryota.
DR   InParanoid; A0A1U7S8Q5; -.
DR   OrthoDB; 5489847at2759; -.
DR   Proteomes; UP000189705; Unplaced.
DR   CDD; cd00054; EGF_CA; 4.
DR   CDD; cd00063; FN3; 14.
DR   CDD; cd00087; FReD; 1.
DR   Gene3D; 2.20.25.10; -; 1.
DR   Gene3D; 3.90.215.10; Gamma Fibrinogen, chain A, domain 1; 1.
DR   Gene3D; 2.60.40.10; Immunoglobulins; 16.
DR   Gene3D; 2.10.25.10; Laminin; 16.
DR   InterPro; IPR000742; EGF-like_dom.
DR   InterPro; IPR013111; EGF_extracell.
DR   InterPro; IPR041161; EGF_Tenascin.
DR   InterPro; IPR036056; Fibrinogen-like_C.
DR   InterPro; IPR014716; Fibrinogen_a/b/g_C_1.
DR   InterPro; IPR002181; Fibrinogen_a/b/g_C_dom.
DR   InterPro; IPR003961; FN3_dom.
DR   InterPro; IPR036116; FN3_sf.
DR   InterPro; IPR013783; Ig-like_fold.
DR   NCBIfam; NF040941; GGGWT_bact; 1.
DR   PANTHER; PTHR46708; TENASCIN; 1.
DR   PANTHER; PTHR46708:SF1; TENASCIN; 1.
DR   Pfam; PF07974; EGF_2; 6.
DR   Pfam; PF18720; EGF_Tenascin; 8.
DR   Pfam; PF00147; Fibrinogen_C; 1.
DR   Pfam; PF00041; fn3; 16.
DR   SMART; SM00181; EGF; 17.
DR   SMART; SM00186; FBG; 1.
DR   SMART; SM00060; FN3; 16.
DR   SUPFAM; SSF56496; Fibrinogen C-terminal domain-like; 1.
DR   SUPFAM; SSF49265; Fibronectin type III; 12.
DR   PROSITE; PS00022; EGF_1; 6.
DR   PROSITE; PS01186; EGF_2; 7.
DR   PROSITE; PS50026; EGF_3; 4.
DR   PROSITE; PS51406; FIBRINOGEN_C_2; 1.
DR   PROSITE; PS50853; FN3; 14.
PE   3: Inferred from homology;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW   ProRule:PRU00076};
KW   EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW   ProRule:PRU00076}; Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Reference proteome {ECO:0000313|Proteomes:UP000189705};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Secreted {ECO:0000256|ARBA:ARBA00022530}; Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..22
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           23..2437
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5010522236"
FT   DOMAIN          188..219
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   DOMAIN          436..467
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   DOMAIN          560..591
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   DOMAIN          622..653
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   DOMAIN          688..778
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          868..958
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          959..1049
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          1050..1138
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          1170..1265
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          1311..1401
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          1402..1492
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          1493..1583
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          1584..1674
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          1675..1767
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          1858..1947
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          1948..2037
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          2038..2124
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          2125..2213
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          2211..2426
FT                   /note="Fibrinogen C-terminal"
FT                   /evidence="ECO:0000259|PROSITE:PS51406"
FT   REGION          1042..1062
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   DISULFID        192..202
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT   DISULFID        209..218
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT   DISULFID        440..450
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT   DISULFID        457..466
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT   DISULFID        564..574
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT   DISULFID        581..590
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT   DISULFID        626..636
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT   DISULFID        643..652
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ   SEQUENCE   2437 AA;  267049 MW;  CD193AF06DA439FE CRC64;
     MGLPTQVLAC AIIVLLHQHV SGGLIKRIIR QKREAGLNVT LPDDNQPVVF NHVYNIKLPV
     GSLCSVDLDS ASGDADLKAE IEPSKHYEEH TVNEDNQIVF THRINIPRRA CGCASAPDIK
     DLLNRLEELE GLVSSLREQC TIGAGCCPSA QTAEGRVDTT PFCSGRGNYS TEICGCICDP
     GWKGPNCSEP ECLNNCYNRG RCVNGKCICD EGFTGKDCSE LTCPGDCNDQ GKCIDGVCVC
     FEGYTGEDCV EELCLPPCSE HGKCVNGRCV CVEGFTGEDC GEPLCLNNCN NRGRCVENEC
     VCDEGYTGED CGELICPNDC FDRGRCINGT CYCEEGYTGE DCGELTCPNN CNGNGRCENG
     QCLCDEGFVG DDCSERRCPK DCNKRGRCID GQCVCNEGFE GIDCGQVKCP KDCNNRGQCV
     NGQCVCNEGF MGEDCGVLRC PKDCSNHGRC INGQCECDEG FMGEDCSELK CPNDCHNRGR
     CVKGHCVCDE GFIGEDCGEL RCPNDCYNRG RCVKGICVCD EGFVGNDCSE LRCPNDCSKH
     GRCVNGQCVC DEGYTGEDCS ELRCPNDCHN RGRCVDGECL CEKGFTGVDC GELACLDNCN
     NRGRCENGQC VCNEGFTGID CSQLRCANDC NNQGRCIEGQ CVCDEGFTGE DCSQRSCLNN
     CNNLGRCVDG RCICDNGYIG DDCSDVSPPS DLTVTNVTDK TVNLEWKNEN LVNQYLITYV
     PTSIGGLDMQ FTVPGNQTSA TIRELEPGVE YFIRVFAILK NKKSIPISAR VATYLPAPEG
     LKFKSVRETS VQVEWDPLNI SFDGWELIFR NMKKEDNGDI TRSLTRPETS YMQPGLAPGQ
     QYNVSLHIVK NNTRGPGLSK VITTKLDAPS QIEAKDVTDT TALITWFKPL AEIDGIELTY
     GPKDVPGDKT TIDLSEDENQ YSIGDLKPFT EYEVVLISRR GDMESDPMSE VFLTDLDAPK
     NLKRVSQTDN SITLEWKKSQ ADIDSYRVKF APISGGDHAE ITVPKSNQAT TKATLTGLRP
     GTEYGIGVTA VKEDTESAPA TINAGTDLDS PKDLGVNNPT ETSLSLSWRR PLAKFDRYRL
     TYVTPSGRKN EVEIPADSTS YILQGLDAGT EYTTSLIAEK GRHKSKPTTV KSSTAVYSLE
     LMQGATPGPD EHSGFWSSPA PEASGMWEAT LWDLLVSNVT DHGFGLSWKA DAGAYTSFVV
     EYEEVTLAAG PPAEVFMPGE SLGAVIDGLQ ANATYKVKVY GMAGGQRSHP LEAVATTALL
     VTAPGESNNE TSFNSTYPAP TELSYLTIPS TDSHTVLSAP TAFFISDVTA QLHDLKVTGR
     TFSSFTLTWA AQDEVFDQFF ITLKDLSSLN RTLEIFLPGS QRETEFTNLT AGTQYQINLH
     GSARGQLSQS LEAITNTAEE PELGNLTVSE VSWDGFQPTW TAADGAFETF VLQVQESDNP
     EEVQNHTVPG GLRFVNITAL KDYTRYNITL YGVIQGYRTK PLSAETTTAM RAEVGELNVS
     GITPEGFNLS WTATERAFEI FTIEIINSNR FLEPMEYNIS GNLRTAHISG LSPSTDYIVY
     LYGITPGFRT QAISVAVTTV DEPLLSKLTV SNATSNSVSL TWEAQDTAFD HFVLEVRNSD
     LPLDSLVHTV SGASRSFVIT NLRAATNYTV QLHGLVDGQS GQTLTAVATT EAEPQLGTLT
     LTNVTPDSFN LSWTTRNGPF AKFVINVRDS YSAHEPQELT VSGGARSAHI SGLVDYTGYD
     INIKGTTSAG VHTEPLTAFV MTEAMPPLEN LTVSDINPYG FTVSWMASEN AFDNFLVIVV
     DSGKLLDPQE FLLTGTQRHL KLKGLITGIG YEVMLYGFAK GRQTKPLSIV AITEAEPEVD
     NILVSDITPD GFRLSWTADD GVFDSFFIKI RDTKKQSDPW ERIVPGHERT QDITGLKEGT
     EYEIELYGII SGRRSQPINA IAITAMGSPK GINFTDITEN SATVTWMLPR TRVESFRILY
     APKTGGTPNI VTVDGTKTRT KLVKLVPGVE YVVNITSVKG FEESEPVSGP LKTALDSPSG
     LVVVNITDSE ALATWQPAIA AVDNYVISYI SETEPEVKQM VSGNTVEYDL NGLRPATEYV
     LRLHAVEDGQ QSATISTKFT TGMDAPRDLR AIDIQSETAV LTWRPPRASL TGYLLIYESA
     DGKVKEVILV PETTSYDLTE LSPSTQYTVK LRALHRSLKS NIIQTVFTTS GLLYPFPKDC
     SQALLNGETA SGLYTIYLNG EKSQPLEVYC DMGSDGGGWI VFLRRRNGKE DFYRNWRTYS
     AGFGDPKDEF WIGLENLHKI TSQGQYELRV DLRDKGETAF AIYDKFSVGD SKTRYRLKVD
     GYTGTAGDSM TYHNGRSFST YDKDNDFAIT NCALSYKGAF WYKNCHRVNL MGRYGDNSHS
     QGINWFHWKG HEYSIEFAEM KLRPSSFRNL EGRRKRA
//
DBGET integrated database retrieval system