ID Q4S758_TETNG Unreviewed; 1193 AA.
AC Q4S758;
DT 19-JUL-2005, integrated into UniProtKB/TrEMBL.
DT 19-JUL-2005, sequence version 1.
DT 27-MAR-2024, entry version 110.
DE SubName: Full=(spotted green pufferfish) hypothetical protein {ECO:0000313|EMBL:CAG03524.1};
GN ORFNames=GSTENG00022976001 {ECO:0000313|EMBL:CAG03524.1};
OS Tetraodon nigroviridis (Spotted green pufferfish) (Chelonodon
OS nigroviridis).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Eupercaria; Tetraodontiformes; Tetradontoidea; Tetraodontidae; Tetraodon.
OX NCBI_TaxID=99883 {ECO:0000313|EMBL:CAG03524.1};
RN [1] {ECO:0000313|EMBL:CAG03524.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=15496914; DOI=10.1038/nature03025;
RA Jaillon O., Aury J.-M., Brunet F., Petit J.-L., Stange-Thomann N.,
RA Mauceli E., Bouneau L., Fischer C., Ozouf-Costaz C., Bernot A., Nicaud S.,
RA Jaffe D., Fisher S., Lutfalla G., Dossat C., Segurens B., Dasilva C.,
RA Salanoubat M., Levy M., Boudet N., Castellano S., Anthouard V., Jubin C.,
RA Castelli V., Katinka M., Vacherie B., Biemont C., Skalli Z., Cattolico L.,
RA Poulain J., De Berardinis V., Cruaud C., Duprat S., Brottier P.,
RA Coutanceau J.-P., Gouzy J., Parra G., Lardier G., Chapple C.,
RA McKernan K.J., McEwan P., Bosak S., Kellis M., Volff J.-N., Guigo R.,
RA Zody M.C., Mesirov J., Lindblad-Toh K., Birren B., Nusbaum C., Kahn D.,
RA Robinson-Rechavi M., Laudet V., Schachter V., Quetier F., Saurin W.,
RA Scarpelli C., Wincker P., Lander E.S., Weissenbach J., Roest Crollius H.;
RT "Genome duplication in the teleost fish Tetraodon nigroviridis reveals the
RT early vertebrate proto-karyotype.";
RL Nature 431:946-957(2004).
RN [2] {ECO:0000313|EMBL:CAG03524.1}
RP NUCLEOTIDE SEQUENCE.
RG Genoscope;
RG Whitehead Institute Centre for Genome Research;
RL Submitted (FEB-2004) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the thrombospondin family.
CC {ECO:0000256|ARBA:ARBA00009456}.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:CAG03524.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CAAE01014723; CAG03524.1; -; Genomic_DNA.
DR AlphaFoldDB; Q4S758; -.
DR KEGG; tng:GSTEN00022976G001; -.
DR GO; GO:0005576; C:extracellular region; IEA:InterPro.
DR GO; GO:0005509; F:calcium ion binding; IEA:UniProtKB-UniRule.
DR GO; GO:0008201; F:heparin binding; IEA:UniProtKB-KW.
DR GO; GO:0048856; P:anatomical structure development; IEA:UniProt.
DR GO; GO:0007155; P:cell adhesion; IEA:UniProtKB-KW.
DR Gene3D; 2.60.120.200; -; 2.
DR Gene3D; 2.10.70.10; Complement Module, domain 1; 1.
DR Gene3D; 2.10.25.10; Laminin; 3.
DR Gene3D; 2.20.100.10; Thrombospondin type-1 (TSP1) repeat; 3.
DR Gene3D; 4.10.1080.10; TSP type-3 repeat; 2.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR003367; Thrombospondin_3-like_rpt.
DR InterPro; IPR017897; Thrombospondin_3_rpt.
DR InterPro; IPR008859; Thrombospondin_C.
DR InterPro; IPR000884; TSP1_rpt.
DR InterPro; IPR036383; TSP1_rpt_sf.
DR InterPro; IPR028974; TSP_type-3_rpt.
DR InterPro; IPR048287; TSPN-like_N.
DR InterPro; IPR001007; VWF_dom.
DR PANTHER; PTHR10199; THROMBOSPONDIN; 1.
DR PANTHER; PTHR10199:SF78; THROMBOSPONDIN-1; 1.
DR Pfam; PF00090; TSP_1; 3.
DR Pfam; PF02412; TSP_3; 7.
DR Pfam; PF05735; TSP_C; 1.
DR Pfam; PF00093; VWC; 1.
DR PRINTS; PR01705; TSP1REPEAT.
DR SMART; SM00181; EGF; 3.
DR SMART; SM00179; EGF_CA; 2.
DR SMART; SM00209; TSP1; 3.
DR SMART; SM00210; TSPN; 1.
DR SMART; SM00214; VWC; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 2.
DR SUPFAM; SSF57603; FnI-like domain; 1.
DR SUPFAM; SSF103647; TSP type-3 repeat; 3.
DR SUPFAM; SSF82895; TSP-1 type 1 repeat; 3.
DR PROSITE; PS01186; EGF_2; 1.
DR PROSITE; PS50026; EGF_3; 1.
DR PROSITE; PS50092; TSP1; 3.
DR PROSITE; PS51234; TSP3; 4.
DR PROSITE; PS51236; TSP_CTER; 1.
DR PROSITE; PS01208; VWFC_1; 1.
DR PROSITE; PS50184; VWFC_2; 1.
PE 3: Inferred from homology;
KW Calcium {ECO:0000256|ARBA:ARBA00022837, ECO:0000256|PROSITE-
KW ProRule:PRU00634}; Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Heparin-binding {ECO:0000256|ARBA:ARBA00022674};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..19
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 20..1193
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5004243392"
FT DOMAIN 294..351
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 625..669
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT REPEAT 706..741
FT /note="TSP type-3"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00634"
FT REPEAT 765..800
FT /note="TSP type-3"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00634"
FT REPEAT 862..897
FT /note="TSP type-3"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00634"
FT REPEAT 898..933
FT /note="TSP type-3"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00634"
FT DOMAIN 937..1151
FT /note="TSP C-terminal"
FT /evidence="ECO:0000259|PROSITE:PS51236"
FT REGION 703..733
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 755..774
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 819..911
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 261..288
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 711..727
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 757..771
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 820..847
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 862..876
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:CAG03524.1"
SQ SEQUENCE 1193 AA; 133256 MW; 6E8781648FCEC7F2 CRC64;
GIFLLLILWT CESTRVAESR DDNSVYDLFE LVQVPRKNHG VTLVKGDDPY SPAYKILDPD
LIPAVPDQAF SDLIDSIRAE RGFLLLLNFK QFKRTRGSLL TVEKKDGSGP VFEIISNGKA
NTLDIVFSTE NKQQVVSIEE EDQAQLYVGC EDVNTAELDA PIQSILTQET PAGARLRIGK
GAVNDRFMGV LQNVRFVFGT TLDAILRNKG CQNSISSETM ILENLNGSSA IRTEYTGHKT
KDLQMVCGFS CEDLFSMFKE LKSLGVVVKE LSNELRQLTD ENKLIKNHIG IHNGVCIHNG
IMHKNKEEWK VDDCTECTCQ NSATVCRKIS CPVISCANAT IPDRECCPHC GTPRDSAEDS
WSPWSEWTHC SVSCGRGIQQ RGRSCDRINN NCEGTSVQTR DCYIQECDKR FKQDGSWSHW
SPWSSCSVTC GAGVITRIRL CNSPTPQFGG KSCTGEGRQT EKCQKSSCPI NGNWGPWSPW
DTCTLTCGGG VQIRKRLCND PEPKYGGKDC VGDAKDIQIC NNKSCPIDGC LSNPCFAGVK
CTSFPDGSWK CGKCPVGYSG NGIKCKDIDE CKEVPDACFE FNGVHRCENT DPGYNCLPCP
PRYSGPQPYG KGVEQAVAKK QICTPRNPCL DGSHECNKNA RCNYLGHFAD PMYRCECKPG
YAGNGHICGE DTDLDGWPNL DLICVENATY HCKKDNCPNL PNSGQEDYDN DGIGDACDND
DDNDGIPDDR DNCPFVYNPR QYDYDRDDVG DRCDNCPYNS NPDQTDTDNN GEGDACAVDI
DGDGILNEKD NCPYVYNVDQ RDTDLDGVGD MCDNCPLEHN PDQVDTDDDR VGDKCDSNQD
IDEDGHQNNL DNCPYIPNAN QADHDKDGKG DACDHDDDND GIPDDKDNCR LAFNPDQLDS
DGDGRGDACK DDFDQDNVPD IYDVCPENFD ISETDFRKFQ MVPLDPKGTS QIDPNWVVRH
QGKELVQTVN CDPGIAVGYH EFNSVDFSGT FFINTERDDD YAGFVFGYQS SSRFYVVMWK
QITQTYWSNK PTKAQGYSGL SIKVVNSTTG PGEHLRNALW HTGNTAGQVR TLWHDPKNVG
WKDFTAYRWH LIHRPRTGLI RVVMYEGKKI MADSGSIYDK TYAGGRLGLF VFSQEMVYFS
DLKYECREKY LRHYVRRDTR MRPFSERPGR LYCTPLEKYS ALREMGSLDN GPV
//