ID Q4SP98_TETNG Unreviewed; 981 AA.
AC Q4SP98;
DT 19-JUL-2005, integrated into UniProtKB/TrEMBL.
DT 19-JUL-2005, sequence version 1.
DT 27-MAR-2024, entry version 101.
DE SubName: Full=(spotted green pufferfish) hypothetical protein {ECO:0000313|EMBL:CAF97534.1};
GN ORFNames=GSTENG00014946001 {ECO:0000313|EMBL:CAF97534.1};
OS Tetraodon nigroviridis (Spotted green pufferfish) (Chelonodon
OS nigroviridis).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Eupercaria; Tetraodontiformes; Tetradontoidea; Tetraodontidae; Tetraodon.
OX NCBI_TaxID=99883 {ECO:0000313|EMBL:CAF97534.1};
RN [1] {ECO:0000313|EMBL:CAF97534.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=15496914; DOI=10.1038/nature03025;
RA Jaillon O., Aury J.-M., Brunet F., Petit J.-L., Stange-Thomann N.,
RA Mauceli E., Bouneau L., Fischer C., Ozouf-Costaz C., Bernot A., Nicaud S.,
RA Jaffe D., Fisher S., Lutfalla G., Dossat C., Segurens B., Dasilva C.,
RA Salanoubat M., Levy M., Boudet N., Castellano S., Anthouard V., Jubin C.,
RA Castelli V., Katinka M., Vacherie B., Biemont C., Skalli Z., Cattolico L.,
RA Poulain J., De Berardinis V., Cruaud C., Duprat S., Brottier P.,
RA Coutanceau J.-P., Gouzy J., Parra G., Lardier G., Chapple C.,
RA McKernan K.J., McEwan P., Bosak S., Kellis M., Volff J.-N., Guigo R.,
RA Zody M.C., Mesirov J., Lindblad-Toh K., Birren B., Nusbaum C., Kahn D.,
RA Robinson-Rechavi M., Laudet V., Schachter V., Quetier F., Saurin W.,
RA Scarpelli C., Wincker P., Lander E.S., Weissenbach J., Roest Crollius H.;
RT "Genome duplication in the teleost fish Tetraodon nigroviridis reveals the
RT early vertebrate proto-karyotype.";
RL Nature 431:946-957(2004).
RN [2] {ECO:0000313|EMBL:CAF97534.1}
RP NUCLEOTIDE SEQUENCE.
RG Genoscope;
RG Whitehead Institute Centre for Genome Research;
RL Submitted (FEB-2004) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:CAF97534.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CAAE01014542; CAF97534.1; -; Genomic_DNA.
DR AlphaFoldDB; Q4SP98; -.
DR KEGG; tng:GSTEN00014946G001; -.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR GO; GO:0048856; P:anatomical structure development; IEA:UniProt.
DR CDD; cd00054; EGF_CA; 5.
DR CDD; cd00110; LamG; 3.
DR Gene3D; 2.60.120.200; -; 3.
DR Gene3D; 2.10.25.10; Laminin; 6.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR013032; EGF-like_CS.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
DR InterPro; IPR018097; EGF_Ca-bd_CS.
DR InterPro; IPR001791; Laminin_G.
DR PANTHER; PTHR24049; CRUMBS FAMILY MEMBER; 1.
DR PANTHER; PTHR24049:SF22; DROSOPHILA CRUMBS HOMOLOG; 1.
DR Pfam; PF00008; EGF; 3.
DR Pfam; PF12661; hEGF; 2.
DR Pfam; PF02210; Laminin_G_2; 3.
DR SMART; SM00181; EGF; 7.
DR SMART; SM00179; EGF_CA; 5.
DR SMART; SM00282; LamG; 3.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 3.
DR SUPFAM; SSF57196; EGF/Laminin; 5.
DR PROSITE; PS00010; ASX_HYDROXYL; 1.
DR PROSITE; PS00022; EGF_1; 6.
DR PROSITE; PS01186; EGF_2; 1.
DR PROSITE; PS50026; EGF_3; 7.
DR PROSITE; PS01187; EGF_CA; 1.
DR PROSITE; PS50025; LAM_G_DOMAIN; 3.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}.
FT DOMAIN 7..49
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 78..261
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 284..456
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 458..494
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 522..708
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 710..746
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 748..784
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 786..822
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 864..901
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 903..939
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DISULFID 484..493
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 736..745
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 752..762
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 774..783
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 812..821
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 891..900
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 929..938
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:CAF97534.1"
SQ SEQUENCE 981 AA; 105725 MW; DA82EF5B24C2F5CB CRC64;
GTFCRGITSG CEQQPCQNGG VCESHGDGFR CLCSQQSQNG RLYGGGTCAT PLSGCDGDQC
ENGGICTPSL IGPNCETSTV FSFESRGYVH VKTQHGGPDA PLNVTFSFRS ERAVGTLVQQ
RVDDLVLSIE LFEGRLCLRS LRGQGSSTLV QELPEVLSDS GWHRVEASLG GVVSFIRLIC
SGRTCAGPAA ARALLQEQPG ALPSPGEGGL YIGGGARGWD GAARSAPFLG CFRDVFVDSR
LVVPGRGPGG AEAQANVTAG CSDRDKCEDN PCRNRGRCVK YVTARFGNNG EESYAVFSLD
DDPGPTATVS MFIRTRRPSG LLLVLANSTS QYLRLWLEDG RVKVQVNTFE TFWGRGRVDD
GHVHLLSLRL EATECFLFLS AQSQGSLRIR PIRAQTGDQV LVGGLPDARA SAMFGGYFKG
CVQDLRIDSK RLQFFPLAPP VESYRLEKLA GVAQGCSGDD ACAAEPCLNG GVCYSMWDDF
ICNCPPHTAG QRCQEVKWCE LSPCPDATTC QPRSQGFECV SNATFRFESG VFRYRSSGRI
RRRLASVSLS FRSRQPNATV LHAHKDAHHL TLSLLDSHLV MELRAGTQGA TLRSRAPLSD
GRWHRVELRL EEPALPASGW VMAADGEEES RSASAASAGE LEFVTEGADV VLGGLGWEAG
ASFSGCLGPV EIGGLLLPFH LHSELKLPRP QEEAFTLASG GSAPRRGCWG ARVCAPDPCQ
NDGACEDLFD LHRCRCPPQW TGPVCREPAD PCVSGPCVHG NCTNLPAGGF GCACQAGHSG
ERCEVEVDAC QDSKCVNGAT CLKGVQSYSC LCPQNLTGQY CRWVPPPPAQ APVLALLTPK
LVSYSEKVPD IPWYIDIDPL PLLPAARCRG ARWNYSCFNG GNCSGLDGCD CLPGFTGHWC
EKDVDECASA PCMNGGFCLN YVNSFECVCD LNFSGVHCQM DVSDFYLYVF LSLWQNLFQL
MSYLVIRLDD EPEIDLGFQL D
//