ID Q4RU98_TETNG Unreviewed; 3019 AA.
AC Q4RU98;
DT 19-JUL-2005, integrated into UniProtKB/TrEMBL.
DT 19-JUL-2005, sequence version 1.
DT 27-MAR-2024, entry version 115.
DE SubName: Full=(spotted green pufferfish) hypothetical protein {ECO:0000313|EMBL:CAG08034.1};
DE Flags: Fragment;
GN ORFNames=GSTENG00028894001 {ECO:0000313|EMBL:CAG08034.1};
OS Tetraodon nigroviridis (Spotted green pufferfish) (Chelonodon
OS nigroviridis).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Eupercaria; Tetraodontiformes; Tetradontoidea; Tetraodontidae; Tetraodon.
OX NCBI_TaxID=99883 {ECO:0000313|EMBL:CAG08034.1};
RN [1] {ECO:0000313|EMBL:CAG08034.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=15496914; DOI=10.1038/nature03025;
RA Jaillon O., Aury J.-M., Brunet F., Petit J.-L., Stange-Thomann N.,
RA Mauceli E., Bouneau L., Fischer C., Ozouf-Costaz C., Bernot A., Nicaud S.,
RA Jaffe D., Fisher S., Lutfalla G., Dossat C., Segurens B., Dasilva C.,
RA Salanoubat M., Levy M., Boudet N., Castellano S., Anthouard V., Jubin C.,
RA Castelli V., Katinka M., Vacherie B., Biemont C., Skalli Z., Cattolico L.,
RA Poulain J., De Berardinis V., Cruaud C., Duprat S., Brottier P.,
RA Coutanceau J.-P., Gouzy J., Parra G., Lardier G., Chapple C.,
RA McKernan K.J., McEwan P., Bosak S., Kellis M., Volff J.-N., Guigo R.,
RA Zody M.C., Mesirov J., Lindblad-Toh K., Birren B., Nusbaum C., Kahn D.,
RA Robinson-Rechavi M., Laudet V., Schachter V., Quetier F., Saurin W.,
RA Scarpelli C., Wincker P., Lander E.S., Weissenbach J., Roest Crollius H.;
RT "Genome duplication in the teleost fish Tetraodon nigroviridis reveals the
RT early vertebrate proto-karyotype.";
RL Nature 431:946-957(2004).
RN [2] {ECO:0000313|EMBL:CAG08034.1}
RP NUCLEOTIDE SEQUENCE.
RG Genoscope;
RG Whitehead Institute Centre for Genome Research;
RL Submitted (FEB-2004) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Secreted, extracellular space, extracellular
CC matrix {ECO:0000256|ARBA:ARBA00004498}.
CC -!- SIMILARITY: Belongs to the fibrillin family.
CC {ECO:0000256|ARBA:ARBA00008972}.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:CAG08034.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CAAE01014995; CAG08034.1; -; Genomic_DNA.
DR KEGG; tng:GSTEN00028894G001; -.
DR GO; GO:0001527; C:microfibril; IEA:UniProt.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR GO; GO:0048513; P:animal organ development; IEA:UniProt.
DR CDD; cd00054; EGF_CA; 26.
DR Gene3D; 2.10.25.10; Laminin; 36.
DR Gene3D; 3.90.290.10; TGF-beta binding (TB) domain; 9.
DR InterPro; IPR026823; cEGF.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR013032; EGF-like_CS.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
DR InterPro; IPR018097; EGF_Ca-bd_CS.
DR InterPro; IPR024731; EGF_dom.
DR InterPro; IPR040872; Fibrillin_U_N.
DR InterPro; IPR009030; Growth_fac_rcpt_cys_sf.
DR InterPro; IPR017878; TB_dom.
DR InterPro; IPR036773; TB_dom_sf.
DR PANTHER; PTHR24034; EGF-LIKE DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR24034:SF202; FIBRILLIN 3; 1.
DR Pfam; PF12662; cEGF; 3.
DR Pfam; PF12947; EGF_3; 1.
DR Pfam; PF07645; EGF_CA; 27.
DR Pfam; PF18193; Fibrillin_U_N; 1.
DR Pfam; PF12661; hEGF; 2.
DR Pfam; PF00683; TB; 9.
DR PIRSF; PIRSF036312; Fibrillin; 9.
DR SMART; SM00181; EGF; 37.
DR SMART; SM00179; EGF_CA; 35.
DR SUPFAM; SSF57196; EGF/Laminin; 12.
DR SUPFAM; SSF57184; Growth factor receptor domain; 9.
DR SUPFAM; SSF57581; TB module/8-cys domain; 9.
DR PROSITE; PS00010; ASX_HYDROXYL; 32.
DR PROSITE; PS00022; EGF_1; 1.
DR PROSITE; PS01186; EGF_2; 19.
DR PROSITE; PS50026; EGF_3; 32.
DR PROSITE; PS01187; EGF_CA; 15.
DR PROSITE; PS51364; TB; 9.
PE 3: Inferred from homology;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Secreted {ECO:0000256|ARBA:ARBA00022530};
KW Signal {ECO:0000256|ARBA:ARBA00022729}.
FT DOMAIN 147..179
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 185..228
FT /note="TB"
FT /evidence="ECO:0000259|PROSITE:PS51364"
FT DOMAIN 247..288
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 289..330
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 362..415
FT /note="TB"
FT /evidence="ECO:0000259|PROSITE:PS51364"
FT DOMAIN 515..554
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 555..596
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 626..666
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 667..707
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 713..758
FT /note="TB"
FT /evidence="ECO:0000259|PROSITE:PS51364"
FT DOMAIN 830..867
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 872..913
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 914..949
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 958..997
FT /note="TB"
FT /evidence="ECO:0000259|PROSITE:PS51364"
FT DOMAIN 1017..1058
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1063..1103
FT /note="TB"
FT /evidence="ECO:0000259|PROSITE:PS51364"
FT DOMAIN 1261..1302
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1303..1340
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1345..1385
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1386..1424
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1427..1468
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1469..1505
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1510..1550
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1555..1604
FT /note="TB"
FT /evidence="ECO:0000259|PROSITE:PS51364"
FT DOMAIN 1694..1735
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1736..1773
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1811..1863
FT /note="TB"
FT /evidence="ECO:0000259|PROSITE:PS51364"
FT DOMAIN 1881..1922
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1923..1964
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1965..2002
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 2007..2042
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 2046..2088
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 2089..2128
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 2155..2196
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 2206..2248
FT /note="TB"
FT /evidence="ECO:0000259|PROSITE:PS51364"
FT DOMAIN 2253..2294
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 2295..2334
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 2400..2443
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 2444..2485
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 2490..2543
FT /note="TB"
FT /evidence="ECO:0000259|PROSITE:PS51364"
FT DOMAIN 2555..2592
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT REGION 1664..1693
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2650..2731
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1664..1686
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2661..2712
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 151..161
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 169..178
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 519..529
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 918..928
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 2011..2021
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:CAG08034.1"
FT NON_TER 3019
FT /evidence="ECO:0000313|EMBL:CAG08034.1"
SQ SEQUENCE 3019 AA; 326228 MW; F2E77F9C483FA7CD CRC64;
PNVCGSRFHS YCCPGWKTLP GGNQCIVRKS SAAPASSQHN NLQEFVRRWL LLQTQHVHVL
QWTALPQLWI RSSCLQAGGP TEAWERPEAF DSLRLPPRPA VGAQKASGTA CWRSSAVQSC
NIRCMNGGSC AEDSCTCPKG YTGSHCGQPV CENGCQNGGR CIGPNRCACV YGFTGPQCER
DYRTGPCFTQ VSHQMCQGQL SGIICTKTLC CATIGRAWGH PCEQCPAQPH PCRRGFIPNI
RTGACQDVDE CQAVPGLCAG GNCINTVGSY ECKCPAGHRQ SDTSHKCEDI DECSTIPGVC
DGGECTNTAG SYVCSCPRGY ISSTDGSRCV GEQVTCGGAI GSRSQTRRKR RVNVEIRDQR
VGTCFSALAN GRCAAELNGQ YTKMQCCCDT GRCWALGQIP EMCPVRGSEE FRRLCIVGVP
QGHGIPNVYP NGNFPTYGFK LPSVPNGNGH GGNGGGGGNG GNGGGFGSAS VNESIDVCKH
FTNLCLNGRC IPIHASYRCE CNMGYKQDVR GECIDVDECV SNPCINGDCV NTPGSYHCKC
HEGYQGTPTK QACIDIDECI VNGVMCRNGR CVNTEGSFQC ICNAGFELTP DGKNCIGEPQ
NRVALHGNPD FYKLDLNFSS ASPAQDHDEC ATTNMCLNGM CINEDGSFKC ICKPGFALAP
NGRYCTDIDE CQTPGICMNG RCINSEGSFR CECPPGLAIG MDGRVCVDTH MRTTCYGAIK
MGTCSRPFPG AVTKSECCCA DPEHGFGEPC YPCPSRNSGS HTPLVPDIPI WKHRRKRSFP
SLTDFSIPQL SSRPCATVVS ASRQTAEVSV AALFPCACVH GHQTSCSLLD INECALDPDI
CLNGICENLR GSYRCICNIG YESDTSGKSC VDINECLVNR LLCDNGLCRN TPGSYTCSCP
KGFIFKPDSE TCEDINECLS SPCVNGVCRN VAGSFNCECS LGSKLDSTNT ICVDSMKSTC
WLTIQDSRCE VNINGATLKS ECCSTLGAAW GSPCEPCEID TSCSRGFARM KGLVCEDINE
CEVFPGVCTN GHCVNTQGSF RCECAEGLTL DSTGRTCVDL RSEQCYLKWH EDECGEPLPG
RYRVDMCCCT VGAAWGVDCE ECPKTGIPGI PSNLPPRERF RQQGGHSDRE ALLQRHRRVS
HLSGPVRPRH LRQHSRQLRV RVFRGLRERL HDDEELHGRE RVPAERKPVQ ERSVRQHGRN
LPVFLRHRLP GHARPPELRR YRRMHHHERR LRNPLHQLRG QLRVQLQREY ALMPDLRTCA
DIDECEETPD ICDGGQCTNI PGEYRCLCYD GFMASMDMRT CIDVNECDLN PNICLHGDCE
NTKGSFICHC QLGYFVNKGS TGCTDVDECE IGAHNCAMHA ACINVPGSFK CRCRDGWVGD
GIECLDQDEC AGEDHNCNLN ADCLNTPGSY RCACKDGFVG DGFSCSDMDE CADNVNLCEN
GQCLNAPGGY RCECEMGFTP TEDSRACQDI DECNFQNICV FGSCQNLPGM FRCVCDDGYE
LDRSGGNCTD INECADPMNC INGLCVNTPG GYMCNCPEDF ELNPTRVGCV DTRVGNCFLD
TNIRGDGGIS CSLEIGVGVT RASCCCSLGG AWGNPCELCP PSNSCNANEQ IHLTSEVRLG
PPEMEIRANA PVLSAAEYKT LCPGGEGFRP NPITVILEGK CLSGRSPRQS QTVPDSPRQS
QTVPDRHGVN PPPDIDECQE LPGLCQGGIC INTFGSFQCE CPAGYYLNEE TRVCDDIDEC
VSSIGICGPG TCYNTQGNYT CVCPPEYMQV NGGNNCMGER RGTWRPSLSF SQKEMLSPWL
LPLLGADMRK SVCYRNFNDT CENELSFNMT KMVCCCSYNV GKAWNKPCEA CPAPASSDYR
VLCGNQAPGF IIDIHTGKPI DIDECRDIPG LCAHGVCINQ IGSFRCECPM GFSYNNILLI
CEDIDECSSG DNLCQRNANC INIPGSYRCQ CSPGFKLSPN GACVDRNECQ EIPNVCSHGS
CIDTQGSYRC ACHNGFKATA DQSMCMDIDE CDRQPCGNGT CKNTVGSYNC LCFPGFELTH
NNDCMDIDEC SALQGQVCRN GQCINGLGFF QCFCHEGYEN TPDEKNCVDV NECVRLPGTC
SPGTCQNLDG SFRCICPPGY EVQNDQCIGE THIRALDVQS RGAEDSTPTR FPPPDINECE
VESNTCQFGT CTNTPGSFQC TCQPGFVLSD NKRRCYGINL THTRESFCFT KFDAGKCSVP
KALNTTKAKC CCSKMPGEGW GLPCELCPRS NEDTNECLDN PGVCQNGICI NTDGSFRCEC
TFGYNLDYTG VKCVDTDECS IGNPCGNGTC TNVVGGFECL CQEGFEPGPM MTCEGRSADT
PTSHGGSRVL TISPSFPSDV NECSQNPLLC AFRCVNTFGS YECMCPDGYV LRDDQRMCRD
QDECSEGLDD CDSKGMTCKN LIGTFMCICP PGMQRRPDGE GCTDLNECRA KPGICSNGRC
VNTVGSYRCE CNDGFEPSAT GTECIDNRQG YCYTEVLQTM CQQSSTSRIS VTKSECCCNV
GRGWGSQCEL CPLPGTVHFK KMCPLGPGYT TDGRDINECQ VLPDLCRNGQ CVNTIGSFRC
HCNVGYKADF TSTSCVGTAA LGVLLLVLLV LPADSLPLFS CTFQTWTSAP CLRSRVTSCV
RTQRAATSAP APEATACSRT GRPAETWTNA PPSSTTASST ASTPSEASPA SARLDSRSTR
RPVSTTTSAP GPTAAAAPAP PASTRPAASA ASAPRASLWT AQAWSVKMWT SAATTIAVSM
AVRTCWEATA AAVHRATCST TSGTSVWTKM SVRADPWCRS ASCYNTLGSF KCVCPSGFDF
DQTAGGCQDV NECSAGGNPC IYGCSNTDGG YLCGCPGGFY RAGQGHCMSG SGFTGQFTEG
DEEDSLSPEA CYECKINGGG KNGRHKRSAG ENGLKQVPLA AAPGSDSWSV MHSAVSMASV
DTQDAIPMNV SLSSLLNKEP LLELLPALQP LENHVRYVIT HGNANEHFRL LERRDGKSVL
RLGKRPPLPG SYRLEIASL
//