ID Q4SXM5_TETNG Unreviewed; 1739 AA.
AC Q4SXM5;
DT 19-JUL-2005, integrated into UniProtKB/TrEMBL.
DT 19-JUL-2005, sequence version 1.
DT 24-JAN-2024, entry version 89.
DE SubName: Full=(spotted green pufferfish) hypothetical protein {ECO:0000313|EMBL:CAF94607.1};
GN ORFNames=GSTENG00010762001 {ECO:0000313|EMBL:CAF94607.1};
OS Tetraodon nigroviridis (Spotted green pufferfish) (Chelonodon
OS nigroviridis).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Eupercaria; Tetraodontiformes; Tetradontoidea; Tetraodontidae; Tetraodon.
OX NCBI_TaxID=99883 {ECO:0000313|EMBL:CAF94607.1};
RN [1] {ECO:0000313|EMBL:CAF94607.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=15496914; DOI=10.1038/nature03025;
RA Jaillon O., Aury J.-M., Brunet F., Petit J.-L., Stange-Thomann N.,
RA Mauceli E., Bouneau L., Fischer C., Ozouf-Costaz C., Bernot A., Nicaud S.,
RA Jaffe D., Fisher S., Lutfalla G., Dossat C., Segurens B., Dasilva C.,
RA Salanoubat M., Levy M., Boudet N., Castellano S., Anthouard V., Jubin C.,
RA Castelli V., Katinka M., Vacherie B., Biemont C., Skalli Z., Cattolico L.,
RA Poulain J., De Berardinis V., Cruaud C., Duprat S., Brottier P.,
RA Coutanceau J.-P., Gouzy J., Parra G., Lardier G., Chapple C.,
RA McKernan K.J., McEwan P., Bosak S., Kellis M., Volff J.-N., Guigo R.,
RA Zody M.C., Mesirov J., Lindblad-Toh K., Birren B., Nusbaum C., Kahn D.,
RA Robinson-Rechavi M., Laudet V., Schachter V., Quetier F., Saurin W.,
RA Scarpelli C., Wincker P., Lander E.S., Weissenbach J., Roest Crollius H.;
RT "Genome duplication in the teleost fish Tetraodon nigroviridis reveals the
RT early vertebrate proto-karyotype.";
RL Nature 431:946-957(2004).
RN [2] {ECO:0000313|EMBL:CAF94607.1}
RP NUCLEOTIDE SEQUENCE.
RG Genoscope;
RG Whitehead Institute Centre for Genome Research;
RL Submitted (FEB-2004) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Secreted {ECO:0000256|ARBA:ARBA00004613}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:CAF94607.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CAAE01012357; CAF94607.1; -; Genomic_DNA.
DR KEGG; tng:GSTEN00010762G001; -.
DR GO; GO:0005615; C:extracellular space; IEA:InterPro.
DR GO; GO:0004866; F:endopeptidase inhibitor activity; IEA:InterPro.
DR CDD; cd02896; complement_C3_C4_C5; 1.
DR CDD; cd03582; NTR_complement_C5; 1.
DR Gene3D; 1.50.10.20; -; 1.
DR Gene3D; 2.20.130.20; -; 1.
DR Gene3D; 2.40.50.120; -; 1.
DR Gene3D; 2.60.120.1540; -; 1.
DR Gene3D; 2.60.40.1930; -; 3.
DR Gene3D; 2.60.40.1940; -; 1.
DR Gene3D; 6.20.50.160; -; 1.
DR Gene3D; 2.60.40.690; Alpha-macroglobulin, receptor-binding domain; 1.
DR Gene3D; 1.20.91.20; Anaphylotoxins (complement system); 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 2.
DR InterPro; IPR009048; A-macroglobulin_rcpt-bd.
DR InterPro; IPR036595; A-macroglobulin_rcpt-bd_sf.
DR InterPro; IPR011625; A2M_N_BRD.
DR InterPro; IPR011626; Alpha-macroglobulin_TED.
DR InterPro; IPR000020; Anaphylatoxin/fibulin.
DR InterPro; IPR018081; Anaphylatoxin_comp_syst.
DR InterPro; IPR041425; C3/4/5_MG1.
DR InterPro; IPR048843; C5_CUB.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR001599; Macroglobln_a2.
DR InterPro; IPR002890; MG2.
DR InterPro; IPR041555; MG3.
DR InterPro; IPR040839; MG4.
DR InterPro; IPR001134; Netrin_domain.
DR InterPro; IPR018933; Netrin_module_non-TIMP.
DR InterPro; IPR008930; Terpenoid_cyclase/PrenylTrfase.
DR InterPro; IPR008993; TIMP-like_OB-fold.
DR PANTHER; PTHR11412:SF83; COMPLEMENT C5; 1.
DR PANTHER; PTHR11412; MACROGLOBULIN / COMPLEMENT; 1.
DR Pfam; PF00207; A2M; 1.
DR Pfam; PF07703; A2M_BRD; 1.
DR Pfam; PF07677; A2M_recep; 1.
DR Pfam; PF01821; ANATO; 1.
DR Pfam; PF21309; C5_CUB; 1.
DR Pfam; PF17790; MG1; 1.
DR Pfam; PF01835; MG2; 1.
DR Pfam; PF17791; MG3; 1.
DR Pfam; PF17789; MG4; 1.
DR Pfam; PF01759; NTR; 1.
DR Pfam; PF07678; TED_complement; 1.
DR SMART; SM01359; A2M_N_2; 1.
DR SMART; SM01361; A2M_recep; 1.
DR SMART; SM00643; C345C; 1.
DR SUPFAM; SSF49410; Alpha-macroglobulin receptor domain; 1.
DR SUPFAM; SSF47686; Anaphylotoxins (complement system); 1.
DR SUPFAM; SSF48239; Terpenoid cyclases/Protein prenyltransferases; 1.
DR SUPFAM; SSF50242; TIMP-like; 1.
DR PROSITE; PS01178; ANAPHYLATOXIN_2; 1.
DR PROSITE; PS50189; NTR; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Secreted {ECO:0000256|ARBA:ARBA00022525}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..19
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 20..1739
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5004244140"
FT DOMAIN 679..715
FT /note="Anaphylatoxin-like"
FT /evidence="ECO:0000259|PROSITE:PS01178"
FT DOMAIN 1588..1735
FT /note="NTR"
FT /evidence="ECO:0000259|PROSITE:PS50189"
FT REGION 822..867
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1739 AA; 190286 MW; 009C0B348717A999 CRC64;
MKVCVLLVCA FCLCWRTEAE SRSYLITAPL SLRLDAVETV LLQLFGFTSE VKVYVFLKTS
MAPDHVVLAQ EVVTLNTQNR HQAAAKVRLY PGQLQRTASH VILHVQSAEI NQHLSIPVSR
TNGFLFIQTD KPLYTPHQSV KVRAFSLNQE LRPANRSVFL TFKDPDRITV DVAEMIDVNN
GIPSMQNPFR IPIKPKLGIW SIEAAYSGDF TTAARTDFEV KEYVLPSFSI LVKPEQNYIS
FGNFRSFSFQ VSVRYLHGAP VADGVVFLRY GYVSGKAPAV LIPTSVSRER LSSTGEVSVT
VNMEKVLSKH DGPRDLGALV GKHLYIAVLV QEDTGGISEE AEFSAVKFVK SPYRLSLVST
PPFIKPGLPY HISVAVKDHL DKPVKGVVVA LVRRQLFRRG EQEEMPCTPR STSSSSGIAS
SSATLKQADP VLPAASQAQL DLLPLSYHSP NQRYLYIDPP LPGQGLEVGS FANINVYSAT
PTYIPVRALS FLVLSRGKVV DFGSVAFVSS TDHRQTLNFE VTPAMVPSVR LLVYYILFGE
GTSELVADSV WLDVRDKCVN GLQEHKPKDN LQMKIVANQD GLVALSARDS ALFALRPNYK
DPVSTVLRHL EQSDRGCGGG GGRDSADVFR LAGLTFLTNA NAQAATSSKP AEQPVSPQPP
LRPPADVCVR SSFGPLRSCC LQGMRFIPKT LSCLQLSRQR FHKHPRCGDV FRTCCEFVQQ
QLDQDQSLIL GRHELGADFD QEPSLIRSYF PESWMWEVQR VSPGQTSLTR TLPDSLTTWD
VQAVGMFQNG DCQPAEGGGV ASERAGLKLF LVLLRDLRGG SGSGVGQSAA QPGRPGSVPG
GQRRAAGADG VRLQPAGRRR DGTAQLHPRA RLRLPALSRQ CSPLSCQYCV TLLAEAAVCL
TDSQPAPGRA GLRSTGCTWT PLFAGGVGKV AFTVLGLEPG EHRLTFVLKT RRGHKDILEK
KLRVVPEGVR REQLSGGRLD PQGLYGSAAG ARRGGATPTS RWLCSVVAGS EKISVELRNT
LPASRVPNTA VERMLTINGE VLDQVVSVVH SADGLRQLAS LPAGSAEAEL GVLLLQVQLY
RYLETSRHWS VLGADIGKSS GQLKEKIRQG LVSVSSFRRG DSSFSMWVNK GPSTWLTALV
VKILAQVDPV VPVDRQALSE SVAWLIRKSQ QPDGSFEEPS SYRPNRIVAA GAVDRSVFIT
SFVLIALQRA TSINEPILQL RFHADSMRAA AAYISQHAVG VASVYVRAVA TFALTLYDSN
SMAASTLLSS LENLARERGH PAVIRYWQDG ERSSEWLKPD QSSGVTVETT AYVLLSFLLK
GRIHYANPVL TWLTQDHQYG GAFYSGQDTA LTLEALTEYS RVVPRAALSL DISVRYSRKG
PLGRVQLSQS RPVATPIQVT KDDSIHVSTG YGTGVSSVKL KTVYYETTAS TQRCNFDVTV
EVVDPDTANN PRITSPTLVA CAKFKPPPNE LLTESGLTVM KIQLPTGVEP HLEDLKQFQD
GDEPLVSHYE LQGDTVVIQM DAVPSEVFVC VEFRVRSRFV VRGSSVSVLS IYEPQDKGTM
CTKEFSYQEQ KLQRLCVAEQ CQCMAAACAT YRGGVDVTLT AARRTSETCR PQIRYAYKVM
VKSSAAEGDF MTYAATVVEV LKNSDKELEA VSSGTPVELV KRATCTSVEI QENQQYLLMG
SGGSEIRLER SFRYRLPLDS EALLELWPTD CGSPECLDFV SQLDDFALDL QLMSCADAS
//