GenomeNet

Database: UniProt
Entry: Q4S0I2_TETNG
LinkDB: Q4S0I2_TETNG
Original site: Q4S0I2_TETNG 
ID   Q4S0I2_TETNG            Unreviewed;      1438 AA.
AC   Q4S0I2;
DT   19-JUL-2005, integrated into UniProtKB/TrEMBL.
DT   19-JUL-2005, sequence version 1.
DT   27-MAR-2024, entry version 81.
DE   SubName: Full=(spotted green pufferfish) hypothetical protein {ECO:0000313|EMBL:CAG05850.1};
GN   ORFNames=GSTENG00026005001 {ECO:0000313|EMBL:CAG05850.1};
OS   Tetraodon nigroviridis (Spotted green pufferfish) (Chelonodon
OS   nigroviridis).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC   Eupercaria; Tetraodontiformes; Tetradontoidea; Tetraodontidae; Tetraodon.
OX   NCBI_TaxID=99883 {ECO:0000313|EMBL:CAG05850.1};
RN   [1] {ECO:0000313|EMBL:CAG05850.1}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=15496914; DOI=10.1038/nature03025;
RA   Jaillon O., Aury J.-M., Brunet F., Petit J.-L., Stange-Thomann N.,
RA   Mauceli E., Bouneau L., Fischer C., Ozouf-Costaz C., Bernot A., Nicaud S.,
RA   Jaffe D., Fisher S., Lutfalla G., Dossat C., Segurens B., Dasilva C.,
RA   Salanoubat M., Levy M., Boudet N., Castellano S., Anthouard V., Jubin C.,
RA   Castelli V., Katinka M., Vacherie B., Biemont C., Skalli Z., Cattolico L.,
RA   Poulain J., De Berardinis V., Cruaud C., Duprat S., Brottier P.,
RA   Coutanceau J.-P., Gouzy J., Parra G., Lardier G., Chapple C.,
RA   McKernan K.J., McEwan P., Bosak S., Kellis M., Volff J.-N., Guigo R.,
RA   Zody M.C., Mesirov J., Lindblad-Toh K., Birren B., Nusbaum C., Kahn D.,
RA   Robinson-Rechavi M., Laudet V., Schachter V., Quetier F., Saurin W.,
RA   Scarpelli C., Wincker P., Lander E.S., Weissenbach J., Roest Crollius H.;
RT   "Genome duplication in the teleost fish Tetraodon nigroviridis reveals the
RT   early vertebrate proto-karyotype.";
RL   Nature 431:946-957(2004).
RN   [2] {ECO:0000313|EMBL:CAG05850.1}
RP   NUCLEOTIDE SEQUENCE.
RG   Genoscope;
RG   Whitehead Institute Centre for Genome Research;
RL   Submitted (FEB-2004) to the EMBL/GenBank/DDBJ databases.
CC   -!- FUNCTION: Type IV collagen is the major structural component of
CC       glomerular basement membranes (GBM), forming a 'chicken-wire' meshwork
CC       together with laminins, proteoglycans and entactin/nidogen.
CC       {ECO:0000256|ARBA:ARBA00003696}.
CC   -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004370}.
CC       Secreted, extracellular space, extracellular matrix, basement membrane
CC       {ECO:0000256|ARBA:ARBA00004302}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:CAG05850.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; CAAE01014781; CAG05850.1; -; Genomic_DNA.
DR   KEGG; tng:GSTEN00026005G001; -.
DR   GO; GO:0005604; C:basement membrane; IEA:UniProtKB-SubCell.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR   GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR   Gene3D; 1.20.5.320; 6-Phosphogluconate Dehydrogenase, domain 3; 1.
DR   Gene3D; 2.170.240.10; Collagen IV, non-collagenous; 1.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR001442; Collagen_IV_NC.
DR   InterPro; IPR036954; Collagen_IV_NC_sf.
DR   InterPro; IPR016187; CTDL_fold.
DR   PANTHER; PTHR24023:SF1019; COLLAGEN; 1.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   Pfam; PF01413; C4; 2.
DR   Pfam; PF01391; Collagen; 18.
DR   SMART; SM00111; C4; 2.
DR   SUPFAM; SSF56436; C-type lectin-like; 2.
DR   PROSITE; PS51403; NC1_IV; 1.
PE   4: Predicted;
KW   Basement membrane {ECO:0000256|ARBA:ARBA00022869};
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Secreted {ECO:0000256|ARBA:ARBA00022530}.
FT   DOMAIN          1240..1438
FT                   /note="Collagen IV NC1"
FT                   /evidence="ECO:0000259|PROSITE:PS51403"
FT   REGION          72..98
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          117..624
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          647..775
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          815..1141
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1187..1231
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        152..167
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        179..193
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1210..1229
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   NON_TER         1
FT                   /evidence="ECO:0000313|EMBL:CAG05850.1"
SQ   SEQUENCE   1438 AA;  141781 MW;  452E83875C703694 CRC64;
     GCGPACGKCD CSGVKGAKGE RGYPGLQGNM GFPGMQGPEG PPGPMGTKGD LGEAGAPGLK
     GVRGPPGLPG FPGNPGLPGI NGNDGPAGPP GHPRSCWSEG GKWASRTYRA NRTSWLSRTQ
     RGEGRPRTTR FCFPGPQGEA GLPGLNIEGP RGEKGLKGDR GEKGDMGVEG ESLFGPPGQP
     GIPGLPGPPG EPINPNECDI ERGAPGPPGP PGLQGELGQK GDKGDTCVQC ESSGPPGLPG
     PQGSKGEHGP PGSVGVKGEK GDPGAAGQPG RPGSPGIPGL MGAPGAVGEP GDIYLAPGLK
     GERGLPGVPG SPGRPGQDGE PGRAGIPGIP GPKGEPAKEG IKGERGPSGD PGFLGPPGER
     GPPGVPGFGR PGEPGEKGSQ GQPGFPGTPG PPGAKGEPGS GVGSPGPQGV PGRQGERGIP
     GLQGERGLPG DEGFPGFPGQ KGELGPPGIG LPGPTGSKGI SGVPGSAGSP GEPGKQGKDG
     LPGPPGLHGQ KGEPGQGLPG PKGSQGIPGI TGYPGEKGNI GLPGIPGQDG KMGPLGPQGV
     KGEVGPPGPP GLTGLQGSPG KGTPGLPGPQ GPPGEPGPFG EDGVKGEKGY PGPPGLDMPG
     PKGEKGTPGF PGSPGSKGQQ GVVGLQGRDG LIGAQGQKGE MGIMGTPGIP GFPGPPGQPG
     SPGHRGDPGV TGPRGAIGEN GTKGDRGDAG LPGPPGNISS FEMEHMKGQK GDIGIKGNPG
     STGQKGAVGI PGEPGLRGRD GEPGLPGQPG EKGDSGFPGE PGVMGPPGQK GSLGEMGLPG
     MITQLMHTLC FGKCFIKMLI LVYLCTGGMG PKGTKGDFGT PGHPGSKGTD GPKGEKGIAG
     QPGIGIPGPP GERGEKGQPG FQGLPGEKGV RGFEGMPGKP GLEGIKGDKG SIGRTGQPGR
     PGEKGVSGLP GPEGKRGFDG RPGEGGMQGP PGTPGQKGEA GVDGIPGSSG DRGDPGVPGR
     GLPGQPGVNG TKGEKGSPGF PGTPGHPGVP GTKGDKGIPG TAGPQGETGE RGLPGISLEG
     PKGERGQPGQ PGEPGSQGLP GPPGVQGRHG PKGETGEQGI PGFPGESGQK GETGVPGFPG
     PSGLPGIDGQ KGEQGQQGVP GFPGNDGPPG PLGPHTFIKG EAGSPGNPGL PGPQGPAGFP
     GPKGLQGDLV FFFFLIEKQF YFTLPGSHLQ SHIWYFFPGV TGVSGPKGEE GFPGIDGQPG
     GKGEPGLPGP QGPRGHPGPP GPDGVPGQVG PPGPSSMYHG FLVTRHSQTV DVPQCPEGTS
     LIYDGYSLLY VQGNERSHGQ DLGTAGSCLR KFSPMPFLFC DINNVCNFAS RNDYSYWLTS
     PEPMPMSMAP ITGQSIKPFI SRCAVCEAPA MVIAVHSQTI VIPQCPYGWD SLWIGYSFVM
     HTSAGAEGSG QALASPGSCL EEFRSAPFIE CHGRGTCNYY ANSYSFWLAT IEDSDMFT
//
DBGET integrated database retrieval system