ID Q4S0I2_TETNG Unreviewed; 1438 AA.
AC Q4S0I2;
DT 19-JUL-2005, integrated into UniProtKB/TrEMBL.
DT 19-JUL-2005, sequence version 1.
DT 27-MAR-2024, entry version 81.
DE SubName: Full=(spotted green pufferfish) hypothetical protein {ECO:0000313|EMBL:CAG05850.1};
GN ORFNames=GSTENG00026005001 {ECO:0000313|EMBL:CAG05850.1};
OS Tetraodon nigroviridis (Spotted green pufferfish) (Chelonodon
OS nigroviridis).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Eupercaria; Tetraodontiformes; Tetradontoidea; Tetraodontidae; Tetraodon.
OX NCBI_TaxID=99883 {ECO:0000313|EMBL:CAG05850.1};
RN [1] {ECO:0000313|EMBL:CAG05850.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=15496914; DOI=10.1038/nature03025;
RA Jaillon O., Aury J.-M., Brunet F., Petit J.-L., Stange-Thomann N.,
RA Mauceli E., Bouneau L., Fischer C., Ozouf-Costaz C., Bernot A., Nicaud S.,
RA Jaffe D., Fisher S., Lutfalla G., Dossat C., Segurens B., Dasilva C.,
RA Salanoubat M., Levy M., Boudet N., Castellano S., Anthouard V., Jubin C.,
RA Castelli V., Katinka M., Vacherie B., Biemont C., Skalli Z., Cattolico L.,
RA Poulain J., De Berardinis V., Cruaud C., Duprat S., Brottier P.,
RA Coutanceau J.-P., Gouzy J., Parra G., Lardier G., Chapple C.,
RA McKernan K.J., McEwan P., Bosak S., Kellis M., Volff J.-N., Guigo R.,
RA Zody M.C., Mesirov J., Lindblad-Toh K., Birren B., Nusbaum C., Kahn D.,
RA Robinson-Rechavi M., Laudet V., Schachter V., Quetier F., Saurin W.,
RA Scarpelli C., Wincker P., Lander E.S., Weissenbach J., Roest Crollius H.;
RT "Genome duplication in the teleost fish Tetraodon nigroviridis reveals the
RT early vertebrate proto-karyotype.";
RL Nature 431:946-957(2004).
RN [2] {ECO:0000313|EMBL:CAG05850.1}
RP NUCLEOTIDE SEQUENCE.
RG Genoscope;
RG Whitehead Institute Centre for Genome Research;
RL Submitted (FEB-2004) to the EMBL/GenBank/DDBJ databases.
CC -!- FUNCTION: Type IV collagen is the major structural component of
CC glomerular basement membranes (GBM), forming a 'chicken-wire' meshwork
CC together with laminins, proteoglycans and entactin/nidogen.
CC {ECO:0000256|ARBA:ARBA00003696}.
CC -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004370}.
CC Secreted, extracellular space, extracellular matrix, basement membrane
CC {ECO:0000256|ARBA:ARBA00004302}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:CAG05850.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CAAE01014781; CAG05850.1; -; Genomic_DNA.
DR KEGG; tng:GSTEN00026005G001; -.
DR GO; GO:0005604; C:basement membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR Gene3D; 1.20.5.320; 6-Phosphogluconate Dehydrogenase, domain 3; 1.
DR Gene3D; 2.170.240.10; Collagen IV, non-collagenous; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR001442; Collagen_IV_NC.
DR InterPro; IPR036954; Collagen_IV_NC_sf.
DR InterPro; IPR016187; CTDL_fold.
DR PANTHER; PTHR24023:SF1019; COLLAGEN; 1.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR Pfam; PF01413; C4; 2.
DR Pfam; PF01391; Collagen; 18.
DR SMART; SM00111; C4; 2.
DR SUPFAM; SSF56436; C-type lectin-like; 2.
DR PROSITE; PS51403; NC1_IV; 1.
PE 4: Predicted;
KW Basement membrane {ECO:0000256|ARBA:ARBA00022869};
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Secreted {ECO:0000256|ARBA:ARBA00022530}.
FT DOMAIN 1240..1438
FT /note="Collagen IV NC1"
FT /evidence="ECO:0000259|PROSITE:PS51403"
FT REGION 72..98
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 117..624
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 647..775
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 815..1141
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1187..1231
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 152..167
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 179..193
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1210..1229
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:CAG05850.1"
SQ SEQUENCE 1438 AA; 141781 MW; 452E83875C703694 CRC64;
GCGPACGKCD CSGVKGAKGE RGYPGLQGNM GFPGMQGPEG PPGPMGTKGD LGEAGAPGLK
GVRGPPGLPG FPGNPGLPGI NGNDGPAGPP GHPRSCWSEG GKWASRTYRA NRTSWLSRTQ
RGEGRPRTTR FCFPGPQGEA GLPGLNIEGP RGEKGLKGDR GEKGDMGVEG ESLFGPPGQP
GIPGLPGPPG EPINPNECDI ERGAPGPPGP PGLQGELGQK GDKGDTCVQC ESSGPPGLPG
PQGSKGEHGP PGSVGVKGEK GDPGAAGQPG RPGSPGIPGL MGAPGAVGEP GDIYLAPGLK
GERGLPGVPG SPGRPGQDGE PGRAGIPGIP GPKGEPAKEG IKGERGPSGD PGFLGPPGER
GPPGVPGFGR PGEPGEKGSQ GQPGFPGTPG PPGAKGEPGS GVGSPGPQGV PGRQGERGIP
GLQGERGLPG DEGFPGFPGQ KGELGPPGIG LPGPTGSKGI SGVPGSAGSP GEPGKQGKDG
LPGPPGLHGQ KGEPGQGLPG PKGSQGIPGI TGYPGEKGNI GLPGIPGQDG KMGPLGPQGV
KGEVGPPGPP GLTGLQGSPG KGTPGLPGPQ GPPGEPGPFG EDGVKGEKGY PGPPGLDMPG
PKGEKGTPGF PGSPGSKGQQ GVVGLQGRDG LIGAQGQKGE MGIMGTPGIP GFPGPPGQPG
SPGHRGDPGV TGPRGAIGEN GTKGDRGDAG LPGPPGNISS FEMEHMKGQK GDIGIKGNPG
STGQKGAVGI PGEPGLRGRD GEPGLPGQPG EKGDSGFPGE PGVMGPPGQK GSLGEMGLPG
MITQLMHTLC FGKCFIKMLI LVYLCTGGMG PKGTKGDFGT PGHPGSKGTD GPKGEKGIAG
QPGIGIPGPP GERGEKGQPG FQGLPGEKGV RGFEGMPGKP GLEGIKGDKG SIGRTGQPGR
PGEKGVSGLP GPEGKRGFDG RPGEGGMQGP PGTPGQKGEA GVDGIPGSSG DRGDPGVPGR
GLPGQPGVNG TKGEKGSPGF PGTPGHPGVP GTKGDKGIPG TAGPQGETGE RGLPGISLEG
PKGERGQPGQ PGEPGSQGLP GPPGVQGRHG PKGETGEQGI PGFPGESGQK GETGVPGFPG
PSGLPGIDGQ KGEQGQQGVP GFPGNDGPPG PLGPHTFIKG EAGSPGNPGL PGPQGPAGFP
GPKGLQGDLV FFFFLIEKQF YFTLPGSHLQ SHIWYFFPGV TGVSGPKGEE GFPGIDGQPG
GKGEPGLPGP QGPRGHPGPP GPDGVPGQVG PPGPSSMYHG FLVTRHSQTV DVPQCPEGTS
LIYDGYSLLY VQGNERSHGQ DLGTAGSCLR KFSPMPFLFC DINNVCNFAS RNDYSYWLTS
PEPMPMSMAP ITGQSIKPFI SRCAVCEAPA MVIAVHSQTI VIPQCPYGWD SLWIGYSFVM
HTSAGAEGSG QALASPGSCL EEFRSAPFIE CHGRGTCNYY ANSYSFWLAT IEDSDMFT
//