ID Q4RJ71_TETNG Unreviewed; 971 AA.
AC Q4RJ71;
DT 19-JUL-2005, integrated into UniProtKB/TrEMBL.
DT 19-JUL-2005, sequence version 1.
DT 27-MAR-2024, entry version 63.
DE SubName: Full=(spotted green pufferfish) hypothetical protein {ECO:0000313|EMBL:CAG11561.1};
GN ORFNames=GSTENG00033558001 {ECO:0000313|EMBL:CAG11561.1};
OS Tetraodon nigroviridis (Spotted green pufferfish) (Chelonodon
OS nigroviridis).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Eupercaria; Tetraodontiformes; Tetradontoidea; Tetraodontidae; Tetraodon.
OX NCBI_TaxID=99883 {ECO:0000313|EMBL:CAG11561.1};
RN [1] {ECO:0000313|EMBL:CAG11561.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=15496914; DOI=10.1038/nature03025;
RA Jaillon O., Aury J.-M., Brunet F., Petit J.-L., Stange-Thomann N.,
RA Mauceli E., Bouneau L., Fischer C., Ozouf-Costaz C., Bernot A., Nicaud S.,
RA Jaffe D., Fisher S., Lutfalla G., Dossat C., Segurens B., Dasilva C.,
RA Salanoubat M., Levy M., Boudet N., Castellano S., Anthouard V., Jubin C.,
RA Castelli V., Katinka M., Vacherie B., Biemont C., Skalli Z., Cattolico L.,
RA Poulain J., De Berardinis V., Cruaud C., Duprat S., Brottier P.,
RA Coutanceau J.-P., Gouzy J., Parra G., Lardier G., Chapple C.,
RA McKernan K.J., McEwan P., Bosak S., Kellis M., Volff J.-N., Guigo R.,
RA Zody M.C., Mesirov J., Lindblad-Toh K., Birren B., Nusbaum C., Kahn D.,
RA Robinson-Rechavi M., Laudet V., Schachter V., Quetier F., Saurin W.,
RA Scarpelli C., Wincker P., Lander E.S., Weissenbach J., Roest Crollius H.;
RT "Genome duplication in the teleost fish Tetraodon nigroviridis reveals the
RT early vertebrate proto-karyotype.";
RL Nature 431:946-957(2004).
RN [2] {ECO:0000313|EMBL:CAG11561.1}
RP NUCLEOTIDE SEQUENCE.
RG Genoscope;
RG Whitehead Institute Centre for Genome Research;
RL Submitted (FEB-2004) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:CAG11561.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CAAE01015039; CAG11561.1; -; Genomic_DNA.
DR AlphaFoldDB; Q4RJ71; -.
DR KEGG; tng:GSTEN00033558G001; -.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR Gene3D; 2.60.120.1000; -; 2.
DR Gene3D; 1.20.5.320; 6-Phosphogluconate Dehydrogenase, domain 3; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR000885; Fib_collagen_C.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1082; COLLAGEN ALPHA-1(X) CHAIN; 1.
DR Pfam; PF01410; COLFI; 1.
DR Pfam; PF01391; Collagen; 8.
DR SMART; SM00038; COLFI; 1.
DR PROSITE; PS51461; NC1_FIB; 1.
PE 4: Predicted;
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Secreted {ECO:0000256|ARBA:ARBA00022530}.
FT DOMAIN 653..857
FT /note="Fibrillar collagen NC1"
FT /evidence="ECO:0000259|PROSITE:PS51461"
FT REGION 1..257
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 295..321
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 384..428
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 584..620
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 20..34
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 598..614
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:CAG11561.1"
SQ SEQUENCE 971 AA; 101392 MW; F8C28A4F9829C122 CRC64;
GEPGEEGQPG VIGEPGLKVS NTHISDQFKS STFSAPYILG DIGPPGHEGE QGQEGIRGPP
GPPGEDGPQG KDGPKGDPGE QGPPGEPGDK GIEGDPGPLG PPGEPGKQGF RGPEGKPGPP
GNRGRHGKKG ERGLSGALGE IGERGDAGQP GEPGPKGARG TRGAPGQPGV MGMEGQPGLP
GYTGHPGQPG PIGPPGAKGE KGYPGEDNKT PGPPGPLGEP GPPGERGDRG EPGDEGYQGH
SGLTGSRGAA GPQGPTGMAL KQLQDSLNAS NVLTVAACII VSFLCIFQGP PGLPGEPGPK
GETGTPGSAG KKGGRGQVGA PGVEVSITQY DSKKIEEESQ IRLLCDVPVI STFQGPAGLP
GLKGMKGYPG LEGPPGLTGL PGLPGKPGRK GQRGVIGKEG FVGRPGPRAD RGLPGLPGER
GPPGQKVGNT KTDCTDCIWK YFCIHVELLS LSQGDPGLPG DRGTPGLKGM VGATGDQGRK
GERGAKGQLS VARKRSCLEE VDGWLQKQHL PRWREDGAQT RSADIKPPNC ASRTCCLDKG
EVGNVGMTGL SGFPGPKGPN GDVGFRGLLG PKGPMGTLGF TGPVGPEGLM GPMGKPGPQG
PQGNPGPRGP PGAPGPSRQI YDDSAVYTLS DSSVAFRTES ILGAEISLPD QNTEILKTLR
HLSTVIERIK KPLGTEENPA RVCKDLLDCR HKPDDGWFWI DPNLGCTSDA FKVFCNFTAG
GQTCLHPVAT DKMVFGVGKV QMKFLHLLST EASQSILLHC LNDLPGRPPD SVGSTGSASH
ENSTLRFRGW NKQMFEKDTL LEPHVLQDDC KTTLNPRPNL KERAGPLDYR FLEDTGRFAD
GATRDSLVQT PRTTLSKLSA CARRMNITLR FLPVSFTRSR ENSTFFLTRV SAGCHLRQTT
DQSGKKQFHQ HCKHCHHSQP HAAFWLCVTF QTIEEAETAT PSSARGGEFR ASLWIAISRV
INVGVDRNNI R
//