ID A0A060XSV0_ONCMY Unreviewed; 930 AA.
AC A0A060XSV0;
DT 03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT 03-SEP-2014, sequence version 1.
DT 27-MAR-2024, entry version 22.
DE RecName: Full=FRAS1-related extracellular matrix protein N-terminal domain-containing protein {ECO:0000259|Pfam:PF19309};
GN ORFNames=GSONMT00047603001 {ECO:0000313|EMBL:CDQ82372.1};
OS Oncorhynchus mykiss (Rainbow trout) (Salmo gairdneri).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Protacanthopterygii; Salmoniformes;
OC Salmonidae; Salmoninae; Oncorhynchus.
OX NCBI_TaxID=8022 {ECO:0000313|EMBL:CDQ82372.1, ECO:0000313|Proteomes:UP000193380};
RN [1] {ECO:0000313|EMBL:CDQ82372.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=24755649; DOI=10.1038/ncomms4657;
RA Berthelot C., Brunet F., Chalopin D., Juanchich A., Bernard M., Noel B.,
RA Bento P., Da Silva C., Labadie K., Alberti A., Aury J.M., Louis A.,
RA Dehais P., Bardou P., Montfort J., Klopp C., Cabau C., Gaspin C.,
RA Thorgaard G.H., Boussaha M., Quillet E., Guyomard R., Galiana D., Bobe J.,
RA Volff J.N., Genet C., Wincker P., Jaillon O., Roest Crollius H.,
RA Guiguen Y.;
RT "The rainbow trout genome provides novel insights into evolution after
RT whole-genome duplication in vertebrates.";
RL Nat. Commun. 5:3657-3657(2014).
RN [2] {ECO:0000313|EMBL:CDQ82372.1}
RP NUCLEOTIDE SEQUENCE.
RA Genoscope - CEA;
RL Submitted (MAR-2014) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; FR905964; CDQ82372.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A060XSV0; -.
DR STRING; 8022.A0A060XSV0; -.
DR PaxDb; 8022-A0A060XSV0; -.
DR Proteomes; UP000193380; Unassembled WGS sequence.
DR InterPro; IPR039005; CSPG_rpt.
DR InterPro; IPR045658; FRAS1-rel_N.
DR PANTHER; PTHR45739:SF3; FRAS-RELATED EXTRACELLULAR MATRIX PROTEIN 1B PRECURSOR; 1.
DR PANTHER; PTHR45739; MATRIX PROTEIN, PUTATIVE-RELATED; 1.
DR Pfam; PF16184; Cadherin_3; 4.
DR Pfam; PF19309; Frem_N; 1.
DR PROSITE; PS51854; CSPG; 4.
PE 4: Predicted;
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Reference proteome {ECO:0000313|Proteomes:UP000193380};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..23
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 24..930
FT /note="FRAS1-related extracellular matrix protein N-
FT terminal domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5001591316"
FT DOMAIN 28..251
FT /note="FRAS1-related extracellular matrix protein N-
FT terminal"
FT /evidence="ECO:0000259|Pfam:PF19309"
FT REPEAT 282..376
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 401..488
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 509..621
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 804..895
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
SQ SEQUENCE 930 AA; 104328 MW; 544E20EC168993A4 CRC64;
MAVARVHSWM LLTLMAVLPH ACAQQSLVQV NSGVQVGRGR SVFVTDKELQ FNVDQTSDCK
VEVVLNEPVT QRVGKLTPQV FDCHFLPDEV KYIHNGSPLL EEDTVMLRVY RFTDSDTVVE
TVVLRMIVVE PESSLVELGS TPLVVPRFYG LSNAIDGGIL SIRSPPDVAC TVRLLTPETS
VPSLGQLVTE DDTGLRKGIK HCHLLFLSLP CLHGTKEVRF LKATCHEFLS LALKYQHLSP
PSPEIDYIPI RVELRDQASR ALLETEAVWL PVLIHGAMQN QPPQAAFMAS FILEADQFIL
TPLTTAALDA KDGETPQERL VFNVTKPPAQ GYIAHLDDHT KTCHSFIWQD LHEMKIAYQP
PNSSHTGRRN YEVEFQAIDG SYVASPPIMV HFSVRAAETN APRVSWNMGL DLLEGQSRPI
TWEDLQIVDS DNIDAVYLVA VDGPLHGRLS VRGGKGFMFR IRDLQEGVVV YHHSDSDTTR
DHIVFRITDG RHSIRHKFPI NILPKDDTAP FLINNVALEV QEGGEVLVEE YMLLASDLDS
SDDYILYQLL TFPRAGEVVK KAHPQQPGNR ISSQILHCYN CLITDSVPVK SFLQRDLFMG
MIYYRHSGEE CFEDTFDFTL SDSHQPPNLS HRHTVVIHVF PVKDQLPVEV SGSVRSVTVR
ETDVVYLTQS HIHFRDTEHP DTDLSFFITT PCFSPTRPGL PDAGRLFYTD STHSMKKDPM
VPVLKSFTQV RSPIYCANTL SCISNMCCVR QHAVNHMKVG YMPPIEDIGP EPLFVQFLFS
VSDHHGGTTT DLLFNVTVTP VDNQAPEVFT NLLRVEEGGG AFFTEEHLLV RDGDSLEDKL
RVGVQTTPIH GKLELQGREL RQGDTFILQD LRGLRVRYIH DDSETVEDTV GLTVTDGVNS
AHVFLPVQVS RLYTLPFKRL GSLRNVLVFE
//