ID A0A060WYY5_ONCMY Unreviewed; 789 AA.
AC A0A060WYY5;
DT 03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT 03-SEP-2014, sequence version 1.
DT 24-JAN-2024, entry version 31.
DE RecName: Full=CUB domain-containing protein {ECO:0000259|PROSITE:PS01180};
GN ORFNames=GSONMT00017914001 {ECO:0000313|EMBL:CDQ72583.1};
OS Oncorhynchus mykiss (Rainbow trout) (Salmo gairdneri).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Protacanthopterygii; Salmoniformes;
OC Salmonidae; Salmoninae; Oncorhynchus.
OX NCBI_TaxID=8022 {ECO:0000313|EMBL:CDQ72583.1, ECO:0000313|Proteomes:UP000193380};
RN [1] {ECO:0000313|EMBL:CDQ72583.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=24755649; DOI=10.1038/ncomms4657;
RA Berthelot C., Brunet F., Chalopin D., Juanchich A., Bernard M., Noel B.,
RA Bento P., Da Silva C., Labadie K., Alberti A., Aury J.M., Louis A.,
RA Dehais P., Bardou P., Montfort J., Klopp C., Cabau C., Gaspin C.,
RA Thorgaard G.H., Boussaha M., Quillet E., Guyomard R., Galiana D., Bobe J.,
RA Volff J.N., Genet C., Wincker P., Jaillon O., Roest Crollius H.,
RA Guiguen Y.;
RT "The rainbow trout genome provides novel insights into evolution after
RT whole-genome duplication in vertebrates.";
RL Nat. Commun. 5:3657-3657(2014).
RN [2] {ECO:0000313|EMBL:CDQ72583.1}
RP NUCLEOTIDE SEQUENCE.
RA Genoscope - CEA;
RL Submitted (MAR-2014) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00059}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; FR904842; CDQ72583.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A060WYY5; -.
DR STRING; 8022.A0A060WYY5; -.
DR PaxDb; 8022-A0A060WYY5; -.
DR Proteomes; UP000193380; Unassembled WGS sequence.
DR CDD; cd00041; CUB; 1.
DR Gene3D; 2.60.120.290; Spermadhesin, CUB domain; 1.
DR InterPro; IPR000859; CUB_dom.
DR InterPro; IPR035914; Sperma_CUB_dom_sf.
DR Pfam; PF00431; CUB; 1.
DR SMART; SM00042; CUB; 1.
DR SUPFAM; SSF49854; Spermadhesin, CUB domain; 1.
DR PROSITE; PS01180; CUB; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Reference proteome {ECO:0000313|Proteomes:UP000193380};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..25
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 26..789
FT /note="CUB domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5001590411"
FT DOMAIN 32..161
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT REGION 142..166
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 358..388
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 437..473
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 534..632
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 699..729
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 756..789
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 142..157
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 362..385
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 540..619
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 789 AA; 84761 MW; 8D8C3A2C3D55DA2C CRC64;
MTCNSSFWAH VFMTAYLLWV TGCVAQKVYF DCGAKVDVVD VQGLILSPGF PYNYSSGTHC
VWQFFVPVGH QLILEIFDFD VFESHDPSAQ YTAISNLKVE DAEADDGATF PSGAKAPQSS
FQHDKVKQVV VQEQSTKMEI AKVSNSAKRS TSDIASPHPP SHLLPRDQAA VNSITSPLLR
GDPDLNPTAN PRAAAPDLAS VSSETQVVDA CPHDVLYISD LLTFSSRFCG SNRPSSSQLV
FGSSQEMVEV IMELITTTHW GRGFALLFHY HNRTEPDGDD PRHAYAPAGA SKMGSLLAAM
SGAAFFAMVL TSTLCIIFRP KLCPKRASSC SSNNSEVQEG VQNSGADVRE LQLVAPNQPS
LEVPRTAEND NNHSLSLTHT GSPVSVGGDM SEHAEVDLSS NGLTELDLGT DEVFIISSAP
STSSRLLFSA HTQRERFLRH SDTGPGPVSD WPSPDLAASP TETRAAQDGC ARPRPRAWSV
RTFQDFLPPL PQLHKKWCSW NSTSPFTKLV DSAPSGFVSE CRGDDSRKVF SDPQLEAQDD
SKASDSSMSN VSYPLTLPAQ RQRRLNSTSN MRRSRFTGPC FGLLSGSNPS DSTKAPGGSL
LQASPSQPSN STSGQGQPEE VQGGKRRDFP AESDHVSVPV FAICEEEDRQ PLILTEHLGH
SSMLNGLSRG VYEAKGPAGG SLNPVPQSAA NGPLLQRGRS EWRPWGSQAS GGASPYPLPH
PSGSHTATNT NESAFVHSTA NQIQMPSLSQ LTVPCSVTGN VTSDSPASSP CDSRADRQSM
QSWVGPAKQ
//