ID O61765_CAEEL Unreviewed; 301 AA.
AC O61765;
DT 01-AUG-1998, integrated into UniProtKB/TrEMBL.
DT 01-AUG-1998, sequence version 1.
DT 27-MAR-2024, entry version 142.
DE RecName: Full=Major sperm protein {ECO:0000256|RuleBase:RU003425};
GN ORFNames=C35E7.9 {ECO:0000313|EMBL:CCD66815.1,
GN ECO:0000313|WormBase:C35E7.9}, CELE_C35E7.9
GN {ECO:0000313|EMBL:CCD66815.1};
OS Caenorhabditis elegans.
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae;
OC Caenorhabditis.
OX NCBI_TaxID=6239 {ECO:0000313|EMBL:CCD66815.1, ECO:0000313|Proteomes:UP000001940};
RN [1] {ECO:0000313|EMBL:CCD66815.1, ECO:0000313|Proteomes:UP000001940}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Bristol N2 {ECO:0000313|EMBL:CCD66815.1,
RC ECO:0000313|Proteomes:UP000001940};
RX PubMed=9851916; DOI=10.1126/science.282.5396.2012;
RG The C. elegans sequencing consortium;
RA Sulson J.E., Waterston R.;
RT "Genome sequence of the nematode C. elegans: a platform for investigating
RT biology.";
RL Science 282:2012-2018(1998).
CC -!- FUNCTION: Central component in molecular interactions underlying sperm
CC crawling. Forms an extensive filament system that extends from sperm
CC villipoda, along the leading edge of the pseudopod.
CC {ECO:0000256|RuleBase:RU003425}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; BX284601; CCD66815.1; -; Genomic_DNA.
DR PIR; T33068; T33068.
DR RefSeq; NP_492824.1; NM_060423.3.
DR AlphaFoldDB; O61765; -.
DR SMR; O61765; -.
DR STRING; 6239.C35E7.9.1; -.
DR PaxDb; 6239-C35E7-9; -.
DR EnsemblMetazoa; C35E7.9.1; C35E7.9.1; WBGene00016461.
DR GeneID; 172987; -.
DR KEGG; cel:CELE_C35E7.9; -.
DR UCSC; C35E7.9; c. elegans.
DR AGR; WB:WBGene00016461; -.
DR WormBase; C35E7.9; CE17522; WBGene00016461; -.
DR eggNOG; ENOG502SEH0; Eukaryota.
DR GeneTree; ENSGT00970000195941; -.
DR HOGENOM; CLU_059627_0_0_1; -.
DR InParanoid; O61765; -.
DR OMA; IIASIHI; -.
DR OrthoDB; 2879630at2759; -.
DR Proteomes; UP000001940; Chromosome I.
DR Bgee; WBGene00016461; Expressed in adult organism and 2 other cell types or tissues.
DR GO; GO:0005856; C:cytoskeleton; IEA:UniProtKB-KW.
DR Gene3D; 2.60.40.10; Immunoglobulins; 1.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR000535; MSP_dom.
DR InterPro; IPR008962; PapD-like_sf.
DR PANTHER; PTHR21515:SF4; CYLICIN HOMOLOGUE-RELATED; 1.
DR PANTHER; PTHR21515; MAJOR SPERM PROTEIN; 1.
DR Pfam; PF00635; Motile_Sperm; 1.
DR SUPFAM; SSF49354; PapD-like; 1.
DR PROSITE; PS50202; MSP; 1.
PE 1: Evidence at protein level;
KW Cytoplasm {ECO:0000256|RuleBase:RU003425};
KW Cytoskeleton {ECO:0000256|RuleBase:RU003425};
KW Proteomics identification {ECO:0007829|EPD:O61765};
KW Reference proteome {ECO:0000313|Proteomes:UP000001940}.
FT DOMAIN 192..301
FT /note="MSP"
FT /evidence="ECO:0000259|PROSITE:PS50202"
FT REGION 21..201
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 46..194
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 301 AA; 34339 MW; 1B74ABDB3B35311E CRC64;
MTIEISVLIG ITIAGWILAG CGGKKKKDGK SSTASAAAPK ADSKMKPPVE NVKSKKSEKK
EEPKKEEEPK KEEEKKEKSK KSEKKDDKKE EAKKEDDKKD EKKDEKKEDK KDDKKDDKKE
EKKEEKKEDE KEKGDDKKED EKDDKKSGSK DAEKKEEKKE EEKKEEKKEE KKEEKKEDEK
KEDEKKEEPK PHITIDPPGD LMFKADQQEQ KKLKLTNTHD KKIMFKIKTS DNQVYLMNPV
YGTVEPGKSA NLTLTRNKAP AKEAKLVIVN SVFSGDDKDL AKSFKTGKPT GGQVTIMLNG
K
//