ID E3MG47_CAERE Unreviewed; 315 AA.
AC E3MG47;
DT 11-JAN-2011, integrated into UniProtKB/TrEMBL.
DT 11-JAN-2011, sequence version 1.
DT 27-MAR-2024, entry version 50.
DE RecName: Full=Major sperm protein {ECO:0000256|RuleBase:RU003425};
GN ORFNames=CRE_24030 {ECO:0000313|EMBL:EFP01396.1};
OS Caenorhabditis remanei (Caenorhabditis vulgaris).
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae;
OC Caenorhabditis.
OX NCBI_TaxID=31234 {ECO:0000313|Proteomes:UP000008281};
RN [1] {ECO:0000313|Proteomes:UP000008281}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=PB4641 {ECO:0000313|Proteomes:UP000008281};
RG Caenorhabditis remanei Sequencing Consortium;
RA Wilson R.K.;
RT "PCAP assembly of the Caenorhabditis remanei genome.";
RL Submitted (JUL-2007) to the EMBL/GenBank/DDBJ databases.
CC -!- FUNCTION: Central component in molecular interactions underlying sperm
CC crawling. Forms an extensive filament system that extends from sperm
CC villipoda, along the leading edge of the pseudopod.
CC {ECO:0000256|RuleBase:RU003425}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; DS268443; EFP01396.1; -; Genomic_DNA.
DR RefSeq; XP_003104745.1; XM_003104697.1.
DR AlphaFoldDB; E3MG47; -.
DR STRING; 31234.E3MG47; -.
DR EnsemblMetazoa; CRE24030.1; CRE24030.1; WBGene00083076.
DR eggNOG; ENOG502SEH0; Eukaryota.
DR HOGENOM; CLU_059627_0_0_1; -.
DR InParanoid; E3MG47; -.
DR OMA; AWEEKKS; -.
DR Proteomes; UP000008281; Unassembled WGS sequence.
DR GO; GO:0005856; C:cytoskeleton; IEA:UniProtKB-KW.
DR Gene3D; 2.60.40.10; Immunoglobulins; 1.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR000535; MSP_dom.
DR InterPro; IPR008962; PapD-like_sf.
DR PANTHER; PTHR21515:SF4; CYLICIN HOMOLOGUE-RELATED; 1.
DR PANTHER; PTHR21515; MAJOR SPERM PROTEIN; 1.
DR Pfam; PF00635; Motile_Sperm; 1.
DR SUPFAM; SSF49354; PapD-like; 1.
DR PROSITE; PS50202; MSP; 1.
PE 4: Predicted;
KW Cytoplasm {ECO:0000256|RuleBase:RU003425};
KW Cytoskeleton {ECO:0000256|RuleBase:RU003425};
KW Reference proteome {ECO:0000313|Proteomes:UP000008281};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..23
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 24..315
FT /note="Major sperm protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5003176830"
FT DOMAIN 204..315
FT /note="MSP"
FT /evidence="ECO:0000259|PROSITE:PS50202"
FT REGION 21..209
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 35..50
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 52..202
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 315 AA; 35996 MW; 2950BC173A9ED2ED CRC64;
MGAGISVFVG ITTSFWLIVG CGGKKKKGGA AKPIQSTPPP APPPPPPNVE SKMKALEEKE
KKSEKKEEAK KEEEKKEKSK KSEKKEEKKE DKKEEKKEEK KEDDKKEKSK KSEIKEEEKK
EDKEKKDDKK EDEKEKDDEK KEEKEEEKKE GIKEEKKEEK DEDKKEEEKK EEEKKDEKKE
EEKKDDEKKE EKKEDPKAGE LKPHITVDPI GDLEFQADKQ EQKKITISNS HDKKIMFKLK
TSDNNVYLVN PVFGTIEPGK TAEVLITRNK APAKEAKLVI VNSLVSFSGD DKDLAKSFKT
AKPTGGQVTV KLCAK
//