GenomeNet

Database: UniProt
Entry: E3MG47_CAERE
LinkDB: E3MG47_CAERE
Original site: E3MG47_CAERE 
ID   E3MG47_CAERE            Unreviewed;       315 AA.
AC   E3MG47;
DT   11-JAN-2011, integrated into UniProtKB/TrEMBL.
DT   11-JAN-2011, sequence version 1.
DT   27-MAR-2024, entry version 50.
DE   RecName: Full=Major sperm protein {ECO:0000256|RuleBase:RU003425};
GN   ORFNames=CRE_24030 {ECO:0000313|EMBL:EFP01396.1};
OS   Caenorhabditis remanei (Caenorhabditis vulgaris).
OC   Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC   Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae;
OC   Caenorhabditis.
OX   NCBI_TaxID=31234 {ECO:0000313|Proteomes:UP000008281};
RN   [1] {ECO:0000313|Proteomes:UP000008281}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=PB4641 {ECO:0000313|Proteomes:UP000008281};
RG   Caenorhabditis remanei Sequencing Consortium;
RA   Wilson R.K.;
RT   "PCAP assembly of the Caenorhabditis remanei genome.";
RL   Submitted (JUL-2007) to the EMBL/GenBank/DDBJ databases.
CC   -!- FUNCTION: Central component in molecular interactions underlying sperm
CC       crawling. Forms an extensive filament system that extends from sperm
CC       villipoda, along the leading edge of the pseudopod.
CC       {ECO:0000256|RuleBase:RU003425}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; DS268443; EFP01396.1; -; Genomic_DNA.
DR   RefSeq; XP_003104745.1; XM_003104697.1.
DR   AlphaFoldDB; E3MG47; -.
DR   STRING; 31234.E3MG47; -.
DR   EnsemblMetazoa; CRE24030.1; CRE24030.1; WBGene00083076.
DR   eggNOG; ENOG502SEH0; Eukaryota.
DR   HOGENOM; CLU_059627_0_0_1; -.
DR   InParanoid; E3MG47; -.
DR   OMA; AWEEKKS; -.
DR   Proteomes; UP000008281; Unassembled WGS sequence.
DR   GO; GO:0005856; C:cytoskeleton; IEA:UniProtKB-KW.
DR   Gene3D; 2.60.40.10; Immunoglobulins; 1.
DR   InterPro; IPR013783; Ig-like_fold.
DR   InterPro; IPR000535; MSP_dom.
DR   InterPro; IPR008962; PapD-like_sf.
DR   PANTHER; PTHR21515:SF4; CYLICIN HOMOLOGUE-RELATED; 1.
DR   PANTHER; PTHR21515; MAJOR SPERM PROTEIN; 1.
DR   Pfam; PF00635; Motile_Sperm; 1.
DR   SUPFAM; SSF49354; PapD-like; 1.
DR   PROSITE; PS50202; MSP; 1.
PE   4: Predicted;
KW   Cytoplasm {ECO:0000256|RuleBase:RU003425};
KW   Cytoskeleton {ECO:0000256|RuleBase:RU003425};
KW   Reference proteome {ECO:0000313|Proteomes:UP000008281};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..23
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           24..315
FT                   /note="Major sperm protein"
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5003176830"
FT   DOMAIN          204..315
FT                   /note="MSP"
FT                   /evidence="ECO:0000259|PROSITE:PS50202"
FT   REGION          21..209
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        35..50
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        52..202
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   315 AA;  35996 MW;  2950BC173A9ED2ED CRC64;
     MGAGISVFVG ITTSFWLIVG CGGKKKKGGA AKPIQSTPPP APPPPPPNVE SKMKALEEKE
     KKSEKKEEAK KEEEKKEKSK KSEKKEEKKE DKKEEKKEEK KEDDKKEKSK KSEIKEEEKK
     EDKEKKDDKK EDEKEKDDEK KEEKEEEKKE GIKEEKKEEK DEDKKEEEKK EEEKKDEKKE
     EEKKDDEKKE EKKEDPKAGE LKPHITVDPI GDLEFQADKQ EQKKITISNS HDKKIMFKLK
     TSDNNVYLVN PVFGTIEPGK TAEVLITRNK APAKEAKLVI VNSLVSFSGD DKDLAKSFKT
     AKPTGGQVTV KLCAK
//
DBGET integrated database retrieval system