ID A0A0B2VJ21_TOXCA Unreviewed; 375 AA.
AC A0A0B2VJ21;
DT 04-MAR-2015, integrated into UniProtKB/TrEMBL.
DT 04-MAR-2015, sequence version 1.
DT 24-JAN-2024, entry version 16.
DE RecName: Full=Major sperm protein {ECO:0000256|RuleBase:RU003425};
GN Name=ssp-31 {ECO:0000313|EMBL:KHN83496.1};
GN ORFNames=Tcan_15924 {ECO:0000313|EMBL:KHN83496.1};
OS Toxocara canis (Canine roundworm).
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC Spirurina; Ascaridomorpha; Ascaridoidea; Toxocaridae; Toxocara.
OX NCBI_TaxID=6265 {ECO:0000313|EMBL:KHN83496.1, ECO:0000313|Proteomes:UP000031036};
RN [1] {ECO:0000313|EMBL:KHN83496.1, ECO:0000313|Proteomes:UP000031036}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=PN_DK_2014 {ECO:0000313|EMBL:KHN83496.1};
RA Zhu X.-Q., Korhonen P.K., Cai H., Young N.D., Nejsum P.,
RA von Samson-Himmelstjerna G., Boag P.R., Tan P., Li Q., Min J., Yang Y.,
RA Wang X., Fang X., Hall R.S., Hofmann A., Sternberg P.W., Jex A.R.,
RA Gasser R.B.;
RT "Genetic blueprint of the zoonotic pathogen Toxocara canis.";
RL Submitted (NOV-2014) to the EMBL/GenBank/DDBJ databases.
CC -!- FUNCTION: Central component in molecular interactions underlying sperm
CC crawling. Forms an extensive filament system that extends from sperm
CC villipoda, along the leading edge of the pseudopod.
CC {ECO:0000256|RuleBase:RU003425}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KHN83496.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JPKZ01001183; KHN83496.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A0B2VJ21; -.
DR Proteomes; UP000031036; Unassembled WGS sequence.
DR GO; GO:0005856; C:cytoskeleton; IEA:UniProtKB-KW.
DR Gene3D; 2.60.40.10; Immunoglobulins; 1.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR000535; MSP_dom.
DR InterPro; IPR008962; PapD-like_sf.
DR PANTHER; PTHR22947; MAJOR SPERM PROTEIN; 1.
DR PANTHER; PTHR22947:SF3; MSP DOMAIN-CONTAINING PROTEIN-RELATED; 1.
DR Pfam; PF00635; Motile_Sperm; 1.
DR SUPFAM; SSF49354; PapD-like; 1.
DR PROSITE; PS50202; MSP; 1.
PE 4: Predicted;
KW Cytoplasm {ECO:0000256|RuleBase:RU003425};
KW Cytoskeleton {ECO:0000256|RuleBase:RU003425};
KW Reference proteome {ECO:0000313|Proteomes:UP000031036}.
FT DOMAIN 226..328
FT /note="MSP"
FT /evidence="ECO:0000259|PROSITE:PS50202"
FT REGION 1..22
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 108..163
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 194..227
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 127..155
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 197..217
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 375 AA; 41410 MW; 365FB6A186E3C5AD CRC64;
MEMAAIPNGT SESERIAAGV SSRENTSLRN NILGESRIDA AAFSPNTATI TIRRNDLCYC
RTRNGSSRHR LKKSSATIRV PRMVLEARSP GSKPNFLLVQ LEAIKNKNEK MKGRSPGAAV
PLPKPKESKG PLEKEKNSPT SSETKIDERV KNNPEPVKLP DSPQFKSAIL PLLPPTSLGG
ALSTLTARAS QIMCKHTEQP DTKKSAESTK TNDAKKPNQK GTRNNGLTVE PMEAEFSVEG
GMCTLMLLNE SNVRFAIKIK TSNNQFFRVN PVYSFLDSGT MNELEIFRLP GGSARIDKLL
LCYVIAKEED TNAKALFDLR AKIQNLYIFR LPGGSARIDK LLLCYVIAKE EDTNAKALFD
LRAKIQNLYV KLKTV
//