GenomeNet

Database: UniProt
Entry: A0A0B2VJ21_TOXCA
LinkDB: A0A0B2VJ21_TOXCA
Original site: A0A0B2VJ21_TOXCA 
ID   A0A0B2VJ21_TOXCA        Unreviewed;       375 AA.
AC   A0A0B2VJ21;
DT   04-MAR-2015, integrated into UniProtKB/TrEMBL.
DT   04-MAR-2015, sequence version 1.
DT   24-JAN-2024, entry version 16.
DE   RecName: Full=Major sperm protein {ECO:0000256|RuleBase:RU003425};
GN   Name=ssp-31 {ECO:0000313|EMBL:KHN83496.1};
GN   ORFNames=Tcan_15924 {ECO:0000313|EMBL:KHN83496.1};
OS   Toxocara canis (Canine roundworm).
OC   Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC   Spirurina; Ascaridomorpha; Ascaridoidea; Toxocaridae; Toxocara.
OX   NCBI_TaxID=6265 {ECO:0000313|EMBL:KHN83496.1, ECO:0000313|Proteomes:UP000031036};
RN   [1] {ECO:0000313|EMBL:KHN83496.1, ECO:0000313|Proteomes:UP000031036}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=PN_DK_2014 {ECO:0000313|EMBL:KHN83496.1};
RA   Zhu X.-Q., Korhonen P.K., Cai H., Young N.D., Nejsum P.,
RA   von Samson-Himmelstjerna G., Boag P.R., Tan P., Li Q., Min J., Yang Y.,
RA   Wang X., Fang X., Hall R.S., Hofmann A., Sternberg P.W., Jex A.R.,
RA   Gasser R.B.;
RT   "Genetic blueprint of the zoonotic pathogen Toxocara canis.";
RL   Submitted (NOV-2014) to the EMBL/GenBank/DDBJ databases.
CC   -!- FUNCTION: Central component in molecular interactions underlying sperm
CC       crawling. Forms an extensive filament system that extends from sperm
CC       villipoda, along the leading edge of the pseudopod.
CC       {ECO:0000256|RuleBase:RU003425}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:KHN83496.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; JPKZ01001183; KHN83496.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A0B2VJ21; -.
DR   Proteomes; UP000031036; Unassembled WGS sequence.
DR   GO; GO:0005856; C:cytoskeleton; IEA:UniProtKB-KW.
DR   Gene3D; 2.60.40.10; Immunoglobulins; 1.
DR   InterPro; IPR013783; Ig-like_fold.
DR   InterPro; IPR000535; MSP_dom.
DR   InterPro; IPR008962; PapD-like_sf.
DR   PANTHER; PTHR22947; MAJOR SPERM PROTEIN; 1.
DR   PANTHER; PTHR22947:SF3; MSP DOMAIN-CONTAINING PROTEIN-RELATED; 1.
DR   Pfam; PF00635; Motile_Sperm; 1.
DR   SUPFAM; SSF49354; PapD-like; 1.
DR   PROSITE; PS50202; MSP; 1.
PE   4: Predicted;
KW   Cytoplasm {ECO:0000256|RuleBase:RU003425};
KW   Cytoskeleton {ECO:0000256|RuleBase:RU003425};
KW   Reference proteome {ECO:0000313|Proteomes:UP000031036}.
FT   DOMAIN          226..328
FT                   /note="MSP"
FT                   /evidence="ECO:0000259|PROSITE:PS50202"
FT   REGION          1..22
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          108..163
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          194..227
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        127..155
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        197..217
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   375 AA;  41410 MW;  365FB6A186E3C5AD CRC64;
     MEMAAIPNGT SESERIAAGV SSRENTSLRN NILGESRIDA AAFSPNTATI TIRRNDLCYC
     RTRNGSSRHR LKKSSATIRV PRMVLEARSP GSKPNFLLVQ LEAIKNKNEK MKGRSPGAAV
     PLPKPKESKG PLEKEKNSPT SSETKIDERV KNNPEPVKLP DSPQFKSAIL PLLPPTSLGG
     ALSTLTARAS QIMCKHTEQP DTKKSAESTK TNDAKKPNQK GTRNNGLTVE PMEAEFSVEG
     GMCTLMLLNE SNVRFAIKIK TSNNQFFRVN PVYSFLDSGT MNELEIFRLP GGSARIDKLL
     LCYVIAKEED TNAKALFDLR AKIQNLYIFR LPGGSARIDK LLLCYVIAKE EDTNAKALFD
     LRAKIQNLYV KLKTV
//
DBGET integrated database retrieval system