ID A0A016UBK3_9BILA Unreviewed; 294 AA.
AC A0A016UBK3;
DT 11-JUN-2014, integrated into UniProtKB/TrEMBL.
DT 11-JUN-2014, sequence version 1.
DT 27-MAR-2024, entry version 22.
DE RecName: Full=Major sperm protein {ECO:0000256|RuleBase:RU003425};
GN Name=Acey_s0048.g1668 {ECO:0000313|EMBL:EYC12212.1};
GN ORFNames=Y032_0048g1668 {ECO:0000313|EMBL:EYC12212.1};
OS Ancylostoma ceylanicum.
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC Rhabditina; Rhabditomorpha; Strongyloidea; Ancylostomatidae;
OC Ancylostomatinae; Ancylostoma.
OX NCBI_TaxID=53326 {ECO:0000313|EMBL:EYC12212.1, ECO:0000313|Proteomes:UP000024635};
RN [1] {ECO:0000313|Proteomes:UP000024635}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=HY135 {ECO:0000313|Proteomes:UP000024635};
RX PubMed=25730766; DOI=10.1038/ng.3237;
RA Schwarz E.M., Hu Y., Antoshechkin I., Miller M.M., Sternberg P.W.,
RA Aroian R.V.;
RT "The genome and transcriptome of the zoonotic hookworm Ancylostoma
RT ceylanicum identify infection-specific gene families.";
RL Nat. Genet. 47:416-422(2015).
CC -!- FUNCTION: Central component in molecular interactions underlying sperm
CC crawling. Forms an extensive filament system that extends from sperm
CC villipoda, along the leading edge of the pseudopod.
CC {ECO:0000256|RuleBase:RU003425}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EYC12212.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JARK01001384; EYC12212.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A016UBK3; -.
DR STRING; 53326.A0A016UBK3; -.
DR Proteomes; UP000024635; Unassembled WGS sequence.
DR GO; GO:0005856; C:cytoskeleton; IEA:UniProtKB-KW.
DR Gene3D; 2.60.40.10; Immunoglobulins; 1.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR000535; MSP_dom.
DR InterPro; IPR008962; PapD-like_sf.
DR PANTHER; PTHR21513; MAJOR SPERM PROTEIN; 1.
DR PANTHER; PTHR21513:SF19; MAJOR SPERM PROTEIN; 1.
DR Pfam; PF00635; Motile_Sperm; 1.
DR SUPFAM; SSF49354; PapD-like; 1.
DR PROSITE; PS50202; MSP; 1.
PE 4: Predicted;
KW Cytoplasm {ECO:0000256|RuleBase:RU003425};
KW Cytoskeleton {ECO:0000256|RuleBase:RU003425};
KW Reference proteome {ECO:0000313|Proteomes:UP000024635}.
FT DOMAIN 152..272
FT /note="MSP"
FT /evidence="ECO:0000259|PROSITE:PS50202"
FT REGION 1..140
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 275..294
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 22..36
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 62..93
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 109..139
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 294 AA; 32162 MW; 1EE50C0EA6051A67 CRC64;
MDVPDSDGEC GLGQGSFLQK EHRIPSKSKE ILENAHKTTT FHTTTAADRG RPKATGMVAE
GGEEPPKDDT GKKPPDAKGA PEQAKDGNKT PGKDGANDGK AAPDAGKAAD APKGDEGKEK
EKEKEKEKEK AKEKETQVPA DVQQAEGTHV LGKAAGPKEK PRNVVFKVPP ERKPVWSDIK
IQNPTNDRKT FKVKCTSAEI FRVQPPFGFI RPLDTARIRV WFQNSNGVPT DGKKHYFAVY
FMNALEGKTV KELWHKTAKH EGICRINAIF EKVGTDNQPI PPKPMDDAAK DKPA
//