ID A0A0V1NIN1_9BILA Unreviewed; 476 AA.
AC A0A0V1NIN1;
DT 16-MAR-2016, integrated into UniProtKB/TrEMBL.
DT 16-MAR-2016, sequence version 1.
DT 24-JAN-2024, entry version 15.
DE RecName: Full=Major sperm protein {ECO:0000256|RuleBase:RU003425};
GN Name=msp-78 {ECO:0000313|EMBL:KRZ83667.1};
GN ORFNames=T08_11672 {ECO:0000313|EMBL:KRZ83667.1};
OS Trichinella sp. T8.
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Enoplea; Dorylaimia;
OC Trichinellida; Trichinellidae; Trichinella.
OX NCBI_TaxID=92180 {ECO:0000313|EMBL:KRZ83667.1, ECO:0000313|Proteomes:UP000054924};
RN [1] {ECO:0000313|EMBL:KRZ83667.1, ECO:0000313|Proteomes:UP000054924}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ISS272 {ECO:0000313|EMBL:KRZ83667.1};
RA Korhonen P.K., Edoardo P., Giuseppe L.R., Gasser R.B.;
RT "Evolution of Trichinella species and genotypes.";
RL Submitted (JAN-2015) to the EMBL/GenBank/DDBJ databases.
CC -!- FUNCTION: Central component in molecular interactions underlying sperm
CC crawling. Forms an extensive filament system that extends from sperm
CC villipoda, along the leading edge of the pseudopod.
CC {ECO:0000256|RuleBase:RU003425}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KRZ83667.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JYDM01000202; KRZ83667.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A0V1NIN1; -.
DR STRING; 92180.A0A0V1NIN1; -.
DR Proteomes; UP000054924; Unassembled WGS sequence.
DR GO; GO:0005856; C:cytoskeleton; IEA:UniProtKB-KW.
DR Gene3D; 2.60.40.10; Immunoglobulins; 1.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR000535; MSP_dom.
DR InterPro; IPR008962; PapD-like_sf.
DR InterPro; IPR029526; PGBD.
DR PANTHER; PTHR47055; DDE_TNP_1_7 DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR47055:SF3; DDE_TNP_1_7 DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF13843; DDE_Tnp_1_7; 2.
DR Pfam; PF00635; Motile_Sperm; 1.
DR SUPFAM; SSF49354; PapD-like; 1.
DR PROSITE; PS50202; MSP; 1.
PE 4: Predicted;
KW Cytoplasm {ECO:0000256|RuleBase:RU003425};
KW Cytoskeleton {ECO:0000256|RuleBase:RU003425};
KW Reference proteome {ECO:0000313|Proteomes:UP000054924}.
FT DOMAIN 359..476
FT /note="MSP"
FT /evidence="ECO:0000259|PROSITE:PS50202"
FT REGION 1..41
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 67..95
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 232..255
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..20
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 22..41
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 240..255
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 476 AA; 53715 MW; 1D69A54C3E34F4AA CRC64;
MKWLRSSRKE TPKKDETEAS TGHGALASSS MDISDSLEGR RCEQVSTLQE AKEVMATSNH
VLNVVVLPPT AGDSGSDDTD QEYLPDDPED ESDPAVLSRY NCQPEAKNYW PTQPDMGAQC
AISCMARNRF MEIKKYLHLA DNQKLVKGDK MSKVTPLYKL LNSSLVKHEA KNYWPTQPDM
GAQCAISCMA RNRFMEIKKY LHLADNQKLV KGDKMSKVTP LYKLLNSSLV KHGSDDTDQE
YLPDDPEDES DPAGELEVEQ EQFIGVIVLS RYNCQPEAKN YWPTQPDMGA QCAISCMARN
RFMEIKKYLH LADNQKLVKG DKMSKVTPLY KLLNSSLVKH DQCSFEESCS CYRMALPTDI
VTVPAEEVWF NAPCLSEQVT ALKLSNPGNR LLGYKVVSKV ENRYVVLPSQ GALQPKKDIT
INIICHPFPF RSESPPVDTL IIEWVDAFES ETDFNEDWFV MGGIVRKKVL SVKFNP
//