GenomeNet

Database: UniProt
Entry: A0A0V1NIN1_9BILA
LinkDB: A0A0V1NIN1_9BILA
Original site: A0A0V1NIN1_9BILA 
ID   A0A0V1NIN1_9BILA        Unreviewed;       476 AA.
AC   A0A0V1NIN1;
DT   16-MAR-2016, integrated into UniProtKB/TrEMBL.
DT   16-MAR-2016, sequence version 1.
DT   24-JAN-2024, entry version 15.
DE   RecName: Full=Major sperm protein {ECO:0000256|RuleBase:RU003425};
GN   Name=msp-78 {ECO:0000313|EMBL:KRZ83667.1};
GN   ORFNames=T08_11672 {ECO:0000313|EMBL:KRZ83667.1};
OS   Trichinella sp. T8.
OC   Eukaryota; Metazoa; Ecdysozoa; Nematoda; Enoplea; Dorylaimia;
OC   Trichinellida; Trichinellidae; Trichinella.
OX   NCBI_TaxID=92180 {ECO:0000313|EMBL:KRZ83667.1, ECO:0000313|Proteomes:UP000054924};
RN   [1] {ECO:0000313|EMBL:KRZ83667.1, ECO:0000313|Proteomes:UP000054924}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=ISS272 {ECO:0000313|EMBL:KRZ83667.1};
RA   Korhonen P.K., Edoardo P., Giuseppe L.R., Gasser R.B.;
RT   "Evolution of Trichinella species and genotypes.";
RL   Submitted (JAN-2015) to the EMBL/GenBank/DDBJ databases.
CC   -!- FUNCTION: Central component in molecular interactions underlying sperm
CC       crawling. Forms an extensive filament system that extends from sperm
CC       villipoda, along the leading edge of the pseudopod.
CC       {ECO:0000256|RuleBase:RU003425}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:KRZ83667.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; JYDM01000202; KRZ83667.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A0V1NIN1; -.
DR   STRING; 92180.A0A0V1NIN1; -.
DR   Proteomes; UP000054924; Unassembled WGS sequence.
DR   GO; GO:0005856; C:cytoskeleton; IEA:UniProtKB-KW.
DR   Gene3D; 2.60.40.10; Immunoglobulins; 1.
DR   InterPro; IPR013783; Ig-like_fold.
DR   InterPro; IPR000535; MSP_dom.
DR   InterPro; IPR008962; PapD-like_sf.
DR   InterPro; IPR029526; PGBD.
DR   PANTHER; PTHR47055; DDE_TNP_1_7 DOMAIN-CONTAINING PROTEIN; 1.
DR   PANTHER; PTHR47055:SF3; DDE_TNP_1_7 DOMAIN-CONTAINING PROTEIN; 1.
DR   Pfam; PF13843; DDE_Tnp_1_7; 2.
DR   Pfam; PF00635; Motile_Sperm; 1.
DR   SUPFAM; SSF49354; PapD-like; 1.
DR   PROSITE; PS50202; MSP; 1.
PE   4: Predicted;
KW   Cytoplasm {ECO:0000256|RuleBase:RU003425};
KW   Cytoskeleton {ECO:0000256|RuleBase:RU003425};
KW   Reference proteome {ECO:0000313|Proteomes:UP000054924}.
FT   DOMAIN          359..476
FT                   /note="MSP"
FT                   /evidence="ECO:0000259|PROSITE:PS50202"
FT   REGION          1..41
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          67..95
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          232..255
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1..20
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        22..41
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        240..255
FT                   /note="Acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   476 AA;  53715 MW;  1D69A54C3E34F4AA CRC64;
     MKWLRSSRKE TPKKDETEAS TGHGALASSS MDISDSLEGR RCEQVSTLQE AKEVMATSNH
     VLNVVVLPPT AGDSGSDDTD QEYLPDDPED ESDPAVLSRY NCQPEAKNYW PTQPDMGAQC
     AISCMARNRF MEIKKYLHLA DNQKLVKGDK MSKVTPLYKL LNSSLVKHEA KNYWPTQPDM
     GAQCAISCMA RNRFMEIKKY LHLADNQKLV KGDKMSKVTP LYKLLNSSLV KHGSDDTDQE
     YLPDDPEDES DPAGELEVEQ EQFIGVIVLS RYNCQPEAKN YWPTQPDMGA QCAISCMARN
     RFMEIKKYLH LADNQKLVKG DKMSKVTPLY KLLNSSLVKH DQCSFEESCS CYRMALPTDI
     VTVPAEEVWF NAPCLSEQVT ALKLSNPGNR LLGYKVVSKV ENRYVVLPSQ GALQPKKDIT
     INIICHPFPF RSESPPVDTL IIEWVDAFES ETDFNEDWFV MGGIVRKKVL SVKFNP
//
DBGET integrated database retrieval system