ID A0A016TYH3_9BILA Unreviewed; 546 AA.
AC A0A016TYH3;
DT 11-JUN-2014, integrated into UniProtKB/TrEMBL.
DT 11-JUN-2014, sequence version 1.
DT 27-MAR-2024, entry version 27.
DE RecName: Full=MSP domain-containing protein {ECO:0000259|PROSITE:PS50202};
GN Name=Acey_s0070.g409 {ECO:0000313|EMBL:EYC07393.1};
GN ORFNames=Y032_0070g409 {ECO:0000313|EMBL:EYC07393.1};
OS Ancylostoma ceylanicum.
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC Rhabditina; Rhabditomorpha; Strongyloidea; Ancylostomatidae;
OC Ancylostomatinae; Ancylostoma.
OX NCBI_TaxID=53326 {ECO:0000313|EMBL:EYC07393.1, ECO:0000313|Proteomes:UP000024635};
RN [1] {ECO:0000313|Proteomes:UP000024635}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=HY135 {ECO:0000313|Proteomes:UP000024635};
RX PubMed=25730766; DOI=10.1038/ng.3237;
RA Schwarz E.M., Hu Y., Antoshechkin I., Miller M.M., Sternberg P.W.,
RA Aroian R.V.;
RT "The genome and transcriptome of the zoonotic hookworm Ancylostoma
RT ceylanicum identify infection-specific gene families.";
RL Nat. Genet. 47:416-422(2015).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EYC07393.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JARK01001406; EYC07393.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A016TYH3; -.
DR Proteomes; UP000024635; Unassembled WGS sequence.
DR Gene3D; 2.60.40.10; Immunoglobulins; 1.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR000535; MSP_dom.
DR InterPro; IPR008962; PapD-like_sf.
DR Pfam; PF00635; Motile_Sperm; 1.
DR SUPFAM; SSF49354; PapD-like; 1.
DR PROSITE; PS50202; MSP; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000024635}.
FT DOMAIN 46..159
FT /note="MSP"
FT /evidence="ECO:0000259|PROSITE:PS50202"
FT REGION 1..28
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 166..192
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 248..428
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 449..482
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 527..546
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 168..192
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 248..318
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 319..347
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 348..428
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 532..546
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 546 AA; 60130 MW; 22F64D076E2D8D33 CRC64;
MSSRSESPSP KEKSKPSKRT KAGAGESAQA PVIPNIIAAA ESALAKVDMI PTKSVTFFPN
EKRQQTYIVL SNNSDRKVMF KMKSTRPGVY KMKPVFGVVN PGEKYSIRLS YMGIKVGHRI
PINDRITVVL ASVAHKGGET DKEATEGEMK KRKIYILYKG VNDQVDPEAG GDAEKKPAVD
KEQAARSAAD HKAYMDGYDE GYKAAIIESR DSKASANPAE ALERLQKSKG VGKEPKFEEG
FKEGYKKAME LLKTTEKKEM KEEKSEPKET KTPPPPKEPV KEPVKEVVKE PAKETAKGTP
KEPEKIAPKK EAAKVQSGRK SKQKSKPSKK SGKAPAKGKK VAKTPKKKSK VATKEVVKPA
TKEPVKEPAK PPPKAKPEGK TPKKMKSPPK EVAKEASKKE PPKEPAKAPV KEPAKTPEPK
KEPEKVTEVR KELVKEIKKE EVKELKKEFD KAEKEKLSDG TNVTAFTPGG TDPSLGDPLK
RKDIVHVGPT GKVRIILAVF RDLVDNPSEF QKQIVIIMYR DARVPENLLS RSGDEDDDDD
VKGDMP
//