GenomeNet

Database: UniProt
Entry: A0A016TYH3_9BILA
LinkDB: A0A016TYH3_9BILA
Original site: A0A016TYH3_9BILA 
ID   A0A016TYH3_9BILA        Unreviewed;       546 AA.
AC   A0A016TYH3;
DT   11-JUN-2014, integrated into UniProtKB/TrEMBL.
DT   11-JUN-2014, sequence version 1.
DT   27-MAR-2024, entry version 27.
DE   RecName: Full=MSP domain-containing protein {ECO:0000259|PROSITE:PS50202};
GN   Name=Acey_s0070.g409 {ECO:0000313|EMBL:EYC07393.1};
GN   ORFNames=Y032_0070g409 {ECO:0000313|EMBL:EYC07393.1};
OS   Ancylostoma ceylanicum.
OC   Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC   Rhabditina; Rhabditomorpha; Strongyloidea; Ancylostomatidae;
OC   Ancylostomatinae; Ancylostoma.
OX   NCBI_TaxID=53326 {ECO:0000313|EMBL:EYC07393.1, ECO:0000313|Proteomes:UP000024635};
RN   [1] {ECO:0000313|Proteomes:UP000024635}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=HY135 {ECO:0000313|Proteomes:UP000024635};
RX   PubMed=25730766; DOI=10.1038/ng.3237;
RA   Schwarz E.M., Hu Y., Antoshechkin I., Miller M.M., Sternberg P.W.,
RA   Aroian R.V.;
RT   "The genome and transcriptome of the zoonotic hookworm Ancylostoma
RT   ceylanicum identify infection-specific gene families.";
RL   Nat. Genet. 47:416-422(2015).
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:EYC07393.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; JARK01001406; EYC07393.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A016TYH3; -.
DR   Proteomes; UP000024635; Unassembled WGS sequence.
DR   Gene3D; 2.60.40.10; Immunoglobulins; 1.
DR   InterPro; IPR013783; Ig-like_fold.
DR   InterPro; IPR000535; MSP_dom.
DR   InterPro; IPR008962; PapD-like_sf.
DR   Pfam; PF00635; Motile_Sperm; 1.
DR   SUPFAM; SSF49354; PapD-like; 1.
DR   PROSITE; PS50202; MSP; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000024635}.
FT   DOMAIN          46..159
FT                   /note="MSP"
FT                   /evidence="ECO:0000259|PROSITE:PS50202"
FT   REGION          1..28
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          166..192
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          248..428
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          449..482
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          527..546
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        168..192
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        248..318
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        319..347
FT                   /note="Basic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        348..428
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        532..546
FT                   /note="Acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   546 AA;  60130 MW;  22F64D076E2D8D33 CRC64;
     MSSRSESPSP KEKSKPSKRT KAGAGESAQA PVIPNIIAAA ESALAKVDMI PTKSVTFFPN
     EKRQQTYIVL SNNSDRKVMF KMKSTRPGVY KMKPVFGVVN PGEKYSIRLS YMGIKVGHRI
     PINDRITVVL ASVAHKGGET DKEATEGEMK KRKIYILYKG VNDQVDPEAG GDAEKKPAVD
     KEQAARSAAD HKAYMDGYDE GYKAAIIESR DSKASANPAE ALERLQKSKG VGKEPKFEEG
     FKEGYKKAME LLKTTEKKEM KEEKSEPKET KTPPPPKEPV KEPVKEVVKE PAKETAKGTP
     KEPEKIAPKK EAAKVQSGRK SKQKSKPSKK SGKAPAKGKK VAKTPKKKSK VATKEVVKPA
     TKEPVKEPAK PPPKAKPEGK TPKKMKSPPK EVAKEASKKE PPKEPAKAPV KEPAKTPEPK
     KEPEKVTEVR KELVKEIKKE EVKELKKEFD KAEKEKLSDG TNVTAFTPGG TDPSLGDPLK
     RKDIVHVGPT GKVRIILAVF RDLVDNPSEF QKQIVIIMYR DARVPENLLS RSGDEDDDDD
     VKGDMP
//
DBGET integrated database retrieval system