GenomeNet

Database: UniProt
Entry: A0A1X0NRQ1_9TRYP
LinkDB: A0A1X0NRQ1_9TRYP
Original site: A0A1X0NRQ1_9TRYP 
ID   A0A1X0NRQ1_9TRYP        Unreviewed;       420 AA.
AC   A0A1X0NRQ1;
DT   05-JUL-2017, integrated into UniProtKB/TrEMBL.
DT   05-JUL-2017, sequence version 1.
DT   22-FEB-2023, entry version 16.
DE   RecName: Full=MSP domain-containing protein {ECO:0000259|PROSITE:PS50202};
GN   ORFNames=TM35_000231340 {ECO:0000313|EMBL:ORC87163.1};
OS   Trypanosoma theileri.
OC   Eukaryota; Discoba; Euglenozoa; Kinetoplastea; Metakinetoplastina;
OC   Trypanosomatida; Trypanosomatidae; Trypanosoma.
OX   NCBI_TaxID=67003 {ECO:0000313|EMBL:ORC87163.1, ECO:0000313|Proteomes:UP000192257};
RN   [1] {ECO:0000313|EMBL:ORC87163.1, ECO:0000313|Proteomes:UP000192257}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Edinburgh {ECO:0000313|EMBL:ORC87163.1};
RA   Kelly S., Ivens A., Mott A., O'Neill E., Emms D., Macleod O., Voorheis P.,
RA   Matthews J., Matthews K., Carrington M.;
RT   "An alternative strategy for trypanosome survival in the mammalian
RT   bloodstream revealed through genome and transcriptome analysis of the
RT   ubiquitous bovine parasite Trypanosoma (Megatrypanum) theileri.";
RL   Submitted (MAR-2017) to the EMBL/GenBank/DDBJ databases.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:ORC87163.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; NBCO01000023; ORC87163.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A1X0NRQ1; -.
DR   VEuPathDB; TriTrypDB:TM35_000231340; -.
DR   OrthoDB; 131750at2759; -.
DR   Proteomes; UP000192257; Unassembled WGS sequence.
DR   Gene3D; 2.60.40.10; Immunoglobulins; 1.
DR   InterPro; IPR013783; Ig-like_fold.
DR   InterPro; IPR000535; MSP_dom.
DR   InterPro; IPR008962; PapD-like_sf.
DR   Pfam; PF00635; Motile_Sperm; 1.
DR   SUPFAM; SSF49354; PapD-like; 1.
DR   PROSITE; PS50202; MSP; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000192257}.
FT   DOMAIN          1..154
FT                   /note="MSP"
FT                   /evidence="ECO:0000259|PROSITE:PS50202"
FT   REGION          130..178
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          234..358
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        234..260
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        286..326
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        338..358
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   420 AA;  46449 MW;  C6B711FD9477E937 CRC64;
     MPSTGVSKEL VHFSQDFLYF PLPLTDVTIE NIVQLRSLLP RTGNNGENVI AFKVLCSVRN
     RYSVRPSTGF ILPGESVNIK FLLDAQQLRR NAANNKDGSA ANALPDVNTH DDIVVDLVVV
     PRDQAIMYLQ RQQNNNNNNS KGNTKESNNK ANNNNNNNSN NNNVSNSNNS HSYMNSNNTP
     VCVEEAATFW KERGQVRADD IQATRRKLRC VYGEKNVPDS LVMRFSTEKL VSSEETPVIN
     TPRRSSNNHN NHNNVNIDSL LLPPTVPRTR PHTNHMDLHR RTAPSAPPRS SPSASLSNRE
     KTPSSVGPSS PSKSVGMNNS QAAAVGRTAG SPFAPRMDGG SNNYNTITNT NTTTTTTTAS
     NNNNLFMSIG SGKEDKEDSF WHKYLYYKIP YPMLGVLLVL SFLCALFERG TLLSWLLIGQ
//
DBGET integrated database retrieval system