ID A0A3Q0KBW6_SCHMA Unreviewed; 789 AA.
AC A0A3Q0KBW6;
DT 13-FEB-2019, integrated into UniProtKB/TrEMBL.
DT 13-SEP-2023, sequence version 2.
DT 27-MAR-2024, entry version 22.
DE SubName: Full=Putative sex comb on midleg homolog {ECO:0000313|WBParaSite:Smp_006250.1};
OS Schistosoma mansoni (Blood fluke).
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Platyhelminthes; Trematoda;
OC Digenea; Strigeidida; Schistosomatoidea; Schistosomatidae; Schistosoma.
OX NCBI_TaxID=6183 {ECO:0000313|Proteomes:UP000008854, ECO:0000313|WBParaSite:Smp_006250.1};
RN [1] {ECO:0000313|Proteomes:UP000008854}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Puerto Rican {ECO:0000313|Proteomes:UP000008854};
RX PubMed=22253936; DOI=10.1371/journal.pntd.0001455;
RA Protasio A.V., Tsai I.J., Babbage A., Nichol S., Hunt M., Aslett M.A.,
RA De Silva N., Velarde G.S., Anderson T.J., Clark R.C., Davidson C.,
RA Dillon G.P., Holroyd N.E., LoVerde P.T., Lloyd C., McQuillan J.,
RA Oliveira G., Otto T.D., Parker-Manuel S.J., Quail M.A., Wilson R.A.,
RA Zerlotini A., Dunne D.W., Berriman M.;
RT "A systematically improved high quality genome and transcriptome of the
RT human blood fluke Schistosoma mansoni.";
RL PLoS Negl. Trop. Dis. 6:E1455-E1455(2012).
RN [2] {ECO:0000313|WBParaSite:Smp_006250.1}
RP IDENTIFICATION.
RC STRAIN=Puerto Rican {ECO:0000313|WBParaSite:Smp_006250.1};
RG WormBaseParasite;
RL Submitted (DEC-2018) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_018654106.1; XM_018788577.1.
DR AlphaFoldDB; A0A3Q0KBW6; -.
DR STRING; 6183.A0A3Q0KBW6; -.
DR EnsemblMetazoa; Smp_006250.1; Smp_006250.1; Smp_006250.
DR GeneID; 8352324; -.
DR KEGG; smm:Smp_006250.1; -.
DR WBParaSite; Smp_006250.1; Smp_006250.1; Smp_006250.
DR InParanoid; A0A3Q0KBW6; -.
DR Proteomes; UP000008854; Unassembled WGS sequence.
DR ExpressionAtlas; A0A3Q0KBW6; baseline.
DR GO; GO:0005634; C:nucleus; IEA:InterPro.
DR GO; GO:0006355; P:regulation of DNA-templated transcription; IEA:InterPro.
DR CDD; cd20092; MBT_dScm-like_rpt2; 1.
DR CDD; cd09509; SAM_Polycomb; 1.
DR Gene3D; 2.30.30.140; -; 2.
DR Gene3D; 1.10.150.50; Transcription Factor, Ets-1; 1.
DR InterPro; IPR004092; Mbt.
DR InterPro; IPR001660; SAM.
DR InterPro; IPR013761; SAM/pointed_sf.
DR PANTHER; PTHR12247; POLYCOMB GROUP PROTEIN; 1.
DR PANTHER; PTHR12247:SF132; POLYCOMB PROTEIN SCM; 1.
DR Pfam; PF02820; MBT; 2.
DR Pfam; PF00536; SAM_1; 1.
DR SMART; SM00561; MBT; 2.
DR SMART; SM00454; SAM; 1.
DR SUPFAM; SSF47769; SAM/Pointed domain; 1.
DR SUPFAM; SSF63748; Tudor/PWWP/MBT; 3.
DR PROSITE; PS51079; MBT; 1.
DR PROSITE; PS50105; SAM_DOMAIN; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000008854};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT REPEAT 165..266
FT /note="MBT"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00459"
FT DOMAIN 722..787
FT /note="SAM"
FT /evidence="ECO:0000259|PROSITE:PS50105"
FT REGION 341..380
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 355..380
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 789 AA; 88792 MW; CB201B4F56EAD440 CRC64;
MGCHVSADSS NSFTWDDYLK KTNGRPAKLE CFKQSLVPPP NYFEVNTILE AEDQRSAALV
LNPYSNDLIK ARYANNSTSS NQQLLETPCR RSSLNPHSSG TVRRFRAASF SLARVIETCG
PRLRIRLVGT DDRNDYWFLV DSDQIRPYPS GSPLQPPFGY MHNHLVWNRT LKKATEGTRF
ADPSWFISQP PDPEDNYFQV NDKLEAVDRR NTQLICPASV GAVNGHHILI NFDGWSGAFD
YWARFDSREL FPVGWCKSAN YPLQPPGPNV IRSPIIHSTP SFLYAHNSVS QAPSVSPKYL
SKVLTNSCSR ISKTDKTSSC PRKKLGLSRR IKSHANLVRS NLRDETESST HFRNPRELST
KQPLSMSPFS PALSKPNHNS VASEHMDVNK NERVDLFVEP NSLNTSQVDS SIQSLTSPPI
VPPVLEDASD SLNTFYSSSP PEHPPRIEAS EFVRDSADGR HRRFSSPSDC VVRLQAKSSL
MPVSNVLHPY EPELKSWKVV TKTKQKNKQH KLKRRSGNSK KKCPEIKQKF KIANDSLTNS
VLCDYEYVDS VSPQLKGFSD MSQNYEKSIA YPEDNPNLYK NETANSTAFC KNDSSLTCTP
VFMADRKSLF KSENDRTNGS TMGSSLVFQS HISHDIMHTN GCYFDEMNSP THDNLSVETS
DSHLWFSGKL ESNSNKSTYP VYAHQNVDSL DSQSLTTHLP SSTVSPSSLD LGSFPFPNPT
QWTIEEVYNY ITARDATLLE AAEKFKHHEI DGQALLLLSM ESLRNYMKIK LGPALKMVHL
ISRLKRGLL
//