ID A0A267DSX8_9PLAT Unreviewed; 920 AA.
AC A0A267DSX8;
DT 22-NOV-2017, integrated into UniProtKB/TrEMBL.
DT 22-NOV-2017, sequence version 1.
DT 27-MAR-2024, entry version 21.
DE RecName: Full=WW domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=BOX15_Mlig027479g3 {ECO:0000313|EMBL:PAA52276.1};
OS Macrostomum lignano.
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Platyhelminthes;
OC Rhabditophora; Macrostomorpha; Macrostomida; Macrostomidae; Macrostomum.
OX NCBI_TaxID=282301 {ECO:0000313|EMBL:PAA52276.1, ECO:0000313|Proteomes:UP000215902};
RN [1] {ECO:0000313|EMBL:PAA52276.1, ECO:0000313|Proteomes:UP000215902}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=DV1 {ECO:0000313|EMBL:PAA52276.1};
RC TISSUE=Whole organism {ECO:0000313|EMBL:PAA52276.1};
RA Berezikov E.;
RT "A platform for efficient transgenesis in Macrostomum lignano, a flatworm
RT model organism for stem cell research.";
RL Submitted (JUN-2017) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PAA52276.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; NIVC01003270; PAA52276.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A267DSX8; -.
DR STRING; 282301.A0A267DSX8; -.
DR Proteomes; UP000215902; Unassembled WGS sequence.
DR GO; GO:0045292; P:mRNA cis splicing, via spliceosome; IEA:InterPro.
DR CDD; cd00201; WW; 2.
DR Gene3D; 2.20.70.10; -; 2.
DR Gene3D; 1.10.10.440; FF domain; 5.
DR InterPro; IPR002713; FF_domain.
DR InterPro; IPR036517; FF_domain_sf.
DR InterPro; IPR039726; Prp40-like.
DR InterPro; IPR001202; WW_dom.
DR InterPro; IPR036020; WW_dom_sf.
DR PANTHER; PTHR11864; PRE-MRNA-PROCESSING PROTEIN PRP40; 1.
DR PANTHER; PTHR11864:SF0; PRP40 PRE-MRNA PROCESSING FACTOR 40 HOMOLOG A (YEAST); 1.
DR Pfam; PF01846; FF; 3.
DR Pfam; PF00397; WW; 2.
DR SMART; SM00441; FF; 5.
DR SMART; SM00456; WW; 2.
DR SUPFAM; SSF81698; FF domain; 5.
DR SUPFAM; SSF51045; WW domain; 2.
DR PROSITE; PS51676; FF; 3.
DR PROSITE; PS50020; WW_DOMAIN_2; 2.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Reference proteome {ECO:0000313|Proteomes:UP000215902}.
FT DOMAIN 66..99
FT /note="WW"
FT /evidence="ECO:0000259|PROSITE:PS50020"
FT DOMAIN 112..140
FT /note="WW"
FT /evidence="ECO:0000259|PROSITE:PS50020"
FT DOMAIN 280..334
FT /note="FF"
FT /evidence="ECO:0000259|PROSITE:PS51676"
FT DOMAIN 347..401
FT /note="FF"
FT /evidence="ECO:0000259|PROSITE:PS51676"
FT DOMAIN 629..686
FT /note="FF"
FT /evidence="ECO:0000259|PROSITE:PS51676"
FT REGION 1..41
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 168..200
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 215..288
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 691..908
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 328..360
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 1..21
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 712..755
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 765..784
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 791..819
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 920 AA; 103164 MW; 3FAB07FCFA5724D8 CRC64;
MSAANGFPVR TQHQQHSSPV VGAPMFPSGA APGAYSAPAP QHQQQPHHAF HAMPHHQMGH
PAAAAAATDK KWAEHTMPDG KTYYYNLVTG KTVWEKPDDL KTEWELIMAR CPWREFRTDA
GKVYYSHSVT NQSVWTKPEE LRLAEQQAEM ARQRAAAAAA IEPLRQMQHL QQSGPPPPPT
AAVGSPAASG GGGGGGGSSA IEEAMKKTLE SFGVIPPTTD AAASDAGSSA SAAGKRGGGK
RRRGGGDMSG ADDDSDSSED SDGSGGSGRK KKKNKPQQQQ QSKEDVFKEL LRDRRVPSSA
TWETAVRMIQ EDPRYQQVKH LQHRRQVFNN YKTQRQKEEK EEQRLKIKKA REDLEQFLLQ
TAPITSSTKY RTAEKMFVDL RVWTAVPERD RRDIFDDAVR EIDKREREAQ RALQRRNIER
FGEILAGMKE MNYLTTWTEA QQMLSNSASF RSDRELLDMD KEDALICFQK HVTRLEKEED
DKKEENQLRQ RRQERKNREA FVVLLDELHD QGRLTANSLW KDCFRDICRD RRFDQMLCQQ
GSTPLDLFKF YVDDLKKRFP EEKRIVKEIM KDRAFSICPD TALEDFLRLI ASDERGRSLD
PGNVERSFDS LQEKARCIEQ EKRAEERRKM QRHAEAFKEL LFSADPPVDA STSWDTVRER
FGQADCFQAI PLESERLIVF KDFLKSLAEP AASAASGGTG AAAGSKKKKK KKDRDREADK
DKDHRSKDKD RKKSRRNSDG DLDGEKADSG GERDKKAKKK KKKQSRHSDD GEGDAGGEKK
KKKKDKKEKE QKKHRKEGKS SSGGGSSRKR KRDDGGGGGD DLPDDGEESG GSRRRQHRSR
RHSGASSSAA VGKRARRHSS RSSAASGDER GGRGGGSSKT AADDAPPKES QAALNASDSM
DLSEGELEKE RERLIAQLHD
//