ID A0A3Q3W3F9_MOLML Unreviewed; 832 AA.
AC A0A3Q3W3F9;
DT 10-APR-2019, integrated into UniProtKB/TrEMBL.
DT 10-APR-2019, sequence version 1.
DT 27-MAR-2024, entry version 25.
DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSMMOP00000006157.1};
OS Mola mola (Ocean sunfish) (Tetraodon mola).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Eupercaria; Tetraodontiformes; Molidae; Mola.
OX NCBI_TaxID=94237 {ECO:0000313|Ensembl:ENSMMOP00000006157.1, ECO:0000313|Proteomes:UP000261620};
RN [1] {ECO:0000313|Ensembl:ENSMMOP00000006157.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; A0A3Q3W3F9; -.
DR STRING; 94237.ENSMMOP00000006157; -.
DR Ensembl; ENSMMOT00000006268.1; ENSMMOP00000006157.1; ENSMMOG00000004808.1.
DR Proteomes; UP000261620; Unplaced.
DR CDD; cd12884; SPRY_hnRNP; 1.
DR Gene3D; 2.60.120.920; -; 1.
DR Gene3D; 3.40.50.300; P-loop containing nucleotide triphosphate hydrolases; 1.
DR Gene3D; 1.10.720.30; SAP domain; 1.
DR InterPro; IPR001870; B30.2/SPRY.
DR InterPro; IPR043136; B30.2/SPRY_sf.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR027417; P-loop_NTPase.
DR InterPro; IPR003034; SAP_dom.
DR InterPro; IPR036361; SAP_dom_sf.
DR InterPro; IPR003877; SPRY_dom.
DR InterPro; IPR035778; SPRY_hnRNP_U.
DR PANTHER; PTHR12381; HETEROGENEOUS NUCLEAR RIBONUCLEOPROTEIN U FAMILY MEMBER; 1.
DR PANTHER; PTHR12381:SF41; HETEROGENEOUS NUCLEAR RIBONUCLEOPROTEIN U-LIKE PROTEIN 1; 1.
DR Pfam; PF13671; AAA_33; 1.
DR Pfam; PF02037; SAP; 1.
DR Pfam; PF00622; SPRY; 1.
DR SMART; SM00513; SAP; 1.
DR SMART; SM00449; SPRY; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR SUPFAM; SSF52540; P-loop containing nucleoside triphosphate hydrolases; 1.
DR SUPFAM; SSF68906; SAP domain; 1.
DR PROSITE; PS50188; B302_SPRY; 1.
DR PROSITE; PS50800; SAP; 1.
PE 4: Predicted;
KW Methylation {ECO:0000256|ARBA:ARBA00022481};
KW Phosphoprotein {ECO:0000256|ARBA:ARBA00022553};
KW Reference proteome {ECO:0000313|Proteomes:UP000261620}.
FT DOMAIN 5..39
FT /note="SAP"
FT /evidence="ECO:0000259|PROSITE:PS50800"
FT DOMAIN 207..405
FT /note="B30.2/SPRY"
FT /evidence="ECO:0000259|PROSITE:PS50188"
FT REGION 42..121
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 151..222
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 613..703
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 752..783
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 47..77
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 91..113
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 151..217
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 613..631
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 669..703
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 832 AA; 95116 MW; C9C2AE0E876F1D4A CRC64;
MSVDVKKLKV NELREELQRR GLDTRGLKAD LVVRLNAALE AEAQADASEQ GEQEEQGEQG
EQEEQGEQEE QEEQEDDQPA SDSGHSLPDF TADVDIDKSD MMSEEEKKPV PALETATENK
PGEFLLVVNL KIQYETCSVD KVSTFRFVKT EDDQQEDNQG TEDQHRQDEA ARTEQVKVEA
DKGGQYSRKR PYEENRSYSY YEHREEKRSR TPQPPAEDEE ENIDDTLVTI DTYNCDLHFK
VSRDRYSGYP LTIEGFAYLW AGARASHGVT QGRVCYEMKI NEEIPVKHLP SSEPDPHVVR
IGWSINHSST QLGEEPFSFG YGGTGKKSEN CKFADFGEKF RENDVIGCYI DFDSGNEVEM
GFSKNGVWLG VAFRTSKEAL AGRALFPHVL VKNCAVEFNF GQKLQPYFPP PEGYTYIHNL
SMEDKVRGTK GPATKSDCEI LMMVGLPACG KTTWAVKYAE TNPEKKYNIL GTNAIMDKMK
VMGLRRQKNY AGRWDILIQQ ATQCLNRLIE IAARKRRNYI LDQTNVYGSA RRRKMRPFEG
FQRKAIVICP TDEDLKERTL KQTNEQGKDV PDHAVLEMKA NFTLPEPCDF LEAVTFIELQ
CDEAENLLKQ YNEEGRKAGP PPDKRFDNRQ GGFRGRGSGG YQRYDNREMS RGGYPNRSGD
AGSGYRGGYN RGSYSQNRWG NSYRDGSSEA RSGYSRNQQS GGNYNRVAPY KIIFLNFLLY
PQGYSQGYGQ GYNQGSYNHN YYSNYSQYPG YSQSYSQTPA SGQTYNHHQQ QPPQQQQQQQ
QQQQQQSYNQ QYQQYAQQWQ QYYQNQNQWN QYYSQYGSYP GQGSQGSSSG SQ
//