ID A0A443RRW8_9ACAR Unreviewed; 864 AA.
AC A0A443RRW8;
DT 08-MAY-2019, integrated into UniProtKB/TrEMBL.
DT 08-MAY-2019, sequence version 1.
DT 27-MAR-2024, entry version 19.
DE SubName: Full=Scm-like with four MBT domains protein 1 isoform X1 {ECO:0000313|EMBL:RWS18017.1};
GN ORFNames=B4U79_08675 {ECO:0000313|EMBL:RWS18017.1};
OS Dinothrombium tinctorium.
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Chelicerata; Arachnida; Acari;
OC Acariformes; Trombidiformes; Prostigmata; Anystina; Parasitengona;
OC Trombidioidea; Trombidiidae; Dinothrombium.
OX NCBI_TaxID=1965070 {ECO:0000313|EMBL:RWS18017.1, ECO:0000313|Proteomes:UP000285301};
RN [1] {ECO:0000313|EMBL:RWS18017.1, ECO:0000313|Proteomes:UP000285301}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=UoL-WK {ECO:0000313|EMBL:RWS18017.1};
RA Dong X., Chaisiri K., Xia D., Armstrong S.D., Fang Y., Donnelly M.J.,
RA Kadowaki T., McGarry J.W., Darby A.C., Makepeace B.L.;
RT "Genomes of trombidid mites reveal novel predicted allergens and laterally-
RT transferred genes associated with secondary metabolism.";
RL Gigascience 0:0-0(2018).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:RWS18017.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; NCKU01000006; RWS18017.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A443RRW8; -.
DR STRING; 1965070.A0A443RRW8; -.
DR Proteomes; UP000285301; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:InterPro.
DR GO; GO:0006355; P:regulation of DNA-templated transcription; IEA:InterPro.
DR Gene3D; 2.30.30.140; -; 4.
DR Gene3D; 3.90.1150.190; SLED domain; 1.
DR Gene3D; 1.10.150.50; Transcription Factor, Ets-1; 1.
DR InterPro; IPR004092; Mbt.
DR InterPro; IPR013761; SAM/pointed_sf.
DR InterPro; IPR021987; SLED.
DR InterPro; IPR038348; SLED_sf.
DR PANTHER; PTHR12247; POLYCOMB GROUP PROTEIN; 1.
DR PANTHER; PTHR12247:SF129; SAM DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF02820; MBT; 3.
DR Pfam; PF12140; SLED; 1.
DR SMART; SM00561; MBT; 3.
DR SUPFAM; SSF47769; SAM/Pointed domain; 1.
DR SUPFAM; SSF63748; Tudor/PWWP/MBT; 3.
DR PROSITE; PS51079; MBT; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000285301};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT REPEAT 111..214
FT /note="MBT"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00459"
FT DOMAIN 474..596
FT /note="SLED"
FT /evidence="ECO:0000259|Pfam:PF12140"
FT REGION 646..702
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 717..753
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 646..666
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 667..693
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 717..733
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 864 AA; 98763 MW; 490556E544B1670F CRC64;
MSAADDDSLD YEIAVREDAF WHIEASHSNG FEVGMKLEIE HSDHKYWIAN VTAVFGQLLS
LKWEAANDTF WFDVKNKKCY PYGHFRHQKQ YQMQPPQNVN LSEKCRVDIK TKYLDSSSEN
LALSKPQCLF NLGGVSPVEL FAAGTVVEVS HSIDPEKHWF AEIVRNVGGR LYLKWICRET
NHKSSQQSFW LFFAHPRVHH LGFAKEQGNI SYEPPYLYES WASDIQQFLF AEETDESILR
KYLLNLAKSK RPNLFEPMCF SNVKPNDKVS MIDSKTLCLM PATVKNRVAN NHLLICPRTK
DASYCYPNDD TLAVLPFFWG TDFPNLGINF QLDGDQIGNN DRNTYLHKLG GRAAPLQLAE
CKISDFTVNS KLEVVHSTEP DKICEGKVVR VTSPLIWVQI SSDTVRVLPY NSTEMFPCGW
TKNNEYPLTD LQSSSLRSKQ PENITEKKVT KDERITMHSS VPCTLIPNGS KAWCPPIYFN
HKCFTGPALS KSKICELPRS VGPGPVQLVI QEVISKIISV AYVPSRVLNE LSSKAFDELL
KKNNIKKTIS VEFKAKYQKR CYRDQITVIR SSEQVEEYCR TVCSHLKCCY NLFGTQLYDG
DECPSNCRGL TKSNKVLKRA IYYREKAAEA KLNQEGENKV RKMTLIPSSS DNENSQNQLL
CSEKKQSDES QASDSEENGK KKRKVENSES KDNQITAESK VQNEDLLNLE KVETKKENIE
SRNIEKPIKK SPASECSSKE STNGEVNAEE KDRNKTLVTK PKKSKEFKCE EETIESFDNP
LHWSVKDVYN VIKQSHCSMF ANTLLEHEID GAALLLLDNE TIRTNLLTGY TRRYSAQEVI
KLSTLIQRLK SKWFRNVGKM QCLR
//