ID A0A401P982_SCYTO Unreviewed; 423 AA.
AC A0A401P982;
DT 08-MAY-2019, integrated into UniProtKB/TrEMBL.
DT 08-MAY-2019, sequence version 1.
DT 27-MAR-2024, entry version 19.
DE RecName: Full=DM domain-containing protein {ECO:0000259|PROSITE:PS50809};
GN ORFNames=scyTo_0001096 {ECO:0000313|EMBL:GCB69644.1};
OS Scyliorhinus torazame (Cloudy catshark).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Chondrichthyes;
OC Elasmobranchii; Galeomorphii; Galeoidea; Carcharhiniformes; Scyliorhinidae;
OC Scyliorhinus.
OX NCBI_TaxID=75743 {ECO:0000313|EMBL:GCB69644.1, ECO:0000313|Proteomes:UP000288216};
RN [1] {ECO:0000313|EMBL:GCB69644.1, ECO:0000313|Proteomes:UP000288216}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=30297745; DOI=.1038/s41559-018-0673-5;
RA Hara Y, Yamaguchi K, Onimaru K, Kadota M, Koyanagi M, Keeley SD, Tatsumi K,
RA Tanaka K, Motone F, Kageyama Y, Nozu R, Adachi N, Nishimura O, Nakagawa R,
RA Tanegashima C, Kiyatake I, Matsumoto R, Murakumo K, Nishida K, Terakita A,
RA Kuratani S, Sato K, Hyodo S Kuraku.S.;
RT "Shark genomes provide insights into elasmobranch evolution and the origin
RT of vertebrates.";
RL Nat. Ecol. Evol. 2:1761-1771(2018).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|PROSITE-ProRule:PRU00070}.
CC -!- SIMILARITY: Belongs to the DMRT family.
CC {ECO:0000256|ARBA:ARBA00006834}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:GCB69644.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; BFAA01000233; GCB69644.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A401P982; -.
DR STRING; 75743.A0A401P982; -.
DR OMA; ELQFMYT; -.
DR Proteomes; UP000288216; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0043565; F:sequence-specific DNA binding; IEA:InterPro.
DR GO; GO:0006355; P:regulation of DNA-templated transcription; IEA:InterPro.
DR CDD; cd14417; CUE_DMA_DMRTA1; 1.
DR Gene3D; 4.10.1040.10; DM DNA-binding domain; 1.
DR InterPro; IPR001275; DM_DNA-bd.
DR InterPro; IPR036407; DM_DNA-bd_sf.
DR InterPro; IPR005173; DMA.
DR InterPro; IPR026607; DMRT.
DR InterPro; IPR046472; DMRT5_1_DMB_dom.
DR InterPro; IPR009060; UBA-like_sf.
DR PANTHER; PTHR12322; DOUBLESEX AND MAB-3 RELATED TRANSCRIPTION FACTOR DMRT; 1.
DR PANTHER; PTHR12322:SF71; DOUBLESEX- AND MAB-3-RELATED TRANSCRIPTION FACTOR A1; 1.
DR Pfam; PF00751; DM; 1.
DR Pfam; PF03474; DMA; 1.
DR Pfam; PF20624; DMRT5_DMB; 1.
DR SMART; SM00301; DM; 1.
DR SUPFAM; SSF82927; Cysteine-rich DNA binding domain, (DM domain); 1.
DR SUPFAM; SSF46934; UBA-like; 1.
DR PROSITE; PS40000; DM_1; 1.
DR PROSITE; PS50809; DM_2; 1.
PE 3: Inferred from homology;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00070};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723, ECO:0000256|PROSITE-
KW ProRule:PRU00070};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00070}; Reference proteome {ECO:0000313|Proteomes:UP000288216};
KW Signal {ECO:0000256|SAM:SignalP};
KW Zinc {ECO:0000256|ARBA:ARBA00022833, ECO:0000256|PROSITE-ProRule:PRU00070}.
FT SIGNAL 1..27
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 28..423
FT /note="DM domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5019427659"
FT DOMAIN 37..84
FT /note="DM"
FT /evidence="ECO:0000259|PROSITE:PS50809"
FT DNA_BIND 37..84
FT /note="DM"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00070"
FT REGION 165..229
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 191..205
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 423 AA; 45017 MW; 6E8647159AF68C85 CRC64;
MAGSGSLLRP PALLLRAAAA AAAAASAAER YPRTPKCARC RNHGVVSALK GHKRFCRWRD
CMCAKCMLIA ERQRVMAAQV ALRRQQAQEE NEARELQFIY AGGGAAEAGL AMAAAAAAAN
SIIPGSRAVS TPYEIFGAEY QNHKDGQKSP KYELFYNGLV SRSMLQPPHS LAPETEGSDT
NLRDKAGNAE ASENESAQLS LSPDPPSEGA DSPGSMSPSD GESGNECEKP KELAKVMANL
PGSSSNHRAP IDILTKVFPS HKRDKLVCIL AGCKGNVVQA IEQVLNGKEA KAQVKDVECT
APEPGELQRP SHFTFAGLSG GSLGTKSAFS PLQSSAVFGG PANLYGLNPR LGVNPLRLAY
SNPGRGLPTF MSPYVTSGLM PTLPFRPQID YSFPGIVRDI PYFQNKEALC SGGLYSRINP
EKQ
//