ID A0A4U1EMA1_MONMO Unreviewed; 555 AA.
AC A0A4U1EMA1;
DT 31-JUL-2019, integrated into UniProtKB/TrEMBL.
DT 31-JUL-2019, sequence version 1.
DT 27-MAR-2024, entry version 18.
DE RecName: Full=Doublesex- and mab-3-related transcription factor A2 {ECO:0000256|ARBA:ARBA00034335};
DE AltName: Full=Doublesex- and mab-3-related transcription factor 5 {ECO:0000256|ARBA:ARBA00034363};
GN ORFNames=EI555_021067 {ECO:0000313|EMBL:TKC37448.1};
OS Monodon monoceros (Narwhal) (Ceratodon monodon).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Artiodactyla; Whippomorpha; Cetacea; Odontoceti;
OC Monodontidae; Monodon.
OX NCBI_TaxID=40151 {ECO:0000313|EMBL:TKC37448.1, ECO:0000313|Proteomes:UP000308365};
RN [1] {ECO:0000313|Proteomes:UP000308365}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=31054839; DOI=10.1016/j.isci.2019.03.023;
RA Westbury M.V., Petersen B., Garde E., Heide-Jorgensen M.P., Lorenzen E.D.;
RT "Narwhal Genome Reveals Long-Term Low Genetic Diversity despite Current
RT Large Abundance Size.";
RL IScience 15:592-599(2019).
CC -!- FUNCTION: May be involved in sexual development.
CC {ECO:0000256|ARBA:ARBA00034300}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|PROSITE-ProRule:PRU00070}.
CC -!- SIMILARITY: Belongs to the DMRT family.
CC {ECO:0000256|ARBA:ARBA00006834}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:TKC37448.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; RWIC01001137; TKC37448.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A4U1EMA1; -.
DR Proteomes; UP000308365; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0043565; F:sequence-specific DNA binding; IEA:InterPro.
DR GO; GO:0006355; P:regulation of DNA-templated transcription; IEA:InterPro.
DR CDD; cd14418; CUE_DMA_DMRTA2; 1.
DR Gene3D; 4.10.1040.10; DM DNA-binding domain; 1.
DR InterPro; IPR001275; DM_DNA-bd.
DR InterPro; IPR036407; DM_DNA-bd_sf.
DR InterPro; IPR005173; DMA.
DR InterPro; IPR026607; DMRT.
DR InterPro; IPR046472; DMRT5_1_DMB_dom.
DR InterPro; IPR009060; UBA-like_sf.
DR PANTHER; PTHR12322; DOUBLESEX AND MAB-3 RELATED TRANSCRIPTION FACTOR DMRT; 1.
DR PANTHER; PTHR12322:SF76; DOUBLESEX- AND MAB-3-RELATED TRANSCRIPTION FACTOR A2; 1.
DR Pfam; PF00751; DM; 1.
DR Pfam; PF03474; DMA; 1.
DR Pfam; PF20624; DMRT5_DMB; 1.
DR SMART; SM00301; DM; 1.
DR SUPFAM; SSF82927; Cysteine-rich DNA binding domain, (DM domain); 1.
DR SUPFAM; SSF46934; UBA-like; 1.
DR PROSITE; PS40000; DM_1; 1.
DR PROSITE; PS50809; DM_2; 1.
PE 3: Inferred from homology;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00070};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723, ECO:0000256|PROSITE-
KW ProRule:PRU00070};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00070}; Reference proteome {ECO:0000313|Proteomes:UP000308365};
KW Zinc {ECO:0000256|ARBA:ARBA00022833, ECO:0000256|PROSITE-ProRule:PRU00070}.
FT DOMAIN 89..136
FT /note="DM"
FT /evidence="ECO:0000259|PROSITE:PS50809"
FT DNA_BIND 89..136
FT /note="DM"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00070"
FT REGION 222..335
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 249..265
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 555 AA; 55015 MW; E8F7A1DAAEFE867C CRC64;
MRSSPQVSPL FPRCSTPGPA MELRSELPSV PGAATAAATT TGPPVASVAS VAAAAAAAAS
LPVSVAGGLL RAPPLLLRAA EKYPRTPKCA RCRNHGVVSA LKGHKRYCRW KDCLCAKCTL
IAERQRVMAA QVALRRQQAQ EENEARELQL LYGTAEGLAL AAANGIIPPR PAYEVFGSVC
AADGGGPGAG APAGTGGGAA GAGSSEAKLQ KFDVFPKTLL QAGRAGSPQP PPGKPLSPDG
ADSGPGTSSP EVRAGSGSEN GDGESFSGSP LARASKEAGG SCPGSAGPGG GAEEDSPGSA
SPLGSESGSE VDKEEAEAAP APGLGGGPGP RQRTPLDILT RVFPGHRRGV LELVLQGCGG
DVVQAIEQVL NHHRGGLAAG LGPAVPPDKA AGGAVVAADD AWPGRVDAAA AGGPGLPAPL
QAGPAAPPHH RPLLAGAMAP GALGSLSSRS AFSPLQPNAS HFGADAGAYP LGAPLGLSPL
RLAYSAAAAH SRGLAFMAPY STAGLVPTLG FRPPMDYAFS DLMRDRSAAA AAVHKEPTYG
GGLYGPMVNG APEKQ
//