ID A0A267FNG9_9PLAT Unreviewed; 722 AA.
AC A0A267FNG9;
DT 22-NOV-2017, integrated into UniProtKB/TrEMBL.
DT 22-NOV-2017, sequence version 1.
DT 27-MAR-2024, entry version 28.
DE RecName: Full=Nuclear receptor domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=BOX15_Mlig000893g4 {ECO:0000313|EMBL:PAA74667.1};
OS Macrostomum lignano.
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Platyhelminthes;
OC Rhabditophora; Macrostomorpha; Macrostomida; Macrostomidae; Macrostomum.
OX NCBI_TaxID=282301 {ECO:0000313|EMBL:PAA74667.1, ECO:0000313|Proteomes:UP000215902};
RN [1] {ECO:0000313|EMBL:PAA74667.1, ECO:0000313|Proteomes:UP000215902}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=DV1 {ECO:0000313|EMBL:PAA74667.1};
RC TISSUE=Whole organism {ECO:0000313|EMBL:PAA74667.1};
RA Berezikov E.;
RT "A platform for efficient transgenesis in Macrostomum lignano, a flatworm
RT model organism for stem cell research.";
RL Submitted (JUN-2017) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PAA74667.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; NIVC01000930; PAA74667.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A267FNG9; -.
DR STRING; 282301.A0A267FNG9; -.
DR Proteomes; UP000215902; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-KW.
DR GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro.
DR GO; GO:0043565; F:sequence-specific DNA binding; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR CDD; cd07179; 2DBD_NR_DBD2; 1.
DR Gene3D; 3.30.50.10; Erythroid Transcription Factor GATA-1, subunit A; 2.
DR InterPro; IPR013087; Znf_C2H2_type.
DR InterPro; IPR001628; Znf_hrmn_rcpt.
DR InterPro; IPR013088; Znf_NHR/GATA.
DR PANTHER; PTHR48092; KNIRPS-RELATED PROTEIN-RELATED; 1.
DR Pfam; PF00105; zf-C4; 3.
DR PRINTS; PR00047; STROIDFINGER.
DR SMART; SM00399; ZnF_C4; 2.
DR SUPFAM; SSF57716; Glucocorticoid receptor-like (DNA-binding domain); 3.
DR PROSITE; PS00031; NUCLEAR_REC_DBD_1; 1.
DR PROSITE; PS51030; NUCLEAR_REC_DBD_2; 2.
DR PROSITE; PS50157; ZINC_FINGER_C2H2_2; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Receptor {ECO:0000256|ARBA:ARBA00023170};
KW Reference proteome {ECO:0000313|Proteomes:UP000215902};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771, ECO:0000256|PROSITE-
KW ProRule:PRU00042}.
FT DOMAIN 239..343
FT /note="Nuclear receptor"
FT /evidence="ECO:0000259|PROSITE:PS51030"
FT DOMAIN 257..287
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 346..423
FT /note="Nuclear receptor"
FT /evidence="ECO:0000259|PROSITE:PS51030"
FT REGION 202..231
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 470..547
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 202..226
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 480..507
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 532..547
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 722 AA; 78471 MW; DA2447AA997EF7F6 CRC64;
MMTACSPVQL NFAKSAPTMA ALEVECGAGQ TSASCSTVCQ QHNEVTKLEA AVDSDWTMTG
SVAPASYLMQ AYVSQSPAAD FYGQYYCQAP QPNLQQQQQQ QHSQVYFDPT YHPVYSDYHH
HQHQQHQDPF QQQQQQHQFV DFPGGASTDV QQQMFLPPVY QVEQQQQQQQ QQQFLQQMQP
PIIDTKNCHQ FKEVDRDVRV ACSSSSSSSS SSSSSSSSSG GGGSGRRSNS AAVAAHSGRH
ACDVCGDTAA GFHCGAFVCE ACKKFFIRSS RAEQQQQLLL LHHGDSAVSS AGGPSGAVKY
ACSKSGSCDI TKETRTHCQF CRYKKCLALR MFPPGKNPAI VASIDEIPCR VCGASSSGFH
FGAITCEGCK GFFRRTIKER DSGKYVCSKG GGCEINKSSR NACKSCRFTK CIRAGMSSEG
SRIGRQPNAV KHMCAQEIQA IKTKRRRLLS GNIFDDDETK KQPQHQLQTF WPETPQPLPP
HHHQQQQQPS SCHSAAANDT SGASDSPREL SELVLPPPPQ QLDCRPTYKS THDPAAAEDP
DATDTEHQAR FRVAAAVGLE VLCDALASTT DSLNGFANAV GRFLETLSGV SDGERAAVTT
GCCGFGGALA AALCALPAAA VGERPVEAAW PNQQLQERAR RLRKRLGEAQ FDRVELGLLC
AAAAGDDSHF PHHQQSVGQR ARASLRAYQT EKYETSRSGM MEWTCNLLQH LQDLSVELSA
AV
//