ID A0A452H7X8_9SAUR Unreviewed; 1386 AA.
AC A0A452H7X8;
DT 08-MAY-2019, integrated into UniProtKB/TrEMBL.
DT 08-MAY-2019, sequence version 1.
DT 27-MAR-2024, entry version 21.
DE RecName: Full=RHD domain-containing protein {ECO:0000259|PROSITE:PS50254};
OS Gopherus agassizii (Agassiz's desert tortoise).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Testudinata; Testudines; Cryptodira; Durocryptodira;
OC Testudinoidea; Testudinidae; Gopherus.
OX NCBI_TaxID=38772 {ECO:0000313|Ensembl:ENSGAGP00000010823.1, ECO:0000313|Proteomes:UP000291020};
RN [1] {ECO:0000313|Proteomes:UP000291020}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=28562605;
RA Tollis M., DeNardo D.F., Cornelius J.A., Dolby G.A., Edwards T.,
RA Henen B.T., Karl A.E., Murphy R.W., Kusumi K.;
RT "The Agassiz's desert tortoise genome provides a resource for the
RT conservation of a threatened species.";
RL PLoS ONE 12:e0177708-e0177708(2017).
RN [2] {ECO:0000313|Ensembl:ENSGAGP00000010823.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 38772.ENSGAGP00000010823; -.
DR Ensembl; ENSGAGT00000012400.1; ENSGAGP00000010823.1; ENSGAGG00000008420.1.
DR Proteomes; UP000291020; Unassembled WGS sequence.
DR GO; GO:0005737; C:cytoplasm; IEA:UniProt.
DR GO; GO:0043229; C:intracellular organelle; IEA:UniProt.
DR GO; GO:0043227; C:membrane-bounded organelle; IEA:UniProt.
DR GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro.
DR GO; GO:0000978; F:RNA polymerase II cis-regulatory region sequence-specific DNA binding; IEA:InterPro.
DR CDD; cd07882; RHD-n_TonEBP; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 1.
DR Gene3D; 2.60.40.340; Rel homology domain (RHD), DNA-binding domain; 1.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR014756; Ig_E-set.
DR InterPro; IPR002909; IPT_dom.
DR InterPro; IPR008366; NFAT.
DR InterPro; IPR015646; NFAT5_RHD_DNA-bd.
DR InterPro; IPR008967; p53-like_TF_DNA-bd_sf.
DR InterPro; IPR032397; RHD_dimer.
DR InterPro; IPR011539; RHD_DNA_bind_dom.
DR InterPro; IPR037059; RHD_DNA_bind_dom_sf.
DR PANTHER; PTHR12533; NFAT; 1.
DR PANTHER; PTHR12533:SF10; NUCLEAR FACTOR OF ACTIVATED T-CELLS 5; 1.
DR Pfam; PF16179; RHD_dimer; 1.
DR Pfam; PF00554; RHD_DNA_bind; 1.
DR PRINTS; PR01789; NUCFACTORATC.
DR SMART; SM00429; IPT; 1.
DR SUPFAM; SSF81296; E set domains; 1.
DR SUPFAM; SSF49417; p53-like transcription factors; 1.
DR PROSITE; PS50254; REL_2; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000291020}.
FT DOMAIN 216..395
FT /note="RHD"
FT /evidence="ECO:0000259|PROSITE:PS50254"
FT REGION 1..52
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 135..172
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 193..229
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 900..940
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1039..1062
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1250..1280
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 21..52
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 193..207
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1386 AA; 149451 MW; 177367DB9F665783 CRC64;
MSQTSGGEAG SPPPAVVAAD ASSAPSSSMG GACSSFTTSS SPTIYSTSVT DSKAMQVESC
SSAVGVSNRG VSEKQLTSNT VQQQQSMPKR HTVLYISPPP EDLLDNSQMS CQDEGCGLES
EQSCIMWMED SPSNFSNMST SSYNDNTEVP RKSRKRNPKQ RPGIKRRDCE GSSMDIFDAD
SAKAPHYVLS QLSTDSKGNS KAGNGASENQ KGAGGKKSPM LSGQYPTKSE GKELKIVVQP
ETQHRARYLT EGSRGSVKDR TQQGFPTVKL EGHNEPVVLQ VFVGNDSGRV KPHGFYQACR
VTGRNTTPCK EVDIEGTTVI EVGLDPSNNM TLAVDCVGIL KLRNADVEAR IGIAGSKKKS
TRARLVFRVN ITQKDGSTLT LQTPSSPILC TQPAGVPEIL KKSLHSCSVK GEEEVFLIGK
NFLKGTKVIF QENISDENSW KAEAEIDMEL FHQNHLIVKV PPYHDQKITS SVSVGIYVVT
NAGRSHDVQS FTYTPDTSGT LNVNVKKEIS SPAQPCSFEE AIKAVGATGC NLDKVNILPS
ALITPLMPTS VIKNEDVAPM EVTAEKRSPT IFKATKVVGP TQQTLENMSS ISGNGIFSTA
ASHLPSECEK QQQIQPKVYN PETLTTIQTQ DISQPGSCPA VSAPSQLQNS DALLQQAAQF
QTRESQSREV LQSDSTVVTL SQLTEASQQQ QSTLSEPAQT LQQQISSSIF SPANSVSQLQ
NTIQQLQAGN FPASTASGSS GDVDLVQQVL EAEQQLSSVL FSGSDSSEDV QEQLSADIFQ
QVGQIQTRVT PGIFSSSETA VHSRQENLLS GRAENVHPQP ENSLSNQQQQ QQAMETSAAM
VIGMQQSMCQ AATQMQSDLF SSAASGNGNL QQSPVYQQAS HLLSGLSTSE DMQMQCELFS
SSSGVSGNET TTAQQQVSTN GSTMFQTSSS ADGKEASGQN KQMQNNVFQT MVQMQHSGES
QPQVNLFSST ENMMAVQAGG TQQQGAGLFQ QGGEIMSLQS GSFMQQSPHS QAQLFHSQNP
IGDAQNISQE RQGSIFHSPN SIVHNQTTSS SSSDQLQPPM FHSQNTMGVL QSSSVPQDQQ
SANMFLSQNS MNNPVTQEEQ MSFFTTQNSI SPLQTATNTE QQPSFQQQAQ IPHIQNPMIP
QDQPQAQPTQ QSLFQPQVSL GSLQSSTMPQ NQQGAIFQSQ HSMVAIQSSP PSQEQQQQQQ
NMMFSTQNTS TVASQKQTMI FNPNQNPITN QEQQNQSLFH AQSNMAPMNQ DQQPMQFQSQ
TTVTPLQNPG SSQPETQQPT MFHNSPQIQL VQGSPGSQEQ QVTLFISSAS MSALQNSMSQ
QELQQSPIYS SQNNMTGIQG AASPAQQQSS VFHNTTGSAI NQLQNSPASS QQTSGIFLFG
IQNSKC
//