ID A0A2G8JUL4_STIJA Unreviewed; 680 AA.
AC A0A2G8JUL4;
DT 31-JAN-2018, integrated into UniProtKB/TrEMBL.
DT 31-JAN-2018, sequence version 1.
DT 24-JAN-2024, entry version 16.
DE SubName: Full=Putative nucleolar transcription factor 1-A isoform X3 {ECO:0000313|EMBL:PIK39428.1};
GN ORFNames=BSL78_23728 {ECO:0000313|EMBL:PIK39428.1};
OS Stichopus japonicus (Sea cucumber).
OC Eukaryota; Metazoa; Echinodermata; Eleutherozoa; Echinozoa; Holothuroidea;
OC Aspidochirotacea; Aspidochirotida; Stichopodidae; Apostichopus.
OX NCBI_TaxID=307972 {ECO:0000313|EMBL:PIK39428.1, ECO:0000313|Proteomes:UP000230750};
RN [1] {ECO:0000313|EMBL:PIK39428.1, ECO:0000313|Proteomes:UP000230750}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Shaxun {ECO:0000313|EMBL:PIK39428.1};
RC TISSUE=Muscle {ECO:0000313|EMBL:PIK39428.1};
RX PubMed=29023486;
RA Zhang X., Sun L., Yuan J., Sun Y., Gao Y., Zhang L., Li S., Dai H.,
RA Hamel J.F., Liu C., Yu Y., Liu S., Lin W., Guo K., Jin S., Xu P.,
RA Storey K.B., Huan P., Zhang T., Zhou Y., Zhang J., Lin C., Li X., Xing L.,
RA Huo D., Sun M., Wang L., Mercier A., Li F., Yang H., Xiang J.;
RT "The sea cucumber genome provides insights into morphological evolution and
RT visceral regeneration.";
RL PLoS Biol. 15:E2003790-E2003790(2017).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PIK39428.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; MRZV01001241; PIK39428.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A2G8JUL4; -.
DR STRING; 307972.A0A2G8JUL4; -.
DR Proteomes; UP000230750; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-UniRule.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR CDD; cd21998; HMG-box_UBF1_rpt1-like; 1.
DR CDD; cd21999; HMG-box_UBF1_rpt2; 1.
DR CDD; cd22000; HMG-box_UBF1_rpt3; 1.
DR CDD; cd22003; HMG-box_UBF1_rpt6-like; 1.
DR Gene3D; 1.10.30.10; High mobility group box domain; 5.
DR InterPro; IPR009071; HMG_box_dom.
DR InterPro; IPR036910; HMG_box_dom_sf.
DR PANTHER; PTHR46318; UPSTREAM BINDING TRANSCRIPTION FACTOR; 1.
DR PANTHER; PTHR46318:SF3; UPSTREAM BINDING TRANSCRIPTION FACTOR; 1.
DR Pfam; PF00505; HMG_box; 2.
DR SMART; SM00398; HMG; 4.
DR SUPFAM; SSF47095; HMG-box; 4.
DR PROSITE; PS50118; HMG_BOX_2; 4.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00267};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00267}; Reference proteome {ECO:0000313|Proteomes:UP000230750}.
FT DOMAIN 142..210
FT /note="HMG box"
FT /evidence="ECO:0000259|PROSITE:PS50118"
FT DOMAIN 244..291
FT /note="HMG box"
FT /evidence="ECO:0000259|PROSITE:PS50118"
FT DOMAIN 321..385
FT /note="HMG box"
FT /evidence="ECO:0000259|PROSITE:PS50118"
FT DOMAIN 522..588
FT /note="HMG box"
FT /evidence="ECO:0000259|PROSITE:PS50118"
FT DNA_BIND 142..210
FT /note="HMG box"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00267"
FT DNA_BIND 244..291
FT /note="HMG box"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00267"
FT DNA_BIND 321..385
FT /note="HMG box"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00267"
FT DNA_BIND 522..588
FT /note="HMG box"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00267"
FT REGION 399..457
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 601..680
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 430..457
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 626..661
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 680 AA; 79336 MW; A02F5CECB81471B8 CRC64;
MQQRKVNYFR ARNALGRVFN AFNTLPGRKR SHATDAGEEG AAKVQKVDTP FKMDDRVTLV
SNLKLQLPKR DTRSYKSTIS KVNWEKVAFK QFSPKECQQE WELIQADIRK YKNATEIVTE
ALDMVNNTTY LPKLSALYPN MPKKPLTPFF QYYSAKHKSV KEKNPSFSQT KISQVLSEKY
KTLPEKKKQK YVTKYAAEMK DYDVKKTEFY TEHPELKQSK KGEGSRSPIC PTPYRWWLAK
QLMDRPEVPR SEVESELRAK WKALSDKRRL KWIHMSLQKK DEYEQAIQEY SKTHPNFKPN
SKPLLTKSEK ALYDKQQGKP TRPPMNGYSL FCSHFMKLKN ELTSTQRMQA CSKEWKKLSD
KDRMEYNTRT VQAKKEYFEN LEKYVLTLPP EEAAKYQSEL AASKVPSKSR NKYEERAKEM
NALSSGSEAD SSSDEDDDSE DEDDNEEEEE EEEEEDDVMA RNLWIEHNLH SYMQVHKLKE
ARAKKSLRLA WENLEKARKV PWKKQAQQVT NKMQQLLAEM PKKPHTSAYG IFTSEMMRNQ
DIVKLDQKNK MKAVAEKWKN ISKTEKRKYE AKKDRLWKKH EKDVEKYKKS LSSTDQELLE
KVLTNPIPKT KQKVAKASTD TKDGESSEDS DSEESEEEEE DDEGSSGDDE SDEDGEKGTE
ESAEEEATVQ EKRSSRAIIG
//