ID K7GFW3_PELSI Unreviewed; 1180 AA.
AC K7GFW3;
DT 09-JAN-2013, integrated into UniProtKB/TrEMBL.
DT 09-JAN-2013, sequence version 1.
DT 27-MAR-2024, entry version 53.
DE SubName: Full=MIS18 binding protein 1 {ECO:0000313|Ensembl:ENSPSIP00000019174.1};
GN Name=MIS18BP1 {ECO:0000313|Ensembl:ENSPSIP00000019174.1};
OS Pelodiscus sinensis (Chinese softshell turtle) (Trionyx sinensis).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Testudinata; Testudines; Cryptodira; Trionychia;
OC Trionychidae; Pelodiscus.
OX NCBI_TaxID=13735 {ECO:0000313|Ensembl:ENSPSIP00000019174.1, ECO:0000313|Proteomes:UP000007267};
RN [1] {ECO:0000313|Proteomes:UP000007267}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Daiwa-1 {ECO:0000313|Proteomes:UP000007267};
RG Soft-shell Turtle Genome Consortium;
RL Submitted (OCT-2011) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Proteomes:UP000007267}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Daiwa-1 {ECO:0000313|Proteomes:UP000007267};
RX PubMed=23624526; DOI=10.1038/ng.2615;
RA Wang Z., Pascual-Anaya J., Zadissa A., Li W., Niimura Y., Huang Z., Li C.,
RA White S., Xiong Z., Fang D., Wang B., Ming Y., Chen Y., Zheng Y.,
RA Kuraku S., Pignatelli M., Herrero J., Beal K., Nozawa M., Li Q., Wang J.,
RA Zhang H., Yu L., Shigenobu S., Wang J., Liu J., Flicek P., Searle S.,
RA Wang J., Kuratani S., Yin Y., Aken B., Zhang G., Irie N.;
RT "The draft genomes of soft-shell turtle and green sea turtle yield insights
RT into the development and evolution of the turtle-specific body plan.";
RL Nat. Genet. 45:701-706(2013).
RN [3] {ECO:0000313|Ensembl:ENSPSIP00000019174.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AGCU01069308; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGCU01069309; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGCU01069310; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGCU01069311; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGCU01069312; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGCU01069313; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGCU01069314; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR AlphaFoldDB; K7GFW3; -.
DR STRING; 13735.ENSPSIP00000019174; -.
DR Ensembl; ENSPSIT00000019264.1; ENSPSIP00000019174.1; ENSPSIG00000017031.1.
DR eggNOG; ENOG502QRUS; Eukaryota.
DR GeneTree; ENSGT00390000007395; -.
DR HOGENOM; CLU_009019_1_0_1; -.
DR OMA; CHSNCQN; -.
DR TreeFam; TF106401; -.
DR Proteomes; UP000007267; Unassembled WGS sequence.
DR CDD; cd00167; SANT; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR039110; KNL2-like.
DR InterPro; IPR001005; SANT/Myb.
DR InterPro; IPR015216; SANTA.
DR PANTHER; PTHR16124; MIS18-BINDING PROTEIN 1; 1.
DR PANTHER; PTHR16124:SF3; MIS18-BINDING PROTEIN 1; 1.
DR Pfam; PF00249; Myb_DNA-binding; 1.
DR Pfam; PF09133; SANTA; 1.
DR SMART; SM00717; SANT; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS50090; MYB_LIKE; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000007267}.
FT DOMAIN 937..979
FT /note="Myb-like"
FT /evidence="ECO:0000259|PROSITE:PS50090"
FT REGION 158..180
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 306..340
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 373..399
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 617..679
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 753..920
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 981..1012
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 620..642
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 649..679
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 758..821
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 873..920
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 981..1010
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1180 AA; 133624 MW; 30949F460993D104 CRC64;
MTAAPVKNTH ILKSPISSGR KGAMPLQAVF MSNIPSGLCD FKENSARGTG TFKKREGIFQ
STLITEDTYA KEFLDLSEIR PVSDTVAMQV ASCPQQQILP LKDKNVLKRK ACDPLTRESP
AKIFQRMKAK VSHEKQHPVP YKGKLLETNV NSDLILTPAT NPAHQGRWDK KVSDEDNHQR
DVKLPDKQIQ LRKALHWTTV LQNKNVDSLS VVSESPQKFI LRMKQKVQTQ LQGPAMSNQI
EQSISSRIAN SPLIKSDFAK QVNNFNGEST VNNISRSQDD IFLVEPIDAD DEMSQNTVVD
TVNRNPNPSK TRVQLSERHG SGETIYASPH REGGPLQKSD WKTAQAIEKI SDTDPQRPTQ
CLCNIMFSSP KVHIPRKQKP KEGDCKVLSS TSADKNDGNT HKQQKICLSD WRIKVINNNT
AVCVEGKRID MKELCWHSNA IVERVAYNQV KTISGSIYLL QGNIEPVSMR KDGFPCKFIN
RFKCGFPKLW KQYIENFLEE LKSKEQDTDE AGNEKISSMD AVEVEEELMG DLKKQPITQN
TTYEVALNNE NRYLTPKCHP VQNDPDASYS RSGRRIKPPL NYWCGEREFV DNKLNVTMEE
GGKNYLSLVC SNERSKKKTI SSFPNSREHT AEKSEGKTKS QCKGKIYVKR ANSKREIGPS
DKRDSRRFVS DPDESDYEAE LNNDKRAVVT LTPLKHKKVC ENKLKYNSWT TEKSAAQSIS
KYGNETRNYK TNSGRELKTC QYSLRSQKHF CQDTLSTEDS SSKDEEDSNE DIPLSVKRKT
KPSLEREIYK SSSDARSSQN ETKKKSFEQR KTEDSAATFS HDRQLKIDLS GQKNQSEKEP
QGKAPAGGPS SAPLTDRRVS TRKANINPPK YVFESDSEEE QADREFQRKE KKSKVSVKIH
DHKIINSAKS SAVKSKESDR REMKNFFEPF PGATEDWTEK ELQKLQRAVA SFPKHKNGFW
VDVAMALGTR SAEECQQKYM EDHQTKGSKK AATKTALDKK APKDADKKQP VITARVGTLK
RKQQMRDFLE HLPKDDHDDI FTATPLQNCR VKLPTFWESQ DDDVFQLMDN NPITPSSAVF
PLVKTPQCDH ISPGMLGSIN RRDYDKYVFR MQKNSKDKKG VWSNIKKKSA GTVFTTPTCR
RTAFAFDQGA ANNPVIGKLF AGDAAAPSDE EEQEDSYFST
//