ID G3VA14_SARHA Unreviewed; 789 AA.
AC G3VA14;
DT 16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT 07-APR-2021, sequence version 2.
DT 27-MAR-2024, entry version 66.
DE RecName: Full=Chromo domain-containing protein {ECO:0000259|PROSITE:PS50013};
GN Name=MPHOSPH8 {ECO:0000313|Ensembl:ENSSHAP00000000018.2};
OS Sarcophilus harrisii (Tasmanian devil) (Sarcophilus laniarius).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Metatheria; Dasyuromorphia; Dasyuridae; Sarcophilus.
OX NCBI_TaxID=9305 {ECO:0000313|Ensembl:ENSSHAP00000000018.2, ECO:0000313|Proteomes:UP000007648};
RN [1] {ECO:0000313|Ensembl:ENSSHAP00000000018.2, ECO:0000313|Proteomes:UP000007648}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21709235; DOI=10.1073/pnas.1102838108;
RA Miller W., Hayes V.M., Ratan A., Petersen D.C., Wittekindt N.E., Miller J.,
RA Walenz B., Knight J., Qi J., Zhao F., Wang Q., Bedoya-Reina O.C.,
RA Katiyar N., Tomsho L.P., Kasson L.M., Hardie R.A., Woodbridge P.,
RA Tindall E.A., Bertelsen M.F., Dixon D., Pyecroft S., Helgen K.M.,
RA Lesk A.M., Pringle T.H., Patterson N., Zhang Y., Kreiss A., Woods G.M.,
RA Jones M.E., Schuster S.C.;
RT "Genetic diversity and population structure of the endangered marsupial
RT Sarcophilus harrisii (Tasmanian devil).";
RL Proc. Natl. Acad. Sci. U.S.A. 108:12348-12353(2011).
RN [2] {ECO:0000313|Ensembl:ENSSHAP00000000018.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; G3VA14; -.
DR STRING; 9305.ENSSHAP00000000018; -.
DR Ensembl; ENSSHAT00000000020.2; ENSSHAP00000000018.2; ENSSHAG00000000020.2.
DR eggNOG; KOG0504; Eukaryota.
DR eggNOG; KOG1911; Eukaryota.
DR GeneTree; ENSGT00730000111087; -.
DR TreeFam; TF106394; -.
DR Proteomes; UP000007648; Unassembled WGS sequence.
DR CDD; cd18633; CD_MMP8; 1.
DR Gene3D; 2.40.50.40; -; 1.
DR Gene3D; 1.25.40.20; Ankyrin repeat-containing domain; 1.
DR InterPro; IPR002110; Ankyrin_rpt.
DR InterPro; IPR036770; Ankyrin_rpt-contain_sf.
DR InterPro; IPR016197; Chromo-like_dom_sf.
DR InterPro; IPR000953; Chromo/chromo_shadow_dom.
DR InterPro; IPR023780; Chromo_domain.
DR InterPro; IPR023779; Chromodomain_CS.
DR PANTHER; PTHR24184:SF28; CHROMO DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR24184; SI:CH211-189E2.2; 1.
DR Pfam; PF00023; Ank; 1.
DR Pfam; PF12796; Ank_2; 1.
DR Pfam; PF00385; Chromo; 1.
DR SMART; SM00248; ANK; 4.
DR SMART; SM00298; CHROMO; 1.
DR SUPFAM; SSF48403; Ankyrin repeat; 1.
DR SUPFAM; SSF54160; Chromo domain-like; 1.
DR PROSITE; PS50297; ANK_REP_REGION; 2.
DR PROSITE; PS50088; ANK_REPEAT; 3.
DR PROSITE; PS00598; CHROMO_1; 1.
DR PROSITE; PS50013; CHROMO_2; 1.
PE 4: Predicted;
KW ANK repeat {ECO:0000256|PROSITE-ProRule:PRU00023};
KW Phosphoprotein {ECO:0000256|ARBA:ARBA00022553};
KW Reference proteome {ECO:0000313|Proteomes:UP000007648};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT DOMAIN 63..113
FT /note="Chromo"
FT /evidence="ECO:0000259|PROSITE:PS50013"
FT REPEAT 597..629
FT /note="ANK"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
FT REPEAT 630..662
FT /note="ANK"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
FT REPEAT 663..695
FT /note="ANK"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
FT REGION 16..62
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 141..197
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 211..442
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 765..789
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 37..52
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 165..191
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 211..262
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 283..332
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 340..378
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 395..442
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 775..789
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 789 AA; 89661 MW; 04ABD07D1B4E8BCB CRC64;
MAALASAGGG AVSVAAAAAS GSDSCSEDAS GGVGEEAGGL EKDDGGRAGE TVGESEEDEE
DVFEVEKILD VKTEAGKILY KVRWKGYTSD DDTWEPEVHL EDCKEVLLEF RKKIFDGKNK
PIKKDLQRLV LNDDDIFEAE SDSDWQSETK EDISPKKKKK KSRHREDKSP DDLKKRKWKS
GKVKDKNKAQ LETSSENLVF DSKSKKRILE SKEDSKEYKK TKKDDLKEAK KVKKGEIRDS
KGKVRDDFKE SKKKRERLSD SLLESESSTF DDSLSQLADD DSEDLPFDNK GDKQKFKSGK
DKLELDIIQD VISDKQPDDT ASAEEDADSK TKRKKKKFKK VEEYKEEIKK AENKDPYSEK
KNLYKKQKNQ EKVKSSIEID KLAPTPAQIQ KSTKLGSEER GRRSTDSIGE EKETRKNEVK
EKYQKRYDSD KEEKGKKEQK GIKTYKEMRN AFDLFTLTPE EKDYSENNRK REETFTEDYR
TKENKQLYKE RRSTRDETDT WAYIAAEGDQ EVLDNVCQMD ENSDRQQVLS LGMDLQLEWM
KLEDFQKHLN GEDETFTTAD AIPSNLLRDA VKNGDYVTVK IALNSNEDYN LDQEDSSGMT
LVMLAAAGGQ DDLLRLLIKK GAKVNGRQKN GTTALIHAAE KNFLTTVAIL LEAGAFVNVQ
QSNGETALMK ACKRGNSDIV RLVIECGADC NILSKHQNSA LHFAKQCNNV LVYDLLKSHL
ETLSRVAEET IRDYFEARLA LLEPVFPIAC HRLCEGPDFS TDFNYKPPQN IPEGKKPVNK
KRLKVKENR
//