ID G3VJL5_SARHA Unreviewed; 1737 AA.
AC G3VJL5;
DT 16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT 07-APR-2021, sequence version 2.
DT 27-MAR-2024, entry version 61.
DE SubName: Full=Methyl-CpG binding domain protein 5 {ECO:0000313|Ensembl:ENSSHAP00000003370.2};
GN Name=MBD5 {ECO:0000313|Ensembl:ENSSHAP00000003370.2};
OS Sarcophilus harrisii (Tasmanian devil) (Sarcophilus laniarius).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Metatheria; Dasyuromorphia; Dasyuridae; Sarcophilus.
OX NCBI_TaxID=9305 {ECO:0000313|Ensembl:ENSSHAP00000003370.2, ECO:0000313|Proteomes:UP000007648};
RN [1] {ECO:0000313|Ensembl:ENSSHAP00000003370.2, ECO:0000313|Proteomes:UP000007648}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21709235; DOI=10.1073/pnas.1102838108;
RA Miller W., Hayes V.M., Ratan A., Petersen D.C., Wittekindt N.E., Miller J.,
RA Walenz B., Knight J., Qi J., Zhao F., Wang Q., Bedoya-Reina O.C.,
RA Katiyar N., Tomsho L.P., Kasson L.M., Hardie R.A., Woodbridge P.,
RA Tindall E.A., Bertelsen M.F., Dixon D., Pyecroft S., Helgen K.M.,
RA Lesk A.M., Pringle T.H., Patterson N., Zhang Y., Kreiss A., Woods G.M.,
RA Jones M.E., Schuster S.C.;
RT "Genetic diversity and population structure of the endangered marsupial
RT Sarcophilus harrisii (Tasmanian devil).";
RL Proc. Natl. Acad. Sci. U.S.A. 108:12348-12353(2011).
RN [2] {ECO:0000313|Ensembl:ENSSHAP00000003370.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 9305.ENSSHAP00000003370; -.
DR Ensembl; ENSSHAT00000003406.2; ENSSHAP00000003370.2; ENSSHAG00000002961.2.
DR eggNOG; ENOG502QTC7; Eukaryota.
DR GeneTree; ENSGT00530000064137; -.
DR InParanoid; G3VJL5; -.
DR TreeFam; TF106391; -.
DR Proteomes; UP000007648; Unassembled WGS sequence.
DR GO; GO:0030496; C:midbody; IEA:Ensembl.
DR GO; GO:0005654; C:nucleoplasm; IEA:Ensembl.
DR GO; GO:0003677; F:DNA binding; IEA:InterPro.
DR Gene3D; 2.30.30.140; -; 1.
DR InterPro; IPR016177; DNA-bd_dom_sf.
DR InterPro; IPR001739; Methyl_CpG_DNA-bd.
DR InterPro; IPR000313; PWWP_dom.
DR PANTHER; PTHR16112; METHYL-CPG BINDING PROTEIN, DROSOPHILA; 1.
DR PANTHER; PTHR16112:SF18; METHYL-CPG-BINDING DOMAIN PROTEIN 5; 1.
DR SMART; SM00391; MBD; 1.
DR SUPFAM; SSF54171; DNA-binding domain; 1.
DR SUPFAM; SSF63748; Tudor/PWWP/MBT; 1.
DR PROSITE; PS50982; MBD; 1.
DR PROSITE; PS50812; PWWP; 1.
PE 4: Predicted;
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000007648}.
FT DOMAIN 10..80
FT /note="MBD"
FT /evidence="ECO:0000259|PROSITE:PS50982"
FT DOMAIN 1623..1661
FT /note="PWWP"
FT /evidence="ECO:0000259|PROSITE:PS50812"
FT REGION 191..283
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 326..349
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 449..521
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 593..641
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1026..1069
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1585..1611
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 191..211
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 221..235
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 246..270
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 329..345
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 454..470
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 496..512
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 593..635
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1737 AA; 184420 MW; 70F15B1D1D29E2D8 CRC64;
MNGGKECDGG DKDGGLPVQV PVGWQRRVDQ NGVLYISPSG SLLSCLEQVK TYLLTDGTCK
CGLECPLILP KVFNFDPGAA VKQRTAEDVK ADEDVTKLCI HKRKIIAVAT LHKSMEAPHP
SLVLTSPGGG TSATPIVTSR AATPRSMRTK SHEGITNSVM PECKNPFKLM IGTSNTMGRL
YVQELTGSQQ QELHSAYPRQ RLGSSEHGQK SPYRGSHGGL PSPASSGSQI YGDGSISPRT
DPLGSPDVFT RSNTGFHGAP NSSPIHMNRT PLSPPSVMLH GSPVQSSCAM AGRTNIPLSP
TLTTKSPVMK KPMCNFSANM EIPRAMFHHK PPQGPPPPPP PSCALQKKPL TSEKDPLGIL
DPIPSKPVNQ NPILINATNF HSNVHSQVPV MNVSMPPAVV PLPSNLPLPT VKPGHMNHGS
HVQRVQHSAS TSLSPSPVTS PVHMIGTGIG RIEASPQRSR SSSTSSDHGN FMMPPVGPQS
TCGGIKCPPR SPRSTIGSPR PSMPSSPSTK PDGLHQYKDI PNPLIAGMSN VLNAPSNAAF
PAASAGSGPL KSQPGLLGMP LNQILNQHNA ASFPASSLLS AAAKAQLANQ NKLAGNNNSS
SSNSGAVANS GNTEGHSTLN TMFPPTANML LPTSEGQSGR AALRDKLMSQ QKESMRKRKQ
PTTTVLSLLR QSHMDSPAVP KPGPDLIRKQ SQGSFPINSM SQLLQSMSCQ SSHVSSNSST
GCGSSNPALP CSANQLHFTD TNINSSVLQN SLTQNLPLRG EAMHCQNANT NFMHGSSPGP
NHHLAGLINQ IQASGNCGML SQPGMALGNS LHPNPPQSRI SASSTSMIPN SIVSSCNQTS
SDAGGSGPSS SIAIAGTNQP AITKTTSVLQ DGVIVTTAAG TPLQSQLPIG SDFPFVGQEH
VLHFPQNSPS NNNLPHPLNP SLLSSLPISL PVNQQHLLNQ NLLNILQPSA GEGKSEVNLN
PLGFLNPNVN AALAFLSTDM DGQVLQPVHF QLLAALLQNQ AQAAAMLPLP SFNLTISDLL
QQQNNPLPSL TQMTTPPDHL PSSQSDSNRA ETLLTNPLGN PLPSFSGSDT TSNPLLLPAV
TGASGLMALN PQLLGGVLNS ASGNTANHPE VSIATSSQAT TTTTTTSSAV AALTVSTLGG
TAVVSMAETL LNISNNAGNT PGPTKLNSNS VVPQLLNPLL GTGLLGDMSS INNTLNNHQL
THLQSLLNSN QMFPSNQQQQ QLLHGYQNLQ TFQGQPTIPG SGNSSNPMAC LFQNFQVRMQ
EDAALLNKRM STQSGLTALP ENSNSTLPPF QDTACELQQR TEPTLGQQAK DSLNVTGQGD
ASVDAIYKAV VDAASKGMQV VITTAVSSTT QISPIPALSA MSAFTASIGD PLNLSSAVSA
VIHGRNIGGA DHDSRLRNVR GTRLPKNLEH GKNASEGDGF EYFKSAGCNT PKKLWEEEQS
PGGEINRWKC EEFLDHSTHI HSSPCHERPN NVSTLPLLTG EQHPILLPQR NCQGDKMLEE
NFRYNNYKRT MMSFKERLEN TVERCAHING NRPQQSRGFG ELLSASKQDL LMEEQSPSSS
NSLESSLVKD YIHFNGDFNA KSINGCVPSP SDAKSISSED DLRNPESPSS NELIHYRPRT
FNVGDLVWGQ IKGLTSWPGK LVREEEVHNS CQQNTEEGKA YYKMALPAIS MLLWETYPEF
PQGDSIAVEV KLLPGLGKAA CHSQGLEQHG HQQEGRGRQE WACGGRGLYR EQRGRGL
//