ID A0A1D2MHC5_ORCCI Unreviewed; 393 AA.
AC A0A1D2MHC5;
DT 30-NOV-2016, integrated into UniProtKB/TrEMBL.
DT 30-NOV-2016, sequence version 1.
DT 27-MAR-2024, entry version 25.
DE SubName: Full=Chromobox protein 6 {ECO:0000313|EMBL:ODM92400.1};
GN ORFNames=Ocin01_14279 {ECO:0000313|EMBL:ODM92400.1};
OS Orchesella cincta (Springtail) (Podura cincta).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Collembola;
OC Entomobryomorpha; Entomobryoidea; Orchesellidae; Orchesellinae; Orchesella.
OX NCBI_TaxID=48709 {ECO:0000313|EMBL:ODM92400.1, ECO:0000313|Proteomes:UP000094527};
RN [1] {ECO:0000313|EMBL:ODM92400.1, ECO:0000313|Proteomes:UP000094527}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC TISSUE=Mixed pool {ECO:0000313|EMBL:ODM92400.1};
RX PubMed=27289101; DOI=10.1093/gbe/evw134;
RA Faddeeva-Vakhrusheva A., Derks M.F., Anvar S.Y., Agamennone V., Suring W.,
RA Smit S., van Straalen N.M., Roelofs D.;
RT "Gene Family Evolution Reflects Adaptation to Soil Environmental Stressors
RT in the Genome of the Collembolan Orchesella cincta.";
RL Genome Biol. Evol. 8:2106-2117(2016).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:ODM92400.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LJIJ01001241; ODM92400.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A1D2MHC5; -.
DR STRING; 48709.A0A1D2MHC5; -.
DR OrthoDB; 75895at2759; -.
DR Proteomes; UP000094527; Unassembled WGS sequence.
DR GO; GO:0005694; C:chromosome; IEA:UniProt.
DR CDD; cd00024; CD_CSD; 1.
DR Gene3D; 2.40.50.40; -; 1.
DR InterPro; IPR016197; Chromo-like_dom_sf.
DR InterPro; IPR000953; Chromo/chromo_shadow_dom.
DR InterPro; IPR023780; Chromo_domain.
DR PANTHER; PTHR22812; CHROMOBOX PROTEIN; 1.
DR Pfam; PF00385; Chromo; 1.
DR SMART; SM00298; CHROMO; 1.
DR SUPFAM; SSF54160; Chromo domain-like; 1.
DR PROSITE; PS50013; CHROMO_2; 1.
PE 4: Predicted;
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000094527}.
FT DOMAIN 133..180
FT /note="Chromo"
FT /evidence="ECO:0000259|PROSITE:PS50013"
FT REGION 6..94
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 190..260
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 15..45
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 70..89
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 191..205
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 236..260
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 393 AA; 43418 MW; ADA1890C4AA8ED86 CRC64;
MCCNIQTFSL TGEDDTEDSV TRSAETSRAP DGKGGDSEDV EVDGEGTSRS IAIPGPSKPR
ASATNQKARK RTGQGRKAGG KKRKVTKSTA RMRSNASVRM KIKTIESKKL SMDEMKNEDH
PGHVVNAAGE NLYNVERILK KSVAKGVVWY RIKWEGCDKT KNCWKRAENL GNCSVVAEFE
LNEERKKLAK LARRGKGKKG KSRPSSKRTS PPNQPAASDK SEPQPPFVAD PNDPDPNDPD
PNDPDPNDPD PNNDPDLPID EANENAEAAE LEQETNIIMG RTFNTLFEFK GEEGTLFRRE
WNQNEVPVSV VGSTIVDGTI VYLVRIRNHD ADGPVGKPLD FSDCVMVLPT ISLLNFNNKL
MEYYEHRLGP NGLGLPIIAA AKEEILRRCA MMQ
//