ID A0A1D2NIS3_ORCCI Unreviewed; 861 AA.
AC A0A1D2NIS3;
DT 30-NOV-2016, integrated into UniProtKB/TrEMBL.
DT 30-NOV-2016, sequence version 1.
DT 27-MAR-2024, entry version 25.
DE SubName: Full=Polyhomeotic-like protein 2 {ECO:0000313|EMBL:ODN04866.1};
GN ORFNames=Ocin01_01861 {ECO:0000313|EMBL:ODN04866.1};
OS Orchesella cincta (Springtail) (Podura cincta).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Collembola;
OC Entomobryomorpha; Entomobryoidea; Orchesellidae; Orchesellinae; Orchesella.
OX NCBI_TaxID=48709 {ECO:0000313|EMBL:ODN04866.1, ECO:0000313|Proteomes:UP000094527};
RN [1] {ECO:0000313|EMBL:ODN04866.1, ECO:0000313|Proteomes:UP000094527}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC TISSUE=Mixed pool {ECO:0000313|EMBL:ODN04866.1};
RX PubMed=27289101; DOI=10.1093/gbe/evw134;
RA Faddeeva-Vakhrusheva A., Derks M.F., Anvar S.Y., Agamennone V., Suring W.,
RA Smit S., van Straalen N.M., Roelofs D.;
RT "Gene Family Evolution Reflects Adaptation to Soil Environmental Stressors
RT in the Genome of the Collembolan Orchesella cincta.";
RL Genome Biol. Evol. 8:2106-2117(2016).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:ODN04866.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LJIJ01000035; ODN04866.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A1D2NIS3; -.
DR STRING; 48709.A0A1D2NIS3; -.
DR OMA; CAKIELI; -.
DR OrthoDB; 5399754at2759; -.
DR Proteomes; UP000094527; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0032991; C:protein-containing complex; IEA:UniProt.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR CDD; cd09577; SAM_Ph1_2_3; 1.
DR Gene3D; 3.30.60.160; -; 1.
DR Gene3D; 1.10.150.50; Transcription Factor, Ets-1; 1.
DR InterPro; IPR001660; SAM.
DR InterPro; IPR013761; SAM/pointed_sf.
DR InterPro; IPR012313; Znf_FCS.
DR InterPro; IPR038603; Znf_FCS_sf.
DR PANTHER; PTHR12247; POLYCOMB GROUP PROTEIN; 1.
DR PANTHER; PTHR12247:SF138; POLYHOMEOTIC DISTAL, ISOFORM A-RELATED; 1.
DR Pfam; PF00536; SAM_1; 1.
DR Pfam; PF21319; zf-FCS_1; 1.
DR SMART; SM00454; SAM; 1.
DR SUPFAM; SSF47769; SAM/Pointed domain; 1.
DR PROSITE; PS50105; SAM_DOMAIN; 1.
DR PROSITE; PS51024; ZF_FCS; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000094527};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771, ECO:0000256|PROSITE-
KW ProRule:PRU00367}.
FT DOMAIN 670..704
FT /note="FCS-type"
FT /evidence="ECO:0000259|PROSITE:PS51024"
FT DOMAIN 784..848
FT /note="SAM"
FT /evidence="ECO:0000259|PROSITE:PS50105"
FT REGION 287..307
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 348..450
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 457..476
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 512..631
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 709..778
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 348..426
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 436..450
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 520..555
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 575..599
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 600..627
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 709..734
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 735..750
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 751..774
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 861 AA; 93023 MW; 8905F842A7352C71 CRC64;
MASQTHSGGT ATSVAQQQQG VTQQQLQIQQ PTTTATLQGT HQTGGQNQMQ PLQVIHQSLQ
NQTLMPQQFY NTQGQLLMPG NIAFHPGLNP QIQQAQLAPS IQVIAAGKTL QPNQLSTPYS
FNTSYTIPTS AANNQAYIIG QLNSQQQLSS ILQQQQQTKP GEMQKLEMYC FHRTCNTAAD
WNDSANTCPD YVAIPQFLNQ NSILIRGTQP DQMFIQQQQP LQNNTNTHQH FTMPLTTTQL
PATSISSATT MEFNRPRQLC SLNPHYRFNH SLRKTKVEIS NISHNRTAAA ATATTTTTAS
KSTAPAATTT VATAVTANGS SATTNSNDSS RYGYISIKHV AATDFTKSTN ADNSSDAPSP
ASAKTINHTS QGATQQLQPQ SVTLIQQQPA TQQPQPIRPN YSTAPATSTT ISQQQGITTT
ATPQQRAKMR KPGVGRGSSN NASPTQQAKV MFPRPTIPTS ISPLSAPLKP APSSQQSVGA
TLTKVPISSQ GPPQSVNNQP LKTINLAALH TPPTLPDATL TPLMPQSSSQ GQTMQQAPQP
LQVFPAPTTQ LPSPNISITP APPPLKKEEH HNENSSSQAP PPSSQQQQQN NSTQGNQHSE
LKSENEKENT KTETNTEESE ESTGKQKLPK AMVKPQVLTH VIEGFVIQEA SEPFPLGRNG
DSDEPPMKKI ALSGEMGKCE MCGKVDLRSK FKKNKRFCSS QCAKGMKAQQ QQQQNSQQPS
SVQNTTNNHY NNVNDSRKSK GKKWGESGES ELTDETSSSV GEPSLMSPST EQVDEEPKVN
PVKWNVSEVV DFVRSLPGCA EYADDFALQE IDGQALMLLK EDHLMSAMGM KLGPALKLCA
KIELIKAGMS SANGDGNQSK A
//