ID G3VQM4_SARHA Unreviewed; 1003 AA.
AC G3VQM4;
DT 16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT 07-APR-2021, sequence version 2.
DT 27-MAR-2024, entry version 80.
DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSSHAP00000005479.2};
OS Sarcophilus harrisii (Tasmanian devil) (Sarcophilus laniarius).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Metatheria; Dasyuromorphia; Dasyuridae; Sarcophilus.
OX NCBI_TaxID=9305 {ECO:0000313|Ensembl:ENSSHAP00000005479.2, ECO:0000313|Proteomes:UP000007648};
RN [1] {ECO:0000313|Ensembl:ENSSHAP00000005479.2, ECO:0000313|Proteomes:UP000007648}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21709235; DOI=10.1073/pnas.1102838108;
RA Miller W., Hayes V.M., Ratan A., Petersen D.C., Wittekindt N.E., Miller J.,
RA Walenz B., Knight J., Qi J., Zhao F., Wang Q., Bedoya-Reina O.C.,
RA Katiyar N., Tomsho L.P., Kasson L.M., Hardie R.A., Woodbridge P.,
RA Tindall E.A., Bertelsen M.F., Dixon D., Pyecroft S., Helgen K.M.,
RA Lesk A.M., Pringle T.H., Patterson N., Zhang Y., Kreiss A., Woods G.M.,
RA Jones M.E., Schuster S.C.;
RT "Genetic diversity and population structure of the endangered marsupial
RT Sarcophilus harrisii (Tasmanian devil).";
RL Proc. Natl. Acad. Sci. U.S.A. 108:12348-12353(2011).
RN [2] {ECO:0000313|Ensembl:ENSSHAP00000005479.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; G3VQM4; -.
DR Ensembl; ENSSHAT00000005533.2; ENSSHAP00000005479.2; ENSSHAG00000004787.2.
DR eggNOG; KOG1721; Eukaryota.
DR GeneTree; ENSGT01100000263534; -.
DR HOGENOM; CLU_002678_17_1_1; -.
DR InParanoid; G3VQM4; -.
DR OrthoDB; 4622522at2759; -.
DR TreeFam; TF343410; -.
DR Proteomes; UP000007648; Unassembled WGS sequence.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0006355; P:regulation of DNA-templated transcription; IEA:InterPro.
DR CDD; cd07765; KRAB_A-box; 1.
DR Gene3D; 6.10.140.140; -; 1.
DR Gene3D; 3.30.160.60; Classic Zinc Finger; 23.
DR InterPro; IPR001909; KRAB.
DR InterPro; IPR036051; KRAB_dom_sf.
DR InterPro; IPR036236; Znf_C2H2_sf.
DR InterPro; IPR013087; Znf_C2H2_type.
DR PANTHER; PTHR24393; ZINC FINGER PROTEIN; 1.
DR PANTHER; PTHR24393:SF167; ZINC FINGER PROTEIN 345-RELATED; 1.
DR Pfam; PF01352; KRAB; 1.
DR Pfam; PF00096; zf-C2H2; 20.
DR SMART; SM00349; KRAB; 1.
DR SMART; SM00355; ZnF_C2H2; 22.
DR SUPFAM; SSF57667; beta-beta-alpha zinc fingers; 13.
DR SUPFAM; SSF109640; KRAB domain (Kruppel-associated box); 1.
DR PROSITE; PS50805; KRAB; 1.
DR PROSITE; PS00028; ZINC_FINGER_C2H2_1; 22.
DR PROSITE; PS50157; ZINC_FINGER_C2H2_2; 23.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000007648};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771, ECO:0000256|PROSITE-
KW ProRule:PRU00042}.
FT SIGNAL 1..20
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 21..1003
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5029668196"
FT DOMAIN 73..144
FT /note="KRAB"
FT /evidence="ECO:0000259|PROSITE:PS50805"
FT DOMAIN 301..328
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 356..383
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 412..439
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 440..467
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 468..495
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 496..523
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 524..551
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 552..579
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 580..607
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 608..635
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 636..663
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 664..691
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 692..719
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 720..747
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 748..775
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 776..803
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 804..831
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 832..859
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 860..887
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 888..915
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 916..943
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 944..971
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 972..999
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT REGION 20..56
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 314..352
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 34..56
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 325..352
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1003 AA; 115474 MW; 28FD2BAECA65FD0E CRC64;
MRSGLPSLLL SAVASAAAQA RRPPAGLSSP GLRGRHWRDA KRGVMPHNCP RKRRLEEAGP
PARIPWARSR EAVTFRDVAL DFTWDQWGCL STAQRELYRE VMLENYRNLV SVGLLGPKPK
LISWLEQGEG AAIWDPLGCI GSRPCLCEGG RSSRRPGASG RSCSAHHSCA SIFEALDGAG
WDSKLENRES PLKQKARTPE TSGEGLVKDV LHGSVFIGAP TQEIPLWKQM GNISRNFHDF
KKLAFFHSKI PSVQTGHSCF DNGKHFSRDI SGLESSRNGN EFGDLSALIQ HQRANRQKKP
SDCEELGKAF GPDVYVTQHR RIPSGQEPYQ RKEGEKIQER SHPTPDQTVP KEEKPYKCKD
CEKIFTSRSC LINHERIHSG DKLSEYKECR KAVRKRSPLV QPERVRSEEK PYKCQECGKA
FRHNSVLVQH QNIHSGEKPY QCKECGIAFS QKSYLTKHKS IHTGEKPHQC KECGAAFRQR
SYLTKHQRVH TGERPYPCKE CGSAFSQKSY LTKHQKIHTG ERPYPCKECG SAFRQKSHLR
QHQRIHAGEK PYPCKECGTS FSQKSYLTKH HRIHTGEKPY KCNDCGLAFS QRSYLTKHQR
IHTGERPYQC NECGGAFRQR SHLIHHQRIH TGEKPYQCKE CGKAFRQNSV LIQHQSVHAE
EKPHRCKDCG KSFNCKSHLT RHEKIHTGEK PFECARCGKA FSRNLTLIAH QNIHTGEKPY
KCQECGRAFR HHSGLTQHQG VHTEEKSYRC DDCGTIFSQK SYLAIHQRIH TGEKPYQCKE
CGAAFRQRSH LIQHQRIHTG EKPYKCRECG KTFRQNSVFI QHRKVHTEEK PYQCKECGKA
FRQNSVLIQH QRIHTEEKPY QCKECGKVFR QYSVLVHHQR IHTGEKPHMC IRCGKAFIRR
SELLSHQLIH TGEKPYKCRE CGKAFRQNSV LLQHKRIHTG EKPYHCKECG AAFRKRSYLI
QHQRIHTGEK PYKCKECEKM FSNCSSLYNH EKIHKRAKSY ECE
//