ID G3VT78_SARHA Unreviewed; 849 AA.
AC G3VT78;
DT 16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT 07-APR-2021, sequence version 2.
DT 27-MAR-2024, entry version 68.
DE RecName: Full=Circadian locomoter output cycles protein kaput {ECO:0000256|ARBA:ARBA00040572};
DE EC=2.3.1.48 {ECO:0000256|ARBA:ARBA00013184};
GN Name=CLOCK {ECO:0000313|Ensembl:ENSSHAP00000006383.2};
OS Sarcophilus harrisii (Tasmanian devil) (Sarcophilus laniarius).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Metatheria; Dasyuromorphia; Dasyuridae; Sarcophilus.
OX NCBI_TaxID=9305 {ECO:0000313|Ensembl:ENSSHAP00000006383.2, ECO:0000313|Proteomes:UP000007648};
RN [1] {ECO:0000313|Ensembl:ENSSHAP00000006383.2, ECO:0000313|Proteomes:UP000007648}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21709235; DOI=10.1073/pnas.1102838108;
RA Miller W., Hayes V.M., Ratan A., Petersen D.C., Wittekindt N.E., Miller J.,
RA Walenz B., Knight J., Qi J., Zhao F., Wang Q., Bedoya-Reina O.C.,
RA Katiyar N., Tomsho L.P., Kasson L.M., Hardie R.A., Woodbridge P.,
RA Tindall E.A., Bertelsen M.F., Dixon D., Pyecroft S., Helgen K.M.,
RA Lesk A.M., Pringle T.H., Patterson N., Zhang Y., Kreiss A., Woods G.M.,
RA Jones M.E., Schuster S.C.;
RT "Genetic diversity and population structure of the endangered marsupial
RT Sarcophilus harrisii (Tasmanian devil).";
RL Proc. Natl. Acad. Sci. U.S.A. 108:12348-12353(2011).
RN [2] {ECO:0000313|Ensembl:ENSSHAP00000006383.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- CATALYTIC ACTIVITY:
CC Reaction=acetyl-CoA + L-lysyl-[protein] = CoA + H(+) + N(6)-acetyl-L-
CC lysyl-[protein]; Xref=Rhea:RHEA:45948, Rhea:RHEA-COMP:9752,
CC Rhea:RHEA-COMP:10731, ChEBI:CHEBI:15378, ChEBI:CHEBI:29969,
CC ChEBI:CHEBI:57287, ChEBI:CHEBI:57288, ChEBI:CHEBI:61930; EC=2.3.1.48;
CC Evidence={ECO:0000256|ARBA:ARBA00000780};
CC -!- SUBCELLULAR LOCATION: Cytoplasm, cytosol
CC {ECO:0000256|ARBA:ARBA00004514}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; G3VT78; -.
DR Ensembl; ENSSHAT00000006438.2; ENSSHAP00000006383.2; ENSSHAG00000005555.2.
DR GeneTree; ENSGT00940000157580; -.
DR HOGENOM; CLU_010044_2_2_1; -.
DR Proteomes; UP000007648; Unassembled WGS sequence.
DR GO; GO:0005829; C:cytosol; IEA:UniProtKB-SubCell.
DR GO; GO:0005634; C:nucleus; IEA:InterPro.
DR GO; GO:0005667; C:transcription regulator complex; IEA:InterPro.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR GO; GO:0046983; F:protein dimerization activity; IEA:InterPro.
DR GO; GO:0032922; P:circadian regulation of gene expression; IEA:InterPro.
DR GO; GO:0006974; P:DNA damage response; IEA:UniProtKB-KW.
DR CDD; cd19734; bHLH-PAS_CLOCK; 1.
DR CDD; cd00130; PAS; 2.
DR Gene3D; 4.10.280.10; Helix-loop-helix DNA-binding domain; 1.
DR Gene3D; 3.30.450.20; PAS domain; 2.
DR InterPro; IPR011598; bHLH_dom.
DR InterPro; IPR047230; CLOCK-like.
DR InterPro; IPR036638; HLH_DNA-bd_sf.
DR InterPro; IPR001067; Nuc_translocat.
DR InterPro; IPR001610; PAC.
DR InterPro; IPR000014; PAS.
DR InterPro; IPR035965; PAS-like_dom_sf.
DR InterPro; IPR013767; PAS_fold.
DR PANTHER; PTHR46055; CIRCADIAN LOCOMOTER OUTPUT CYCLES PROTEIN KAPUT; 1.
DR PANTHER; PTHR46055:SF2; CIRCADIAN LOCOMOTER OUTPUT CYCLES PROTEIN KAPUT; 1.
DR Pfam; PF00010; HLH; 1.
DR Pfam; PF00989; PAS; 1.
DR Pfam; PF14598; PAS_11; 1.
DR PRINTS; PR00785; NCTRNSLOCATR.
DR SMART; SM00353; HLH; 1.
DR SMART; SM00086; PAC; 1.
DR SMART; SM00091; PAS; 2.
DR SUPFAM; SSF47459; HLH, helix-loop-helix DNA-binding domain; 1.
DR SUPFAM; SSF55785; PYP-like sensor domain (PAS domain); 2.
DR PROSITE; PS50888; BHLH; 1.
DR PROSITE; PS50112; PAS; 2.
PE 4: Predicted;
KW Activator {ECO:0000256|ARBA:ARBA00023159};
KW Biological rhythms {ECO:0000256|ARBA:ARBA00023108};
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Cytoplasm {ECO:0000256|ARBA:ARBA00022490};
KW DNA damage {ECO:0000256|ARBA:ARBA00022763};
KW Isopeptide bond {ECO:0000256|ARBA:ARBA00022499};
KW Reference proteome {ECO:0000313|Proteomes:UP000007648};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT DOMAIN 34..84
FT /note="BHLH"
FT /evidence="ECO:0000259|PROSITE:PS50888"
FT DOMAIN 107..177
FT /note="PAS"
FT /evidence="ECO:0000259|PROSITE:PS50112"
FT DOMAIN 285..332
FT /note="PAS"
FT /evidence="ECO:0000259|PROSITE:PS50112"
FT REGION 420..494
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 609..653
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 816..849
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 525..559
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 427..464
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 475..494
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 849 AA; 95792 MW; EB1FC47EACCA8C69 CRC64;
MYFTISCHKM SSIADRDDGS IFDGLVEEDD KDKAKRVSRN KSEKKRRDQF NVLIKELGSM
LPGNARKMDK STVLQKSIDF LRKHKEITAQ SDASEIRQDW KPTFLSNEEF TQLMLEALDG
FFLAIMTDGS IIYVSESVTP LLEHLPSDLV DQSIFNFIPE GEHSEVYKML STHLLESDSL
TPEYLKSKNQ LEFCCHMLRG TIDPKEPPTY EFVKFIGNFK SLNNVSSSSH NGFEGTIQRS
HRPSYEDRIC FVATVRLATP QFIKEMCTVE EPNEEFTSRH SLEWKFLFLD HRAPPIIGYL
PFEVLGTSGY DYYHVDDLEN LAKCHEHLMQ YGKGKSCYYR FLTKGQQWIW LQTHYYITYH
QWNSRPEFIV CTHTVVSYAE VRAERRRELG IKESLPEIAA DKSQDSGSDN RINTVSLKEA
LERFDHSPTP SASSRSSRKS SHTAVSDPSS TPTKIATDNS TPPRQHLPGH EKIAQRRSSF
SSQSLNSQSV GQSLAQPMMS QATALHIQQG MSQFSAQLGA MQHLKDQLEQ RTRMIEANIH
RQQEELRKIQ EQLQMVHGQG LQMFLQQSSP GLNFGSVQLS SGNSSNMQQL TPINMQSQVV
QANQIQSGLN TGHIGSSQHM MQQQSLQSPA TQQTQPNVLS GHNQQTSLSS QTQSTLTAPL
YNTMVISQPT TGNIVQIPSS IPQNNNQGAA VTTFTQDRQI RFSQGQQLVT KLVTAPVACG
AVMVPSTMFM GQVVTAYPTF AAQQQQPQTL SITQQQQQQQ QQQPSQQEQQ LTTIQQSSQA
QLTQPSQQFL QTSRLLHGNP STQLILSAAF PLQQGTLTQA HHQQHQSQQQ QLARHRTEGL
TDPSKVQPQ
//