ID A0A3P8PB43_ASTCA Unreviewed; 2993 AA.
AC A0A3P8PB43;
DT 13-FEB-2019, integrated into UniProtKB/TrEMBL.
DT 13-FEB-2019, sequence version 1.
DT 27-MAR-2024, entry version 28.
DE SubName: Full=Leucine rich repeat containing 71 {ECO:0000313|Ensembl:ENSACLP00000014217.1};
OS Astatotilapia calliptera (Eastern happy) (Chromis callipterus).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Ovalentaria; Cichlomorphae; Cichliformes; Cichlidae; African cichlids;
OC Pseudocrenilabrinae; Haplochromini; Astatotilapia.
OX NCBI_TaxID=8154 {ECO:0000313|Ensembl:ENSACLP00000014217.1, ECO:0000313|Proteomes:UP000265100};
RN [1] {ECO:0000313|Ensembl:ENSACLP00000014217.1, ECO:0000313|Proteomes:UP000265100}
RP NUCLEOTIDE SEQUENCE.
RA Datahose.;
RL Submitted (MAY-2018) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSACLP00000014217.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 8154.ENSACLP00000014217; -.
DR Ensembl; ENSACLT00000014560.1; ENSACLP00000014217.1; ENSACLG00000009699.1.
DR GeneTree; ENSGT00940000156698; -.
DR OMA; DKDNGHE; -.
DR Proteomes; UP000265100; Chromosome 22.
DR Bgee; ENSACLG00000009699; Expressed in liver and 8 other cell types or tissues.
DR GO; GO:0005694; C:chromosome; IEA:UniProt.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003682; F:chromatin binding; IEA:InterPro.
DR GO; GO:0003677; F:DNA binding; IEA:InterPro.
DR GO; GO:0140938; F:histone H3 methyltransferase activity; IEA:UniProt.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0016279; F:protein-lysine N-methyltransferase activity; IEA:UniProt.
DR CDD; cd04717; BAH_polybromo; 1.
DR CDD; cd05525; Bromo_ASH1; 1.
DR CDD; cd15548; PHD_ASH1L; 1.
DR CDD; cd19174; SET_ASH1L; 1.
DR Gene3D; 2.30.30.490; -; 1.
DR Gene3D; 1.20.920.10; Bromodomain-like; 1.
DR Gene3D; 2.170.270.10; SET domain; 1.
DR Gene3D; 3.30.40.10; Zinc/RING finger domain, C3HC4 (zinc finger); 1.
DR InterPro; IPR017956; AT_hook_DNA-bd_motif.
DR InterPro; IPR006560; AWS_dom.
DR InterPro; IPR001025; BAH_dom.
DR InterPro; IPR043151; BAH_sf.
DR InterPro; IPR043320; Bromo_ASH1L.
DR InterPro; IPR001487; Bromodomain.
DR InterPro; IPR036427; Bromodomain-like_sf.
DR InterPro; IPR018359; Bromodomain_CS.
DR InterPro; IPR043319; PHD_ASH1L.
DR InterPro; IPR003616; Post-SET_dom.
DR InterPro; IPR001214; SET_dom.
DR InterPro; IPR046341; SET_dom_sf.
DR InterPro; IPR019786; Zinc_finger_PHD-type_CS.
DR InterPro; IPR011011; Znf_FYVE_PHD.
DR InterPro; IPR001965; Znf_PHD.
DR InterPro; IPR013083; Znf_RING/FYVE/PHD.
DR PANTHER; PTHR46147; HISTONE-LYSINE N-METHYLTRANSFERASE ASH1; 1.
DR PANTHER; PTHR46147:SF2; SET-BINDING PROTEIN; 1.
DR Pfam; PF17907; AWS; 1.
DR Pfam; PF01426; BAH; 1.
DR Pfam; PF00439; Bromodomain; 1.
DR Pfam; PF20826; PHD_5; 1.
DR Pfam; PF00856; SET; 1.
DR SMART; SM00384; AT_hook; 7.
DR SMART; SM00570; AWS; 1.
DR SMART; SM00439; BAH; 1.
DR SMART; SM00297; BROMO; 1.
DR SMART; SM00249; PHD; 1.
DR SMART; SM00317; SET; 1.
DR SUPFAM; SSF47370; Bromodomain; 1.
DR SUPFAM; SSF57903; FYVE/PHD zinc finger; 1.
DR SUPFAM; SSF82199; SET domain; 1.
DR PROSITE; PS51215; AWS; 1.
DR PROSITE; PS51038; BAH; 1.
DR PROSITE; PS00633; BROMODOMAIN_1; 1.
DR PROSITE; PS50014; BROMODOMAIN_2; 1.
DR PROSITE; PS50868; POST_SET; 1.
DR PROSITE; PS50280; SET; 1.
DR PROSITE; PS01359; ZF_PHD_1; 1.
PE 4: Predicted;
KW Bromodomain {ECO:0000256|ARBA:ARBA00023117, ECO:0000256|PROSITE-
KW ProRule:PRU00035}; Chromatin regulator {ECO:0000256|ARBA:ARBA00022853};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW S-adenosyl-L-methionine {ECO:0000256|ARBA:ARBA00022691};
KW Transferase {ECO:0000256|ARBA:ARBA00022679};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771}.
FT DOMAIN 2048..2099
FT /note="AWS"
FT /evidence="ECO:0000259|PROSITE:PS51215"
FT DOMAIN 2102..2218
FT /note="SET"
FT /evidence="ECO:0000259|PROSITE:PS50280"
FT DOMAIN 2226..2242
FT /note="Post-SET"
FT /evidence="ECO:0000259|PROSITE:PS50868"
FT DOMAIN 2409..2479
FT /note="Bromo"
FT /evidence="ECO:0000259|PROSITE:PS50014"
FT DOMAIN 2615..2752
FT /note="BAH"
FT /evidence="ECO:0000259|PROSITE:PS51038"
FT REGION 1..809
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 893..1099
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1138..1205
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1374..1407
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1461..1500
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1514..1571
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1586..1725
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1765..1802
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1834..1894
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1907..1952
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2248..2291
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2508..2534
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2779..2906
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 25..48
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 171..185
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 315..334
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 379..494
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 505..528
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 552..588
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 593..615
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 653..678
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 685..699
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 902..955
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 971..989
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 997..1015
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1019..1040
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1060..1090
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1463..1477
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1514..1541
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1557..1571
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1632..1647
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1665..1680
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1681..1697
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1788..1802
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1868..1883
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2512..2534
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2792..2824
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2993 AA; 331385 MW; BD355E5418122578 CRC64;
MDQKIQGRTA TPPPLLSGAP TGEREKEGAG GKKEEEEEKK RDREKEGTPT ASAGAGGTGG
PGGAGGDQSH FSIKESSLSE GNVKLKIGLQ AKRMKKPPKI LENYVCRPAF RATVRHTGRG
GGNARGNRAG ATGDGAGSQS QSPSQGREKE REKSPSVNRP PISSSSTAPT AKAPTPPPLA
PPPASTTTLT PSQMNGNAPA KKGPPKMDCK SDSKSDTKAA TTASERPLNL HRPPPDSKTH
SSGKKAPAPQ PNPRTSSPSP PLSLTKEPSL EGVKPPQCTY STDSQRDKDK GLQSWGAPTV
TEKLAQLIAT CPPSKTPKPA KPAKIDPSPP LPSSGFMAPT AKQRDRALAN RNTYSRMVHL
SPPPPVSRPP GRPYGSRSKD SILENSTTLT PLRNEDLDGT GKAASSNGNN SISMSSTNSS
SRSNSPAFTY GNNRLTVLSN ESNSDSSNSS ISKQPLHPAH SLPHTTSPSL CPISSSSSSS
VEQRTVSAVS HLGSPVPSQG HHARESPALA EESSAGEREQ DRDIPRDLSK CPPSAATGRP
EGKEKGGSNQ SVRVERGKSS SPSKRSPNQD SNRSGWTSSP TESIRQRTPS PLEPDDDDDD
DDDDEDEDDE EESPQTPLRD DSPDSSLDSP AEQDSKPLKK RRGRKPRWTR VMLKAQSQRS
PDSLFTENKT TMPLSSSLEI PTPPVKRPVG RPPNPNKVKP NPVSQNSVSQ LFPPQPKKRG
RPKSKMPRLD APARGNPPNK LQPSKVFSSL LKSKEEQDPP VLHPEVDLNP PKPMPRKRGR
PKRLPPTLPQ EGQPPTLAPE AGERFRNKGN GQLIMKTIIG KINKMKSVKR KRILSQILLG
PRSEDTPKGT TSSMAATAEA ATQSLSSLAA SFGGKLGPQI NVSKKGTIYM GKRRGRKPKV
ATNVSATMSP SETFISPNTT SPLHPHQSQS HQQHQLSSEV FSSPSLSQLS GGHSPISDGS
FVEPGSVHFA GHSHYSNSHH SHNTFSLPPP TFSAPNPRNP VMGSMSSSGV PVASQKKSSC
RGYHHHHHHY RQHYHYHKLS PPRPLHPTSP APLSELKEAT PSPVSESHSE ETVPSDSGIG
TDNNSTSDRG EKAGGAGGLG AIGIPPGMGG GLLMPGVMGS AMGPGMGINS RGRRRHSNVL
MEHPSPSPSP HGTRSSPDPR RSHPAAPATT LVGHKEKHKH KCKRRSHGCP GYDKLKRQKR
KRKKKYLQLR SRRLDPDFLA ELDEIVVRLS EIRITHRTTG HRLGSGIGIA AGSSRVPGVG
GRAAGAIGGT GGPPPHHYVH RDLLPTIFRV NFGSFYSHPP YSCDPLHYVR KPDMKKKRGR
PPKLRESMSE VPFVPGLGFP LSSGGFYHPS YVPYSSGPLG LGYYRGYPPA SALYPHPHHQ
SPHTAPSHHT HHSPSFPPPP TSFLHHHHPS HLLLNHSKFH KKKHKLLRQE YLGAGRPPVL
YPPMSSELSF NWHHKHKHRH KHRERCAEDD REEAARGGSG SRASAGISDS GTSGKGERVS
GLGMAEALQR CRFGRDTSST SASKQAATTS ANSPSSSSSS SAERYKRKES SMSCLGPSRL
ALGNSSKGHS AESWFRIGNS EADYSKLSRS QLAPGQGSFS DGRAEDPAGC SDSEDEEPLT
PTEDVETHDS PKRTNLFASA ITRTSLKGGR SRKTDVVTES SSFSHMDRPM RKEHSTSAER
REMGSSGIQT RGVTLPTSEE PEGSLHHRQQ HHHQSSLFHS HSSTSSSCLS PSQECCLDVS
LSHNSHRVQP SKYSLLHVNK ILRAKKLQRQ ARTGNNMVKK RGPGRPRKHP LPSPPPSPPP
VVELNQVRHR EKGVERLAEG RGWEGDTVTD AIESVVQGQR RKGQKRKHWE REGDEEEEED
VEEDREAEDM EGHLPEREEN LGSLVARSRP GTGRSWLTQD ELQSLHGSVE TKPDGHCSSE
GPGPVSREQA PPMPIATQRE KRPARPPKKK FQKAGLYSDV YKTEDPRSQF LQLKKEKLEY
IPGEHEYGLF PAPIHVGKYL RQKRIDFQLP YDILWLWKHD QLHKRPDVPL YKKIRSNVYV
DVKPLSGYET TTCNCRLPED STEKGCMDEC LNRMSFAECS PSTCPCGDHC DNQHIQRHEW
VQCLERFRTE GKGWGIRTKE SLRSGQFIIE YLGEVVSEQE FRSRMMEQYF SHSGHYCLNL
DSGMVIDSYR MGNEARFINH SCEPNCQMQK WSVNGVYRIG LFALRDISSG TELTYDYNFR
SFNTEEQQVC KCGSEGCRGI IGGKSQRING LPGKSGGTRR LGRLKEKRKS KHQLKKREEE
SSDSSKFYPH LMKPMSNRER NFVLKHRVFL LRNWEKMREK QELLKREGER ERDASSLSIY
TRWGGVIRDD GNIKSDVFLT QFSALQTSRS VRTRRLAAAE ENTEVTRTAR LAHIFKEICD
MITSYKDSAG QTLAAPLVNL PSRKRNSQYY EKVSDPLDLS TIEKQILTGH YKTVEAFDTD
MLKVFRNAEK YYGKKSSIGR DVCRLRKAYY SARNEAAVQI DEIVGETASE ADSSDSLDRD
HGHHHHTGGP GSHDKDDDVI RCICGMYKDE GLMIQCEKCM VWQHFDCMRL ETEVEHYLCE
QCDPRPVDRE VPMIPQPSYA QAGSVYYICL LRDDLLLHQG DCVYLMRDSR RSPEGQPVRQ
SYRLLSHINR DKLDIFRIEK LWKNEKGERF AFGHHYFRPH ETHHSPSRRF YQNELFRMPL
YEIIPLEAVV GTCCVLDLYT YCKGRPKGVK EQDVYICDYR LDKSAHLFYK IHRNRFPVCT
KPYAFNHFPK RLTPKRDFSP HYVPDNYKRN GGRSAWKSER PKEAGGCEDD ASSCDRGEDF
PPEGEDGRGV EDDMDMASED SALLSAKPRR PERERETGEE EDEEDGQEAD ERKGLEERSA
ERIGELLEVP SSSTSSPLHH PALGRREAQR ERLNKILLDL LHRTPSKNGP TPNETLPVQT
FDEYQCTGNA ETDFPALCAL LNVKDIPAVI AKFPASSASQ TEGVTSKQEM VLL
//