ID I3KRU4_ORENI Unreviewed; 3061 AA.
AC I3KRU4;
DT 11-JUL-2012, integrated into UniProtKB/TrEMBL.
DT 17-JUN-2020, sequence version 2.
DT 27-MAR-2024, entry version 56.
DE SubName: Full=Huntingtin {ECO:0000313|Ensembl:ENSONIP00000023839.2};
GN Name=htt {ECO:0000313|Ensembl:ENSONIP00000023839.2};
OS Oreochromis niloticus (Nile tilapia) (Tilapia nilotica).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Ovalentaria; Cichlomorphae; Cichliformes; Cichlidae; African cichlids;
OC Pseudocrenilabrinae; Oreochromini; Oreochromis.
OX NCBI_TaxID=8128 {ECO:0000313|Ensembl:ENSONIP00000023839.2, ECO:0000313|Proteomes:UP000005207};
RN [1] {ECO:0000313|Proteomes:UP000005207}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RG Broad Institute Genome Assembly Team;
RG Broad Institute Sequencing Platform;
RA Di Palma F., Johnson J., Lander E.S., Lindblad-Toh K.;
RT "The Genome Sequence of Oreochromis niloticus (Nile Tilapia).";
RL Submitted (JAN-2012) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSONIP00000023839.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- FUNCTION: May play a role in microtubule-mediated transport or vesicle
CC function. {ECO:0000256|ARBA:ARBA00002907}.
CC -!- SUBCELLULAR LOCATION: Cytoplasm {ECO:0000256|ARBA:ARBA00004496}.
CC Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the huntingtin family.
CC {ECO:0000256|ARBA:ARBA00007153}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 8128.ENSONIP00000043030; -.
DR Ensembl; ENSONIT00000023860.2; ENSONIP00000023839.2; ENSONIG00000018930.2.
DR eggNOG; ENOG502QR1D; Eukaryota.
DR GeneTree; ENSGT00390000015863; -.
DR HOGENOM; CLU_000428_0_0_1; -.
DR TreeFam; TF323608; -.
DR Proteomes; UP000005207; Linkage group LG6.
DR GO; GO:0005737; C:cytoplasm; IEA:UniProtKB-SubCell.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR Gene3D; 1.25.10.10; Leucine-rich Repeat Variant; 2.
DR InterPro; IPR011989; ARM-like.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR048412; Htt_bridge.
DR InterPro; IPR048413; Htt_C-HEAT_rpt.
DR InterPro; IPR048411; Htt_N_HEAT_rpt-1.
DR InterPro; IPR000091; Huntingtin.
DR InterPro; IPR028426; Huntingtin_fam.
DR InterPro; IPR024613; Huntingtin_N_HEAT_rpt-2.
DR PANTHER; PTHR10170:SF10; HUNTINGTIN; 1.
DR PANTHER; PTHR10170; HUNTINGTON DISEASE PROTEIN; 1.
DR Pfam; PF20925; Htt_bridge; 1.
DR Pfam; PF20927; Htt_C-HEAT; 1.
DR Pfam; PF12372; Htt_N-HEAT; 1.
DR Pfam; PF20926; Htt_N-HEAT_1; 1.
DR PRINTS; PR00375; HUNTINGTIN.
DR SUPFAM; SSF48371; ARM repeat; 2.
PE 3: Inferred from homology;
KW Cytoplasm {ECO:0000256|ARBA:ARBA00022490};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000005207}.
FT REGION 452..515
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 570..613
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1106..1163
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1806..1826
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2551..2577
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 452..513
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 572..606
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1142..1163
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 3061 AA; 340026 MW; 37CC9F63D49D9758 CRC64;
MATMEKLMKA FESLKSFQQQ QGPPTIPKFS LSLCTHTHSH SSFAFCLIFH RKKEQAATKK
DRVTHCLTIC ENIVAQSLRT SPEFQKLLGI AMEMFLLCSD DSESDVRMVA DECLNKIIKA
LMDSNLPRLQ LELYKEIKKN GASRSLRAAL WRFAELAHLI RPQKCRPYLV NLLPCLTRIT
KRQEETVQET LATAMPKIMS ALGHFANDGE IKVLLKAFVA NLKSSSPTIR RTAASSAVSI
CQHSRRTSYF YTWLLNVLLG LLVPVDDEHP SHLILGVLLT LRYLMPLLQQ HVNSTSLKGS
FGVMRKEADV QPTPEQLLQV YELTLHYTQH WDHNVVTAAL ELLQQVFRTP PPELLHMLIT
AGSIPHATVF RQDTENRSRS GSILEFIGNH VNRLLLSKQC KMLSGEEDGL EDDPERTEVT
TAAFTGEIQT PATSVVGTDS SSAAQVDIIT EQPRSSQHAL QPGDSVDLSS PNDNDEEMLS
RSSSESGTLL GTNDRSLPPS DSSQTTTEGP DSAVTPSDVA ELVLDGSESQ YSGMQIGTLQ
DEEDEGTAPS SQEEPQEPFL QSALALSKPH LFDGRGHNRQ GSDSSVDRFI PKDEPAEPEP
DNKPSRIKGP IGHYTDQGAE PLVHCVRLLA ASFLLTGQKN GLIPDKEVRV SVKALALSCV
GAAAALHPEA FFNSLYLEPL DGVPVQQYIS DVLGLIDHGD PQIRGATAIL CAAIIQAALT
KTRFNIHTWL ASVQSATGNP LSLVDLVPLL QKTLKDESSV TCKMACSAVR HCITTICSST
LSELGLQLVI DLFALKDSSY WLVRTELLET LAELDFRLVN FLERKTETLH KGDHHYTGRL
RLQDRVLNDV VVYLLGDDDP RVRHVAASAV SRLVSRLFYD CDQGQVDPVV AIARDQSSVY
LQLLMHETQP PSQFTVSTIT RTYRGYSMSN TVSDVTLENN LSRVVAAVSH ALTSSTSRAL
TFGCCEALCL LASNFPVGNW STGWHCGYVS SNNAERRTLT VGIANTVLSL LSSAWFPLDL
SAHQDALLLS GNLIAAVAPK CMRNPWVGEE EGSSGTSSGG PSKLEEPWAA LSERSLVVMV
EQLFSHLLKV LNICAHVLDD TPPGPAMKAT LPSLSNTPSL SPIRRKGKEK EGAEPSATPM
SPKKGSEANT AGRPTDSTGS TTVNKSTTLG NFYHLPPYLK LYDVLKATHA NYKVTLDLHS
SQEKFGSFLR ATLDVLSQLL ELATLHDIGK CVEEILGYLK SCFSREPTMA TVCVQQLLKT
LFGTNLASQY EGVLSGPSRS QGKALRLGSS SLRPGLYHYC FMAPYTHFTQ ALADASLRNM
VQAEQEQDTS GWFDVMQKAS NQLRSNITNA ARHRGDKNAI HNHIRLFEPL VIKALKQYTT
STSVALQRQV LDLLAQLVQL RVNYCLLDSD QVFIGFVLKQ FEYIEVGQFR DSEAIIPNIF
FFLVLLSYER YHSKQIISIP KIIQLCDGIM ASGRKAVTHA IPALQPIVHD LFVLRGSNKA
DAGKELDTQK EVVVSMLLRL VQYHQVLEMF ILVLQQCHKE NEDKWKRLSR QIADVILPMI
AKQQMHLDSP EALGVLNTLF ETVAPSSLRP VDMLLKSMFT IPATMASVAT VQLWVSGILA
VLRVLVSQST EDIVLSRVHE LALSPNLLSC HAIHCLQGSN SSCLCTFFHF SLFHLPSRFL
LQLVGVLLDD ISTRQVKVEI TEQQHTFYCQ QLGTLLMCLI HVFKSGMFRR ITAAASRLLK
GESGQTATEA NLFYPLEGLN SMVQCLITTH PSLVLLWCQV LLIINYTNYS WWAEVHQTPR
RHSLSSTKLL SPHSSGEGEE DKPESQLAMV NREIVRRGAV ILFCDYVCQN LHDSEHLTWL
IVNHVRDLIS LSHEPPVQDF ISAVHRNSAA SGLFIQAIQS RCDNLTTPTM LKKTLQCLEG
IHLSQSGSLL MLYVDKLLNT PFRVLARMVD TLACRRVEML LAETLQNSIA QLPVEELDRI
QEYLQNSGLA QRHQRFYSLL DRFRATVVDT SSPTPPVTSH PLDGDPPSAP ELVIADKEWY
VALVKSQCCL RGDVSLLEMT ELLTKLPPAD LFSVMSCKEF NLSLLCPCLS MGMQRLARGQ
GSLLLETALQ VTLEQLAGVT GSLPAPHQSF LPPSQPQPYW DQLGDVYGKT GKVLSLCRAL
SQYLLSVSQL PSSLHIPSDK EHLITTFTLW RLLQDRLPLS VDLQWALSCL CLALQQPCVW
NKLSTPEYTT HTCSLIYCLR LIIVAVAVSP GDQLLHQEKK MAKGEKDDGD QVDWQACEIM
AELVEGLPSI LSLGHRRNSI LPTFLTPTLR NIVISLSRLP LVNSYTRVPP LVWKLGWSPQ
PGGEFGTTLP EIPVDFLQEK DVFREFLYRI NTLGWSSRTQ FEETWATLLG VLVTQPITKD
QEEDPQQEDL ERTQLNVLAV QAITSLVLSA MTLPTPGNPA VSCLEQQPRN KSLKALETRF
GRKLAVIRGE VEREIQALVS KRDNVHTYHP YHAWDPVPSL SAASAGTLIS HEKLLLQINT
EREMGNMEYK LGQVSIHSVW LGNNITPLRE EEWGEDEEDE ADTPAPTSPP LSPINSRKHR
AGVDIHSCSQ FLLELYSQWL IPSSPSNRRT PTILISEVVR SLLAVSDLFT ERNQFDMMFS
TLMELQKHHP PEDEILNQYL VPAICKAAAV LGMDKVIAEP VCRLLETTLR STHLPSRMGA
LHGVLYVLEC DLLDDTAKQL IPTVSEYLLS NLKAVAHCVN LHNQQHVLVM CAVAFYMMEN
YPLDVGAEFM AGIIQLCGVM VSASEDSTPS IIYHCVLRGL ERLLLSEQLS RMDGEALVKL
SVDRVNMPSP HRAMAALGLM LTCMYTGKEK ASPASRPAHS DPQAPDSESI IVAMERVSVL
FDRIRKGLPS EARVVSRILP QFLDDFFPPQ DVMNKVIGEF LSNQQPYPQF MATVVYKVFQ
TLHATGQSSM VRDWVLLSLS NFTQRTPVAM AMWSLSCFFV SASTSQWISA LLPHVISRMG
SSEVVDVNLF CLVAMDFYRH QIDEELDRRA FQSVFETVAS PGSPYYQLLG CLQSVHQDTS
L
//