GenomeNet

Database: UniProt
Entry: I3KRU4_ORENI
LinkDB: I3KRU4_ORENI
Original site: I3KRU4_ORENI 
ID   I3KRU4_ORENI            Unreviewed;      3061 AA.
AC   I3KRU4;
DT   11-JUL-2012, integrated into UniProtKB/TrEMBL.
DT   17-JUN-2020, sequence version 2.
DT   27-MAR-2024, entry version 56.
DE   SubName: Full=Huntingtin {ECO:0000313|Ensembl:ENSONIP00000023839.2};
GN   Name=htt {ECO:0000313|Ensembl:ENSONIP00000023839.2};
OS   Oreochromis niloticus (Nile tilapia) (Tilapia nilotica).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC   Ovalentaria; Cichlomorphae; Cichliformes; Cichlidae; African cichlids;
OC   Pseudocrenilabrinae; Oreochromini; Oreochromis.
OX   NCBI_TaxID=8128 {ECO:0000313|Ensembl:ENSONIP00000023839.2, ECO:0000313|Proteomes:UP000005207};
RN   [1] {ECO:0000313|Proteomes:UP000005207}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RG   Broad Institute Genome Assembly Team;
RG   Broad Institute Sequencing Platform;
RA   Di Palma F., Johnson J., Lander E.S., Lindblad-Toh K.;
RT   "The Genome Sequence of Oreochromis niloticus (Nile Tilapia).";
RL   Submitted (JAN-2012) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|Ensembl:ENSONIP00000023839.2}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   -!- FUNCTION: May play a role in microtubule-mediated transport or vesicle
CC       function. {ECO:0000256|ARBA:ARBA00002907}.
CC   -!- SUBCELLULAR LOCATION: Cytoplasm {ECO:0000256|ARBA:ARBA00004496}.
CC       Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC   -!- SIMILARITY: Belongs to the huntingtin family.
CC       {ECO:0000256|ARBA:ARBA00007153}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   STRING; 8128.ENSONIP00000043030; -.
DR   Ensembl; ENSONIT00000023860.2; ENSONIP00000023839.2; ENSONIG00000018930.2.
DR   eggNOG; ENOG502QR1D; Eukaryota.
DR   GeneTree; ENSGT00390000015863; -.
DR   HOGENOM; CLU_000428_0_0_1; -.
DR   TreeFam; TF323608; -.
DR   Proteomes; UP000005207; Linkage group LG6.
DR   GO; GO:0005737; C:cytoplasm; IEA:UniProtKB-SubCell.
DR   GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR   Gene3D; 1.25.10.10; Leucine-rich Repeat Variant; 2.
DR   InterPro; IPR011989; ARM-like.
DR   InterPro; IPR016024; ARM-type_fold.
DR   InterPro; IPR048412; Htt_bridge.
DR   InterPro; IPR048413; Htt_C-HEAT_rpt.
DR   InterPro; IPR048411; Htt_N_HEAT_rpt-1.
DR   InterPro; IPR000091; Huntingtin.
DR   InterPro; IPR028426; Huntingtin_fam.
DR   InterPro; IPR024613; Huntingtin_N_HEAT_rpt-2.
DR   PANTHER; PTHR10170:SF10; HUNTINGTIN; 1.
DR   PANTHER; PTHR10170; HUNTINGTON DISEASE PROTEIN; 1.
DR   Pfam; PF20925; Htt_bridge; 1.
DR   Pfam; PF20927; Htt_C-HEAT; 1.
DR   Pfam; PF12372; Htt_N-HEAT; 1.
DR   Pfam; PF20926; Htt_N-HEAT_1; 1.
DR   PRINTS; PR00375; HUNTINGTIN.
DR   SUPFAM; SSF48371; ARM repeat; 2.
PE   3: Inferred from homology;
KW   Cytoplasm {ECO:0000256|ARBA:ARBA00022490};
KW   Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW   Reference proteome {ECO:0000313|Proteomes:UP000005207}.
FT   REGION          452..515
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          570..613
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1106..1163
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1806..1826
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2551..2577
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        452..513
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        572..606
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1142..1163
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   3061 AA;  340026 MW;  37CC9F63D49D9758 CRC64;
     MATMEKLMKA FESLKSFQQQ QGPPTIPKFS LSLCTHTHSH SSFAFCLIFH RKKEQAATKK
     DRVTHCLTIC ENIVAQSLRT SPEFQKLLGI AMEMFLLCSD DSESDVRMVA DECLNKIIKA
     LMDSNLPRLQ LELYKEIKKN GASRSLRAAL WRFAELAHLI RPQKCRPYLV NLLPCLTRIT
     KRQEETVQET LATAMPKIMS ALGHFANDGE IKVLLKAFVA NLKSSSPTIR RTAASSAVSI
     CQHSRRTSYF YTWLLNVLLG LLVPVDDEHP SHLILGVLLT LRYLMPLLQQ HVNSTSLKGS
     FGVMRKEADV QPTPEQLLQV YELTLHYTQH WDHNVVTAAL ELLQQVFRTP PPELLHMLIT
     AGSIPHATVF RQDTENRSRS GSILEFIGNH VNRLLLSKQC KMLSGEEDGL EDDPERTEVT
     TAAFTGEIQT PATSVVGTDS SSAAQVDIIT EQPRSSQHAL QPGDSVDLSS PNDNDEEMLS
     RSSSESGTLL GTNDRSLPPS DSSQTTTEGP DSAVTPSDVA ELVLDGSESQ YSGMQIGTLQ
     DEEDEGTAPS SQEEPQEPFL QSALALSKPH LFDGRGHNRQ GSDSSVDRFI PKDEPAEPEP
     DNKPSRIKGP IGHYTDQGAE PLVHCVRLLA ASFLLTGQKN GLIPDKEVRV SVKALALSCV
     GAAAALHPEA FFNSLYLEPL DGVPVQQYIS DVLGLIDHGD PQIRGATAIL CAAIIQAALT
     KTRFNIHTWL ASVQSATGNP LSLVDLVPLL QKTLKDESSV TCKMACSAVR HCITTICSST
     LSELGLQLVI DLFALKDSSY WLVRTELLET LAELDFRLVN FLERKTETLH KGDHHYTGRL
     RLQDRVLNDV VVYLLGDDDP RVRHVAASAV SRLVSRLFYD CDQGQVDPVV AIARDQSSVY
     LQLLMHETQP PSQFTVSTIT RTYRGYSMSN TVSDVTLENN LSRVVAAVSH ALTSSTSRAL
     TFGCCEALCL LASNFPVGNW STGWHCGYVS SNNAERRTLT VGIANTVLSL LSSAWFPLDL
     SAHQDALLLS GNLIAAVAPK CMRNPWVGEE EGSSGTSSGG PSKLEEPWAA LSERSLVVMV
     EQLFSHLLKV LNICAHVLDD TPPGPAMKAT LPSLSNTPSL SPIRRKGKEK EGAEPSATPM
     SPKKGSEANT AGRPTDSTGS TTVNKSTTLG NFYHLPPYLK LYDVLKATHA NYKVTLDLHS
     SQEKFGSFLR ATLDVLSQLL ELATLHDIGK CVEEILGYLK SCFSREPTMA TVCVQQLLKT
     LFGTNLASQY EGVLSGPSRS QGKALRLGSS SLRPGLYHYC FMAPYTHFTQ ALADASLRNM
     VQAEQEQDTS GWFDVMQKAS NQLRSNITNA ARHRGDKNAI HNHIRLFEPL VIKALKQYTT
     STSVALQRQV LDLLAQLVQL RVNYCLLDSD QVFIGFVLKQ FEYIEVGQFR DSEAIIPNIF
     FFLVLLSYER YHSKQIISIP KIIQLCDGIM ASGRKAVTHA IPALQPIVHD LFVLRGSNKA
     DAGKELDTQK EVVVSMLLRL VQYHQVLEMF ILVLQQCHKE NEDKWKRLSR QIADVILPMI
     AKQQMHLDSP EALGVLNTLF ETVAPSSLRP VDMLLKSMFT IPATMASVAT VQLWVSGILA
     VLRVLVSQST EDIVLSRVHE LALSPNLLSC HAIHCLQGSN SSCLCTFFHF SLFHLPSRFL
     LQLVGVLLDD ISTRQVKVEI TEQQHTFYCQ QLGTLLMCLI HVFKSGMFRR ITAAASRLLK
     GESGQTATEA NLFYPLEGLN SMVQCLITTH PSLVLLWCQV LLIINYTNYS WWAEVHQTPR
     RHSLSSTKLL SPHSSGEGEE DKPESQLAMV NREIVRRGAV ILFCDYVCQN LHDSEHLTWL
     IVNHVRDLIS LSHEPPVQDF ISAVHRNSAA SGLFIQAIQS RCDNLTTPTM LKKTLQCLEG
     IHLSQSGSLL MLYVDKLLNT PFRVLARMVD TLACRRVEML LAETLQNSIA QLPVEELDRI
     QEYLQNSGLA QRHQRFYSLL DRFRATVVDT SSPTPPVTSH PLDGDPPSAP ELVIADKEWY
     VALVKSQCCL RGDVSLLEMT ELLTKLPPAD LFSVMSCKEF NLSLLCPCLS MGMQRLARGQ
     GSLLLETALQ VTLEQLAGVT GSLPAPHQSF LPPSQPQPYW DQLGDVYGKT GKVLSLCRAL
     SQYLLSVSQL PSSLHIPSDK EHLITTFTLW RLLQDRLPLS VDLQWALSCL CLALQQPCVW
     NKLSTPEYTT HTCSLIYCLR LIIVAVAVSP GDQLLHQEKK MAKGEKDDGD QVDWQACEIM
     AELVEGLPSI LSLGHRRNSI LPTFLTPTLR NIVISLSRLP LVNSYTRVPP LVWKLGWSPQ
     PGGEFGTTLP EIPVDFLQEK DVFREFLYRI NTLGWSSRTQ FEETWATLLG VLVTQPITKD
     QEEDPQQEDL ERTQLNVLAV QAITSLVLSA MTLPTPGNPA VSCLEQQPRN KSLKALETRF
     GRKLAVIRGE VEREIQALVS KRDNVHTYHP YHAWDPVPSL SAASAGTLIS HEKLLLQINT
     EREMGNMEYK LGQVSIHSVW LGNNITPLRE EEWGEDEEDE ADTPAPTSPP LSPINSRKHR
     AGVDIHSCSQ FLLELYSQWL IPSSPSNRRT PTILISEVVR SLLAVSDLFT ERNQFDMMFS
     TLMELQKHHP PEDEILNQYL VPAICKAAAV LGMDKVIAEP VCRLLETTLR STHLPSRMGA
     LHGVLYVLEC DLLDDTAKQL IPTVSEYLLS NLKAVAHCVN LHNQQHVLVM CAVAFYMMEN
     YPLDVGAEFM AGIIQLCGVM VSASEDSTPS IIYHCVLRGL ERLLLSEQLS RMDGEALVKL
     SVDRVNMPSP HRAMAALGLM LTCMYTGKEK ASPASRPAHS DPQAPDSESI IVAMERVSVL
     FDRIRKGLPS EARVVSRILP QFLDDFFPPQ DVMNKVIGEF LSNQQPYPQF MATVVYKVFQ
     TLHATGQSSM VRDWVLLSLS NFTQRTPVAM AMWSLSCFFV SASTSQWISA LLPHVISRMG
     SSEVVDVNLF CLVAMDFYRH QIDEELDRRA FQSVFETVAS PGSPYYQLLG CLQSVHQDTS
     L
//
DBGET integrated database retrieval system