ID V4AN43_LOTGI Unreviewed; 2980 AA.
AC V4AN43;
DT 22-JAN-2014, integrated into UniProtKB/TrEMBL.
DT 22-JAN-2014, sequence version 1.
DT 24-JAN-2024, entry version 44.
DE RecName: Full=Huntingtin {ECO:0008006|Google:ProtNLM};
GN ORFNames=LOTGIDRAFT_160793 {ECO:0000313|EMBL:ESO95031.1};
OS Lottia gigantea (Giant owl limpet).
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Mollusca; Gastropoda;
OC Patellogastropoda; Lottioidea; Lottiidae; Lottia.
OX NCBI_TaxID=225164 {ECO:0000313|EMBL:ESO95031.1, ECO:0000313|Proteomes:UP000030746};
RN [1] {ECO:0000313|EMBL:ESO95031.1, ECO:0000313|Proteomes:UP000030746}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=23254933; DOI=10.1038/nature11696;
RA Simakov O., Marletaz F., Cho S.J., Edsinger-Gonzales E., Havlak P.,
RA Hellsten U., Kuo D.H., Larsson T., Lv J., Arendt D., Savage R.,
RA Osoegawa K., de Jong P., Grimwood J., Chapman J.A., Shapiro H., Aerts A.,
RA Otillar R.P., Terry A.Y., Boore J.L., Grigoriev I.V., Lindberg D.R.,
RA Seaver E.C., Weisblat D.A., Putnam N.H., Rokhsar D.S.;
RT "Insights into bilaterian evolution from three spiralian genomes.";
RL Nature 493:526-531(2013).
CC -!- FUNCTION: May play a role in microtubule-mediated transport or vesicle
CC function. {ECO:0000256|ARBA:ARBA00002907}.
CC -!- SUBCELLULAR LOCATION: Cytoplasm {ECO:0000256|ARBA:ARBA00004496}.
CC Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the huntingtin family.
CC {ECO:0000256|ARBA:ARBA00007153}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KB201701; ESO95031.1; -; Genomic_DNA.
DR RefSeq; XP_009054232.1; XM_009055984.1.
DR STRING; 225164.V4AN43; -.
DR EnsemblMetazoa; LotgiT160793; LotgiP160793; LotgiG160793.
DR GeneID; 20238497; -.
DR KEGG; lgi:LOTGIDRAFT_160793; -.
DR CTD; 20238497; -.
DR HOGENOM; CLU_000428_0_0_1; -.
DR OMA; PNKMEEP; -.
DR OrthoDB; 6903at2759; -.
DR Proteomes; UP000030746; Unassembled WGS sequence.
DR GO; GO:0005737; C:cytoplasm; IEA:UniProtKB-SubCell.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR Gene3D; 1.25.10.10; Leucine-rich Repeat Variant; 2.
DR InterPro; IPR011989; ARM-like.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR048412; Htt_bridge.
DR InterPro; IPR048413; Htt_C-HEAT_rpt.
DR InterPro; IPR048411; Htt_N_HEAT_rpt-1.
DR InterPro; IPR000091; Huntingtin.
DR InterPro; IPR028426; Huntingtin_fam.
DR InterPro; IPR024613; Huntingtin_N_HEAT_rpt-2.
DR PANTHER; PTHR10170:SF10; HUNTINGTIN; 1.
DR PANTHER; PTHR10170; HUNTINGTON DISEASE PROTEIN; 1.
DR Pfam; PF20925; Htt_bridge; 1.
DR Pfam; PF20927; Htt_C-HEAT; 2.
DR Pfam; PF12372; Htt_N-HEAT; 1.
DR Pfam; PF20926; Htt_N-HEAT_1; 1.
DR PRINTS; PR00375; HUNTINGTIN.
DR SUPFAM; SSF48371; ARM repeat; 3.
PE 3: Inferred from homology;
KW Cytoplasm {ECO:0000256|ARBA:ARBA00022490};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000030746}.
FT REGION 367..403
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 899..922
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1031..1095
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1240..1259
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1077..1095
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2980 AA; 336088 MW; F63B44BDDEAC2782 CRC64;
MATIEKLIKA FEALKVFQSA PGGLEEITSS AKKKDQPPSK KDKMVNCNVV ADCICAVNMR
SVPDFPKFLG IAVEAFLTLC DDPESDIRMV ADECLNRTIK TLLETNLGRL QVELYKEIKK
NGSSRTLRGA LWRFADMCHL IRPLKCRPYI VNLLPCLTRI SQRDEEAIQD TLGTSMSKIC
PALMGFANDG EVKSLLKTFL PNLKSTVAVC RRTTATSLVL ICQYSRNPLS FFNYLISVLL
DMVLPVDTEH DIYTLLGVIL CLRHAIPIMN DSDEKDQGMK GSFGLIQKDK EEGVSAMQVE
KIFQLLLFYT SHNDHNVVTA SLEALQQLLR HPPDSLRSIL LVRGSITRTH IFQQDFIDER
ERMRVESSAE LTSEADDQGL EEDADVSMAT DQPIKSNTAT LDDDIEQVEE LQLDSSSQEV
IKDSEENAKK LDPSVHFTTD DDYSNLDIGD LNDDKSEKTT LTRSGSQDTL LSVKSLSPRH
SASVRTLELA QDLNGNPEYV ETSEPPSPMP ISDSNKIFQQ LADFGNITDN EMPLVYCVRL
LCRRFLLSGN KLELILDKFV RVSSKSLALA CISSAIFMCP KIFLLKLLPT DSGDQDIREV
TLYASHPDPV LKGQTGIVIS SFIKAVLIEG RGDFNKWLKM NSPADIGEIC IDNLVNTIIH
VIEDESSVAT RQGLIALQSC LASLLDSCHG GLGLQVLLDI LEVRNNPYWL VKVELLEVLS
SINFKVVSYL ENICEEIEIG KHHYLGKLEI QEYYIQNVVL CLLADEDIRV RHAAANSLVR
LIPNLFYPVD HPQHDPVTAC AQDLTNRVFT SLSHQLIEDL PPLVQGLVKP YHYSPNIKVN
STVESCLSRI IQQLLHQIHI SQSRHLLNGC CHALCLLCEE YTVTCYSSCW GCGSATHIST
KDQTRSGSMK RPPSRSLSSS STFSLEELSS ATGSGPLPMI LSILMSSPVG LDISSHQDVL
QLTGNLMAGS AFKCLRSSEE LASLTSTGDA GKWAAISDIQ LGTLIDQLLV HVARLLNCFL
HVFEEQIPGP PQVKPSLPSL PNAPSLSPVK RKVKGGDKET ASPSGKDNIT DSKTPQKTPQ
KDTKEGEKEK GKKEGLGTFY NLPQYMKLFD VLKGAYSNYK TSLDLSNCDK FCNLLKVTLS
VLSQILEVTT LTDIGKYAEE FLTYFRCTFS IEPTFTVLCV QQLLKALFGT NIGNQWEPGS
VSNYTSQRVK HLTRQAGSNF KPGLYQCCLN QPYTTLTKSL DGTSTTNTDK TQSESDTSQS
NLLWLKKRVE RKVPSILKPG SKVDKSAIAS YIRLFEPLVI KALKQYTVTS SLDLQHQVLN
LLAQLVQLRV NYCLLDSDQI FIGFVLKQFE YIEEGQIRES EMLIPHIFRF LVMLSYERFP
GKVIIDMNGV IHRCDGIMAS GLQPTKHAIP ALRPIVYDLF LLRGNMKSEV GKEQDTQREV
VVSMLLRLIN YHQALEMFVI VLQQCHRESE EKWKKLSRQV MDLVLPALSK QQINLDCSKA
LDVLHLLFEC VSPSVFRPVD ILLKTLLAPP TDISCVEGLQ RWLCLVLAIL RIIISQSKEE
TVLSRLLELN LPLCLTRDLT KTNADPVALT DINPEETCAW FLLQAVGLCS EILNRESAIV
STSQTFCDFT VEQMSHLLLY ITYMFQSGSY RRVATAAMRL INRDRPSCLY SVKEINNFLL
GVSTKWPTLA LHWSNILILL NYDNQKLWTQ ILQTPQKYHI VTPGRYSSGN LESSRRLLQC
CNLEILRRGG LILYCDYVVE NLSNAEHMTW LTVNHVSDLI ELTSESPVQD FISAIHRNPA
ASSMFVQAIN SRCDHVTRPS LIRKTLKSLE AIHLSQSGSL LQLLIDKFLN THQLSIARMC
DTIACRRVEM LLIESEEESK KQLPLEDLDK LLQFMKTNNL IKRHARLASL LSKFRTLFNA
EKELNLSPER THPLVFTSTN ITDINIDKDF YSDVVKDQCF SIDPNVRECA FLLQRLDYAD
VLAITMTKEF CLEILDECIA LGAYRSIVRY NRDKEALPIS FKPEPTIDEL FQAAQLTMFR
HINNIINHLP VPHQLLSYSR RQTSKNLRYY DRIEDLFTDC TWFDMNFDLA TSLCQYLVCI
RRFPWPATVP QEAVSDICRF CVLCLEMISW QLHHNQMPTS EQLQTCLECV ALVLQNKDFT
NIIGHSDSST YVSSITSAVY QLLSSLVVLP GEKVYLLPNK DRLDEQEDDE SSHLITSCDQ
ISELLQCLKT RLRPKSSETQ KLPEFLSSPI RNIVTGLARL PLVNSYARVP PLVWKLGWSP
SPTGNLKTHL PPVPVDYLQE KDVLEEFVFR INTLGWISRH QFEETWMSLL GVLNPVTLTD
THQLSAEEEI ERTQGMVLAV KAITSLLLQS TLIPQAGNPS NSVYEIRPRD KPLAFLHTRC
GKKLGMIRSL IEQEIYSLCC NKSDSQTHHT HSKPTRRLDS YFCEDLFEDN LERRIGSEDF
SLGQISLEGI WSVVGSLETT HSESDTTDST ESPQHDLKKP HIALSNMESK DRSMSINGLD
INSCLQFISE LYGPWLSTES SNKPPLMLLN AVVKSMLSLS DLFMEREQFE MMLDTFLDLY
KQHPSEDEIL LQYLIVGICK ASAVIGVDSS TAERLIKVID SGLKSTHLPS KVSSLHGILY
ILETSTSDLT KTLLPIITEF LTRHLSAVSN FIKVSLLGIY QLYQSKFIRH LSAVSKLYQS
KLIRHLSAVS NAYITSDQFL LTMWATAFYI LENYHEDIKD GDFPSRIMQL VVSTASYNED
SVSTSVFLTL MKGLERLLLA DVLSKTDIES IVKLSLDRLC LPSPQRALAA LGLMFTCMYS
GKSVDQYSPR PRDMEPSFTV SSLDVIYQDP ESLILAMERV TVLFDRIRKG YPYEARVITR
LLPPFLSDFF PPQDIMNKVI GEFLSSQQPY PQLIARVVFQ VFNNLHVQNE VNLVRDWVML
SLSNFTQRSP IAMAIWSLTS FFISASTNSW LKALYPCKYK
//