GenomeNet

Database: UniProt
Entry: H3AAE1_LATCH
LinkDB: H3AAE1_LATCH
Original site: H3AAE1_LATCH 
ID   H3AAE1_LATCH            Unreviewed;      3125 AA.
AC   H3AAE1;
DT   18-APR-2012, integrated into UniProtKB/TrEMBL.
DT   18-APR-2012, sequence version 1.
DT   27-MAR-2024, entry version 58.
DE   SubName: Full=Huntingtin {ECO:0000313|Ensembl:ENSLACP00000006612.1};
GN   Name=HTT {ECO:0000313|Ensembl:ENSLACP00000006612.1};
OS   Latimeria chalumnae (Coelacanth).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Coelacanthiformes; Coelacanthidae; Latimeria.
OX   NCBI_TaxID=7897 {ECO:0000313|Ensembl:ENSLACP00000006612.1, ECO:0000313|Proteomes:UP000008672};
RN   [1] {ECO:0000313|Proteomes:UP000008672}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Wild caught {ECO:0000313|Proteomes:UP000008672};
RA   Di Palma F., Alfoldi J., Johnson J., Berlin A., Gnerre S., Jaffe D.,
RA   MacCallum I., Young S., Walker B.J., Lander E., Lindblad-Toh K.;
RT   "The draft genome of Latimeria chalumnae.";
RL   Submitted (AUG-2011) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|Ensembl:ENSLACP00000006612.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   -!- FUNCTION: May play a role in microtubule-mediated transport or vesicle
CC       function. {ECO:0000256|ARBA:ARBA00002907}.
CC   -!- SUBCELLULAR LOCATION: Cytoplasm {ECO:0000256|ARBA:ARBA00004496}.
CC       Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC   -!- SIMILARITY: Belongs to the huntingtin family.
CC       {ECO:0000256|ARBA:ARBA00007153}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AFYH01090972; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AFYH01090973; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AFYH01090974; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AFYH01090975; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AFYH01090976; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AFYH01090977; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AFYH01090978; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AFYH01090979; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AFYH01090980; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AFYH01090981; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   STRING; 7897.ENSLACP00000006612; -.
DR   Ensembl; ENSLACT00000006666.1; ENSLACP00000006612.1; ENSLACG00000005863.1.
DR   eggNOG; ENOG502QR1D; Eukaryota.
DR   GeneTree; ENSGT00390000015863; -.
DR   HOGENOM; CLU_000428_0_0_1; -.
DR   InParanoid; H3AAE1; -.
DR   OMA; PNKMEEP; -.
DR   TreeFam; TF323608; -.
DR   Proteomes; UP000008672; Unassembled WGS sequence.
DR   Bgee; ENSLACG00000005863; Expressed in pelvic fin.
DR   GO; GO:0005737; C:cytoplasm; IEA:UniProtKB-SubCell.
DR   GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR   Gene3D; 1.25.10.10; Leucine-rich Repeat Variant; 2.
DR   InterPro; IPR011989; ARM-like.
DR   InterPro; IPR016024; ARM-type_fold.
DR   InterPro; IPR048412; Htt_bridge.
DR   InterPro; IPR048413; Htt_C-HEAT_rpt.
DR   InterPro; IPR048411; Htt_N_HEAT_rpt-1.
DR   InterPro; IPR000091; Huntingtin.
DR   InterPro; IPR028426; Huntingtin_fam.
DR   InterPro; IPR024613; Huntingtin_N_HEAT_rpt-2.
DR   PANTHER; PTHR10170:SF10; HUNTINGTIN; 1.
DR   PANTHER; PTHR10170; HUNTINGTON DISEASE PROTEIN; 1.
DR   Pfam; PF20925; Htt_bridge; 1.
DR   Pfam; PF20927; Htt_C-HEAT; 1.
DR   Pfam; PF12372; Htt_N-HEAT; 1.
DR   Pfam; PF20926; Htt_N-HEAT_1; 1.
DR   PRINTS; PR00375; HUNTINGTIN.
DR   SUPFAM; SSF48371; ARM repeat; 2.
PE   3: Inferred from homology;
KW   Cytoplasm {ECO:0000256|ARBA:ARBA00022490};
KW   Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW   Reference proteome {ECO:0000313|Proteomes:UP000008672}.
FT   REGION          18..40
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          387..533
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1142..1200
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2610..2639
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        405..437
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        447..473
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        484..533
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   3125 AA;  347677 MW;  27DC9C80DA97BE19 CRC64;
     MATMEKLMKA FESLKSFQQQ QGPAIAEEPA QKQKKDPTTT KKDRLTHCLT ICENIVVQSL
     RNSPEFQKLL GIAMELFLLC SDDSESDVRM VADECLNKVV KALMDSNLPR LQLELYKEIK
     KSMGSQDLNE VVLRSIRLCW MVCPPQARPY LVNLLPCLTR ISKRPEEAVQ ETLAAAIPKI
     MAALGNFAND SEIKVLLKAF VANLKSSSPT IRRTAAGSAV SICQHSRRTQ YFYTWLLNVL
     LGLLVPVEDD RPNPLVLGVL LTLRYLMPLL QQQVKDTSLK GSFGVTRKET EISPSPMQLI
     QVYELTLHYT QHRDHNVVTA ALELLQQLFR TPPPDLLHAL TTSGTMRSIS ITKSEADSRN
     RSGSILELIA GGGSSCSPVL LRKQKGKVLS GEEEALEDES ETRSDVTTTT FTASVNSEAA
     SDVPSSSGAS TGGSVSPVPE SVGHDIITEQ PRSQHNLQST DSVDLSGSDL TSTATDGDDD
     EMLSRSSSQI STVQSDAAPD INGKITQDSS PTSDSSQTTT EGPDSAVTPS DSSELVLDGT
     ESQYSGMQIG QLQDEEEETT NIVAEEFVET FKNSALAMNK PHLLQTMGHS RQSSDSSVDR
     FISKDEGAEI GDLESKPSKI KGDIGHYTDD DAAPLIHCVR LLSASFLLTG QKNGLIQDKE
     VRVSVKALAV SCVGAATALH PEAFFNKLYK IPWESTDKPE EQYVSDVLKY IEHGDPQIRG
     ATAILCGTII HSILSKSRFN VESWMSSVKS STENTITLVD FIPLLQKTLK DESSVTCKLA
     CAAVRHCITS LCSSSYSELG LQLLIDLLTL RNNSYWLVRT ELLETLAEMD FRLVSFLERQ
     AENLHRGDHH YTGLLKLQER VLSDVVIFLL GDEDPRVRHV AATVLPRLVP RLFYDCDQGQ
     ADPVIAVARE QSSVYLQLLM HETQPASHIT VSTITRTYRG YNILQNIPDV TVENSLSRVI
     TAVSHALTSS TSRALTLGCV EALCLLSTIF PVCSWSVGWH CGYIASGSQA SRVHLYKNRG
     RSFSMSQSNP TDESRKSCVV GMANMIQSLL SSAWFPLDLS SHQDALILAG NLLSAAAPKC
     LKNPWTAEDD VNTSTNKPEE PWPALGDRTL VSMVEQLFSH LLKVINICAH IIDDVAPGPT
     LKATLPSLAN PPSLSPIRRK GKEREPVEQT AAPMSPKKNS ETNPVARPAE SVGAAPTSKS
     TSLGGFYHLP SYLKLYDVLK ATHANYKVTL DLQNSNEKFG SFLRSALDVL SQLLELATLH
     DIGKCVEEIL GYLKSCFSRE ATMATVCVQQ LLKTLFGTNL ASQYDGFSSN PSRSLGKAQR
     LGSSSLRPGL YHYCFMAPYT HFTQALADAS LRNMVQAEHE QDTSGWFDVL QKVSSQLKSG
     IVNVTKHRAD KNAIHNHIRL FEPLVIKALK QYTTTTSVQL QRQVLDLLAQ LVQLRVNYCL
     LDSDQVFIGF VLKQFEYIEV GQFRESEAII PSIFFFLVLL SYERYHSKQI IGIPKIIQLC
     DGIMASGRKA VTHAIPALQP IVHDLFVLRG SNKADAGKEL ETQKEVVVSM LLRLIQYHQV
     LEMFILVLQQ CHKENEDKWK RLSRQIADVI LPMLAKQQMH LDSHEALGVL NSLFETVAPS
     SLRPVDMLLR SMFVIPSTLA LVGTVQLWIS GILAILRVLI SQSTEDIVLS RIQELSLSPN
     LILCQNINRL KEGGECVPPV EEQNGETQVK YLPEETLARF LLQLVGILLE EVATKQIKVD
     MSEQQHTFYC QQLGTLLMCV IHIFKSGMFR RITAAATKLF KGDGADGSFY TLENLNSLVQ
     SMIPTYPSLV LLWCQILLLI NYTNYTWWSE VHQTPRRHSL SSTKLLSPQI SGDSDESESK
     SKLGMCNREI VRRGALILFC DYVCQNLHDS EHLTWLIVNH VQDLIGLSHE PPVQDFISAV
     HRNSAASGLF IQAIQSRCEN LTTPTMLKKT LQCLEGIHLS QSGAVLMLYV DKLICTPFRV
     LARMVDTLVC RRVEMLLAET FQNSIAQLPV EELDRIQQYL HSNGLAQRHQ RLYSLLDRFR
     AMIAEDTVSP SPLVSSHPLD GEKPLESIIV DKWGGKEFYL SLVQSQCCCR ADSALLECTK
     LLNKLPQPEL YSIMTTKEFN LSLLAPCLSF GLHSMSGEQG SALFETACKV TLDHVTHSLQ
     HLPGSHQVFQ PLKPMGNVSY WNKLSDLFGD GLYYQTIVTL CRALAQYLTS LSKQPADVHI
     PVDKEADITR FIVLSLEALS WHLMNDQVPL SVDLQAALDC CCLALQQPGI WSLLASAAYV
     TQACTVIICI KFIVEAGNST QVRYTKYIES TALFFEESVA EDCIAKAEPE YITIACEVMA
     EMVESLQSVL CFGHRRNNSI PAFLTPILRN ITISLARLPL VNSYTRLPPL VWKLGWSPRA
     GGDFGTSLPE IPVEFLQEKD IFREFIYRIN TLGWTNRTQF EETWATLLGV LVTQPIVMDQ
     EEERPLEEDT ERTQINVLAV QAITSLVLSA MTLPVAGNPA VSCLEQQPRN KTLKALDTRF
     GRKLSVIRGI VEREIQAMVS KRENIATHYP YQAWDPVPSL SPATSGNLVS HEKLLLQINM
     EREMGSMEYK LGQVSIHSVW LGNSITPLRE EEWDEDEEDE SDVPAPSSPQ TSPINSRKHR
     AGVDIHSCSQ FLLELYSQWI LPSNPGKKSP AILISEVVRS LLAVSDLFTE RNQFEMMYTT
     LTELQKFHPS EDEILNQYLV PAICKAAAVL GMDKAIAEPV SRLLETTLRS THMPSRIGAL
     HGILYVLECD LLDDTAKQLI PIISDYLVSN LRGIAHCVNV HNQQHVLVMC AAAFYLIENY
     PLDVGPEFTA GIIQMCGVMV SGSEESTPSI IYHCVLRGLE RLLLSEQLSR LDGEALVKLS
     VDRLNMHSPH RAMAALGLML TCMYTEKYVK KGKEKSSPGR ISNADPTAPD SESVIVAMER
     VSVLFDRIRK GFPCEARVVA RILPQFLDDF FPPQDVMNKV IGEFLSNQQP YPQFMANVVY
     KVFQTLHTTG QSSMVRDWVM LSLSNFTQRT PVAMAMWSLS CFFVSASTSP WVSALLPHVI
     SRMGKSEQVD VNLFCLVAID FYRHQIDEEL DRRSFQSVFE IVASPGTPYH RLLTCLQSLH
     QITQL
//
DBGET integrated database retrieval system