ID A0A151P9C0_ALLMI Unreviewed; 3096 AA.
AC A0A151P9C0;
DT 08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT 08-JUN-2016, sequence version 1.
DT 24-JAN-2024, entry version 15.
DE SubName: Full=Huntingtin isoform B {ECO:0000313|EMBL:KYO45676.1};
GN Name=HTT {ECO:0000313|EMBL:KYO45676.1};
GN ORFNames=Y1Q_0021353 {ECO:0000313|EMBL:KYO45676.1};
OS Alligator mississippiensis (American alligator).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Crocodylia; Alligatoridae; Alligatorinae;
OC Alligator.
OX NCBI_TaxID=8496 {ECO:0000313|EMBL:KYO45676.1};
RN [1] {ECO:0000313|EMBL:KYO45676.1, ECO:0000313|Proteomes:UP000050525}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=KSC_2009_1 {ECO:0000313|EMBL:KYO45676.1};
RX PubMed=22293439; DOI=10.1186/gb-2012-13-1-415;
RA St John J.A., Braun E.L., Isberg S.R., Miles L.G., Chong A.Y., Gongora J.,
RA Dalzell P., Moran C., Bed'hom B., Abzhanov A., Burgess S.C., Cooksey A.M.,
RA Castoe T.A., Crawford N.G., Densmore L.D., Drew J.C., Edwards S.V.,
RA Faircloth B.C., Fujita M.K., Greenwold M.J., Hoffmann F.G., Howard J.M.,
RA Iguchi T., Janes D.E., Khan S.Y., Kohno S., de Koning A.J., Lance S.L.,
RA McCarthy F.M., McCormack J.E., Merchant M.E., Peterson D.G., Pollock D.D.,
RA Pourmand N., Raney B.J., Roessler K.A., Sanford J.R., Sawyer R.H.,
RA Schmidt C.J., Triplett E.W., Tuberville T.D., Venegas-Anaya M.,
RA Howard J.T., Jarvis E.D., Guillette L.J.Jr., Glenn T.C., Green R.E.,
RA Ray D.A.;
RT "Sequencing three crocodilian genomes to illuminate the evolution of
RT archosaurs and amniotes.";
RL Genome Biol. 13:415-415(2012).
CC -!- FUNCTION: May play a role in microtubule-mediated transport or vesicle
CC function. {ECO:0000256|ARBA:ARBA00002907}.
CC -!- SUBCELLULAR LOCATION: Cytoplasm {ECO:0000256|ARBA:ARBA00004496}.
CC Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the huntingtin family.
CC {ECO:0000256|ARBA:ARBA00007153}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KYO45676.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AKHW03000533; KYO45676.1; -; Genomic_DNA.
DR Proteomes; UP000050525; Unassembled WGS sequence.
DR GO; GO:0005737; C:cytoplasm; IEA:UniProtKB-SubCell.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR Gene3D; 1.25.10.10; Leucine-rich Repeat Variant; 2.
DR InterPro; IPR011989; ARM-like.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR048412; Htt_bridge.
DR InterPro; IPR048413; Htt_C-HEAT_rpt.
DR InterPro; IPR048411; Htt_N_HEAT_rpt-1.
DR InterPro; IPR000091; Huntingtin.
DR InterPro; IPR028426; Huntingtin_fam.
DR InterPro; IPR024613; Huntingtin_N_HEAT_rpt-2.
DR PANTHER; PTHR10170:SF10; HUNTINGTIN; 1.
DR PANTHER; PTHR10170; HUNTINGTON DISEASE PROTEIN; 1.
DR Pfam; PF20925; Htt_bridge; 1.
DR Pfam; PF20927; Htt_C-HEAT; 1.
DR Pfam; PF12372; Htt_N-HEAT; 1.
DR Pfam; PF20926; Htt_N-HEAT_1; 1.
DR PRINTS; PR00375; HUNTINGTIN.
DR SUPFAM; SSF48371; ARM repeat; 2.
PE 3: Inferred from homology;
KW Cytoplasm {ECO:0000256|ARBA:ARBA00022490};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000050525}.
FT REGION 16..40
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 377..516
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1105..1164
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2571..2599
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 390..458
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 469..516
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1131..1164
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 3096 AA; 344916 MW; 8AACD45339C9B296 CRC64;
MATMEKLMKA FESLRSFQQQ QGPATVPEEP LHRPKKELST TKKDRVNHCL TICENIVAQS
LRNSPEFQKL LGIAMELFLL CSDDAESDVR MVADECLNKV IKALMDSSLP RLQLELYKEI
KKNGASRSLR AALWRFAELA HLVRPQKCRP YLVNLLPCLT RISKRPEESV QETLAAAIPK
IMAAFGNFAN DNEIKVLLKA FIANLKSSSP TIRRTAAGSA VSICQHSRRT QYFYAWLLNV
LLGLLVPVED EHPTLLILGV LLTLRYLIPL LQQQVKDTSL KGSFGVTRKE AEISPSPEQL
IQVYELTLHY TQHQDHNVVT GALELLQQLF RTPPPDLLHA LTTLGGIAQV SVSKDESSSR
SRSGSIVELI GKVLLGEEEG LEDDPETRSE VSTTTFTASM KSEITSELAS SSGVSTAGSV
VSSATDPTGH DIITEQPRSQ HTLQSDSVDL SSCDLTSTAT EGEEDDVLSR SSSQISAVQS
DPTMDLNDGT QASSPVSDSS QTTTEGPDSA VTPSDSSEIV LEGAESQYSG MQIGQLQDEE
DEAANLLPDE SSEPFRNSSF ALQQPRLLKN MGHSRQPSDG SVDRFASKDE ALEPGDHENK
PSRIKGDIGH YTDGNSAPLV HCVRLLSASF LLTGEKGALV PDRDVRVSVK ALAVSCVGAA
VALHPESFFS KLYKTPLETM GEEYEEQYVS DILNYIDHGD PQVRGATAIL CGTIVNSILI
KSRFDVENWI AVVRSSTGNL FSLVDCIPLL QKTLKDESSV TCKLACAAVR HCIMSLCSGS
YSELGLQLIT DLLTLRNSSY WLVRTELLET LAEIDFRLVS FLEGKTDNLH RGVHHYIGLL
KLQDRVLNNV VISLLGDEDP RVRHIAAASL MRLVPKLFYS CDQGQADPVV AVARDQSSVY
LKLLMHETQP PSHFAVSTIT RTYRGYNMLQ SPTDVTMENN LSRVVSAISH ALTTSTTRAL
TFGCCEALYL LSTTFPVCTW SVGWHCGVSQ MSPSEESRKS CTIGMAGMVL SLLSSAWFPL
DLSAHQDALI LTGNLLAASA PKCLKNPWTT EEDANTGAAK QEESWPALGD RTLVTLVEQL
FSHLLKVINI CAHVMDDVTP GPAIKAALPS LTNPPSLSPI RRKGKERDSV EQTSVPMSPK
KGSETNPAAR QTDTPGPAPA SKSSSLGSFY HLPSYLKLYD VLKATHANYK VTLDLQNPNE
KFGCFLRSAL DVLSQILELA TLQDIGKCVE EILGYLKSCF SREPMMATVC VQQLLKTLFG
TNLASQYDGL SSNPSKSQGK AQRLGSSNLR PGLYHYCFMA PYTHFTQALA DASLRNMVQA
EQEHDASGWF DVLQKVSTQL KTSISSVTKH RADKNAIHNH IRLFEPLVIK ALKQYTTTTS
VQLQRQVLDL LAQLVQLRVN YCLLDSDQVF IGFVLKQFEY IEVGQFRESE AIIPNIFFFL
VLLSYERYHS KQIIGIPKII QLCDGIMASG RKAVTHAIPA LQPIVHDLFV LRGTNKADAG
KELETQKEVV VSMLLRLIQY HQVLEMFILV LQQCHKENED KWKRLSRQIA DIILPMLAKQ
QMHIDSHEAL GVLNTLFEIL APSSLRPVDM LLRSMFVTPK TMASVSTVQL WISGILAILR
VLISQSTEDI VLSRIQELSF SPYLISCQAI DRLRHGENVS TPEDQFEVKQ AKYMPEETFS
RFLLQLVGIL LEDIVHKQLK VDMNEQQHTF YCQELGTLLM CLIHIFKSGT FRRITAAATR
LFTGDGSDGS FYTLESLSGL VQAMIPTHPS LVLLWCQILL LVNYTNYSWW SEVHQTPKRH
SLSSTKLLSP QISGDSDESD SESKLCMCNR EIVRRGALIL FCDYVCQNLH DSEHLTWLIV
NHVQDLINLS HEPPVQDFIS AVHRNSAASG LFIQAIQSRC ENLSTPTTLK KTLQCLEGIH
LSQSGAVLML YVDKLLCTPF RVLARMVDTL ACRRVEMLLA ATLQNSVAQL PVEELDRIQE
YLQNSGLGAR HQRLYSLLDR FRLTVAPDTN SPSPLVTLHP LDGENRPALE NLTPDKDWYV
SLVRSQCCVR SDSALLEGAE LVNRIPQPEL NSFMTTKEFN LSLLSPCLSL GMNEMSGDQK
TSLFETARRV TLDHVSTIVQ KLPANHQVFQ PLQPIETSAY WNKLSDIFGD VLVYQSVMTL
CRALAQYLLL LSKLPACLRI PPDTGSDILK FVVMSLEALS WHLIHEQVPL STDIQAVLDC
CCLTLQQPAL WNSLASAVYV THACSLINCV RFLIEAVAVK PGDQLLSPER KKNTSKIVGE
DEVDSNDQKM KHVTKACEMV AELVECLQTV LALGHMRNSN IPAFLTPVLK NIIISLARLP
LVNSYTRVPP LVWKLGWSPK PLGDFGTVFP EIPVEFLQEK EIFKEFIYRI NTLGWTSRTQ
FEETWATLLG VLVTQPIVMD QEESQQEEDT ERTQINVLAV QAITSLVLSA MTIPVAGNPA
VSCLEQQPRN KALKALDTRF GRKLSVVRGI VEQEIQAMVS KRDNIATHHL YQAWDPVPSL
SPATTAALIS HEKLLLQINT ERELGNMDYK LGQVSIHSVW LGNNITPLRE EEWDEDEDEE
NDLPAPSSPP TSPINSRKHR AGVDIHSCSQ FLLELYSQWI LPSNPSKRTP VILISEVVRS
LLAVSDLFTE RNQFEMMYTT LTELRKVHPS EDEILIQYLV PATCKAAAVL GMDKAVAEPV
SRLLESTLRS THMPSRIGAL HGILYILECD LLDETAKQLI PIISEYLLSN LRGVAHCVSV
HNQQHILVMC AAAFYLIENY PLDVGPEFSS GIIQMCVIMV SGNDESTPSI IYHCVLRGLE
RLLLSEQLSR LDSESLVKLS VDRVNVHSPH RAMAALGLML TCMYTGKEKI SPSRATDANP
TAPDSESVIV AMERVSVLFD RIRKGFPFEA RVVARILPQF LDDFFPPQDV MNKVIGEFLS
NQQPYPQFMA TVVYKVFQTL HTTGQSSMVR DWVMLSLSNF TQRTPVAMAM WSLSCFFVSA
STSQWISAIL PHIISRMGKS EQVDVNLFCL VAIDFYRHQI DEELDRRAFQ SVFEGDQKRP
AEAKLKPRQT AQEPRELFCW EVRVKEKEVA ENRTFE
//