GenomeNet

Database: UniProt
Entry: A0A151P9C0_ALLMI
LinkDB: A0A151P9C0_ALLMI
Original site: A0A151P9C0_ALLMI 
ID   A0A151P9C0_ALLMI        Unreviewed;      3096 AA.
AC   A0A151P9C0;
DT   08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT   08-JUN-2016, sequence version 1.
DT   24-JAN-2024, entry version 15.
DE   SubName: Full=Huntingtin isoform B {ECO:0000313|EMBL:KYO45676.1};
GN   Name=HTT {ECO:0000313|EMBL:KYO45676.1};
GN   ORFNames=Y1Q_0021353 {ECO:0000313|EMBL:KYO45676.1};
OS   Alligator mississippiensis (American alligator).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Archelosauria; Archosauria; Crocodylia; Alligatoridae; Alligatorinae;
OC   Alligator.
OX   NCBI_TaxID=8496 {ECO:0000313|EMBL:KYO45676.1};
RN   [1] {ECO:0000313|EMBL:KYO45676.1, ECO:0000313|Proteomes:UP000050525}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=KSC_2009_1 {ECO:0000313|EMBL:KYO45676.1};
RX   PubMed=22293439; DOI=10.1186/gb-2012-13-1-415;
RA   St John J.A., Braun E.L., Isberg S.R., Miles L.G., Chong A.Y., Gongora J.,
RA   Dalzell P., Moran C., Bed'hom B., Abzhanov A., Burgess S.C., Cooksey A.M.,
RA   Castoe T.A., Crawford N.G., Densmore L.D., Drew J.C., Edwards S.V.,
RA   Faircloth B.C., Fujita M.K., Greenwold M.J., Hoffmann F.G., Howard J.M.,
RA   Iguchi T., Janes D.E., Khan S.Y., Kohno S., de Koning A.J., Lance S.L.,
RA   McCarthy F.M., McCormack J.E., Merchant M.E., Peterson D.G., Pollock D.D.,
RA   Pourmand N., Raney B.J., Roessler K.A., Sanford J.R., Sawyer R.H.,
RA   Schmidt C.J., Triplett E.W., Tuberville T.D., Venegas-Anaya M.,
RA   Howard J.T., Jarvis E.D., Guillette L.J.Jr., Glenn T.C., Green R.E.,
RA   Ray D.A.;
RT   "Sequencing three crocodilian genomes to illuminate the evolution of
RT   archosaurs and amniotes.";
RL   Genome Biol. 13:415-415(2012).
CC   -!- FUNCTION: May play a role in microtubule-mediated transport or vesicle
CC       function. {ECO:0000256|ARBA:ARBA00002907}.
CC   -!- SUBCELLULAR LOCATION: Cytoplasm {ECO:0000256|ARBA:ARBA00004496}.
CC       Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC   -!- SIMILARITY: Belongs to the huntingtin family.
CC       {ECO:0000256|ARBA:ARBA00007153}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:KYO45676.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AKHW03000533; KYO45676.1; -; Genomic_DNA.
DR   Proteomes; UP000050525; Unassembled WGS sequence.
DR   GO; GO:0005737; C:cytoplasm; IEA:UniProtKB-SubCell.
DR   GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR   Gene3D; 1.25.10.10; Leucine-rich Repeat Variant; 2.
DR   InterPro; IPR011989; ARM-like.
DR   InterPro; IPR016024; ARM-type_fold.
DR   InterPro; IPR048412; Htt_bridge.
DR   InterPro; IPR048413; Htt_C-HEAT_rpt.
DR   InterPro; IPR048411; Htt_N_HEAT_rpt-1.
DR   InterPro; IPR000091; Huntingtin.
DR   InterPro; IPR028426; Huntingtin_fam.
DR   InterPro; IPR024613; Huntingtin_N_HEAT_rpt-2.
DR   PANTHER; PTHR10170:SF10; HUNTINGTIN; 1.
DR   PANTHER; PTHR10170; HUNTINGTON DISEASE PROTEIN; 1.
DR   Pfam; PF20925; Htt_bridge; 1.
DR   Pfam; PF20927; Htt_C-HEAT; 1.
DR   Pfam; PF12372; Htt_N-HEAT; 1.
DR   Pfam; PF20926; Htt_N-HEAT_1; 1.
DR   PRINTS; PR00375; HUNTINGTIN.
DR   SUPFAM; SSF48371; ARM repeat; 2.
PE   3: Inferred from homology;
KW   Cytoplasm {ECO:0000256|ARBA:ARBA00022490};
KW   Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW   Reference proteome {ECO:0000313|Proteomes:UP000050525}.
FT   REGION          16..40
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          377..516
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1105..1164
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2571..2599
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        390..458
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        469..516
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1131..1164
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   3096 AA;  344916 MW;  8AACD45339C9B296 CRC64;
     MATMEKLMKA FESLRSFQQQ QGPATVPEEP LHRPKKELST TKKDRVNHCL TICENIVAQS
     LRNSPEFQKL LGIAMELFLL CSDDAESDVR MVADECLNKV IKALMDSSLP RLQLELYKEI
     KKNGASRSLR AALWRFAELA HLVRPQKCRP YLVNLLPCLT RISKRPEESV QETLAAAIPK
     IMAAFGNFAN DNEIKVLLKA FIANLKSSSP TIRRTAAGSA VSICQHSRRT QYFYAWLLNV
     LLGLLVPVED EHPTLLILGV LLTLRYLIPL LQQQVKDTSL KGSFGVTRKE AEISPSPEQL
     IQVYELTLHY TQHQDHNVVT GALELLQQLF RTPPPDLLHA LTTLGGIAQV SVSKDESSSR
     SRSGSIVELI GKVLLGEEEG LEDDPETRSE VSTTTFTASM KSEITSELAS SSGVSTAGSV
     VSSATDPTGH DIITEQPRSQ HTLQSDSVDL SSCDLTSTAT EGEEDDVLSR SSSQISAVQS
     DPTMDLNDGT QASSPVSDSS QTTTEGPDSA VTPSDSSEIV LEGAESQYSG MQIGQLQDEE
     DEAANLLPDE SSEPFRNSSF ALQQPRLLKN MGHSRQPSDG SVDRFASKDE ALEPGDHENK
     PSRIKGDIGH YTDGNSAPLV HCVRLLSASF LLTGEKGALV PDRDVRVSVK ALAVSCVGAA
     VALHPESFFS KLYKTPLETM GEEYEEQYVS DILNYIDHGD PQVRGATAIL CGTIVNSILI
     KSRFDVENWI AVVRSSTGNL FSLVDCIPLL QKTLKDESSV TCKLACAAVR HCIMSLCSGS
     YSELGLQLIT DLLTLRNSSY WLVRTELLET LAEIDFRLVS FLEGKTDNLH RGVHHYIGLL
     KLQDRVLNNV VISLLGDEDP RVRHIAAASL MRLVPKLFYS CDQGQADPVV AVARDQSSVY
     LKLLMHETQP PSHFAVSTIT RTYRGYNMLQ SPTDVTMENN LSRVVSAISH ALTTSTTRAL
     TFGCCEALYL LSTTFPVCTW SVGWHCGVSQ MSPSEESRKS CTIGMAGMVL SLLSSAWFPL
     DLSAHQDALI LTGNLLAASA PKCLKNPWTT EEDANTGAAK QEESWPALGD RTLVTLVEQL
     FSHLLKVINI CAHVMDDVTP GPAIKAALPS LTNPPSLSPI RRKGKERDSV EQTSVPMSPK
     KGSETNPAAR QTDTPGPAPA SKSSSLGSFY HLPSYLKLYD VLKATHANYK VTLDLQNPNE
     KFGCFLRSAL DVLSQILELA TLQDIGKCVE EILGYLKSCF SREPMMATVC VQQLLKTLFG
     TNLASQYDGL SSNPSKSQGK AQRLGSSNLR PGLYHYCFMA PYTHFTQALA DASLRNMVQA
     EQEHDASGWF DVLQKVSTQL KTSISSVTKH RADKNAIHNH IRLFEPLVIK ALKQYTTTTS
     VQLQRQVLDL LAQLVQLRVN YCLLDSDQVF IGFVLKQFEY IEVGQFRESE AIIPNIFFFL
     VLLSYERYHS KQIIGIPKII QLCDGIMASG RKAVTHAIPA LQPIVHDLFV LRGTNKADAG
     KELETQKEVV VSMLLRLIQY HQVLEMFILV LQQCHKENED KWKRLSRQIA DIILPMLAKQ
     QMHIDSHEAL GVLNTLFEIL APSSLRPVDM LLRSMFVTPK TMASVSTVQL WISGILAILR
     VLISQSTEDI VLSRIQELSF SPYLISCQAI DRLRHGENVS TPEDQFEVKQ AKYMPEETFS
     RFLLQLVGIL LEDIVHKQLK VDMNEQQHTF YCQELGTLLM CLIHIFKSGT FRRITAAATR
     LFTGDGSDGS FYTLESLSGL VQAMIPTHPS LVLLWCQILL LVNYTNYSWW SEVHQTPKRH
     SLSSTKLLSP QISGDSDESD SESKLCMCNR EIVRRGALIL FCDYVCQNLH DSEHLTWLIV
     NHVQDLINLS HEPPVQDFIS AVHRNSAASG LFIQAIQSRC ENLSTPTTLK KTLQCLEGIH
     LSQSGAVLML YVDKLLCTPF RVLARMVDTL ACRRVEMLLA ATLQNSVAQL PVEELDRIQE
     YLQNSGLGAR HQRLYSLLDR FRLTVAPDTN SPSPLVTLHP LDGENRPALE NLTPDKDWYV
     SLVRSQCCVR SDSALLEGAE LVNRIPQPEL NSFMTTKEFN LSLLSPCLSL GMNEMSGDQK
     TSLFETARRV TLDHVSTIVQ KLPANHQVFQ PLQPIETSAY WNKLSDIFGD VLVYQSVMTL
     CRALAQYLLL LSKLPACLRI PPDTGSDILK FVVMSLEALS WHLIHEQVPL STDIQAVLDC
     CCLTLQQPAL WNSLASAVYV THACSLINCV RFLIEAVAVK PGDQLLSPER KKNTSKIVGE
     DEVDSNDQKM KHVTKACEMV AELVECLQTV LALGHMRNSN IPAFLTPVLK NIIISLARLP
     LVNSYTRVPP LVWKLGWSPK PLGDFGTVFP EIPVEFLQEK EIFKEFIYRI NTLGWTSRTQ
     FEETWATLLG VLVTQPIVMD QEESQQEEDT ERTQINVLAV QAITSLVLSA MTIPVAGNPA
     VSCLEQQPRN KALKALDTRF GRKLSVVRGI VEQEIQAMVS KRDNIATHHL YQAWDPVPSL
     SPATTAALIS HEKLLLQINT ERELGNMDYK LGQVSIHSVW LGNNITPLRE EEWDEDEDEE
     NDLPAPSSPP TSPINSRKHR AGVDIHSCSQ FLLELYSQWI LPSNPSKRTP VILISEVVRS
     LLAVSDLFTE RNQFEMMYTT LTELRKVHPS EDEILIQYLV PATCKAAAVL GMDKAVAEPV
     SRLLESTLRS THMPSRIGAL HGILYILECD LLDETAKQLI PIISEYLLSN LRGVAHCVSV
     HNQQHILVMC AAAFYLIENY PLDVGPEFSS GIIQMCVIMV SGNDESTPSI IYHCVLRGLE
     RLLLSEQLSR LDSESLVKLS VDRVNVHSPH RAMAALGLML TCMYTGKEKI SPSRATDANP
     TAPDSESVIV AMERVSVLFD RIRKGFPFEA RVVARILPQF LDDFFPPQDV MNKVIGEFLS
     NQQPYPQFMA TVVYKVFQTL HTTGQSSMVR DWVMLSLSNF TQRTPVAMAM WSLSCFFVSA
     STSQWISAIL PHIISRMGKS EQVDVNLFCL VAIDFYRHQI DEELDRRAFQ SVFEGDQKRP
     AEAKLKPRQT AQEPRELFCW EVRVKEKEVA ENRTFE
//
DBGET integrated database retrieval system