ID Q9V3N4_DROME Unreviewed; 3583 AA.
AC Q9V3N4;
DT 01-MAY-2000, integrated into UniProtKB/TrEMBL.
DT 01-MAY-2000, sequence version 1.
DT 24-JAN-2024, entry version 173.
DE SubName: Full=Huntingtin, isoform A {ECO:0000313|EMBL:AAF56808.1};
DE SubName: Full=Huntingtin, isoform B {ECO:0000313|EMBL:AGB96421.1};
DE SubName: Full=Huntington disease protein homolog {ECO:0000313|EMBL:AAF03255.1};
GN Name=htt {ECO:0000313|EMBL:AAF56808.1,
GN ECO:0000313|FlyBase:FBgn0027655};
GN Synonyms=dHtt {ECO:0000313|EMBL:AAF56808.1}, dhtt
GN {ECO:0000313|EMBL:AAF56808.1}, Dmel\CG9995
GN {ECO:0000313|EMBL:AAF56808.1}, DmHtt {ECO:0000313|EMBL:AAF56808.1}, HD
GN {ECO:0000313|EMBL:AAF56808.1}, Hsap\HD {ECO:0000313|EMBL:AAF03255.1},
GN HTT {ECO:0000313|EMBL:AAF56808.1}, Htt {ECO:0000313|EMBL:AAF56808.1},
GN huntingtin {ECO:0000313|FlyBase:FBgn0027655};
GN ORFNames=CG9995 {ECO:0000313|EMBL:AAF56808.1,
GN ECO:0000313|FlyBase:FBgn0027655}, Dmel_CG9995
GN {ECO:0000313|EMBL:AAF56808.1};
OS Drosophila melanogaster (Fruit fly).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea;
OC Drosophilidae; Drosophila; Sophophora.
OX NCBI_TaxID=7227 {ECO:0000313|EMBL:AAF56808.1, ECO:0000313|Proteomes:UP000000803};
RN [1] {ECO:0000313|EMBL:AAF03255.1}
RP NUCLEOTIDE SEQUENCE.
RX PubMed=10441347; DOI=10.1093/hmg/8.9.1807;
RA Li Z., Karlovich C.A., Fish M.P., Scott M.P., Myers R.M.;
RT "A putative Drosophila homolog of the Huntington's disease gene.";
RL Hum. Mol. Genet. 8:1807-1815(1999).
RN [2] {ECO:0000313|EMBL:AAF56808.1, ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=10731132; DOI=10.1126/science.287.5461.2185;
RA Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D.,
RA Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F.,
RA George R.A., Lewis S.E., Richards S., Ashburner M., Henderson S.N.,
RA Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q., Chen L.X., Brandon R.C.,
RA Rogers Y.H., Blazej R.G., Champe M., Pfeiffer B.D., Wan K.H., Doyle C.,
RA Baxter E.G., Helt G., Nelson C.R., Gabor G.L., Abril J.F., Agbayani A.,
RA An H.J., Andrews-Pfannkoch C., Baldwin D., Ballew R.M., Basu A.,
RA Baxendale J., Bayraktaroglu L., Beasley E.M., Beeson K.Y., Benos P.V.,
RA Berman B.P., Bhandari D., Bolshakov S., Borkova D., Botchan M.R., Bouck J.,
RA Brokstein P., Brottier P., Burtis K.C., Busam D.A., Butler H., Cadieu E.,
RA Center A., Chandra I., Cherry J.M., Cawley S., Dahlke C., Davenport L.B.,
RA Davies P., de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I.,
RA Dietz S.M., Dodson K., Doup L.E., Downes M., Dugan-Rocha S., Dunkov B.C.,
RA Dunn P., Durbin K.J., Evangelista C.C., Ferraz C., Ferriera S.,
RA Fleischmann W., Fosler C., Gabrielian A.E., Garg N.S., Gelbart W.M.,
RA Glasser K., Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., Harris M.,
RA Harris N.L., Harvey D., Heiman T.J., Hernandez J.R., Houck J., Hostin D.,
RA Houston K.A., Howland T.J., Wei M.H., Ibegwam C., Jalali M., Kalush F.,
RA Karpen G.H., Ke Z., Kennison J.A., Ketchum K.A., Kimmel B.E., Kodira C.D.,
RA Kraft C., Kravitz S., Kulp D., Lai Z., Lasko P., Lei Y., Levitsky A.A.,
RA Li J., Li Z., Liang Y., Lin X., Liu X., Mattei B., McIntosh T.C.,
RA McLeod M.P., McPherson D., Merkulov G., Milshina N.V., Mobarry C.,
RA Morris J., Moshrefi A., Mount S.M., Moy M., Murphy B., Murphy L.,
RA Muzny D.M., Nelson D.L., Nelson D.R., Nelson K.A., Nixon K., Nusskern D.R.,
RA Pacleb J.M., Palazzolo M., Pittman G.S., Pan S., Pollard J., Puri V.,
RA Reese M.G., Reinert K., Remington K., Saunders R.D., Scheeler F., Shen H.,
RA Shue B.C., Siden-Kiamos I., Simpson M., Skupski M.P., Smith T., Spier E.,
RA Spradling A.C., Stapleton M., Strong R., Sun E., Svirskas R., Tector C.,
RA Turner R., Venter E., Wang A.H., Wang X., Wang Z.Y., Wassarman D.A.,
RA Weinstock G.M., Weissenbach J., Williams S.M., WoodageT, Worley K.C.,
RA Wu D., Yang S., Yao Q.A., Ye J., Yeh R.F., Zaveri J.S., Zhan M., Zhang G.,
RA Zhao Q., Zheng L., Zheng X.H., Zhong F.N., Zhong W., Zhou X., Zhu S.,
RA Zhu X., Smith H.O., Gibbs R.A., Myers E.W., Rubin G.M., Venter J.C.;
RT "The genome sequence of Drosophila melanogaster.";
RL Science 287:2185-2195(2000).
RN [3] {ECO:0000313|EMBL:AAF56808.1, ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=12537568;
RA Celniker S.E., Wheeler D.A., Kronmiller B., Carlson J.W., Halpern A.,
RA Patel S., Adams M., Champe M., Dugan S.P., Frise E., Hodgson A.,
RA George R.A., Hoskins R.A., Laverty T., Muzny D.M., Nelson C.R.,
RA Pacleb J.M., Park S., Pfeiffer B.D., Richards S., Sodergren E.J.,
RA Svirskas R., Tabor P.E., Wan K., Stapleton M., Sutton G.G., Venter C.,
RA Weinstock G., Scherer S.E., Myers E.W., Gibbs R.A., Rubin G.M.;
RT "Finishing a whole-genome shotgun: release 3 of the Drosophila melanogaster
RT euchromatic genome sequence.";
RL Genome Biol. 3:RESEARCH0079-RESEARCH0079(2002).
RN [4] {ECO:0000313|EMBL:AAF56808.1, ECO:0000313|Proteomes:UP000000803}
RP GENOME REANNOTATION.
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=12537572; DOI=10.1186/gb-2002-3-12-research0083;
RA Misra S., Crosby M.A., Mungall C.J., Matthews B.B., Campbell K.S.,
RA Hradecky P., Huang Y., Kaminker J.S., Millburn G.H., Prochnik S.E.,
RA Smith C.D., Tupy J.L., Whitfield E.J., Bayraktaroglu L., Berman B.P.,
RA Bettencourt B.R., Celniker S.E., de Grey A.D.N.J., Drysdale R.A.,
RA Harris N.L., Richter J., Russo S., Schroeder A.J., Shu S.Q., Stapleton M.,
RA Yamada C., Ashburner M., Gelbart W.M., Rubin G.M., Lewis S.E.;
RT "Annotation of the Drosophila melanogaster euchromatic genome: a systematic
RT review.";
RL Genome Biol. 3:RESEARCH0083.1-RESEARCH0083.22(2002).
RN [5] {ECO:0000313|EMBL:AAF56808.1, ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=12537573;
RA Kaminker J.S., Bergman C.M., Kronmiller B., Carlson J., Svirskas R.,
RA Patel S., Frise E., Wheeler D.A., Lewis S.E., Rubin G.M., Ashburner M.,
RA Celniker S.E.;
RT "The transposable elements of the Drosophila melanogaster euchromatin: a
RT genomics perspective.";
RL Genome Biol. 3:RESEARCH0084.1-RESEARCH0084.20(2002).
RN [6] {ECO:0000313|EMBL:AAF56808.1, ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=12537574;
RA Hoskins R.A., Smith C.D., Carlson J.W., Carvalho A.B., Halpern A.,
RA Kaminker J.S., Kennedy C., Mungall C.J., Sullivan B.A., Sutton G.G.,
RA Yasuhara J.C., Wakimoto B.T., Myers E.W., Celniker S.E., Rubin G.M.,
RA Karpen G.H.;
RT "Heterochromatic sequences in a Drosophila whole-genome shotgun assembly.";
RL Genome Biol. 3:RESEARCH0085-RESEARCH0085(2002).
RN [7] {ECO:0000313|EMBL:AAF56808.1, ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=16110336; DOI=10.1371/journal.pcbi.0010022;
RA Quesneville H., Bergman C.M., Andrieu O., Autard D., Nouaud D.,
RA Ashburner M., Anxolabehere D.;
RT "Combined evidence annotation of transposable elements in genome
RT sequences.";
RL PLoS Comput. Biol. 1:166-175(2005).
RN [8] {ECO:0000313|EMBL:AAF56808.1}
RP NUCLEOTIDE SEQUENCE.
RA Celniker S., Carlson J., Wan K., Frise E., Hoskins R., Park S.,
RA Svirskas R., Rubin G.;
RL Submitted (AUG-2006) to the EMBL/GenBank/DDBJ databases.
RN [9] {ECO:0000313|EMBL:AAF56808.1, ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=17569856; DOI=10.1126/science.1139815;
RA Smith C.D., Shu S., Mungall C.J., Karpen G.H.;
RT "The Release 5.1 annotation of Drosophila melanogaster heterochromatin.";
RL Science 316:1586-1591(2007).
RN [10] {ECO:0000313|EMBL:AAF56808.1, ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=17569867; DOI=10.1126/science.1139816;
RA Hoskins R.A., Carlson J.W., Kennedy C., Acevedo D., Evans-Holm M.,
RA Frise E., Wan K.H., Park S., Mendez-Lago M., Rossi F., Villasante A.,
RA Dimitri P., Karpen G.H., Celniker S.E.;
RT "Sequence finishing and mapping of Drosophila melanogaster
RT heterochromatin.";
RL Science 316:1625-1628(2007).
RN [11] {ECO:0000313|EMBL:AAF56808.1}
RP NUCLEOTIDE SEQUENCE.
RX PubMed=26109357; DOI=.1534/g3.115.018929;
RG FlyBase Consortium;
RA Matthews B.B., Dos Santos G., Crosby M.A., Emmert D.B., St Pierre S.E.,
RA Gramates L.S., Zhou P., Schroeder A.J., Falls K., Strelets V., Russo S.M.,
RA Gelbart W.M., null;
RT "Gene Model Annotations for Drosophila melanogaster: Impact of High-
RT Throughput Data.";
RL G3 (Bethesda) 5:1721-1736(2015).
RN [12] {ECO:0000313|EMBL:AAF56808.1}
RP NUCLEOTIDE SEQUENCE.
RX PubMed=26109356; DOI=.1534/g3.115.018937;
RG FlyBase Consortium;
RA Crosby M.A., Gramates L.S., Dos Santos G., Matthews B.B., St Pierre S.E.,
RA Zhou P., Schroeder A.J., Falls K., Emmert D.B., Russo S.M., Gelbart W.M.,
RA null;
RT "Gene Model Annotations for Drosophila melanogaster: The Rule-Benders.";
RL G3 (Bethesda) 5:1737-1749(2015).
RN [13] {ECO:0000313|EMBL:AAF56808.1}
RP NUCLEOTIDE SEQUENCE.
RX PubMed=25589440;
RA Hoskins R.A., Carlson J.W., Wan K.H., Park S., Mendez I., Galle S.E.,
RA Booth B.W., Pfeiffer B.D., George R.A., Svirskas R., Krzywinski M.,
RA Schein J., Accardo M.C., Damia E., Messina G., Mendez-Lago M.,
RA de Pablos B., Demakova O.V., Andreyeva E.N., Boldyreva L.V., Marra M.,
RA Carvalho A.B., Dimitri P., Villasante A., Zhimulev I.F., Rubin G.M.,
RA Karpen G.H., Celniker S.E.;
RT "The Release 6 reference sequence of the Drosophila melanogaster genome.";
RL Genome Res. 25:445-458(2015).
RN [14] {ECO:0000313|EMBL:AAF56808.1}
RP NUCLEOTIDE SEQUENCE.
RG Berkeley Drosophila Genome Project;
RA Celniker S., Carlson J., Wan K., Pfeiffer B., Frise E., George R.,
RA Hoskins R., Stapleton M., Pacleb J., Park S., Svirskas R., Smith E., Yu C.,
RA Rubin G.;
RT "Drosophila melanogaster release 4 sequence.";
RL Submitted (APR-2020) to the EMBL/GenBank/DDBJ databases.
RN [15] {ECO:0000313|EMBL:AAF56808.1}
RP NUCLEOTIDE SEQUENCE.
RG FlyBase;
RL Submitted (APR-2020) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Cytoplasm {ECO:0000256|ARBA:ARBA00004496}.
CC Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AF146362; AAF03255.1; -; mRNA.
DR EMBL; AF147779; AAF03256.1; -; Genomic_DNA.
DR EMBL; AE014297; AAF56808.1; -; Genomic_DNA.
DR EMBL; AE014297; AGB96421.1; -; Genomic_DNA.
DR RefSeq; NP_001263041.1; NM_001276112.2.
DR RefSeq; NP_651629.1; NM_143372.2.
DR DIP; DIP-61619N; -.
DR IntAct; Q9V3N4; 7.
DR STRING; 7227.FBpp0307764; -.
DR PaxDb; 7227-FBpp0084679; -.
DR EnsemblMetazoa; FBtr0085310; FBpp0084679; FBgn0027655.
DR EnsemblMetazoa; FBtr0336788; FBpp0307764; FBgn0027655.
DR GeneID; 43392; -.
DR KEGG; dme:Dmel_CG9995; -.
DR UCSC; CG9995-RA; d. melanogaster.
DR AGR; FB:FBgn0027655; -.
DR CTD; 3064; -.
DR FlyBase; FBgn0027655; htt.
DR VEuPathDB; VectorBase:FBgn0027655; -.
DR eggNOG; ENOG502QR1D; Eukaryota.
DR GeneTree; ENSGT00390000015863; -.
DR HOGENOM; CLU_224535_0_0_1; -.
DR InParanoid; Q9V3N4; -.
DR OMA; IACFQQI; -.
DR OrthoDB; 6903at2759; -.
DR BioGRID-ORCS; 43392; 0 hits in 1 CRISPR screen.
DR GenomeRNAi; 43392; -.
DR Proteomes; UP000000803; Chromosome 3R.
DR Bgee; FBgn0027655; Expressed in mouthpart and 26 other cell types or tissues.
DR ExpressionAtlas; Q9V3N4; baseline and differential.
DR GO; GO:0005737; C:cytoplasm; IDA:FlyBase.
DR GO; GO:0043005; C:neuron projection; IEA:GOC.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0008088; P:axo-dendritic transport; IMP:FlyBase.
DR GO; GO:0000132; P:establishment of mitotic spindle orientation; IMP:FlyBase.
DR GO; GO:0033696; P:heterochromatin boundary formation; IMP:FlyBase.
DR GO; GO:0048489; P:synaptic vesicle transport; IMP:FlyBase.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR048413; Htt_C-HEAT_rpt.
DR InterPro; IPR048411; Htt_N_HEAT_rpt-1.
DR InterPro; IPR028426; Huntingtin_fam.
DR InterPro; IPR024613; Huntingtin_N_HEAT_rpt-2.
DR PANTHER; PTHR10170:SF10; HUNTINGTIN; 1.
DR PANTHER; PTHR10170; HUNTINGTON DISEASE PROTEIN; 1.
DR Pfam; PF20927; Htt_C-HEAT; 1.
DR Pfam; PF12372; Htt_N-HEAT; 3.
DR Pfam; PF20926; Htt_N-HEAT_1; 1.
DR SUPFAM; SSF48371; ARM repeat; 1.
PE 1: Evidence at protein level;
KW Cytoplasm {ECO:0000256|ARBA:ARBA00022490};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Proteomics identification {ECO:0007829|PeptideAtlas:Q9V3N4};
KW Reference proteome {ECO:0000313|Proteomes:UP000000803}.
FT REGION 401..458
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 472..512
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 564..600
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 660..682
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 698..722
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 863..892
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 971..1010
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1101..1179
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1200..1224
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1360..1381
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1434..1453
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1982..2001
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3291..3310
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 401..429
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 472..492
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 493..512
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 573..588
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 660..679
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 863..885
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 992..1010
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1102..1118
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1119..1133
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1134..1179
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1200..1221
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 3583 AA; 395867 MW; 98C83978C4F0A888 CRC64;
MDKSRSSAYD KFVGFVEQLR NTECSQKQKI TCFQQIAECI MSPSLAGHIN YAAHCGTATN
VLLLFCEDVD SVVRMSAEEN LNKILRSLEK TRVSRILMDL YGEIKRNGNQ RSLRICLNLF
SYYAPQIKEK HIKWYAVRLL QCMTTISQRK ETLLQETLCD FVKHFSRHIQ QGLSDSESCK
LFETFLDQIS SDCAVKRRCS AQNCMSLIEN ARNRSLMARH GVNKVMELLL TDQQANSVLG
ALGLLRLLLP QLIRGYPGDS HEDSESLAGK KQQQQQTTTS DCRQIIEIYD YCLHLLSTQH
TANHAIINAT LEVINGILQA VDAASDGQCS QSLGQSLRQL LCNQQLQHNE YLRRRKSLKN
QIFQLKNYEV ATSQHQLEDE DENEDVDELV VGATAMQMKK NSNAKLQQAK CREQQQHQHQ
QQLEVDNSSL GINAGEDAPT EAPSSVADEG GEPESTKLRC HIRNAARSIS ECVASDEDKQ
GQGHRQQRDE DGVVVAEDDD DDDDDDDDDD DMELLSAECD DFTTLSQLNE QQQALSAALK
LPTTTAASSG GAATSQDDKL IDVDADVGGL PKPQHQSSLQ NLLAGSDDKS QHLSDIDNES
FNSIDFDAEI TIAGSKEQQQ QHPPADDSVE SGDATAIGTF FNNLLSHSNA ASESVSKLFR
QSSGSKSTPS KSASTPAPAD KSDAISAASL TLSLTSLASS NLEPPERQPL IAETPTPVED
SCSITASHTA STALMMDAPA VEVAASKPET PQLRGTPNAN PFLVENSPLR QTVVGRALIT
VKIGSILEQS LVYYTARLVA ARFLLSGQAA GLQPDSISRV SIKSLSLAVI AQCVRLAPKI
LQLSLEISEQ ELQLLEEATS QIGSGDSTQV SSPQSSDNSQ VGGEKPPLDS SLVPTSLEEN
LLLLDIKDDH FGPSTCPAYL QSATPTLSRS ADASVLLLEG GTTSSRSAKK SEEMLSKSEI
IESSYRPTVA VEDVPPLSMP PRPPKRTKST RSRVGVLGTS STTESSSPQS RQKLSDILLF
HDHCDPILRG GVQQVVGNFL QSSGAGLFLD LQRGLGLQHL LAILLKGFED EIHTVVIQAL
NAFDKIFPNV VSKYLTEPPC HYHAHQQQQQ QQKEQQQQEQ DNQKLEQDLQ RHSSGQQKRS
GQAQTFGQQT FAKDQDNALS SQRQQQRRPN DAGTCANSSA TDNDELLAAL LNDFQLQSTG
MRQQPKNNST DTGQSGNEPD LEPNPNAAVE PFCVFAISPK LLLSKLRLCH HNKYWLVQNK
YAEVISNLNY VLLRSYYANF RCAIDNKNSG ARKQDSKWPP MDASSVCHSV RDAEGEDIVC
TYEAQFLAEL LHLLGDDDAR VREHAACCLC RFIMQTARQD PSQDQGAGGG GGDDIEGNGN
VNVETQQTNF NLLWDFFDYR IFGSMSVTLR NLFRASSTIV PPLAELDALA TSNSAPSYPD
TGSTSGSSTS TSASSGGSAA AVSAASAYFE ASYGIGIAEG HVFALASASQ RQIAQEEKVL
AKVLYRLTNK LMTLNDKNVQ FGIIYALRLL LRHFNFVDYQ QVWLEFNFVE ICISYAYYNN
ATAADLGCQN DLIDVMGKLM AGAMLSSGEP NTAHLDFLLR HSVKMLNIYY HLVTNQRPPT
AGSQSGSSSS KQPKSELFAR EQPAATLQAL GYFAGDYVYM KLYNILRGAN DSYKITINQE
AGSLLICLLK TCLHAVSLCL EGMASASPPE LKLIEEILHY LTRLINYAPA ECVACLRQLL
KYLFAQNYAS QVRLQPSAIG GNGSEIGHHA AFMRPYFAAK GRGHGASSTL LPTINSKPAV
AVGSQRGAPT DARQPIDAGP LQDMGMLFVH GLQPPTPPAG DCVRLIKLFE PMVIYCLTLF
MKSNALVQAP ILRLLSQLLD LNVTYSILDS KNVIFDQVLS NMDLIEGGID RNAFIMVPPM
LRFLVQLTHK SDRQLITIPK IISITNNLLA NGSVRVVALL ALKTLSYELF FMHSQLEEAL
DTEGHNSGRD ACQSPLSAAP TPETREALLA QRRELDTQRE VVLGMLEKFI EARPSQQVLA
LLLLFERSVQ QLDTPPYRSA QDADAVYGTL CRGLCSRQWR LHNAGDLRLL ESCFRNNGNH
VLADSKRFLQ LLQLFIEQGV GNFGDLALAM VMLSNVILKT EEIYLVNHIK LYLKNNPTAE
RRLQALMPSS PSAAPHWQDE APSTSSAAAA ARAAAASFSA GRSSISEINY FAKVLCEKLL
ACLEVLLGLE PSSSSHAYCQ LTGRFMDALL NVCCRSRHKD ALQSVFRLVL AESEFLCKYY
SLLLMSAAGL VGSYLLDAVL LACLRLVLAM RLEEPVALLE QAAQLPLKTN LQRALLREVC
RASAGCDWSA QQVRRLFEGR YLNFLIADHL EFICELCQER PECGSLLQVA LFRNAHRLSR
QSVRIVLRLL GRLCEPAERE ASGDVGSDAD PTLQAMQLLS RLHQLYEGER SLQLPIERMA
RRLSAQSGQP ARNVIYERLV EGDLAGGEDQ DALRTLLLKD LECRQDNETA TPSRIIDESW
LFAQLIKFAT QHADAPQQQK QLMLLLLEIQ SEPKLQRLLR SLGTEHEAKL LRHAIAGSLA
AMMSAFRQKC IQHAPHINYM QPTPLARVSC ALLMSRVAST EATKCRNPPT GEQLDVARAV
GALMACIRNA EQTALIYIDA RLMEKFVVEH LLRREHLPQL LAYLGWLAGA AKQILAMPTR
QESEQDALGV LLATVNTLLQ QPRVWRELNA SSDPSLRCEL LDLLDSVARC ILQDTIFYRR
HRRDRNKAKG PAPQAIFLAK LIETQIEIES LASGRVLAGV EEARLQFAGQ DLARFQVALS
LVTSIGISLL RTHQFYAYAV TPHELIQQPG DQQQEQQADG KLPSIPVDSL SDVDVLRQFV
KRLSIFGFTT RQQFEEYFMT CLLLINKLYD EHMVDQQEQF QIKQVCLQAI LELLMTYKTF
PIVGLANGQF HHTTRWQRIT CDSISLKKLH KVQLLVDACN VFYQPNLERQ LAYDNVIGTR
TFAPNQYDLN FSWAQMEDQA AAGVGVGVGV GLSGGEQANT ADIKQSCDAD VPDMAMRNYR
HFTQLSGIDF RSSTQLVFDV LQQMIELNHI LVLPNLVKFC EICESRDHIK WIKERCLKLQ
EQVAMDDTIS HQHIIYLLCR SQALLIPSLG ELQVLCSLIG NVYLKSTHSF IRIATLQGLL
CLLECCSKTN TTMGRLSEEL ALLRSLIVGY INRHGIIDES PLPFSVEHTK LVWTLNYSLI
EWTSKFVPQC HLLSNTIIAA NNFLKTTADE ELYLCVLHGL ERMVVNSGVP PPGIQPTGKD
AAAGEPGAEG SKAGVGVVVT PQMRHKIEKL ALELLKMENE KFSIPALKLL LSCMYVGSAA
QLENTELSNG IVQDDPEIIA QQNDKVDILL HCIKSSTRDA AWIYGQVLCQ IIRDLVPPNE
ILTKVIKEFL AINHPHCDVI AMIVYQVFRS AIDSSYLQML QDWLICTLPT FLDQPEQQGV
WGLSVIFLSA SINLHLIKLF PLVLGIGASN SAAAATTATT ATTEAEAAAP AMARKLGQHE
IALFVTAAQD FHAKLSGEQR QRFREAFGSF KRSQVYGRML QCL
//