GenomeNet

Database: UniProt
Entry: A0A0V1LLW2_9BILA
LinkDB: A0A0V1LLW2_9BILA
Original site: A0A0V1LLW2_9BILA 
ID   A0A0V1LLW2_9BILA        Unreviewed;      3142 AA.
AC   A0A0V1LLW2;
DT   16-MAR-2016, integrated into UniProtKB/TrEMBL.
DT   16-MAR-2016, sequence version 1.
DT   24-JAN-2024, entry version 28.
DE   SubName: Full=Huntingtin {ECO:0000313|EMBL:KRZ60178.1};
GN   Name=HTT {ECO:0000313|EMBL:KRZ60178.1};
GN   ORFNames=T02_13920 {ECO:0000313|EMBL:KRZ60178.1};
OS   Trichinella nativa.
OC   Eukaryota; Metazoa; Ecdysozoa; Nematoda; Enoplea; Dorylaimia;
OC   Trichinellida; Trichinellidae; Trichinella.
OX   NCBI_TaxID=6335 {ECO:0000313|EMBL:KRZ60178.1, ECO:0000313|Proteomes:UP000054721};
RN   [1] {ECO:0000313|EMBL:KRZ60178.1, ECO:0000313|Proteomes:UP000054721}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=ISS10 {ECO:0000313|EMBL:KRZ60178.1};
RA   Korhonen P.K., Edoardo P., Giuseppe L.R., Gasser R.B.;
RT   "Evolution of Trichinella species and genotypes.";
RL   Submitted (MAY-2015) to the EMBL/GenBank/DDBJ databases.
CC   -!- SUBCELLULAR LOCATION: Cytoplasm {ECO:0000256|ARBA:ARBA00004496}.
CC       Nucleus {ECO:0000256|ARBA:ARBA00004123, ECO:0000256|PROSITE-
CC       ProRule:PRU00649}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:KRZ60178.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; JYDW01000032; KRZ60178.1; -; Genomic_DNA.
DR   Proteomes; UP000054721; Unassembled WGS sequence.
DR   GO; GO:0005737; C:cytoplasm; IEA:UniProtKB-SubCell.
DR   GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR   GO; GO:0006351; P:DNA-templated transcription; IEA:InterPro.
DR   Gene3D; 2.20.25.10; -; 1.
DR   Gene3D; 1.20.930.10; Conserved domain common to transcription factors TFIIS, elongin A, CRSP70; 1.
DR   Gene3D; 1.10.472.30; Transcription elongation factor S-II, central domain; 1.
DR   InterPro; IPR016024; ARM-type_fold.
DR   InterPro; IPR048413; Htt_C-HEAT_rpt.
DR   InterPro; IPR048411; Htt_N_HEAT_rpt-1.
DR   InterPro; IPR028426; Huntingtin_fam.
DR   InterPro; IPR024613; Huntingtin_N_HEAT_rpt-2.
DR   InterPro; IPR035441; TFIIS/LEDGF_dom_sf.
DR   InterPro; IPR003618; TFIIS_cen_dom.
DR   InterPro; IPR036575; TFIIS_cen_dom_sf.
DR   InterPro; IPR017923; TFIIS_N.
DR   PANTHER; PTHR10170:SF10; HUNTINGTIN; 1.
DR   PANTHER; PTHR10170; HUNTINGTON DISEASE PROTEIN; 1.
DR   Pfam; PF20927; Htt_C-HEAT; 2.
DR   Pfam; PF12372; Htt_N-HEAT; 1.
DR   Pfam; PF20926; Htt_N-HEAT_1; 1.
DR   Pfam; PF08711; Med26; 1.
DR   Pfam; PF07500; TFIIS_M; 1.
DR   SMART; SM00510; TFS2M; 1.
DR   SUPFAM; SSF48371; ARM repeat; 1.
DR   SUPFAM; SSF47676; Conserved domain common to transcription factors TFIIS, elongin A, CRSP70; 1.
DR   SUPFAM; SSF46942; Elongation factor TFIIS domain 2; 1.
DR   PROSITE; PS51321; TFIIS_CENTRAL; 1.
DR   PROSITE; PS51319; TFIIS_N; 1.
PE   4: Predicted;
KW   Cytoplasm {ECO:0000256|ARBA:ARBA00022490};
KW   Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW   ProRule:PRU00649}; Reference proteome {ECO:0000313|Proteomes:UP000054721}.
FT   DOMAIN          19..93
FT                   /note="TFIIS N-terminal"
FT                   /evidence="ECO:0000259|PROSITE:PS51319"
FT   DOMAIN          153..267
FT                   /note="TFIIS central"
FT                   /evidence="ECO:0000259|PROSITE:PS51321"
FT   REGION          755..789
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        775..789
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   3142 AA;  357328 MW;  939A1BC7C0E479FF CRC64;
     MTNNVQRQRL EFRVREIENE LSEMLDKNAL NKTKMRKLLE ELRTYGGIDV DMLVSTNIGK
     TVNRIRTTFV KCPELFENAT FLVKKWKKEV SALSTSSRRK SLSKSTSQSS DDMDQFSTSE
     LCSQSTTNFP LFNLSSPVRH NSCRALFDSI RSNLNNCRAV FEDCDANLLN DNNIKTIVQQ
     IEESIYELNG SDETNSKYCS EIRSHAMNLC NSRNCQLLRD ILTGKILPAN FAKMTTEEMA
     PEEVKNMRKA VERDSLKEHM LSNEGSLLHS TTFHCRQCGQ RDCNYTVSYE KDGHEAEAVT
     YVVCNQCGHR MEKLMKNLES LRRPTNMLKK EAVPLSKEKA ALFYQVAEAL SSNTLKQHAE
     YGKAFGNAFT ILFSYCDDPC TDVRILTEET MNIIIRSCIN EWNISRVHSE LCKELKRNGS
     ARCIRSAMGK FVEIVELIGI QKRRQFAITL IPLFEQIIKR PEEMIQDCIG STIARLLQTL
     GRYFRDEEIH GLIHSALANL QLQSTLFRRV AATFITELCH NSRRRSDMIS YWFKYILTKV
     TKQDCDDVFH LLGNLLTLKM FIPLVLKEQQ IDQQYSTNVG GGQRVKQISS ENFVQLCEAI
     LFYTAVPNDV VSSLALEMLV KIFENSTETQ CAIWLSKNSV KKSRFFFPSA DCDLTSTADV
     EPVRSESISS ASCSDDPTAL QQDAAVEQPV LNISFQGDDC LERRVQRFSA SSDDQSQFGE
     RESLESICIG RLKYSGLQRS VESVDERLAL MEMDEEAKSN QQQSNGRKFD EHSHSSSTCA
     SVSGASVNSG KTFDKSTISI VADWAKDEFD SLLAETTSMA QFCCRLICRR FLLSGQAGVT
     LPDEKVRVSH KVLSLATLER LFTLDHALYN LSLYEGCSQN ISDVLLLCKH LDPQVRGFVS
     QAVASLLIST ATAVQSRKWN NDDDQEFKEE MKVLKRFLEI LSNATRDESN VCIHYTLSSL
     RRTFSIMQKL NPSEVVNCAH SLLRLASNPY WLVKVKLAEC FEELDFCLIS LLEGTCYSNL
     TEKLQPAVLH CLIDQLLVDE DRRVSSAAST TLLKLIPRLY FFGNVTLVSL NRQYYGRTFS
     GRQNWLHPDL PTVHNSALDY SSPSDLHYNA LLIDDIEHNL SIVIDLLYST FYKTSAQVFD
     WFGCIVALDE LATAYPPRVF RRAWSTTTIT NGKCLACPLM QHCLQLLRTD CFQLDPLISW
     QRILHFVSTL FAACVVDFES QPTNKKKNIV ELYGLTDVAA DLIAFTMQIT SMYWHVVEQR
     TATPTSTARF VKSGSGSSTT LSSPVIKKLK YPTFIRKEIS IKLHLGRGDS GVGERSGRFS
     EGGYEPQLLK LFEILRRVGS NFRATIDAET EKKFLGLLGS ALQALSAVLE AVSVVDFSGY
     LEDFLFFSKS LMPLMPAQLI STLTQVIKVL FGTNLHFEKQ LRVKRTQDSI VNQAKTLQSC
     SNYEHFLENI QTHIAQKTAR SMMRHDYVED SLICHVGWLE NNYSQYVSQL FQDSERKSFL
     SEAIQLFEPI LMEGLRMYTV QSDGRFQAAF VGLICHLLLV KVNYSVLDPQ RRILNLFCQQ
     LEHYIEDDSC VDREFVCKLF ELLIILSHER QNTVPFVEIP KIIQLVESMY TSKDNPLLGN
     IAMHYVVLDL FLLRQQDGGN NELEIQQEVV IAMLTKALQY AESWDSFNVI MFQLKKFDER
     RWRKISRDLT DRLLPLLLEG KIEIKNSNSV NALYQLLDMV CSRAYSPVDP IFSALCFLAA
     ECEHNTLNKW KHLLAVTCLF RILFLHCPGD SLLDRANEVL PSLKANQQLA PWMNYHTDTK
     PEELIVRLIL RLSEWYLVSF VDSAVADVSP NSHQLELQMI VNFYETIFWI NQSGMHPKLT
     AVFRFEEHFV RIIDQLEIAS RTGEAVLLFQ LCHISAVLCN SEWHLNELIS RVQIKSFPWL
     VILVTCKDLL KNQMSKSTTT MDSLFLKLSS LRVECGTFFA CVEEMAFEQL LKYALKMEHS
     CETMLYNFKR TIDSSSMTIG QQLKMIELLK YSCERYSLIA LHLEISLVQS KHYLVRRCAE
     QSACRRLLSM MSKSTQELQR MTGVELLLKD IIYATSKNSF TLLFKNGELR SLMVKFVNDN
     LHPAFGLSRR LPSELSGRAA AGCSDTGHSF DKLDVLHLLK TDVSADVGAP TDRWSRFAKA
     VQCFPVEDVR SILCDQVNIP GAALRACLKL ARRRFALSCD GGDQNNCGAA PDREFSRNMP
     KLFTAGKEAV IFALQNLIRQ FPADAVARFQ FSQWEVDLTL PIGYREKAFS MHSSVQELFS
     IISAASWRLS TSEISIILDF VHISFLTCLK LLPDLSLHEA TATLATFLTV LANQHCCTFF
     TDQPQNVVHS IIGCTFWMLV KSFRLSALLR RLPSKLPFTQ GSIFFFFSVG SLFCRLSGIG
     FSVNETEKMS FLYCEAMLAF RRSRRLYEQV QDALISDTII GVCRLPQVFS FCLTPFAAFP
     FGWVPDVKRI GDLCTFSNVP VRFLVHADVL KDFIYRVCLL GWSSKQQFEE IWMTLVAVLS
     ATPIGKEMSH RDRLDTVDRV TASSLAVRCI GCIFTMTSPR LVLTPVNNWK RAAQFRPPCS
     FQSEPTWNKR CAEIAHLVKF ELESSSSLEM QADLQLPCSF CDVEKLYESA KLLNRAQIIN
     SNIASGGSIV TPNLRESFDL VSCMYFVIDL FDHWFKDGPD QVPLLLLIST LDAIVCLSDF
     FFEDAHSEWM TNHMNVVFQA RSADDEFITN VILFGILKCA ANLCSSNAES LKMILQVVDH
     GCKIPFQSNK MFMLFGLLQS LRSFPSADVA CFLPNIGELI LHTFQLMSDR QAVNNVQVCG
     LDYEVACCAL ACQMMEKLAD EHGTVDYLKT LLKLAAEAYQ SPLPHCLQNA VATVLQTLVR
     FPAMNFECRQ QILTVTTKCF DVEPGNSYIF TSLTLMLISL HFIKNWFNNQ ADDDQQRQQL
     DASASDQSKA TFHIMVMERF DIILRKIAQS ATEITNVLGV VCCEFLKDNF SPSDVIHKLV
     LDFMSNMKSA DQQKVLIDIL CSVLKDFQSK DEPTIFNWIL LLVPSIVEKR PSNLAISCLT
     CFFLSIFKEP WMEALRIIAL NRLGKMEEFD TELFAFTVKR FKELLPNEIL VNDFLAIFAN
     FSIQYPNTAY SLAHQYCMTK KT
//
DBGET integrated database retrieval system