ID A0A0V1LLW2_9BILA Unreviewed; 3142 AA.
AC A0A0V1LLW2;
DT 16-MAR-2016, integrated into UniProtKB/TrEMBL.
DT 16-MAR-2016, sequence version 1.
DT 24-JAN-2024, entry version 28.
DE SubName: Full=Huntingtin {ECO:0000313|EMBL:KRZ60178.1};
GN Name=HTT {ECO:0000313|EMBL:KRZ60178.1};
GN ORFNames=T02_13920 {ECO:0000313|EMBL:KRZ60178.1};
OS Trichinella nativa.
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Enoplea; Dorylaimia;
OC Trichinellida; Trichinellidae; Trichinella.
OX NCBI_TaxID=6335 {ECO:0000313|EMBL:KRZ60178.1, ECO:0000313|Proteomes:UP000054721};
RN [1] {ECO:0000313|EMBL:KRZ60178.1, ECO:0000313|Proteomes:UP000054721}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ISS10 {ECO:0000313|EMBL:KRZ60178.1};
RA Korhonen P.K., Edoardo P., Giuseppe L.R., Gasser R.B.;
RT "Evolution of Trichinella species and genotypes.";
RL Submitted (MAY-2015) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Cytoplasm {ECO:0000256|ARBA:ARBA00004496}.
CC Nucleus {ECO:0000256|ARBA:ARBA00004123, ECO:0000256|PROSITE-
CC ProRule:PRU00649}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KRZ60178.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JYDW01000032; KRZ60178.1; -; Genomic_DNA.
DR Proteomes; UP000054721; Unassembled WGS sequence.
DR GO; GO:0005737; C:cytoplasm; IEA:UniProtKB-SubCell.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0006351; P:DNA-templated transcription; IEA:InterPro.
DR Gene3D; 2.20.25.10; -; 1.
DR Gene3D; 1.20.930.10; Conserved domain common to transcription factors TFIIS, elongin A, CRSP70; 1.
DR Gene3D; 1.10.472.30; Transcription elongation factor S-II, central domain; 1.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR048413; Htt_C-HEAT_rpt.
DR InterPro; IPR048411; Htt_N_HEAT_rpt-1.
DR InterPro; IPR028426; Huntingtin_fam.
DR InterPro; IPR024613; Huntingtin_N_HEAT_rpt-2.
DR InterPro; IPR035441; TFIIS/LEDGF_dom_sf.
DR InterPro; IPR003618; TFIIS_cen_dom.
DR InterPro; IPR036575; TFIIS_cen_dom_sf.
DR InterPro; IPR017923; TFIIS_N.
DR PANTHER; PTHR10170:SF10; HUNTINGTIN; 1.
DR PANTHER; PTHR10170; HUNTINGTON DISEASE PROTEIN; 1.
DR Pfam; PF20927; Htt_C-HEAT; 2.
DR Pfam; PF12372; Htt_N-HEAT; 1.
DR Pfam; PF20926; Htt_N-HEAT_1; 1.
DR Pfam; PF08711; Med26; 1.
DR Pfam; PF07500; TFIIS_M; 1.
DR SMART; SM00510; TFS2M; 1.
DR SUPFAM; SSF48371; ARM repeat; 1.
DR SUPFAM; SSF47676; Conserved domain common to transcription factors TFIIS, elongin A, CRSP70; 1.
DR SUPFAM; SSF46942; Elongation factor TFIIS domain 2; 1.
DR PROSITE; PS51321; TFIIS_CENTRAL; 1.
DR PROSITE; PS51319; TFIIS_N; 1.
PE 4: Predicted;
KW Cytoplasm {ECO:0000256|ARBA:ARBA00022490};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00649}; Reference proteome {ECO:0000313|Proteomes:UP000054721}.
FT DOMAIN 19..93
FT /note="TFIIS N-terminal"
FT /evidence="ECO:0000259|PROSITE:PS51319"
FT DOMAIN 153..267
FT /note="TFIIS central"
FT /evidence="ECO:0000259|PROSITE:PS51321"
FT REGION 755..789
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 775..789
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 3142 AA; 357328 MW; 939A1BC7C0E479FF CRC64;
MTNNVQRQRL EFRVREIENE LSEMLDKNAL NKTKMRKLLE ELRTYGGIDV DMLVSTNIGK
TVNRIRTTFV KCPELFENAT FLVKKWKKEV SALSTSSRRK SLSKSTSQSS DDMDQFSTSE
LCSQSTTNFP LFNLSSPVRH NSCRALFDSI RSNLNNCRAV FEDCDANLLN DNNIKTIVQQ
IEESIYELNG SDETNSKYCS EIRSHAMNLC NSRNCQLLRD ILTGKILPAN FAKMTTEEMA
PEEVKNMRKA VERDSLKEHM LSNEGSLLHS TTFHCRQCGQ RDCNYTVSYE KDGHEAEAVT
YVVCNQCGHR MEKLMKNLES LRRPTNMLKK EAVPLSKEKA ALFYQVAEAL SSNTLKQHAE
YGKAFGNAFT ILFSYCDDPC TDVRILTEET MNIIIRSCIN EWNISRVHSE LCKELKRNGS
ARCIRSAMGK FVEIVELIGI QKRRQFAITL IPLFEQIIKR PEEMIQDCIG STIARLLQTL
GRYFRDEEIH GLIHSALANL QLQSTLFRRV AATFITELCH NSRRRSDMIS YWFKYILTKV
TKQDCDDVFH LLGNLLTLKM FIPLVLKEQQ IDQQYSTNVG GGQRVKQISS ENFVQLCEAI
LFYTAVPNDV VSSLALEMLV KIFENSTETQ CAIWLSKNSV KKSRFFFPSA DCDLTSTADV
EPVRSESISS ASCSDDPTAL QQDAAVEQPV LNISFQGDDC LERRVQRFSA SSDDQSQFGE
RESLESICIG RLKYSGLQRS VESVDERLAL MEMDEEAKSN QQQSNGRKFD EHSHSSSTCA
SVSGASVNSG KTFDKSTISI VADWAKDEFD SLLAETTSMA QFCCRLICRR FLLSGQAGVT
LPDEKVRVSH KVLSLATLER LFTLDHALYN LSLYEGCSQN ISDVLLLCKH LDPQVRGFVS
QAVASLLIST ATAVQSRKWN NDDDQEFKEE MKVLKRFLEI LSNATRDESN VCIHYTLSSL
RRTFSIMQKL NPSEVVNCAH SLLRLASNPY WLVKVKLAEC FEELDFCLIS LLEGTCYSNL
TEKLQPAVLH CLIDQLLVDE DRRVSSAAST TLLKLIPRLY FFGNVTLVSL NRQYYGRTFS
GRQNWLHPDL PTVHNSALDY SSPSDLHYNA LLIDDIEHNL SIVIDLLYST FYKTSAQVFD
WFGCIVALDE LATAYPPRVF RRAWSTTTIT NGKCLACPLM QHCLQLLRTD CFQLDPLISW
QRILHFVSTL FAACVVDFES QPTNKKKNIV ELYGLTDVAA DLIAFTMQIT SMYWHVVEQR
TATPTSTARF VKSGSGSSTT LSSPVIKKLK YPTFIRKEIS IKLHLGRGDS GVGERSGRFS
EGGYEPQLLK LFEILRRVGS NFRATIDAET EKKFLGLLGS ALQALSAVLE AVSVVDFSGY
LEDFLFFSKS LMPLMPAQLI STLTQVIKVL FGTNLHFEKQ LRVKRTQDSI VNQAKTLQSC
SNYEHFLENI QTHIAQKTAR SMMRHDYVED SLICHVGWLE NNYSQYVSQL FQDSERKSFL
SEAIQLFEPI LMEGLRMYTV QSDGRFQAAF VGLICHLLLV KVNYSVLDPQ RRILNLFCQQ
LEHYIEDDSC VDREFVCKLF ELLIILSHER QNTVPFVEIP KIIQLVESMY TSKDNPLLGN
IAMHYVVLDL FLLRQQDGGN NELEIQQEVV IAMLTKALQY AESWDSFNVI MFQLKKFDER
RWRKISRDLT DRLLPLLLEG KIEIKNSNSV NALYQLLDMV CSRAYSPVDP IFSALCFLAA
ECEHNTLNKW KHLLAVTCLF RILFLHCPGD SLLDRANEVL PSLKANQQLA PWMNYHTDTK
PEELIVRLIL RLSEWYLVSF VDSAVADVSP NSHQLELQMI VNFYETIFWI NQSGMHPKLT
AVFRFEEHFV RIIDQLEIAS RTGEAVLLFQ LCHISAVLCN SEWHLNELIS RVQIKSFPWL
VILVTCKDLL KNQMSKSTTT MDSLFLKLSS LRVECGTFFA CVEEMAFEQL LKYALKMEHS
CETMLYNFKR TIDSSSMTIG QQLKMIELLK YSCERYSLIA LHLEISLVQS KHYLVRRCAE
QSACRRLLSM MSKSTQELQR MTGVELLLKD IIYATSKNSF TLLFKNGELR SLMVKFVNDN
LHPAFGLSRR LPSELSGRAA AGCSDTGHSF DKLDVLHLLK TDVSADVGAP TDRWSRFAKA
VQCFPVEDVR SILCDQVNIP GAALRACLKL ARRRFALSCD GGDQNNCGAA PDREFSRNMP
KLFTAGKEAV IFALQNLIRQ FPADAVARFQ FSQWEVDLTL PIGYREKAFS MHSSVQELFS
IISAASWRLS TSEISIILDF VHISFLTCLK LLPDLSLHEA TATLATFLTV LANQHCCTFF
TDQPQNVVHS IIGCTFWMLV KSFRLSALLR RLPSKLPFTQ GSIFFFFSVG SLFCRLSGIG
FSVNETEKMS FLYCEAMLAF RRSRRLYEQV QDALISDTII GVCRLPQVFS FCLTPFAAFP
FGWVPDVKRI GDLCTFSNVP VRFLVHADVL KDFIYRVCLL GWSSKQQFEE IWMTLVAVLS
ATPIGKEMSH RDRLDTVDRV TASSLAVRCI GCIFTMTSPR LVLTPVNNWK RAAQFRPPCS
FQSEPTWNKR CAEIAHLVKF ELESSSSLEM QADLQLPCSF CDVEKLYESA KLLNRAQIIN
SNIASGGSIV TPNLRESFDL VSCMYFVIDL FDHWFKDGPD QVPLLLLIST LDAIVCLSDF
FFEDAHSEWM TNHMNVVFQA RSADDEFITN VILFGILKCA ANLCSSNAES LKMILQVVDH
GCKIPFQSNK MFMLFGLLQS LRSFPSADVA CFLPNIGELI LHTFQLMSDR QAVNNVQVCG
LDYEVACCAL ACQMMEKLAD EHGTVDYLKT LLKLAAEAYQ SPLPHCLQNA VATVLQTLVR
FPAMNFECRQ QILTVTTKCF DVEPGNSYIF TSLTLMLISL HFIKNWFNNQ ADDDQQRQQL
DASASDQSKA TFHIMVMERF DIILRKIAQS ATEITNVLGV VCCEFLKDNF SPSDVIHKLV
LDFMSNMKSA DQQKVLIDIL CSVLKDFQSK DEPTIFNWIL LLVPSIVEKR PSNLAISCLT
CFFLSIFKEP WMEALRIIAL NRLGKMEEFD TELFAFTVKR FKELLPNEIL VNDFLAIFAN
FSIQYPNTAY SLAHQYCMTK KT
//