ID A0A0V0SMY5_9BILA Unreviewed; 3127 AA.
AC A0A0V0SMY5;
DT 16-MAR-2016, integrated into UniProtKB/TrEMBL.
DT 16-MAR-2016, sequence version 1.
DT 24-JAN-2024, entry version 27.
DE SubName: Full=Huntingtin {ECO:0000313|EMBL:KRX28072.1};
GN Name=HTT {ECO:0000313|EMBL:KRX28072.1};
GN ORFNames=T07_10080 {ECO:0000313|EMBL:KRX28072.1};
OS Trichinella nelsoni.
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Enoplea; Dorylaimia;
OC Trichinellida; Trichinellidae; Trichinella.
OX NCBI_TaxID=6336 {ECO:0000313|EMBL:KRX28072.1, ECO:0000313|Proteomes:UP000054630};
RN [1] {ECO:0000313|EMBL:KRX28072.1, ECO:0000313|Proteomes:UP000054630}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ISS37 {ECO:0000313|EMBL:KRX28072.1};
RA Korhonen P.K., Edoardo P., Giuseppe L.R., Gasser R.B.;
RT "Evolution of Trichinella species and genotypes.";
RL Submitted (JAN-2015) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Cytoplasm {ECO:0000256|ARBA:ARBA00004496}.
CC Nucleus {ECO:0000256|ARBA:ARBA00004123, ECO:0000256|PROSITE-
CC ProRule:PRU00649}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KRX28072.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JYDL01000001; KRX28072.1; -; Genomic_DNA.
DR STRING; 6336.A0A0V0SMY5; -.
DR Proteomes; UP000054630; Unassembled WGS sequence.
DR GO; GO:0005737; C:cytoplasm; IEA:UniProtKB-SubCell.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0006351; P:DNA-templated transcription; IEA:InterPro.
DR Gene3D; 2.20.25.10; -; 1.
DR Gene3D; 1.20.930.10; Conserved domain common to transcription factors TFIIS, elongin A, CRSP70; 1.
DR Gene3D; 1.10.472.30; Transcription elongation factor S-II, central domain; 1.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR048413; Htt_C-HEAT_rpt.
DR InterPro; IPR048411; Htt_N_HEAT_rpt-1.
DR InterPro; IPR028426; Huntingtin_fam.
DR InterPro; IPR024613; Huntingtin_N_HEAT_rpt-2.
DR InterPro; IPR035441; TFIIS/LEDGF_dom_sf.
DR InterPro; IPR003618; TFIIS_cen_dom.
DR InterPro; IPR036575; TFIIS_cen_dom_sf.
DR InterPro; IPR017923; TFIIS_N.
DR PANTHER; PTHR10170:SF10; HUNTINGTIN; 1.
DR PANTHER; PTHR10170; HUNTINGTON DISEASE PROTEIN; 1.
DR Pfam; PF20927; Htt_C-HEAT; 2.
DR Pfam; PF12372; Htt_N-HEAT; 1.
DR Pfam; PF20926; Htt_N-HEAT_1; 1.
DR Pfam; PF08711; Med26; 1.
DR Pfam; PF07500; TFIIS_M; 1.
DR SMART; SM00510; TFS2M; 1.
DR SUPFAM; SSF48371; ARM repeat; 2.
DR SUPFAM; SSF47676; Conserved domain common to transcription factors TFIIS, elongin A, CRSP70; 1.
DR SUPFAM; SSF46942; Elongation factor TFIIS domain 2; 1.
DR PROSITE; PS51321; TFIIS_CENTRAL; 1.
DR PROSITE; PS51319; TFIIS_N; 1.
PE 4: Predicted;
KW Cytoplasm {ECO:0000256|ARBA:ARBA00022490};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00649}; Reference proteome {ECO:0000313|Proteomes:UP000054630}.
FT DOMAIN 19..93
FT /note="TFIIS N-terminal"
FT /evidence="ECO:0000259|PROSITE:PS51319"
FT DOMAIN 153..267
FT /note="TFIIS central"
FT /evidence="ECO:0000259|PROSITE:PS51321"
FT REGION 755..791
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 775..791
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 3127 AA; 356362 MW; 0C03AE0DE0628636 CRC64;
MSNNVQRQRL EFRVREIENE LSEMLDKNAL NKTKMRKLLE ELRTYGGIDV DMLVSTNIGK
TVNRIRTTFV KCPELFENAT FLVKKWKKEV NALSTSSRRK SLSKSTSQSS DDMDQFSTSE
LCSQSTTNFP LFNLSSPVRH NSCRALFDSI RSNLNNCRAM FEDCDANLLN DNNIKTIVQQ
IEESIYELNG SDETNSKYCN EIRSHAMNLC NSKNCQLLRD ILTGKILPAN FAKMTTEEMA
PEEVKNMRKA VERDSLKEHM LSNEDSLLHS TTFHCRQCDQ RDCNYTVSYE KDGHEAEAVT
YVVCNQCGHR MEKLMKNLES LRRPTNMLKK EAVPLSKEKA ALFYQVAEAL SSNTLKQHAE
YGKAFGNAFT ILFSYCDDPC TDVRILTEET MNIIIRSCIN EWNISRVHSE LCKELKRNGS
ARCIRSAMGK FVETVELIGI QKRRQFAITL IPLFEQMIKR PEEMIQDCIG STIARLLQTL
GRYFRDEEIH GLIHSALANL QLQSTLFRRV AATFITELCH NSRRRSDMIS YWFKYILTKV
TKQDCDDVFH LLGNLLTLKM FIPLVLKEQQ IDQQYSTNVG GGQRVKQISS ENFVQLCEAI
LFYTAVPNDV VNSLALEMLV KIFENSTQTQ CAIWLSKNSV KKSRFFFPSA DCDLTSTADV
EPMRSDSISS ASCSDDPTAL QQDTAVEQPV LNISFQGDDC LEKRVQRFSA SSDDQSQFGE
RESLESICIG RLKYSGLQRS IESVDERLAL MEMDEEAKSN QQQSNGRKFD EHSHSSSTCA
SVSGASVNSG KTPAFDKSTI STVADWAKDE FDSLLAETTS MAQFCCRLIC RRFLLSGQAG
VTLPDEKVRV SHKVLSLATL ERLFMLDHAL YNLSLYEGCS QNISDVLLLR KHLDPQVRGF
VSQAVASLLI STATAVQSRK WNNDDDQEFK KEMKVLKRFL EILSNATRDE SNVCIHYTLS
SLRRTFSIMQ KLNPSEVVNC AHNLLRLASN PYWLVKVKLA ECFEELDFCL ISLLEETCYS
NLTEKLQPAV LHCLIDQLLV DEDRRVSSAA STSLLKLIPR LYFFGNVTLV SLNRQYYGRT
FSGRQNWLHP DLPTVHNSAL DYSSPSDLHY NALLIDDTEH NLSIVIDLLY STFYKTSAQV
FDWFGCIVAL DELATAYPPR VYRRAWSTTT TVNGKFLACP LMQHCLQLLR TDCFQLDPLI
SWQRILHFVS TLFAACVVDF ESQPTNKKNN IVELYGLTDV AADLIAFTMQ VTSMYWHVVE
QRTATPTSTA RFVKSGSGSS TTLSSPVIKK LKYPTFIRKE ISIKLHLGRG DSGVGERSGR
FSEGGYEPQL LNLFEILRRV GSNFRATIDA ETEKKFLGLL GSALQALSAV LEAISVVDFS
GYLEDFLFFS KSLMPLMPAQ LISTLTQVIK VLFGTNLHFE KQLRVKRTQD SIVNQATTLQ
SCSNYEHFLE NIQTHIAQKT ARSMMRHDFV EDSLICHVGW LENNYSQYVS QLFQDSERKS
FLSEAIQLFE PILMEGLRMY TVQSDGRFQA AFVGLICHLL LVKVNYSVLD PQRRILNLFC
QQLEHYIEDD SCVDREFVCK LFELLIILSH ERQNTVPFVE IPKIIQLVES MYTSKDNPLL
GNIAMHYVVL DLFLLRQQDV GNNELEIQQE VVIAMLTKAL QYAESWDSFN VIMFQLKKFD
ERRWRKISRD LTDRLLPLLL EGKALNALYQ LLDMVCSRAY SPVDPIFSAL CFLAAECEHN
TLNKWKHLLA VTCLFRILFL HCPGDSLLDR ANEVLPSLKA NQQLAPWMNY HTDTKPEELI
VRLILRLSEW YLVSFVDSAV GDVSPNSHQL ELQMIVNFYE TIFWINQSGM HPKLTAVFRF
EEHFVRIIDQ LEIASRTGDA VLLFQLCHIS AILCNSEWHL NELISRVQIN YLYILRSFPW
LVILVTCKDL LKNQMSKSAT TTTTTTTTMD SLFLKWSSLR VECETFFAYV EEMAFEQLLK
YALKMEHSCE TMLYNFKRTI DSSSMTIGQQ LKMIELLKYS CERYSLIALH LEISLVQSKH
YLVRRCAEQS ACRRLLSMMS KSTQELQRMT GVELLLKDII YATSKNSFTL LFKNGELRSL
MVKFVNDNLH PAFGLSRRLP SELSGRAAAG CFDTGHSFDK LDVLHLLKTD VSADVGVPTD
RWSRFAKAVQ CFPVEDVRSI LCDQNIPGSA LRACLKLARR RFALSCDGGD QNNCGAAPDR
EFSRNMPKLF TAGKEAVIFA LQNLIRQFPA DAVARFQFSQ WEVDLTLPIG YREKAFSIHS
SVQELFSIIS AASWRLSTSE ISIILDFVHI SFLTCLKLLP DLSLHEATAT LATFLTVLAN
QHCCTFFTDQ PQNVVHSIIG CTFWMLVKSF RLSALLRRLP SKLPFTQVNE TEKMSFLYCE
AMLAFRRSRR LYEQVQDALI SDTIIGVCRL PQVFSFCLTP FAAFPFGWVP DVKRIGDLCT
FSNVPVRFLV HADVLKDFIY RVCLLGWSSK QQFEEIWMTL VAVLSATPIG KEMSHRDRLD
TVDRVTASSL AVRCIGCIFT MTSPRLVLTP VNNWKRAAQF RPPCSFQSEP TWNKRCAEIA
HLVKFELESS SSLEMQADLQ LPCSFCDVEK LYESAKLLNR AQIINSNIAS GGSIVTPNLR
ESFDLVSCMY FVIDLFDHWF KDGPDQVPLL LLISTLDAIV CLSDFFFEDA HSEWMTNHMN
VIFQARSADD EFITNVILFG ILKCAANLCS SNAESLKMIL QVVDHGCKIP FQSNKMFMLF
GLLQSLRSFP SADVACFLPN IGELILHTFQ LMSDRQAVNN VQVCGLDYEV ACCALACQMM
EKLADEHGTV DYLKTLLKLA AEAYQSPLPH CLQNAVATVL QTLVRFPAMN FECRQQILTV
TTKCFDVEPG NSYIFTSLTL MLISLHFIKN WFNNQADDDQ QRQQLDASAS DQSKATFHIM
VMERFDIILR KIAQSATEIT NVLGVVCCEF LKDNFSPSDV IHKLVLDFMS NMKSADQQKV
LIDILCSVLK DFQSKDESAI FNWILLLVPS IVEKRPSNLA ISCLTCFFLS IFKEPWMEAL
RIIALNRLGK MEEFDTELFA FTVKRFKELL PNEILVNDFL AIFANFSIQY PNTAYSLAHQ
YCMTKKT
//