ID A0A0V1CQJ6_TRIBR Unreviewed; 3108 AA.
AC A0A0V1CQJ6;
DT 16-MAR-2016, integrated into UniProtKB/TrEMBL.
DT 16-MAR-2016, sequence version 1.
DT 24-JAN-2024, entry version 27.
DE SubName: Full=Huntingtin {ECO:0000313|EMBL:KRY51540.1};
GN Name=HTT {ECO:0000313|EMBL:KRY51540.1};
GN ORFNames=T03_15745 {ECO:0000313|EMBL:KRY51540.1};
OS Trichinella britovi (Parasitic roundworm).
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Enoplea; Dorylaimia;
OC Trichinellida; Trichinellidae; Trichinella.
OX NCBI_TaxID=45882 {ECO:0000313|EMBL:KRY51540.1, ECO:0000313|Proteomes:UP000054653};
RN [1] {ECO:0000313|EMBL:KRY51540.1, ECO:0000313|Proteomes:UP000054653}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ISS120 {ECO:0000313|EMBL:KRY51540.1};
RA Korhonen P.K., Edoardo P., Giuseppe L.R., Gasser R.B.;
RT "Evolution of Trichinella species and genotypes.";
RL Submitted (JAN-2015) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Cytoplasm {ECO:0000256|ARBA:ARBA00004496}.
CC Nucleus {ECO:0000256|ARBA:ARBA00004123, ECO:0000256|PROSITE-
CC ProRule:PRU00649}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KRY51540.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JYDI01000123; KRY51540.1; -; Genomic_DNA.
DR Proteomes; UP000054653; Unassembled WGS sequence.
DR GO; GO:0005737; C:cytoplasm; IEA:UniProtKB-SubCell.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0006351; P:DNA-templated transcription; IEA:InterPro.
DR Gene3D; 2.20.25.10; -; 1.
DR Gene3D; 1.20.930.10; Conserved domain common to transcription factors TFIIS, elongin A, CRSP70; 1.
DR Gene3D; 1.10.472.30; Transcription elongation factor S-II, central domain; 1.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR048413; Htt_C-HEAT_rpt.
DR InterPro; IPR048411; Htt_N_HEAT_rpt-1.
DR InterPro; IPR028426; Huntingtin_fam.
DR InterPro; IPR024613; Huntingtin_N_HEAT_rpt-2.
DR InterPro; IPR035441; TFIIS/LEDGF_dom_sf.
DR InterPro; IPR003618; TFIIS_cen_dom.
DR InterPro; IPR036575; TFIIS_cen_dom_sf.
DR InterPro; IPR017923; TFIIS_N.
DR PANTHER; PTHR10170:SF10; HUNTINGTIN; 1.
DR PANTHER; PTHR10170; HUNTINGTON DISEASE PROTEIN; 1.
DR Pfam; PF20927; Htt_C-HEAT; 2.
DR Pfam; PF12372; Htt_N-HEAT; 1.
DR Pfam; PF20926; Htt_N-HEAT_1; 1.
DR Pfam; PF08711; Med26; 1.
DR Pfam; PF07500; TFIIS_M; 1.
DR SMART; SM00510; TFS2M; 1.
DR SUPFAM; SSF48371; ARM repeat; 2.
DR SUPFAM; SSF47676; Conserved domain common to transcription factors TFIIS, elongin A, CRSP70; 1.
DR SUPFAM; SSF46942; Elongation factor TFIIS domain 2; 1.
DR PROSITE; PS51321; TFIIS_CENTRAL; 1.
DR PROSITE; PS51319; TFIIS_N; 1.
PE 4: Predicted;
KW Cytoplasm {ECO:0000256|ARBA:ARBA00022490};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00649}; Reference proteome {ECO:0000313|Proteomes:UP000054653}.
FT DOMAIN 19..93
FT /note="TFIIS N-terminal"
FT /evidence="ECO:0000259|PROSITE:PS51319"
FT DOMAIN 153..267
FT /note="TFIIS central"
FT /evidence="ECO:0000259|PROSITE:PS51321"
FT REGION 737..771
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 757..771
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 3108 AA; 353331 MW; 2C3A2C6170448B78 CRC64;
MTNNVQRQRL EFRVREIENE LSEMLDKNAL NKTKMRKLLE ELRTYGGIDV DMLVSTNIGK
TVNRIRTTFV KCPELFENAT FLVKKWKKEV SALSTSSRRK SLSKSTSQSS DDMDQFSTSE
LCSQSTTNFP LFNLSSPVRH NSCRALFDSI RSNLNNCRAV FEDCDANLLN DNNIKTIVQQ
IEESIYELNG SDETNSKYCS EIRSHAMNLC NSKNCQLLRD ILTGKILPAN FAKMTTEEMA
PEEVKNMRKA VERDSLKEHM LSNEGSLLHS TTFHCRQCGQ RDCNYTVSYE KDGHEAEAVT
YVVCNQCGHR KKEAVPLSKE KAALFYQVAE ALSSNTLKQH AEYGKAFGNA FTILFSYCDD
PCTDVRILTE ETMNIIIRSC INEWNISRVH SELCKELKRN GSARCIRSAM GKFVEIVELI
GIQKRRQFAI TLIPLFEQII KRPEEMIQDC IGSTIARLLQ TLGRYFRDEE IHGLIHSALA
NLQLQSTLFR RVAATFITEL CHNSRRRSDM ISYWFKYILT KVTKQDCDDV FHLLGNLLTL
KMFIPLVLKE QQIDQQYSTN VGGGQRVKQI SSENFVQLCE AILFYTAVPN DVVNTLALEM
LVKIFENSTE TQCAIWLSKN SVKKSRFFFP SADCDLTSTA DVEPVRSESI SSASCSDDPT
ALQQDAAVEQ PVLNISFQGD DCLERRIQRF SASSDDQSQF GERESLESIC IGRLKYSGLQ
RSVESVDERL ALMEMDEEAK SNQQQSNGRK FDEHSHSSST CASVSGASVN SGKTFDKSTI
SIVADWAKDE SETTSMAQFC CRLICRRFLL SGQAGVTLPD EKVRVSHKVL SLATLERLFT
LDHALYNLSL YEGCSQNISD VLLLCKHLDP QVRGFVSQAV ASLLISTATA VQSRKWNNDD
DQEFKEEMKV LKRFLEILSN ATRDESNVCI HYTLSSLRRT FSIMQKLNPS EVVNCAHSLL
RLASNPYWLV KVKLAECFEE LDFCLISLLE ETCYSNLTEK LQPAVLHCLI DQLLVDEDRR
VSSAASTSLL KLIPRLYFFG NVTLVSLNRQ YYGRTFSGRQ NWLHPDLPTV HNSALDYSSP
SDLHYNALLI DDIEHNLSIV IDLLYSTFYK TSAQVFDWFG CIVALDELAT AYPPRVYRRA
WSTTTTTNGK CLACPLMQHC LQLLRTDCFQ LDPLISWQRI LHFVSTLFAA CVVDFESQPT
NKKKNIVELY GLTDVAADLI AFTMQVTSMY WHVVEQRTAT PTSTARFVKS GSGSSTTLSS
PVIKKLKYPT FIRKEISIKL HLGRGDSGVG ERSGRFSEGG YEPQLLKLFE ILRRVGSNFR
ATIDAETEKK FLGLLGSALQ ALSAVLEAVS VVDFSGYLED FLFFSKSLMP LMPAQLISTL
TQVIKVLFGT NLHFEKQLRV KRTQDSIVNQ AKTLQSCSNY EHFLENIQTH ITQKTARSMM
RHNYVEDSLI CHVGWLENNY SQYVSQLFQD SERKSFLSEA IQLFEPILME GLRMYTVQSD
GRFQAAFVGL ICHLLLVKVN YSVLDPQRRI LNLFCQQLEH YIEDDSCVDR EFVCKLFELL
IILSHERQNT VPFVEIPKII QLVESMYTSK DNPFLGNIAM HYVVLDLFLL RQQDVGNNEL
EIQQEVVIAM LTKALQYAEV GGDHLMSWDS FNVIMFQMKK FDERRWRKIS RDLTDRLLPL
LLEGKIEIKN SNSVNALYQL LDMVCSRAYS PVDPIFSALC FLAAECEHNT LNKWKHLLAV
TCLFRILFLH CPGDSLLDRA NEVLPSLKAN QQLAPWMNYH TDTKPEELIV RLILRLSEWY
LVSFVDSAVA DVSPNNHQLE LQMIVNFYET IFWINQSGMH PKLTAVFRFE EHFVRIIDQL
EIASRTGEAV LLFQLCHISA VLCNSEWHLN ELISRVQIKS FPWLVILVTC KDLLKNQMSK
STTTMDSLFL KLSSLGVECG TFFACVEEMA FEQLLKYALK MEHSCETMLY NFKRTIDSSS
MTIGQQLKMI GLLKYSCERY SLIALHLEIS LVQSKHYLVR RCAEQSACRR LLSMMSKSTQ
ELQRMTGVEL LLKDIIYATS KNSFTLLFKN GELRSLMVKF VNDNLHPAFG LSRRLPSELS
GRAAAGCSDT GHSFDKLDVL HLLKTDVSAD VGAPTDRWSR FAKAAQCFPV EDVRSILCDQ
NIPGAALRAC LKLARRRFAL SCDGGDQNNC GAAPDREFSR NMPKLFTAGK EAVIFALQNL
IRQFPADAVA RFQFSQWEVD LTLPIGYREK AFSMHSSVQE LFSIISAASW RLSTSEISII
LDFVHISFLT CLKLLPDLSL HEATATLATF LTVLANQHCC TFFTDQPQNV VHSIIGCTFW
MLVKSFRLSA LLRRLPSKLP FTQGIGFSVN ETEKMSFLYC EAMLAFRRSR RLYEQVQDAL
ISDTIIGVCR LPQVFSFCLT PFAAFPFGWV PDVKRIGDLC TFSNVPVRFL VHADVLKDFI
YRVCLLGWSS KQQFEEIWMT LVAVLSATPI GKEMSHRDRL DTVDRVTASS LAVRCIGCIF
TMTSPRLVLT PVNNWKRAAQ FRPPCSFQSE PTWNKRCAEI AHLVKFELES SSSLEMQADL
QLPCSFCDVE KLYESAKLLN RAQIINSNIA SGGSIVTPNL RESFDLVSCM YFVIDLFDHW
FKDGPDQVPL LLLISTLDAI VCLSDFFFED AHSEWMTNHM NVVFQARSAD DEFITNVILF
GILKCAANLC SSNAESLKMI LQVVDHGCKI PFQSNKMFML FGLLQSLRSF PSADVACFLP
NIGELILHTF QLMSDRQAVN NVQVCGLDYE VACCALACQM MEKLADEHGT VDYLKTLLKL
AAEAYQSPLP HCLQNAVATV LQTLVRFPAM NFECRQQILT VTTKCFDVEP GNSYIFTSLT
LMLISLHFIK NWFNNQADDD QQRHQLDASA SDQSKATFHI MVMERFDIIL RKIAQSATEI
TNVLGVVCCE FLKDNFSPSD VIHKLVLDFM SNMKSADQQK VLIDILCSVL KDFQSKDEPT
IFNWILLLVP SIVEKRPSNL AISCLTCFFL SIFKEPWMEA LRIIALNRLG KMEEFDTELF
AFTVKRFKEL LPNEILVNDF LAIFANFSIQ YPNTAYSLAH QYCMTKKT
//