GenomeNet

Database: UniProt
Entry: A0A0V0SMY5_9BILA
LinkDB: A0A0V0SMY5_9BILA
Original site: A0A0V0SMY5_9BILA 
ID   A0A0V0SMY5_9BILA        Unreviewed;      3127 AA.
AC   A0A0V0SMY5;
DT   16-MAR-2016, integrated into UniProtKB/TrEMBL.
DT   16-MAR-2016, sequence version 1.
DT   24-JAN-2024, entry version 27.
DE   SubName: Full=Huntingtin {ECO:0000313|EMBL:KRX28072.1};
GN   Name=HTT {ECO:0000313|EMBL:KRX28072.1};
GN   ORFNames=T07_10080 {ECO:0000313|EMBL:KRX28072.1};
OS   Trichinella nelsoni.
OC   Eukaryota; Metazoa; Ecdysozoa; Nematoda; Enoplea; Dorylaimia;
OC   Trichinellida; Trichinellidae; Trichinella.
OX   NCBI_TaxID=6336 {ECO:0000313|EMBL:KRX28072.1, ECO:0000313|Proteomes:UP000054630};
RN   [1] {ECO:0000313|EMBL:KRX28072.1, ECO:0000313|Proteomes:UP000054630}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=ISS37 {ECO:0000313|EMBL:KRX28072.1};
RA   Korhonen P.K., Edoardo P., Giuseppe L.R., Gasser R.B.;
RT   "Evolution of Trichinella species and genotypes.";
RL   Submitted (JAN-2015) to the EMBL/GenBank/DDBJ databases.
CC   -!- SUBCELLULAR LOCATION: Cytoplasm {ECO:0000256|ARBA:ARBA00004496}.
CC       Nucleus {ECO:0000256|ARBA:ARBA00004123, ECO:0000256|PROSITE-
CC       ProRule:PRU00649}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:KRX28072.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; JYDL01000001; KRX28072.1; -; Genomic_DNA.
DR   STRING; 6336.A0A0V0SMY5; -.
DR   Proteomes; UP000054630; Unassembled WGS sequence.
DR   GO; GO:0005737; C:cytoplasm; IEA:UniProtKB-SubCell.
DR   GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR   GO; GO:0006351; P:DNA-templated transcription; IEA:InterPro.
DR   Gene3D; 2.20.25.10; -; 1.
DR   Gene3D; 1.20.930.10; Conserved domain common to transcription factors TFIIS, elongin A, CRSP70; 1.
DR   Gene3D; 1.10.472.30; Transcription elongation factor S-II, central domain; 1.
DR   InterPro; IPR016024; ARM-type_fold.
DR   InterPro; IPR048413; Htt_C-HEAT_rpt.
DR   InterPro; IPR048411; Htt_N_HEAT_rpt-1.
DR   InterPro; IPR028426; Huntingtin_fam.
DR   InterPro; IPR024613; Huntingtin_N_HEAT_rpt-2.
DR   InterPro; IPR035441; TFIIS/LEDGF_dom_sf.
DR   InterPro; IPR003618; TFIIS_cen_dom.
DR   InterPro; IPR036575; TFIIS_cen_dom_sf.
DR   InterPro; IPR017923; TFIIS_N.
DR   PANTHER; PTHR10170:SF10; HUNTINGTIN; 1.
DR   PANTHER; PTHR10170; HUNTINGTON DISEASE PROTEIN; 1.
DR   Pfam; PF20927; Htt_C-HEAT; 2.
DR   Pfam; PF12372; Htt_N-HEAT; 1.
DR   Pfam; PF20926; Htt_N-HEAT_1; 1.
DR   Pfam; PF08711; Med26; 1.
DR   Pfam; PF07500; TFIIS_M; 1.
DR   SMART; SM00510; TFS2M; 1.
DR   SUPFAM; SSF48371; ARM repeat; 2.
DR   SUPFAM; SSF47676; Conserved domain common to transcription factors TFIIS, elongin A, CRSP70; 1.
DR   SUPFAM; SSF46942; Elongation factor TFIIS domain 2; 1.
DR   PROSITE; PS51321; TFIIS_CENTRAL; 1.
DR   PROSITE; PS51319; TFIIS_N; 1.
PE   4: Predicted;
KW   Cytoplasm {ECO:0000256|ARBA:ARBA00022490};
KW   Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW   ProRule:PRU00649}; Reference proteome {ECO:0000313|Proteomes:UP000054630}.
FT   DOMAIN          19..93
FT                   /note="TFIIS N-terminal"
FT                   /evidence="ECO:0000259|PROSITE:PS51319"
FT   DOMAIN          153..267
FT                   /note="TFIIS central"
FT                   /evidence="ECO:0000259|PROSITE:PS51321"
FT   REGION          755..791
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        775..791
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   3127 AA;  356362 MW;  0C03AE0DE0628636 CRC64;
     MSNNVQRQRL EFRVREIENE LSEMLDKNAL NKTKMRKLLE ELRTYGGIDV DMLVSTNIGK
     TVNRIRTTFV KCPELFENAT FLVKKWKKEV NALSTSSRRK SLSKSTSQSS DDMDQFSTSE
     LCSQSTTNFP LFNLSSPVRH NSCRALFDSI RSNLNNCRAM FEDCDANLLN DNNIKTIVQQ
     IEESIYELNG SDETNSKYCN EIRSHAMNLC NSKNCQLLRD ILTGKILPAN FAKMTTEEMA
     PEEVKNMRKA VERDSLKEHM LSNEDSLLHS TTFHCRQCDQ RDCNYTVSYE KDGHEAEAVT
     YVVCNQCGHR MEKLMKNLES LRRPTNMLKK EAVPLSKEKA ALFYQVAEAL SSNTLKQHAE
     YGKAFGNAFT ILFSYCDDPC TDVRILTEET MNIIIRSCIN EWNISRVHSE LCKELKRNGS
     ARCIRSAMGK FVETVELIGI QKRRQFAITL IPLFEQMIKR PEEMIQDCIG STIARLLQTL
     GRYFRDEEIH GLIHSALANL QLQSTLFRRV AATFITELCH NSRRRSDMIS YWFKYILTKV
     TKQDCDDVFH LLGNLLTLKM FIPLVLKEQQ IDQQYSTNVG GGQRVKQISS ENFVQLCEAI
     LFYTAVPNDV VNSLALEMLV KIFENSTQTQ CAIWLSKNSV KKSRFFFPSA DCDLTSTADV
     EPMRSDSISS ASCSDDPTAL QQDTAVEQPV LNISFQGDDC LEKRVQRFSA SSDDQSQFGE
     RESLESICIG RLKYSGLQRS IESVDERLAL MEMDEEAKSN QQQSNGRKFD EHSHSSSTCA
     SVSGASVNSG KTPAFDKSTI STVADWAKDE FDSLLAETTS MAQFCCRLIC RRFLLSGQAG
     VTLPDEKVRV SHKVLSLATL ERLFMLDHAL YNLSLYEGCS QNISDVLLLR KHLDPQVRGF
     VSQAVASLLI STATAVQSRK WNNDDDQEFK KEMKVLKRFL EILSNATRDE SNVCIHYTLS
     SLRRTFSIMQ KLNPSEVVNC AHNLLRLASN PYWLVKVKLA ECFEELDFCL ISLLEETCYS
     NLTEKLQPAV LHCLIDQLLV DEDRRVSSAA STSLLKLIPR LYFFGNVTLV SLNRQYYGRT
     FSGRQNWLHP DLPTVHNSAL DYSSPSDLHY NALLIDDTEH NLSIVIDLLY STFYKTSAQV
     FDWFGCIVAL DELATAYPPR VYRRAWSTTT TVNGKFLACP LMQHCLQLLR TDCFQLDPLI
     SWQRILHFVS TLFAACVVDF ESQPTNKKNN IVELYGLTDV AADLIAFTMQ VTSMYWHVVE
     QRTATPTSTA RFVKSGSGSS TTLSSPVIKK LKYPTFIRKE ISIKLHLGRG DSGVGERSGR
     FSEGGYEPQL LNLFEILRRV GSNFRATIDA ETEKKFLGLL GSALQALSAV LEAISVVDFS
     GYLEDFLFFS KSLMPLMPAQ LISTLTQVIK VLFGTNLHFE KQLRVKRTQD SIVNQATTLQ
     SCSNYEHFLE NIQTHIAQKT ARSMMRHDFV EDSLICHVGW LENNYSQYVS QLFQDSERKS
     FLSEAIQLFE PILMEGLRMY TVQSDGRFQA AFVGLICHLL LVKVNYSVLD PQRRILNLFC
     QQLEHYIEDD SCVDREFVCK LFELLIILSH ERQNTVPFVE IPKIIQLVES MYTSKDNPLL
     GNIAMHYVVL DLFLLRQQDV GNNELEIQQE VVIAMLTKAL QYAESWDSFN VIMFQLKKFD
     ERRWRKISRD LTDRLLPLLL EGKALNALYQ LLDMVCSRAY SPVDPIFSAL CFLAAECEHN
     TLNKWKHLLA VTCLFRILFL HCPGDSLLDR ANEVLPSLKA NQQLAPWMNY HTDTKPEELI
     VRLILRLSEW YLVSFVDSAV GDVSPNSHQL ELQMIVNFYE TIFWINQSGM HPKLTAVFRF
     EEHFVRIIDQ LEIASRTGDA VLLFQLCHIS AILCNSEWHL NELISRVQIN YLYILRSFPW
     LVILVTCKDL LKNQMSKSAT TTTTTTTTMD SLFLKWSSLR VECETFFAYV EEMAFEQLLK
     YALKMEHSCE TMLYNFKRTI DSSSMTIGQQ LKMIELLKYS CERYSLIALH LEISLVQSKH
     YLVRRCAEQS ACRRLLSMMS KSTQELQRMT GVELLLKDII YATSKNSFTL LFKNGELRSL
     MVKFVNDNLH PAFGLSRRLP SELSGRAAAG CFDTGHSFDK LDVLHLLKTD VSADVGVPTD
     RWSRFAKAVQ CFPVEDVRSI LCDQNIPGSA LRACLKLARR RFALSCDGGD QNNCGAAPDR
     EFSRNMPKLF TAGKEAVIFA LQNLIRQFPA DAVARFQFSQ WEVDLTLPIG YREKAFSIHS
     SVQELFSIIS AASWRLSTSE ISIILDFVHI SFLTCLKLLP DLSLHEATAT LATFLTVLAN
     QHCCTFFTDQ PQNVVHSIIG CTFWMLVKSF RLSALLRRLP SKLPFTQVNE TEKMSFLYCE
     AMLAFRRSRR LYEQVQDALI SDTIIGVCRL PQVFSFCLTP FAAFPFGWVP DVKRIGDLCT
     FSNVPVRFLV HADVLKDFIY RVCLLGWSSK QQFEEIWMTL VAVLSATPIG KEMSHRDRLD
     TVDRVTASSL AVRCIGCIFT MTSPRLVLTP VNNWKRAAQF RPPCSFQSEP TWNKRCAEIA
     HLVKFELESS SSLEMQADLQ LPCSFCDVEK LYESAKLLNR AQIINSNIAS GGSIVTPNLR
     ESFDLVSCMY FVIDLFDHWF KDGPDQVPLL LLISTLDAIV CLSDFFFEDA HSEWMTNHMN
     VIFQARSADD EFITNVILFG ILKCAANLCS SNAESLKMIL QVVDHGCKIP FQSNKMFMLF
     GLLQSLRSFP SADVACFLPN IGELILHTFQ LMSDRQAVNN VQVCGLDYEV ACCALACQMM
     EKLADEHGTV DYLKTLLKLA AEAYQSPLPH CLQNAVATVL QTLVRFPAMN FECRQQILTV
     TTKCFDVEPG NSYIFTSLTL MLISLHFIKN WFNNQADDDQ QRQQLDASAS DQSKATFHIM
     VMERFDIILR KIAQSATEIT NVLGVVCCEF LKDNFSPSDV IHKLVLDFMS NMKSADQQKV
     LIDILCSVLK DFQSKDESAI FNWILLLVPS IVEKRPSNLA ISCLTCFFLS IFKEPWMEAL
     RIIALNRLGK MEEFDTELFA FTVKRFKELL PNEILVNDFL AIFANFSIQY PNTAYSLAHQ
     YCMTKKT
//
DBGET integrated database retrieval system