ID A0A162PA16_9CRUS Unreviewed; 2896 AA.
AC A0A162PA16;
DT 06-JUL-2016, integrated into UniProtKB/TrEMBL.
DT 06-JUL-2016, sequence version 1.
DT 24-JAN-2024, entry version 21.
DE SubName: Full=Putative Huntingtin {ECO:0000313|EMBL:KZS18666.1};
GN ORFNames=APZ42_015190 {ECO:0000313|EMBL:KZS18666.1};
OS Daphnia magna.
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Crustacea; Branchiopoda;
OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia.
OX NCBI_TaxID=35525 {ECO:0000313|EMBL:KZS18666.1, ECO:0000313|Proteomes:UP000076858};
RN [1] {ECO:0000313|EMBL:KZS18666.1, ECO:0000313|Proteomes:UP000076858}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Xinb3 {ECO:0000313|EMBL:KZS18666.1,
RC ECO:0000313|Proteomes:UP000076858};
RC TISSUE=Complete organism {ECO:0000313|EMBL:KZS18666.1};
RA Gilbert D.G., Choi J.-H., Mockaitis K., Colbourne J., Pfrender M.;
RT "EvidentialGene: Evidence-directed Construction of Genes on Genomes.";
RL Submitted (MAR-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- FUNCTION: May play a role in microtubule-mediated transport or vesicle
CC function. {ECO:0000256|ARBA:ARBA00002907}.
CC -!- SUBCELLULAR LOCATION: Cytoplasm {ECO:0000256|ARBA:ARBA00004496}.
CC Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the huntingtin family.
CC {ECO:0000256|ARBA:ARBA00007153}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KZS18666.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LRGB01000512; KZS18666.1; -; Genomic_DNA.
DR STRING; 35525.A0A162PA16; -.
DR Proteomes; UP000076858; Unassembled WGS sequence.
DR GO; GO:0005737; C:cytoplasm; IEA:UniProtKB-SubCell.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR Gene3D; 1.25.10.10; Leucine-rich Repeat Variant; 1.
DR InterPro; IPR011989; ARM-like.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR048413; Htt_C-HEAT_rpt.
DR InterPro; IPR048411; Htt_N_HEAT_rpt-1.
DR InterPro; IPR028426; Huntingtin_fam.
DR InterPro; IPR024613; Huntingtin_N_HEAT_rpt-2.
DR PANTHER; PTHR10170:SF10; HUNTINGTIN; 1.
DR PANTHER; PTHR10170; HUNTINGTON DISEASE PROTEIN; 1.
DR Pfam; PF20927; Htt_C-HEAT; 2.
DR Pfam; PF12372; Htt_N-HEAT; 1.
DR Pfam; PF20926; Htt_N-HEAT_1; 1.
DR SUPFAM; SSF48371; ARM repeat; 2.
PE 3: Inferred from homology;
KW Cytoplasm {ECO:0000256|ARBA:ARBA00022490};
KW Membrane {ECO:0000256|SAM:Phobius};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000076858};
KW Signal {ECO:0000256|SAM:SignalP}; Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT SIGNAL 1..15
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 16..2896
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5013243950"
FT TRANSMEM 330..352
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
SQ SEQUENCE 2896 AA; 324212 MW; 9B849D862189F20E CRC64;
MRLLIGFALV SLAVGSTFFP TEKDEVDLVS TDQNDKVSGR IAVPPPNNLR KLYSDLSTCF
IRKIITSLLA PTSAPIRNGG IVNAAPILNP VAAPIVNTGP VGALVSGGAS QVFFSPVRIN
IDQSSQLYVP PGTDGRITVL LKNDGVGDFF LLSGGDDKNF FIQFDYSQIQ LGPGQTLAVP
GRIRVPQYAN NVVSTLTIIV QRRTDNAISR RQVYIRTKSE PGEKWAPWCT IRTVTQCDSY
LDESICSSRY WNMRAEIQDT EAGLLSVSIK PDGRFLGEDF IIGTNESIAV EQSVSCCTTG
VDITAVDVRG NTATCRADQS DSIYLSSGDV AAITLGVLLL ILLLVIIILA ICSPNVQNHK
DFVSYIASSI DSLIRLCDSK DSDIRLASDE GLYKVIKALL PLHSNRILVE LCKQIKKSES
PRILKLSMNR FAELCHLARR CAANAITVII STNRKSDVLL ALVVEYLTDQ ILLSEDISMT
DTSVITGTLL TLKLLLPLFP SPFVPVKENN NHHLGSQSSK SVATMNAVAV ERIVQIFELG
VHYSFSSDHN VVNAALELLQ QLLKMRTLLL AIPALKSANG LQGTRISLRG LSKENSQWNL
TALPVAEESL LLEAEVHLVE FEKTNLDSTE IEVEIAEDSI ITVGRDEQDE LLKGTALTEN
DMLPAEKDLE ETDVEQESCD VVDAPQVIDI GSLYDKTGCS PLLYCARHLS YSFLLSERKG
QMKSDRSVRV SVKSLALNCL AHIGSLEPLI WNSYLSDSER STDGRIAIID VLQYCTHEDP
QLRGQVSLLA CMVLSSVISG FNLTGIESKT LVRIIDETLK DREAAASRLA LQGLQHFLPM
ALEGSFCIET TPLLRSLLKL AENPYWLVRV DLLEVFTSFS WAALEFGIKN KTSFSLPMFQ
EAFLRKVAFT MIGDEDQRVR SAVANCIKCS VESWTTTKSP PSVVRLKSLA SACNFSHQKL
SSKTFAGVPL SINGLAEAYC DGPVNGNVVE NLACFIDELY KLLVSSNSKF VKMGCLQALA
ELSRSYPPAT YMEIYGCSPK GTCSLLKVCI ALMTNSTLML DLANHQSLIT LSTQLFAGCV
KVDLNAPESE IEDKEWHFLT GPLATIASHL LSHISRILNA FCCVLEDVNP LSVSIKAPLV
TLPNPSALSP IRRKLRNSEG PILDKEDKSD EFARKSKPHI LNPSNIGFFA SQSLYVKFYE
LLKSTNATCK LNFDTVTSER LSSFIQSTLK ALAALVEVYN FQDIGKHSEE VLNYLRIIIT
AEPVYCLQAV QQLLKALFGT NLAAVWYDEA SRFKRGCISP NFSVTMKKDK SMSLSYGFDD
YLYRDTVRQM TFNLTGERPS LITKFRTSVN KIGPDKTALT SYIRLFEPMV IKALKLYTVT
SCPHVQKQVL DLLCELIHLC VNYCLLDADQ VFLNFVIQQF EFLEEGFIPQ VETLLPSMFR
FLSLLSYDRF HSKTIISVPR IMQLAGGLMA TGKDAELHVL PTLRPMVEDL FLSRERMSDN
FKELETQKEV LINILLRLLQ YHKVYPLITA ILLQVQFENA SKWKSLSKQI CDALIPHLNK
HSLFLDSDEA LQMLYKLISH MDPPVVHESL LGLLQTFTPK QSEDLNSSAT SNYHRVLACR
LTVFKTLISV FDEAFLLDCL NEFNLSYLVN QRSDPLNVTP TENSPPDHIL SSLLVDTLKL
AFSHWKSKSF ADSCGSTSIS WKIDGFSFSV HLVLDLIALC RQLVNAEDFP VLRTAIRAEL
ARDECKYPLF EVARELIPNN PIVFARVCAL LAPEIHCFEA DDMQQFFTGL NGSIAEAGYF
LVLPGIMPKQ EEKLKAFVSV HADLILSYSS ETPVKKFFQN LVVDENLIKP LVKCVLAKSG
LSLVAAAVSL FNIIEALQIA NVEEILLEVI SSFQSLSNGV VHKKLERLLL KMNFSSAESQ
LQCLEKAQEI ILQSKNERWY SELMTKHRAS LAYIPEMLRK IDKAFLLEFA LSRNLKPIMW
QEIVTQIHLD DLPGWFSNYS FLSFALGTGR KSKPDIFVRA KTYFLNMIDT TVFDNIKILP
IARILSGMCK TIQPSELKDI FTPLEISILA RVSLTFMAAM DSLQRSLVLL PASSAIVSIF
CLELPSVWTN KETLPAVYKA IDALESLCIR LRNNTTFEHP SVTGLEHMNC PDPNFVSSAQ
QLSRIVTFLE QETHIPEVLH KDFLLIAIAF CRMPLFYSYT CIPPSVWNKG WNPVLAFDVE
RITFDLPIVP SYFLHDEVVL KEFIFRIERI CWITKKKFEE IWMTFLGILN LQNEDNVRPE
EQASVIQATC EAVHALTMVL QKNLPYRTSH TAPRANNLDE IPDFMHTELV KLRGKKLKEL
CNLLGDDCDV PSSCNSIYYF NRLSFRHIQR SIDQGSPSKK KAVSPEEITD IDIQSCVRFL
VDLYSQWLQS STTPYPLLVD IFRSLLHISD FFVDAQQFEW LLDNSFALYK QLLPDDPTSL
ESALTGCILK SASFTTLNSE YWDLIRKHCE ASLYSQHIST RASGLDGLLY LLQKLARNPD
LARTGSLVNL AMDYVGKWLN AENSGPTWYQ SVVWSMAFYC GEHFAVLPYQ HNAVVDFVVQ
CAIGWLAQNQ QQPTEIQKEI LQGLERLSVL GLLTPTWRFT VKREILAIIR NVDRDQILKG
ALRTLLSLVY SDFGSQLYQP GADPETMLAA MEWVGALFQR MKRGTDREAE ILGQIMPVIT
CDIFPPADIL NRILSEFITA RPSVACCLTP TIFKVFGTAR SQGQQSLVVE WILLSLSNFT
RSLSDESANS VWNVLCFFTA ASSNPWLQSV FPLVNQPVNP KNPMEVEIFV ASALDFRNQL
DDQQRSTLVS IFFSARDNSS FFAAVDVALK GKTYSWRMIH LFLVTSSVKQ MVNWQTPIHR
ACVIDDIHSG HDCVVN
//