ID A0A0B4JCV6_DROME Unreviewed; 2930 AA.
AC A0A0B4JCV6;
DT 01-APR-2015, integrated into UniProtKB/TrEMBL.
DT 01-APR-2015, sequence version 1.
DT 27-MAR-2024, entry version 65.
DE RecName: Full=E3 ubiquitin-protein ligase {ECO:0000256|RuleBase:RU369009};
DE EC=2.3.2.26 {ECO:0000256|RuleBase:RU369009};
GN Name=ctrip {ECO:0000313|EMBL:ADV37265.1,
GN ECO:0000313|FlyBase:FBgn0260794};
GN Synonyms=CG14656 {ECO:0000313|EMBL:ADV37265.1}, CG17735
GN {ECO:0000313|EMBL:ADV37265.1}, CTRIP {ECO:0000313|EMBL:ADV37265.1},
GN CTRIP/TRIP12 {ECO:0000313|EMBL:ADV37265.1}, Dmel\CG42574
GN {ECO:0000313|EMBL:ADV37265.1};
GN ORFNames=CG42574 {ECO:0000313|EMBL:ADV37265.1,
GN ECO:0000313|FlyBase:FBgn0260794}, Dmel_CG42574
GN {ECO:0000313|EMBL:ADV37265.1};
OS Drosophila melanogaster (Fruit fly).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea;
OC Drosophilidae; Drosophila; Sophophora.
OX NCBI_TaxID=7227 {ECO:0000313|EMBL:ADV37265.1, ECO:0000313|Proteomes:UP000000803};
RN [1] {ECO:0000313|EMBL:ADV37265.1, ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=10731132; DOI=10.1126/science.287.5461.2185;
RA Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D.,
RA Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F.,
RA George R.A., Lewis S.E., Richards S., Ashburner M., Henderson S.N.,
RA Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q., Chen L.X., Brandon R.C.,
RA Rogers Y.H., Blazej R.G., Champe M., Pfeiffer B.D., Wan K.H., Doyle C.,
RA Baxter E.G., Helt G., Nelson C.R., Gabor G.L., Abril J.F., Agbayani A.,
RA An H.J., Andrews-Pfannkoch C., Baldwin D., Ballew R.M., Basu A.,
RA Baxendale J., Bayraktaroglu L., Beasley E.M., Beeson K.Y., Benos P.V.,
RA Berman B.P., Bhandari D., Bolshakov S., Borkova D., Botchan M.R., Bouck J.,
RA Brokstein P., Brottier P., Burtis K.C., Busam D.A., Butler H., Cadieu E.,
RA Center A., Chandra I., Cherry J.M., Cawley S., Dahlke C., Davenport L.B.,
RA Davies P., de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I.,
RA Dietz S.M., Dodson K., Doup L.E., Downes M., Dugan-Rocha S., Dunkov B.C.,
RA Dunn P., Durbin K.J., Evangelista C.C., Ferraz C., Ferriera S.,
RA Fleischmann W., Fosler C., Gabrielian A.E., Garg N.S., Gelbart W.M.,
RA Glasser K., Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., Harris M.,
RA Harris N.L., Harvey D., Heiman T.J., Hernandez J.R., Houck J., Hostin D.,
RA Houston K.A., Howland T.J., Wei M.H., Ibegwam C., Jalali M., Kalush F.,
RA Karpen G.H., Ke Z., Kennison J.A., Ketchum K.A., Kimmel B.E., Kodira C.D.,
RA Kraft C., Kravitz S., Kulp D., Lai Z., Lasko P., Lei Y., Levitsky A.A.,
RA Li J., Li Z., Liang Y., Lin X., Liu X., Mattei B., McIntosh T.C.,
RA McLeod M.P., McPherson D., Merkulov G., Milshina N.V., Mobarry C.,
RA Morris J., Moshrefi A., Mount S.M., Moy M., Murphy B., Murphy L.,
RA Muzny D.M., Nelson D.L., Nelson D.R., Nelson K.A., Nixon K., Nusskern D.R.,
RA Pacleb J.M., Palazzolo M., Pittman G.S., Pan S., Pollard J., Puri V.,
RA Reese M.G., Reinert K., Remington K., Saunders R.D., Scheeler F., Shen H.,
RA Shue B.C., Siden-Kiamos I., Simpson M., Skupski M.P., Smith T., Spier E.,
RA Spradling A.C., Stapleton M., Strong R., Sun E., Svirskas R., Tector C.,
RA Turner R., Venter E., Wang A.H., Wang X., Wang Z.Y., Wassarman D.A.,
RA Weinstock G.M., Weissenbach J., Williams S.M., WoodageT, Worley K.C.,
RA Wu D., Yang S., Yao Q.A., Ye J., Yeh R.F., Zaveri J.S., Zhan M., Zhang G.,
RA Zhao Q., Zheng L., Zheng X.H., Zhong F.N., Zhong W., Zhou X., Zhu S.,
RA Zhu X., Smith H.O., Gibbs R.A., Myers E.W., Rubin G.M., Venter J.C.;
RT "The genome sequence of Drosophila melanogaster.";
RL Science 287:2185-2195(2000).
RN [2] {ECO:0000313|EMBL:ADV37265.1, ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=12537568;
RA Celniker S.E., Wheeler D.A., Kronmiller B., Carlson J.W., Halpern A.,
RA Patel S., Adams M., Champe M., Dugan S.P., Frise E., Hodgson A.,
RA George R.A., Hoskins R.A., Laverty T., Muzny D.M., Nelson C.R.,
RA Pacleb J.M., Park S., Pfeiffer B.D., Richards S., Sodergren E.J.,
RA Svirskas R., Tabor P.E., Wan K., Stapleton M., Sutton G.G., Venter C.,
RA Weinstock G., Scherer S.E., Myers E.W., Gibbs R.A., Rubin G.M.;
RT "Finishing a whole-genome shotgun: release 3 of the Drosophila melanogaster
RT euchromatic genome sequence.";
RL Genome Biol. 3:RESEARCH0079-RESEARCH0079(2002).
RN [3] {ECO:0000313|EMBL:ADV37265.1, ECO:0000313|Proteomes:UP000000803}
RP GENOME REANNOTATION.
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=12537572; DOI=10.1186/gb-2002-3-12-research0083;
RA Misra S., Crosby M.A., Mungall C.J., Matthews B.B., Campbell K.S.,
RA Hradecky P., Huang Y., Kaminker J.S., Millburn G.H., Prochnik S.E.,
RA Smith C.D., Tupy J.L., Whitfield E.J., Bayraktaroglu L., Berman B.P.,
RA Bettencourt B.R., Celniker S.E., de Grey A.D.N.J., Drysdale R.A.,
RA Harris N.L., Richter J., Russo S., Schroeder A.J., Shu S.Q., Stapleton M.,
RA Yamada C., Ashburner M., Gelbart W.M., Rubin G.M., Lewis S.E.;
RT "Annotation of the Drosophila melanogaster euchromatic genome: a systematic
RT review.";
RL Genome Biol. 3:RESEARCH0083.1-RESEARCH0083.22(2002).
RN [4] {ECO:0000313|EMBL:ADV37265.1, ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=12537573;
RA Kaminker J.S., Bergman C.M., Kronmiller B., Carlson J., Svirskas R.,
RA Patel S., Frise E., Wheeler D.A., Lewis S.E., Rubin G.M., Ashburner M.,
RA Celniker S.E.;
RT "The transposable elements of the Drosophila melanogaster euchromatin: a
RT genomics perspective.";
RL Genome Biol. 3:RESEARCH0084.1-RESEARCH0084.20(2002).
RN [5] {ECO:0000313|EMBL:ADV37265.1, ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=12537574;
RA Hoskins R.A., Smith C.D., Carlson J.W., Carvalho A.B., Halpern A.,
RA Kaminker J.S., Kennedy C., Mungall C.J., Sullivan B.A., Sutton G.G.,
RA Yasuhara J.C., Wakimoto B.T., Myers E.W., Celniker S.E., Rubin G.M.,
RA Karpen G.H.;
RT "Heterochromatic sequences in a Drosophila whole-genome shotgun assembly.";
RL Genome Biol. 3:RESEARCH0085-RESEARCH0085(2002).
RN [6] {ECO:0000313|EMBL:ADV37265.1, ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=16110336; DOI=10.1371/journal.pcbi.0010022;
RA Quesneville H., Bergman C.M., Andrieu O., Autard D., Nouaud D.,
RA Ashburner M., Anxolabehere D.;
RT "Combined evidence annotation of transposable elements in genome
RT sequences.";
RL PLoS Comput. Biol. 1:166-175(2005).
RN [7] {ECO:0000313|EMBL:ADV37265.1, ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=17569856; DOI=10.1126/science.1139815;
RA Smith C.D., Shu S., Mungall C.J., Karpen G.H.;
RT "The Release 5.1 annotation of Drosophila melanogaster heterochromatin.";
RL Science 316:1586-1591(2007).
RN [8] {ECO:0000313|EMBL:ADV37265.1, ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=17569867; DOI=10.1126/science.1139816;
RA Hoskins R.A., Carlson J.W., Kennedy C., Acevedo D., Evans-Holm M.,
RA Frise E., Wan K.H., Park S., Mendez-Lago M., Rossi F., Villasante A.,
RA Dimitri P., Karpen G.H., Celniker S.E.;
RT "Sequence finishing and mapping of Drosophila melanogaster
RT heterochromatin.";
RL Science 316:1625-1628(2007).
CC -!- CATALYTIC ACTIVITY:
CC Reaction=S-ubiquitinyl-[E2 ubiquitin-conjugating enzyme]-L-cysteine +
CC [acceptor protein]-L-lysine = [E2 ubiquitin-conjugating enzyme]-L-
CC cysteine + N(6)-ubiquitinyl-[acceptor protein]-L-lysine.;
CC EC=2.3.2.26; Evidence={ECO:0000256|ARBA:ARBA00000885,
CC ECO:0000256|RuleBase:RU369009};
CC -!- PATHWAY: Protein modification; protein ubiquitination.
CC {ECO:0000256|RuleBase:RU369009}.
CC -!- SUBCELLULAR LOCATION: Nucleus, nucleoplasm
CC {ECO:0000256|ARBA:ARBA00004642}.
CC -!- SIMILARITY: Belongs to the UPL family. K-HECT subfamily.
CC {ECO:0000256|ARBA:ARBA00006331, ECO:0000256|RuleBase:RU369009}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AE014297; ADV37265.1; -; Genomic_DNA.
DR RefSeq; NP_001189173.1; NM_001202244.2.
DR EnsemblMetazoa; FBtr0303914; FBpp0292917; FBgn0260794.
DR GeneID; 40596; -.
DR AGR; FB:FBgn0260794; -.
DR CTD; 40596; -.
DR FlyBase; FBgn0260794; ctrip.
DR VEuPathDB; VectorBase:FBgn0260794; -.
DR OrthoDB; 1093891at2759; -.
DR UniPathway; UPA00143; -.
DR BioGRID-ORCS; 40596; 1 hit in 1 CRISPR screen.
DR GenomeRNAi; 40596; -.
DR Proteomes; UP000000803; Chromosome 3R.
DR Bgee; FBgn0260794; Expressed in cleaving embryo and 28 other cell types or tissues.
DR ExpressionAtlas; A0A0B4JCV6; baseline and differential.
DR GO; GO:0016607; C:nuclear speck; IBA:GO_Central.
DR GO; GO:0005634; C:nucleus; ISS:FlyBase.
DR GO; GO:0061630; F:ubiquitin protein ligase activity; IBA:GO_Central.
DR GO; GO:0004842; F:ubiquitin-protein transferase activity; ISS:FlyBase.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0006974; P:DNA damage response; IBA:GO_Central.
DR GO; GO:0006281; P:DNA repair; IEA:UniProtKB-KW.
DR GO; GO:0042753; P:positive regulation of circadian rhythm; IMP:FlyBase.
DR GO; GO:0045732; P:positive regulation of protein catabolic process; IMP:FlyBase.
DR GO; GO:0043161; P:proteasome-mediated ubiquitin-dependent protein catabolic process; IBA:GO_Central.
DR GO; GO:0000209; P:protein polyubiquitination; IBA:GO_Central.
DR GO; GO:0009966; P:regulation of signal transduction; IEA:UniProt.
DR GO; GO:0006511; P:ubiquitin-dependent protein catabolic process; ISS:FlyBase.
DR Gene3D; 3.30.720.50; -; 1.
DR Gene3D; 3.30.2410.10; Hect, E3 ligase catalytic domain; 1.
DR Gene3D; 3.90.1750.10; Hect, E3 ligase catalytic domains; 1.
DR Gene3D; 1.25.10.10; Leucine-rich Repeat Variant; 1.
DR InterPro; IPR011989; ARM-like.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR000569; HECT_dom.
DR InterPro; IPR035983; Hect_E3_ubiquitin_ligase.
DR InterPro; IPR045322; HECTD1/TRIP12-like.
DR InterPro; IPR004170; WWE-dom.
DR InterPro; IPR018123; WWE-dom_subgr.
DR InterPro; IPR037197; WWE_dom_sf.
DR PANTHER; PTHR45670; E3 UBIQUITIN-PROTEIN LIGASE TRIP12; 1.
DR PANTHER; PTHR45670:SF1; E3 UBIQUITIN-PROTEIN LIGASE TRIP12; 1.
DR Pfam; PF00632; HECT; 1.
DR Pfam; PF02825; WWE; 1.
DR SMART; SM00119; HECTc; 1.
DR SMART; SM00678; WWE; 1.
DR SUPFAM; SSF48371; ARM repeat; 1.
DR SUPFAM; SSF56204; Hect, E3 ligase catalytic domain; 1.
DR SUPFAM; SSF117839; WWE domain; 1.
DR PROSITE; PS50237; HECT; 1.
DR PROSITE; PS50918; WWE; 1.
PE 1: Evidence at protein level;
KW DNA damage {ECO:0000256|ARBA:ARBA00022763};
KW DNA repair {ECO:0000256|ARBA:ARBA00023204};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Phosphoprotein {ECO:0000256|ARBA:ARBA00022553};
KW Proteomics identification {ECO:0007829|PeptideAtlas:A0A0B4JCV6};
KW Reference proteome {ECO:0000313|Proteomes:UP000000803};
KW Transferase {ECO:0000256|RuleBase:RU369009};
KW Ubl conjugation pathway {ECO:0000256|ARBA:ARBA00022786,
KW ECO:0000256|PROSITE-ProRule:PRU00104}.
FT DOMAIN 1186..1263
FT /note="WWE"
FT /evidence="ECO:0000259|PROSITE:PS50918"
FT DOMAIN 2631..2930
FT /note="HECT"
FT /evidence="ECO:0000259|PROSITE:PS50237"
FT REGION 1..177
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 211..584
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 622..655
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 669..693
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 703..722
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 741..832
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1299..1322
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1451..1480
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1581..1604
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1623..1793
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1857..1884
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2207..2229
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2498..2545
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..55
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 73..120
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 129..174
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 219..268
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 314..363
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 394..416
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 419..441
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 457..488
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 511..584
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 756..821
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1623..1647
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1695..1756
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1757..1771
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1773..1793
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1862..1884
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2207..2226
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT ACT_SITE 2897
FT /note="Glycyl thioester intermediate"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00104"
SQ SEQUENCE 2930 AA; 314167 MW; 56318021F699A64E CRC64;
MAESVKSQSL SALTEGQHDD GGSTATSATL ANKTSGTNTT SASNHSSQRR RNNNTNSNRN
KSHHKNEKRN APATSGPAVS SSVDVVPATS TASATSTRRS RSQGRHSSAA ALSSKESQIP
KRSVGAIRSP SSPAFNSIPV VSDQSLSPGS RKRQLNQHNN SNNKASSSHQ SAPGGDQLVC
APLKKRRLQQ SLGSGICEGA VEINLGAASF DHTPRQASGD CVAQRTRSKT ISPEELPSTS
SAAAARHHQI RPSASSTSFS SSHRNKRKAS IGGSIAAVET TPHRGTAAAR AGRSGNLLNY
YRKTRKVSHT RSSQKSEKQS IPAEDTTANR NGSAGSGSSS KSGVIPKNRK SLRHSQQNLL
DTEAAEVESA AAETGGEQQS EHGNLDVIDQ LPTPEAGPNQ SEASGIHNQS QLEADRQPVV
QDEDDEEDDE EEEEEEEEEV GFYGIVNSAG SSYEEDTQIV AEEDEITTEE EVDDEDDDEI
EEEDLSESEF AQQLIGELGG AGPPPALFQQ QAPPQLTRYT PATATTAATF VPPQQQQVVY
NPQQQQHSSL RRSSRGKTGS CVSSAAAAQQ QQHHQQQQHS SAAAAVQQQL LPPPGTYQYQ
QVGGNTVVVA VRHQQQLQQQ QQQQQQLALH HQHQQQQHQQ QQQQHQQQQH QQQQRATLVQ
PGFLFSYRSN QPQQQQQPAQ QIHQGPKVTH SSAASDALTY SLMAQQPPSG PPHAGGQQIT
PGANSANLSI VAAALSAARD VGGGSDGGGS AGGATPATGA SASSVGNTSA VGASSSSNSS
AGQAASSNSN NVTATGSGSA PGGGPTSTGT TSGTQHGSGS GAAAAVDSES DDSEVGRLQA
LLEARGLPPH LFGALGPRVT HILHRTIGNS SSSKANQLLQ GLQSHDESQQ LQAAIEMCQM
LVMGNEDTLA GFPIKQVVPA LIQLLRMEHN FDIMNNACRA LAYMLEALPR SSGTVVEAVP
VFLEKLQVIQ CMDVAEQSLS ALEILSRRHN KAILQANGIS ACLTYLDFFS IVAQRAALAI
AANCCLNMHP EEFHFVAESL PLLARLLSQQ DKKCIESVCS AFCRLVESFQ HDGQRLQQIA
SPDLLKNCQQ LLLVTPAILN TGTFTAVVRM LSLMCCSCPD LAISLLRNDI AATLLYLLTG
NAEPAAASAT HVELISRSPS ELYELTCLIG ELMPRLPLDG IFAVDSLLDR PTLNTQDQVH
WQWRDDRGSW HNYSTIDSRL IEAANQSSED EISLSTFGRT YTVDFHAMQQ INEDTGTTRP
VQRRLNHNYV APMSAGQDLT TTSAGSAAAG GASTSAAAAA ASSNNNNNNN NNPPGNSVNL
NQVKRRPSLD ARIACLKEER GLAADFIKHI FNVLYEVYSS SAGPNVRYKC LRALLRMVYY
ATPELLRQVL KYQLVSSHIA GMLGSNDLRI VVGALQMAEI LMRQLPDVFG THFRREGVIY
QFTQLTDPNN PICANPSPKP LSATATPTAN AGGSQSAPAS ANSLQVNPFF MDSAPGLSSA
STTPSSSKHQ SYSVKSFSHA MNALTASAKG TPSGALDATS SSTTAGGYNY SSSAPSSSSG
APAAYFVTQQ GDPRQYVHFQ QPAVPAPPPQ QELLPSGVQQ QGQQVPQVIY QPHHQQPAHL
VLASTSSGAA SSSSSSSSSS SASALQHKMT DMLKRKAPPK RKSQSGGRAK SRQEDAAVAP
AGSGPGGAPP SSSGSAMHEL LSRATSLGSG NGGRSTPNSG GGSGSSKSRF NAGNSSNAGS
SKSSFLASLN PARWGRQTHQ NHHHHHQQSQ QQHHGLSKDS GNSNSTGSGS GAGLAYTVSQ
HGAGSGAGGL NAAAVAASIS KSISHANLLA AANRERARQW VREQAVDFVK RYTEQEAKRS
KAASESGATQ SGSSGVGLSS TGNTPLSTAG STNVLERLSS ILFKLNGSYH DCLDALLELK
TILLESDISP FEVNHSGLIK AMLNYMTSET GLVERDARLR SFMHVFAGLP LEPLLQNVGQ
MPTIEPIAFG AFVAKLNGCV TQLEQFPVKV HDFPAGPGGR SNQSALRFFN THQLKCNLQR
HPQCNNLRQW KGGTVKIDPL AMVQAIERYL VVRGYGGIRA DSDDDSEEDM DDNVAAVVLS
QASFRHKLQF TIGDHVLPYN MTVYQAVKQF SPLVSEQPET DNESETLLGN ASIWVQQHTI
HYRPVEEEVT SGAAAGAASS SSSCSSGVQK QQSSSSSASS CVNATSSCSS SSGVASGGGS
LTKKAHKSSS KFMRKKTELW HEGIAPVVIS ALKPFLSSSL PADVVTVQDA SLDALCMLRV
IHALNRHWDH LYGCVVRQNI IPQSDFIHPK IMAKANRQLQ DPLVIMTGNL PQWLPQIGMA
CPFLFPFETR HLLFYATSFD RDRALQRLLD TTPDLNAAES SERVAPRLDR RKRAISRTEI
LKQAEHILQD FGHSKALLEI QYENEVGTGL GPTLEFYALV SAELQRTDLG LWNGSDSYKQ
NSVTIVDVVK ANSAVLHIED ALEATTTDQN TPAVAGASLV SSSTTTTTTT AQQHQHPPTR
SSSRSHVLRS GAGQQPVEHS SSSAGANENA LNMVIAQQFS DTNSANPAAI DNPSSTTTAT
TVVQHNTTTN NSSIITTTTT TSYVHAVHGL FPLPLGKSSK LPQMTKAKAK FKFLGKFMAK
AVMDSRMLDL PFSLPFYRWL VSEEHSIGLA DLMRVAPEVQ NTLVRLQDLV RQREYILSDP
NIDAMEKTEK IEQLDLDGCP IADLGLDFVL PGHANIELCR GGRDTPVTVH NLHQYISLVT
YWFLIEGVQK QFEALREGFD SVFPIQRLRM FYPEELECVF CGSGSEQQHS RWEIKMLQES
CRTDHGFHQD SQAIQYLYEI LASYNRDEQR AFLQFVTGSP RLPTGGFKAL TPPLTIVRKT
LDENQNPNDY LPSVMTCVNY LKLPDYSSRE VMRQKLKVAA NEGSMSFHLS
//