ID Q7KSE8_DROME Unreviewed; 2556 AA.
AC Q7KSE8;
DT 05-JUL-2004, integrated into UniProtKB/TrEMBL.
DT 05-JUL-2004, sequence version 1.
DT 27-MAR-2024, entry version 155.
DE SubName: Full=Osa, isoform C {ECO:0000313|EMBL:AAS65166.1};
GN Name=osa {ECO:0000313|EMBL:AAS65166.1,
GN ECO:0000313|FlyBase:FBgn0261885};
GN Synonyms=anon-WO0118547.314 {ECO:0000313|EMBL:AAS65166.1},
GN anon-WO0172774.126 {ECO:0000313|EMBL:AAS65166.1}, C819
GN {ECO:0000313|EMBL:AAS65166.1}, Dmel\CG7467
GN {ECO:0000313|EMBL:AAS65166.1}, E(E2F)3C {ECO:0000313|EMBL:AAS65166.1},
GN eld {ECO:0000313|EMBL:AAS65166.1}, en(lz)4F/4H
GN {ECO:0000313|EMBL:AAS65166.1}, eyelid {ECO:0000313|EMBL:AAS65166.1},
GN l(3)00090 {ECO:0000313|EMBL:AAS65166.1}, l(3)04539
GN {ECO:0000313|EMBL:AAS65166.1}, l(3)j9C3 {ECO:0000313|EMBL:AAS65166.1},
GN OSA {ECO:0000313|EMBL:AAS65166.1}, Osa {ECO:0000313|EMBL:AAS65166.1},
GN p300 {ECO:0000313|EMBL:AAS65166.1};
GN ORFNames=CG7467 {ECO:0000313|EMBL:AAS65166.1,
GN ECO:0000313|FlyBase:FBgn0261885}, Dmel_CG7467
GN {ECO:0000313|EMBL:AAS65166.1};
OS Drosophila melanogaster (Fruit fly).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea;
OC Drosophilidae; Drosophila; Sophophora.
OX NCBI_TaxID=7227 {ECO:0000313|EMBL:AAS65166.1, ECO:0000313|Proteomes:UP000000803};
RN [1] {ECO:0000313|EMBL:AAS65166.1, ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=10731132; DOI=10.1126/science.287.5461.2185;
RA Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D.,
RA Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F.,
RA George R.A., Lewis S.E., Richards S., Ashburner M., Henderson S.N.,
RA Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q., Chen L.X., Brandon R.C.,
RA Rogers Y.H., Blazej R.G., Champe M., Pfeiffer B.D., Wan K.H., Doyle C.,
RA Baxter E.G., Helt G., Nelson C.R., Gabor G.L., Abril J.F., Agbayani A.,
RA An H.J., Andrews-Pfannkoch C., Baldwin D., Ballew R.M., Basu A.,
RA Baxendale J., Bayraktaroglu L., Beasley E.M., Beeson K.Y., Benos P.V.,
RA Berman B.P., Bhandari D., Bolshakov S., Borkova D., Botchan M.R., Bouck J.,
RA Brokstein P., Brottier P., Burtis K.C., Busam D.A., Butler H., Cadieu E.,
RA Center A., Chandra I., Cherry J.M., Cawley S., Dahlke C., Davenport L.B.,
RA Davies P., de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I.,
RA Dietz S.M., Dodson K., Doup L.E., Downes M., Dugan-Rocha S., Dunkov B.C.,
RA Dunn P., Durbin K.J., Evangelista C.C., Ferraz C., Ferriera S.,
RA Fleischmann W., Fosler C., Gabrielian A.E., Garg N.S., Gelbart W.M.,
RA Glasser K., Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., Harris M.,
RA Harris N.L., Harvey D., Heiman T.J., Hernandez J.R., Houck J., Hostin D.,
RA Houston K.A., Howland T.J., Wei M.H., Ibegwam C., Jalali M., Kalush F.,
RA Karpen G.H., Ke Z., Kennison J.A., Ketchum K.A., Kimmel B.E., Kodira C.D.,
RA Kraft C., Kravitz S., Kulp D., Lai Z., Lasko P., Lei Y., Levitsky A.A.,
RA Li J., Li Z., Liang Y., Lin X., Liu X., Mattei B., McIntosh T.C.,
RA McLeod M.P., McPherson D., Merkulov G., Milshina N.V., Mobarry C.,
RA Morris J., Moshrefi A., Mount S.M., Moy M., Murphy B., Murphy L.,
RA Muzny D.M., Nelson D.L., Nelson D.R., Nelson K.A., Nixon K., Nusskern D.R.,
RA Pacleb J.M., Palazzolo M., Pittman G.S., Pan S., Pollard J., Puri V.,
RA Reese M.G., Reinert K., Remington K., Saunders R.D., Scheeler F., Shen H.,
RA Shue B.C., Siden-Kiamos I., Simpson M., Skupski M.P., Smith T., Spier E.,
RA Spradling A.C., Stapleton M., Strong R., Sun E., Svirskas R., Tector C.,
RA Turner R., Venter E., Wang A.H., Wang X., Wang Z.Y., Wassarman D.A.,
RA Weinstock G.M., Weissenbach J., Williams S.M., WoodageT, Worley K.C.,
RA Wu D., Yang S., Yao Q.A., Ye J., Yeh R.F., Zaveri J.S., Zhan M., Zhang G.,
RA Zhao Q., Zheng L., Zheng X.H., Zhong F.N., Zhong W., Zhou X., Zhu S.,
RA Zhu X., Smith H.O., Gibbs R.A., Myers E.W., Rubin G.M., Venter J.C.;
RT "The genome sequence of Drosophila melanogaster.";
RL Science 287:2185-2195(2000).
RN [2] {ECO:0000313|EMBL:AAS65166.1, ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=12537568;
RA Celniker S.E., Wheeler D.A., Kronmiller B., Carlson J.W., Halpern A.,
RA Patel S., Adams M., Champe M., Dugan S.P., Frise E., Hodgson A.,
RA George R.A., Hoskins R.A., Laverty T., Muzny D.M., Nelson C.R.,
RA Pacleb J.M., Park S., Pfeiffer B.D., Richards S., Sodergren E.J.,
RA Svirskas R., Tabor P.E., Wan K., Stapleton M., Sutton G.G., Venter C.,
RA Weinstock G., Scherer S.E., Myers E.W., Gibbs R.A., Rubin G.M.;
RT "Finishing a whole-genome shotgun: release 3 of the Drosophila melanogaster
RT euchromatic genome sequence.";
RL Genome Biol. 3:RESEARCH0079-RESEARCH0079(2002).
RN [3] {ECO:0000313|EMBL:AAS65166.1, ECO:0000313|Proteomes:UP000000803}
RP GENOME REANNOTATION.
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=12537572; DOI=10.1186/gb-2002-3-12-research0083;
RA Misra S., Crosby M.A., Mungall C.J., Matthews B.B., Campbell K.S.,
RA Hradecky P., Huang Y., Kaminker J.S., Millburn G.H., Prochnik S.E.,
RA Smith C.D., Tupy J.L., Whitfield E.J., Bayraktaroglu L., Berman B.P.,
RA Bettencourt B.R., Celniker S.E., de Grey A.D.N.J., Drysdale R.A.,
RA Harris N.L., Richter J., Russo S., Schroeder A.J., Shu S.Q., Stapleton M.,
RA Yamada C., Ashburner M., Gelbart W.M., Rubin G.M., Lewis S.E.;
RT "Annotation of the Drosophila melanogaster euchromatic genome: a systematic
RT review.";
RL Genome Biol. 3:RESEARCH0083.1-RESEARCH0083.22(2002).
RN [4] {ECO:0000313|EMBL:AAS65166.1, ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=12537573;
RA Kaminker J.S., Bergman C.M., Kronmiller B., Carlson J., Svirskas R.,
RA Patel S., Frise E., Wheeler D.A., Lewis S.E., Rubin G.M., Ashburner M.,
RA Celniker S.E.;
RT "The transposable elements of the Drosophila melanogaster euchromatin: a
RT genomics perspective.";
RL Genome Biol. 3:RESEARCH0084.1-RESEARCH0084.20(2002).
RN [5] {ECO:0000313|EMBL:AAS65166.1, ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=12537574;
RA Hoskins R.A., Smith C.D., Carlson J.W., Carvalho A.B., Halpern A.,
RA Kaminker J.S., Kennedy C., Mungall C.J., Sullivan B.A., Sutton G.G.,
RA Yasuhara J.C., Wakimoto B.T., Myers E.W., Celniker S.E., Rubin G.M.,
RA Karpen G.H.;
RT "Heterochromatic sequences in a Drosophila whole-genome shotgun assembly.";
RL Genome Biol. 3:RESEARCH0085-RESEARCH0085(2002).
RN [6] {ECO:0000313|EMBL:AAS65166.1, ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=16110336; DOI=10.1371/journal.pcbi.0010022;
RA Quesneville H., Bergman C.M., Andrieu O., Autard D., Nouaud D.,
RA Ashburner M., Anxolabehere D.;
RT "Combined evidence annotation of transposable elements in genome
RT sequences.";
RL PLoS Comput. Biol. 1:166-175(2005).
RN [7] {ECO:0000313|EMBL:AAS65166.1, ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=17569856; DOI=10.1126/science.1139815;
RA Smith C.D., Shu S., Mungall C.J., Karpen G.H.;
RT "The Release 5.1 annotation of Drosophila melanogaster heterochromatin.";
RL Science 316:1586-1591(2007).
RN [8] {ECO:0000313|EMBL:AAS65166.1, ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=17569867; DOI=10.1126/science.1139816;
RA Hoskins R.A., Carlson J.W., Kennedy C., Acevedo D., Evans-Holm M.,
RA Frise E., Wan K.H., Park S., Mendez-Lago M., Rossi F., Villasante A.,
RA Dimitri P., Karpen G.H., Celniker S.E.;
RT "Sequence finishing and mapping of Drosophila melanogaster
RT heterochromatin.";
RL Science 316:1625-1628(2007).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AE014297; AAS65166.1; -; Genomic_DNA.
DR RefSeq; NP_996228.1; NM_206506.2.
DR SMR; Q7KSE8; -.
DR EnsemblMetazoa; FBtr0089583; FBpp0088962; FBgn0261885.
DR GeneID; 42130; -.
DR UCSC; CG7467-RC; d. melanogaster.
DR AGR; FB:FBgn0261885; -.
DR CTD; 42130; -.
DR FlyBase; FBgn0261885; osa.
DR VEuPathDB; VectorBase:FBgn0261885; -.
DR GeneTree; ENSGT00940000169092; -.
DR OrthoDB; 5477968at2759; -.
DR BioGRID-ORCS; 42130; 1 hit in 3 CRISPR screens.
DR ChiTaRS; osa; fly.
DR GenomeRNAi; 42130; -.
DR Proteomes; UP000000803; Chromosome 3R.
DR Bgee; FBgn0261885; Expressed in eye disc (Drosophila) and 26 other cell types or tissues.
DR ExpressionAtlas; Q7KSE8; baseline and differential.
DR GO; GO:0035060; C:brahma complex; IEA:InterPro.
DR GO; GO:0016514; C:SWI/SNF complex; IEA:InterPro.
DR GO; GO:0003677; F:DNA binding; IEA:InterPro.
DR GO; GO:0006338; P:chromatin remodeling; IEA:InterPro.
DR CDD; cd16865; ARID_ARID1A-like; 1.
DR Gene3D; 1.10.150.60; ARID DNA-binding domain; 1.
DR InterPro; IPR001606; ARID_dom.
DR InterPro; IPR036431; ARID_dom_sf.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR021906; BAF250/Osa.
DR InterPro; IPR033388; BAF250_C.
DR PANTHER; PTHR12656; BRG-1 ASSOCIATED FACTOR 250 BAF250; 1.
DR PANTHER; PTHR12656:SF5; TRITHORAX GROUP PROTEIN OSA; 1.
DR Pfam; PF01388; ARID; 1.
DR Pfam; PF12031; BAF250_C; 1.
DR SMART; SM01014; ARID; 1.
DR SMART; SM00501; BRIGHT; 1.
DR SUPFAM; SSF46774; ARID-like; 1.
DR SUPFAM; SSF48371; ARM repeat; 1.
DR PROSITE; PS51011; ARID; 1.
PE 1: Evidence at protein level;
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Proteomics identification {ECO:0007829|PeptideAtlas:Q7KSE8};
KW Reference proteome {ECO:0000313|Proteomes:UP000000803}.
FT DOMAIN 1000..1091
FT /note="ARID"
FT /evidence="ECO:0000259|PROSITE:PS51011"
FT REGION 1..969
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1108..1606
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1754..1776
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1885..1964
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2360..2457
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2524..2556
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..20
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 45..66
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 67..86
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 95..109
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 171..211
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 233..251
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 259..293
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 294..348
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 350..386
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 410..470
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 471..503
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 511..549
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 552..570
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 571..593
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 594..623
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 651..684
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 749..764
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 803..826
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 827..846
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 857..872
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 881..932
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1126..1146
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1206..1223
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1265..1282
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1344..1364
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1371..1386
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1430..1444
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1475..1496
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1754..1770
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1885..1916
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1917..1933
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2380..2457
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2556 AA; 268969 MW; 7F09E49CCE81D21E CRC64;
MNEKIKSPQT QQQQQGGAPA PAATPPSAGA APGAATPPTS GPPTPNNNSN NGSDPSIQQQ
QQNVAPHPYG APPPPGSGPG GPPGPDPAAV MHYHHLHQQQ QQHPPPPHMQ QQQHHGGPAP
PPPGGAPEHA PGVKEEYTHL PPPHPHPAYG RYHADPNMDP YRYGQPLPGG KPPQQQQPHP
QQQPPQQPGP GGSPNRPPQQ RYIPGQPPQG PTPTLNSLLQ SSNPPPPPQH RYANTYDPQQ
AAASAAAAAA AQQQQAGGPP PPGHGPPPPQ HQPSPYGGQQ GGWAPPPRPY SPQLGPSQQY
RTPPPTNTSR GQSPYPPAHG QNSGSYPSSP QQQQQQQQQQ QQQAGQQPGG PVPGGPPPGT
GQQPPQQNTP PTSQYSPYPQ RYPTPPGLPA GGSNHRTAYS THQYPEPNRP WPGGSSPSPG
SGHPLPPASP HHVPPLQQQP PPPPHVSAGG PPPSSSPGHA PSPSPQPSQA SPSPHQELIG
QNSNDSSSGG AHSGMGSGPP GTPNPQQVMR PTPSPTGSSG SRSMSPAVAQ NHPISRPASN
QSSSGGPMQQ PPVGAGGPPP MPPHPGMPGG PPQQQQSQQQ QASNSASSAS NSPQQTPPPA
PPPNQGMNNM ATPPPPPQGA AGGGYPMPPH MHGGYKMGGP GQSPGAQGYP PQQPQQYPPG
NYPPRPQYPP GAYATGPPPP PTSQAGAGGA NSMPSGAQAG GYPGRGMPNH TGQYPPYQWV
PPSPQQTVPG GAPGGAMVGN HVQGKGTPPP PVVGGPPPPQ GSGSPRPLNY LKQHLQHKGG
YGGSPTPPQG PQGYGNGPTG MHPGMPMGPP HHMGPPHGPT NMGPPTSTPP QSQMLQGGQP
QGQGASGGPE SGGPEHISQD NGISSSGPTG AAGMHAVTSV VTTGPDGTSM DEVSQQSTLS
NASAASGEDP QCTTPKSRKN DPYSQSHLAP PSTSPHPVVM HPGGGPGEEY DMSSPPNWPR
PAGSPQVFNH VPVPQEPFRS TITTTKKSDS LCKLYEMDDN PDRRGWLDKL RAFMEERRTP
ITACPTISKQ PLDLYRLYIY VKERGGFVEV TKSKTWKDIA GLLGIGASSS AAYTLRKHYT
KNLLTFECHF DRGDIDPLPI IQQVEAGSKK KTAKAASVPS PGGGHLDAGT TNSTGSSNSQ
DSFPAPPGSA PNAAIDGYPG YPGGSPYPVA SGPQPDYATA GQMQRPPSQN NPQTPHPGSP
YPSQPGAYGQ YGSSDQYNAT GPPGQPFGQG PGQYPPQNRN MYPPYGPEGE APPTGANQYG
PYGSRPYSQP PPGGPQPPTQ TVAGGPPAGG APGAPPSSAY PTGRPSQQDY YQPPPDQSPQ
PRRHPDFIKD SQPYPGYNAR PQIYGAWQSG TQQYRPQYPS SPAPQNWGGA PPRGAAPPPG
APHGPPIQQP AGVAQWDQHR YPPQQGPPPP PQQQQQPQQQ QQQPPYQQVA GPPGQQPPQA
PPQWAQMNPG QTAQSGIAPP GSPLRPPSGP GQQNRMPGMP AQQQQSQQQG GVPQPPPQQA
SHGGVPSPGL PQVGPGGMVK PPYAMPPPPS QGVGQQVGQG PPGGMMSQKP PPMPGQAMQQ
QPLQQQPPSH QHPHPHQHPQ HQHPHQMPPN QTAPGGYGPP GMPGGGAQLV KKELIFPHDS
VESTTPVLYR RKRLMKADVC PVDPWRIFMA MRSGLLTECT WALDVLNVLL FDDSTVQFFG
ISNLPGLLTL LLEHFQKNLA EMFDERENEE QSALLAEDAD DDADSGTVMC EKLRTSGRQP
RCVRSISSYN RRRHYENMDR SGKDGAGNGS DSEDADEGID LGQVRVQPNP EERSLLLSFT
PNYTMVTRKG VPVRIQPAEN DIFVDERQKA WDIDTNRLYE QLEPVGSDAW TYGFTEPDPL
DGIIDVFKSE IVNIPFARYI RSDKKGRKRT ELASSSRKPE IKTEENSTEE QTFNKKRRLV
SGGSSSSGAH AEGKKSKLTS EEFAQPNAEV KKEPGTADSD CRPVDMDIEA PQQRLTNGVA
PCSSTPAIFD PRTTAKDEAR VLQRRRDSSF EDECYTRDEA SLHLVSESQD SLARRCIALS
NIFRNLTFVP GNETVLAKST RFLAVLGRLL LLNHEHLRRT PKTRNYDREE DTDFSDSCSS
LQGEREWWWD YLITIRENML VAMANIAGHL ELSRYDELIA RPLIDGLLHW AVCPSAHGQD
PFPSCGPNSV LSPQRLALEA LCKLCVTDAN VDLVIATPPF SRLEKLCAVL TRHLCRNEDQ
VLREFSVNLL HYLAAADSAM ARTVALQSPC ISYLVAFIEQ AEQTALGVAN QHGINYLREN
PDSMGTSLDM LRRAAGTLLH LAKHPDNRSL FMQQEQRLLG LVMSHILDQQ VALIISRVLY
QVSRGTGPIH SVEFRLLQQR QQQQLRPGPA GKQAASAGGS ATVKAETAST ETSSTEAKPA
PAATTAVVND ENSNSSQQLP PAATFNDVSN SSTNSNSCGT ASSNQTNNST TNSSHSSSAI
SSQSAITVAA PSAAATGAGS ATAAAIASDQ QQVSKVAAAA AAAAALSNAS AAAAAAAAAA
AASVGPPTSS SVSAGAAVAQ PAAPPPTNAG TTTAVA
//