ID M9PDJ5_DROME Unreviewed; 2123 AA.
AC M9PDJ5;
DT 26-JUN-2013, integrated into UniProtKB/TrEMBL.
DT 26-JUN-2013, sequence version 1.
DT 24-JAN-2024, entry version 73.
DE SubName: Full=Hormone receptor 4, isoform I {ECO:0000313|EMBL:AGB95007.1};
GN Name=Hr4 {ECO:0000313|EMBL:AGB95007.1,
GN ECO:0000313|FlyBase:FBgn0264562};
GN Synonyms=CG16902 {ECO:0000313|EMBL:AGB95007.1}, CG3600
GN {ECO:0000313|EMBL:AGB95007.1}, CG42527 {ECO:0000313|EMBL:AGB95007.1},
GN CG43692 {ECO:0000313|EMBL:AGB95007.1}, DHR4
GN {ECO:0000313|EMBL:AGB95007.1}, Dmel\CG43934
GN {ECO:0000313|EMBL:AGB95007.1}, EG:133E12.2
GN {ECO:0000313|EMBL:AGB95007.1}, EG:BACH61I5.1
GN {ECO:0000313|EMBL:AGB95007.1}, EP(X)1232
GN {ECO:0000313|EMBL:AGB95007.1}, EP1232 {ECO:0000313|EMBL:AGB95007.1},
GN GRF {ECO:0000313|EMBL:AGB95007.1}, HR4 {ECO:0000313|EMBL:AGB95007.1},
GN hr4 {ECO:0000313|EMBL:AGB95007.1}, NR6A2
GN {ECO:0000313|EMBL:AGB95007.1}, null {ECO:0000313|EMBL:AGB95007.1};
GN ORFNames=CG43934 {ECO:0000313|EMBL:AGB95007.1,
GN ECO:0000313|FlyBase:FBgn0264562}, Dmel_CG43934
GN {ECO:0000313|EMBL:AGB95007.1};
OS Drosophila melanogaster (Fruit fly).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea;
OC Drosophilidae; Drosophila; Sophophora.
OX NCBI_TaxID=7227 {ECO:0000313|EMBL:AGB95007.1, ECO:0000313|Proteomes:UP000000803};
RN [1] {ECO:0000313|EMBL:AGB95007.1, ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=10731132; DOI=10.1126/science.287.5461.2185;
RA Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D.,
RA Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F.,
RA George R.A., Lewis S.E., Richards S., Ashburner M., Henderson S.N.,
RA Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q., Chen L.X., Brandon R.C.,
RA Rogers Y.H., Blazej R.G., Champe M., Pfeiffer B.D., Wan K.H., Doyle C.,
RA Baxter E.G., Helt G., Nelson C.R., Gabor G.L., Abril J.F., Agbayani A.,
RA An H.J., Andrews-Pfannkoch C., Baldwin D., Ballew R.M., Basu A.,
RA Baxendale J., Bayraktaroglu L., Beasley E.M., Beeson K.Y., Benos P.V.,
RA Berman B.P., Bhandari D., Bolshakov S., Borkova D., Botchan M.R., Bouck J.,
RA Brokstein P., Brottier P., Burtis K.C., Busam D.A., Butler H., Cadieu E.,
RA Center A., Chandra I., Cherry J.M., Cawley S., Dahlke C., Davenport L.B.,
RA Davies P., de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I.,
RA Dietz S.M., Dodson K., Doup L.E., Downes M., Dugan-Rocha S., Dunkov B.C.,
RA Dunn P., Durbin K.J., Evangelista C.C., Ferraz C., Ferriera S.,
RA Fleischmann W., Fosler C., Gabrielian A.E., Garg N.S., Gelbart W.M.,
RA Glasser K., Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., Harris M.,
RA Harris N.L., Harvey D., Heiman T.J., Hernandez J.R., Houck J., Hostin D.,
RA Houston K.A., Howland T.J., Wei M.H., Ibegwam C., Jalali M., Kalush F.,
RA Karpen G.H., Ke Z., Kennison J.A., Ketchum K.A., Kimmel B.E., Kodira C.D.,
RA Kraft C., Kravitz S., Kulp D., Lai Z., Lasko P., Lei Y., Levitsky A.A.,
RA Li J., Li Z., Liang Y., Lin X., Liu X., Mattei B., McIntosh T.C.,
RA McLeod M.P., McPherson D., Merkulov G., Milshina N.V., Mobarry C.,
RA Morris J., Moshrefi A., Mount S.M., Moy M., Murphy B., Murphy L.,
RA Muzny D.M., Nelson D.L., Nelson D.R., Nelson K.A., Nixon K., Nusskern D.R.,
RA Pacleb J.M., Palazzolo M., Pittman G.S., Pan S., Pollard J., Puri V.,
RA Reese M.G., Reinert K., Remington K., Saunders R.D., Scheeler F., Shen H.,
RA Shue B.C., Siden-Kiamos I., Simpson M., Skupski M.P., Smith T., Spier E.,
RA Spradling A.C., Stapleton M., Strong R., Sun E., Svirskas R., Tector C.,
RA Turner R., Venter E., Wang A.H., Wang X., Wang Z.Y., Wassarman D.A.,
RA Weinstock G.M., Weissenbach J., Williams S.M., WoodageT, Worley K.C.,
RA Wu D., Yang S., Yao Q.A., Ye J., Yeh R.F., Zaveri J.S., Zhan M., Zhang G.,
RA Zhao Q., Zheng L., Zheng X.H., Zhong F.N., Zhong W., Zhou X., Zhu S.,
RA Zhu X., Smith H.O., Gibbs R.A., Myers E.W., Rubin G.M., Venter J.C.;
RT "The genome sequence of Drosophila melanogaster.";
RL Science 287:2185-2195(2000).
RN [2] {ECO:0000313|EMBL:AGB95007.1, ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=12537568;
RA Celniker S.E., Wheeler D.A., Kronmiller B., Carlson J.W., Halpern A.,
RA Patel S., Adams M., Champe M., Dugan S.P., Frise E., Hodgson A.,
RA George R.A., Hoskins R.A., Laverty T., Muzny D.M., Nelson C.R.,
RA Pacleb J.M., Park S., Pfeiffer B.D., Richards S., Sodergren E.J.,
RA Svirskas R., Tabor P.E., Wan K., Stapleton M., Sutton G.G., Venter C.,
RA Weinstock G., Scherer S.E., Myers E.W., Gibbs R.A., Rubin G.M.;
RT "Finishing a whole-genome shotgun: release 3 of the Drosophila melanogaster
RT euchromatic genome sequence.";
RL Genome Biol. 3:RESEARCH0079-RESEARCH0079(2002).
RN [3] {ECO:0000313|EMBL:AGB95007.1, ECO:0000313|Proteomes:UP000000803}
RP GENOME REANNOTATION.
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=12537572; DOI=10.1186/gb-2002-3-12-research0083;
RA Misra S., Crosby M.A., Mungall C.J., Matthews B.B., Campbell K.S.,
RA Hradecky P., Huang Y., Kaminker J.S., Millburn G.H., Prochnik S.E.,
RA Smith C.D., Tupy J.L., Whitfield E.J., Bayraktaroglu L., Berman B.P.,
RA Bettencourt B.R., Celniker S.E., de Grey A.D.N.J., Drysdale R.A.,
RA Harris N.L., Richter J., Russo S., Schroeder A.J., Shu S.Q., Stapleton M.,
RA Yamada C., Ashburner M., Gelbart W.M., Rubin G.M., Lewis S.E.;
RT "Annotation of the Drosophila melanogaster euchromatic genome: a systematic
RT review.";
RL Genome Biol. 3:RESEARCH0083.1-RESEARCH0083.22(2002).
RN [4] {ECO:0000313|EMBL:AGB95007.1, ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=12537573;
RA Kaminker J.S., Bergman C.M., Kronmiller B., Carlson J., Svirskas R.,
RA Patel S., Frise E., Wheeler D.A., Lewis S.E., Rubin G.M., Ashburner M.,
RA Celniker S.E.;
RT "The transposable elements of the Drosophila melanogaster euchromatin: a
RT genomics perspective.";
RL Genome Biol. 3:RESEARCH0084.1-RESEARCH0084.20(2002).
RN [5] {ECO:0000313|EMBL:AGB95007.1, ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=12537574;
RA Hoskins R.A., Smith C.D., Carlson J.W., Carvalho A.B., Halpern A.,
RA Kaminker J.S., Kennedy C., Mungall C.J., Sullivan B.A., Sutton G.G.,
RA Yasuhara J.C., Wakimoto B.T., Myers E.W., Celniker S.E., Rubin G.M.,
RA Karpen G.H.;
RT "Heterochromatic sequences in a Drosophila whole-genome shotgun assembly.";
RL Genome Biol. 3:RESEARCH0085-RESEARCH0085(2002).
RN [6] {ECO:0000313|EMBL:AGB95007.1, ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=16110336; DOI=10.1371/journal.pcbi.0010022;
RA Quesneville H., Bergman C.M., Andrieu O., Autard D., Nouaud D.,
RA Ashburner M., Anxolabehere D.;
RT "Combined evidence annotation of transposable elements in genome
RT sequences.";
RL PLoS Comput. Biol. 1:166-175(2005).
RN [7] {ECO:0000313|EMBL:AGB95007.1, ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=17569856; DOI=10.1126/science.1139815;
RA Smith C.D., Shu S., Mungall C.J., Karpen G.H.;
RT "The Release 5.1 annotation of Drosophila melanogaster heterochromatin.";
RL Science 316:1586-1591(2007).
RN [8] {ECO:0000313|EMBL:AGB95007.1, ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=17569867; DOI=10.1126/science.1139816;
RA Hoskins R.A., Carlson J.W., Kennedy C., Acevedo D., Evans-Holm M.,
RA Frise E., Wan K.H., Park S., Mendez-Lago M., Rossi F., Villasante A.,
RA Dimitri P., Karpen G.H., Celniker S.E.;
RT "Sequence finishing and mapping of Drosophila melanogaster
RT heterochromatin.";
RL Science 316:1625-1628(2007).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AE014298; AGB95007.1; -; Genomic_DNA.
DR RefSeq; NP_001259161.1; NM_001272232.1.
DR EnsemblMetazoa; FBtr0333343; FBpp0305535; FBgn0264562.
DR GeneID; 31162; -.
DR AGR; FB:FBgn0264562; -.
DR CTD; 31162; -.
DR FlyBase; FBgn0264562; Hr4.
DR VEuPathDB; VectorBase:FBgn0264562; -.
DR GeneTree; ENSGT00940000157936; -.
DR OrthoDB; 5397661at2759; -.
DR BioGRID-ORCS; 31162; 0 hits in 1 CRISPR screen.
DR GenomeRNAi; 31162; -.
DR Proteomes; UP000000803; Chromosome X.
DR Bgee; FBgn0264562; Expressed in saliva-secreting gland and 15 other cell types or tissues.
DR ExpressionAtlas; M9PDJ5; baseline and differential.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro.
DR GO; GO:0043565; F:sequence-specific DNA binding; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR CDD; cd07168; NR_DBD_DHR4_like; 1.
DR CDD; cd06953; NR_LBD_DHR4_like; 1.
DR Gene3D; 3.30.50.10; Erythroid Transcription Factor GATA-1, subunit A; 1.
DR Gene3D; 1.10.565.10; Retinoid X Receptor; 1.
DR InterPro; IPR035500; NHR-like_dom_sf.
DR InterPro; IPR000536; Nucl_hrmn_rcpt_lig-bd.
DR InterPro; IPR001723; Nuclear_hrmn_rcpt.
DR InterPro; IPR001628; Znf_hrmn_rcpt.
DR InterPro; IPR013088; Znf_NHR/GATA.
DR PANTHER; PTHR48092; KNIRPS-RELATED PROTEIN-RELATED; 1.
DR PANTHER; PTHR48092:SF18; NUCLEAR RECEPTOR SUBFAMILY 6 GROUP A MEMBER 1; 1.
DR Pfam; PF00104; Hormone_recep; 1.
DR Pfam; PF00105; zf-C4; 1.
DR PRINTS; PR00398; STRDHORMONER.
DR PRINTS; PR00047; STROIDFINGER.
DR SMART; SM00430; HOLI; 1.
DR SMART; SM00399; ZnF_C4; 1.
DR SUPFAM; SSF57716; Glucocorticoid receptor-like (DNA-binding domain); 1.
DR SUPFAM; SSF48508; Nuclear receptor ligand-binding domain; 1.
DR PROSITE; PS51843; NR_LBD; 1.
DR PROSITE; PS00031; NUCLEAR_REC_DBD_1; 1.
DR PROSITE; PS51030; NUCLEAR_REC_DBD_2; 1.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Receptor {ECO:0000256|ARBA:ARBA00023170, ECO:0000313|EMBL:AGB95007.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000000803};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771}.
FT DOMAIN 918..993
FT /note="Nuclear receptor"
FT /evidence="ECO:0000259|PROSITE:PS51030"
FT DOMAIN 1250..1527
FT /note="NR LBD"
FT /evidence="ECO:0000259|PROSITE:PS51843"
FT REGION 31..53
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 148..327
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 379..595
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 663..851
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 887..912
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1009..1101
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1131..1211
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1340..1382
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1527..1608
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1645..1731
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1875..1897
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1985..2009
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2074..2123
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 1760..1792
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 34..49
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 148..218
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 249..327
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 399..431
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 432..527
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 542..589
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 680..713
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 740..805
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 816..840
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1022..1058
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1059..1074
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1075..1101
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1148..1195
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1581..1596
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1645..1678
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1688..1731
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2074..2089
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2090..2105
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2123 AA; 226152 MW; 9E9AB14D18075871 CRC64;
MTLSRGPYSE LDKMSLFQDL KLKRRKIDSR CSSDGESIAD TSTSSPDLLA PMSPKLCDSG
SAGASLGASL PLPLALPLPM ALPLPMSLPL PLTAASSAVT VSLAAVVAAV AETGGAGAGG
AGTAVTASGA GPCVSTSSTT AAAATSSTSS LSSSSSSSSS TSSSTSSASP TAGASSTATC
PASSSSSSGN GSGGKSGSIK QEHTEIHSSS SAISAAAAST VMSPPPAEAT RSSPATPEGG
GPAGDGSGAT GGGNTSGGST AGVAINEHQN NGNGSGGSSR ASPDSLEEKP STTTTTGRPT
LTPTNGVLSS ASAGTGISTG SSAKLSEAGM SVIRSVKEER LLNVSSKMLV FHQQREQETK
AVAAAAAAAA AGHVTVLVTP SRIKSEPPPP ASPSSTSSTQ RERERERDRE RDRERERERD
RDREREREQS ISSSQQHLSR VSASPPTQLS HGSLGPNIVQ THHLHQQLTQ PLTLRKSSPP
TEHLLSQSMQ HLTQQQAIHL HHLLGQQQQQ QQASHPQQQQ QQQHSPHSLV RVKKEPNVGQ
RHLSPHHQQQ SPLLQHHQQQ QQQQQQQQQH LHQQQQQQQH HQQQPQALAL MHPASLALRN
SNRDAAILFR VKSEVHQQVA AGLPHLMQSA GGAAAAAAAA VAAQRMVCFS NARINGVKPE
VIGGPLGNLR PVGVGGGNGS GSVQCPSPHP SSSSSSSQLS PQTPSQTPPR GTPTVIMGES
CGVRTMVWGY EPPPPSAGQS HGQHPQQQQQ SPHHQPQQQQ QQQQQQSQQQ QQQQQQQSLG
QQQHCLSSPS AGSLTPSSSS GGGSVSGGGV GGPLTPSSVA PQNNEEAAQL LLSLGQTRIQ
DMRSRPHPFR TPHALNMERL WAGDYSQLPP GQLQALNLSA QQQQWGSSNS TGLGGVGGGM
GGRNLEAPHE PTDEDEQPLV CMICEDKATG LHYGIITCEG CKGFFKRTVQ NRRVYTCVAD
GTCEITKAQR NRCQYCRFKK CIEQGMVLQA VREDRMPGGR NSGAVYNLYK VKYKKHKKTN
QKQQQQAAQQ QQQQAAAQQQ HQQQQQHQQH QQHQQQQLHS PLHHHHHQGH QSHHAQQQHH
PQLSPHHLLS PQQQQLAAAV AAAAQHQQQQ QQQQQQQQQA KLMGGVVDMK PMFLGPALKP
ELLQAPPMHS PAQQQQQQQQ QQQQQQASPH LSLSSPHQQQ QQQQGQHQNH HQQQGGGGGG
AGGGAQLPPH LVNGTILKTA LTNPSEIVHL RHRLDSAVSS SKDRQISYEH ALGMIQTLID
CDAMEDIATL PHFSEFLEDK SEISEKLCNI GDSIVHKLVS WTKKLPFYLE IPVEIHTKLL
TDKWHEILIL TTAAYQALHG KRRGEGGGSR HGSPASTPLS TPTGTPLSTP IPSPAQPLHK
DDPEFVSEVN SHLSTLQTCL TTLMGQPIAM EQLKLDVGHM VDKMTQITIM FRRIKLKMEE
YVCLKVYILL NKAEVELESI QERYVQVLRS YLQNSSPQNP QARLSELLSH IPEIQAAASL
LLESKMFYVP FVLNSASIRX QMLEHGLLPT ANHQSAPPPG PDTIPERQQQ QQQERRRSRQ
QQHQLLRNLP KIKIEIGSQR DPEEEESEQE LEQEPEQEQD VQSRPSAAST SILKTSLLSG
AAATLATLTA AAEQIARSSA SCDTARDYSQ SSNSSSNSNA ATGATGTMAS GRTGSSNSLE
ESKRLEHQQQ QQQQQQQPQQ HQQQQYTSIL HTALMSQRSM PPATSATSAA AAVTSTPTTA
ATTSPTPAHV LKTAVTLASL SEIATARQQQ QEQQQQQQQQ QQQQQLLAMQ QQQQQLTKLP
RRIINFGSNH TANTATKALG AGSEAGAGAG VGMATATATA TVGRNRQLGL SQGQVVNVRT
ARLLNGNCCG RLQATAGTGA SRGTNTAGSN TVNMQTQTDD ELLPLLAMLP QDAPTTTGAA
AGAAVSLSLL ATATATTTSS SSSSSSSLSS SSTISLATRA AAVMQQQQQW TAVSTTPATG
VMMTTTTTQS QSQATLAPQQ QQLSLPQQKQ QFNTRINRKT TITTATTTAP TATTATTTTT
RNSNINFNSN NNYMNNNNIN AATARASAFA SASAMATATP TGTPTATGPR QRPAPQPPPP
PRTPTATSAS VTVVVFKTML PDD
//