ID A0A0B4LG27_DROME Unreviewed; 3603 AA.
AC A0A0B4LG27;
DT 01-APR-2015, integrated into UniProtKB/TrEMBL.
DT 01-APR-2015, sequence version 1.
DT 27-MAR-2024, entry version 64.
DE SubName: Full=Starry night, isoform G {ECO:0000313|EMBL:AHN56089.1};
GN Name=stan {ECO:0000313|EMBL:AHN56089.1,
GN ECO:0000313|FlyBase:FBgn0024836};
GN Synonyms=CT20776 {ECO:0000313|EMBL:AHN56089.1}, Dmel\CG11895
GN {ECO:0000313|EMBL:AHN56089.1}, Flam {ECO:0000313|EMBL:AHN56089.1}, FMI
GN {ECO:0000313|EMBL:AHN56089.1}, Fmi {ECO:0000313|EMBL:AHN56089.1}, fmi
GN {ECO:0000313|EMBL:AHN56089.1}, fmi/stan {ECO:0000313|EMBL:AHN56089.1},
GN STAN {ECO:0000313|EMBL:AHN56089.1}, Stan
GN {ECO:0000313|EMBL:AHN56089.1};
GN ORFNames=CG11895 {ECO:0000313|EMBL:AHN56089.1,
GN ECO:0000313|FlyBase:FBgn0024836}, Dmel_CG11895
GN {ECO:0000313|EMBL:AHN56089.1};
OS Drosophila melanogaster (Fruit fly).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea;
OC Drosophilidae; Drosophila; Sophophora.
OX NCBI_TaxID=7227 {ECO:0000313|EMBL:AHN56089.1, ECO:0000313|Proteomes:UP000000803};
RN [1] {ECO:0000313|EMBL:AHN56089.1, ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=10731132; DOI=10.1126/science.287.5461.2185;
RA Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D.,
RA Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F.,
RA George R.A., Lewis S.E., Richards S., Ashburner M., Henderson S.N.,
RA Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q., Chen L.X., Brandon R.C.,
RA Rogers Y.H., Blazej R.G., Champe M., Pfeiffer B.D., Wan K.H., Doyle C.,
RA Baxter E.G., Helt G., Nelson C.R., Gabor G.L., Abril J.F., Agbayani A.,
RA An H.J., Andrews-Pfannkoch C., Baldwin D., Ballew R.M., Basu A.,
RA Baxendale J., Bayraktaroglu L., Beasley E.M., Beeson K.Y., Benos P.V.,
RA Berman B.P., Bhandari D., Bolshakov S., Borkova D., Botchan M.R., Bouck J.,
RA Brokstein P., Brottier P., Burtis K.C., Busam D.A., Butler H., Cadieu E.,
RA Center A., Chandra I., Cherry J.M., Cawley S., Dahlke C., Davenport L.B.,
RA Davies P., de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I.,
RA Dietz S.M., Dodson K., Doup L.E., Downes M., Dugan-Rocha S., Dunkov B.C.,
RA Dunn P., Durbin K.J., Evangelista C.C., Ferraz C., Ferriera S.,
RA Fleischmann W., Fosler C., Gabrielian A.E., Garg N.S., Gelbart W.M.,
RA Glasser K., Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., Harris M.,
RA Harris N.L., Harvey D., Heiman T.J., Hernandez J.R., Houck J., Hostin D.,
RA Houston K.A., Howland T.J., Wei M.H., Ibegwam C., Jalali M., Kalush F.,
RA Karpen G.H., Ke Z., Kennison J.A., Ketchum K.A., Kimmel B.E., Kodira C.D.,
RA Kraft C., Kravitz S., Kulp D., Lai Z., Lasko P., Lei Y., Levitsky A.A.,
RA Li J., Li Z., Liang Y., Lin X., Liu X., Mattei B., McIntosh T.C.,
RA McLeod M.P., McPherson D., Merkulov G., Milshina N.V., Mobarry C.,
RA Morris J., Moshrefi A., Mount S.M., Moy M., Murphy B., Murphy L.,
RA Muzny D.M., Nelson D.L., Nelson D.R., Nelson K.A., Nixon K., Nusskern D.R.,
RA Pacleb J.M., Palazzolo M., Pittman G.S., Pan S., Pollard J., Puri V.,
RA Reese M.G., Reinert K., Remington K., Saunders R.D., Scheeler F., Shen H.,
RA Shue B.C., Siden-Kiamos I., Simpson M., Skupski M.P., Smith T., Spier E.,
RA Spradling A.C., Stapleton M., Strong R., Sun E., Svirskas R., Tector C.,
RA Turner R., Venter E., Wang A.H., Wang X., Wang Z.Y., Wassarman D.A.,
RA Weinstock G.M., Weissenbach J., Williams S.M., WoodageT, Worley K.C.,
RA Wu D., Yang S., Yao Q.A., Ye J., Yeh R.F., Zaveri J.S., Zhan M., Zhang G.,
RA Zhao Q., Zheng L., Zheng X.H., Zhong F.N., Zhong W., Zhou X., Zhu S.,
RA Zhu X., Smith H.O., Gibbs R.A., Myers E.W., Rubin G.M., Venter J.C.;
RT "The genome sequence of Drosophila melanogaster.";
RL Science 287:2185-2195(2000).
RN [2] {ECO:0000313|EMBL:AHN56089.1, ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=12537568;
RA Celniker S.E., Wheeler D.A., Kronmiller B., Carlson J.W., Halpern A.,
RA Patel S., Adams M., Champe M., Dugan S.P., Frise E., Hodgson A.,
RA George R.A., Hoskins R.A., Laverty T., Muzny D.M., Nelson C.R.,
RA Pacleb J.M., Park S., Pfeiffer B.D., Richards S., Sodergren E.J.,
RA Svirskas R., Tabor P.E., Wan K., Stapleton M., Sutton G.G., Venter C.,
RA Weinstock G., Scherer S.E., Myers E.W., Gibbs R.A., Rubin G.M.;
RT "Finishing a whole-genome shotgun: release 3 of the Drosophila melanogaster
RT euchromatic genome sequence.";
RL Genome Biol. 3:RESEARCH0079-RESEARCH0079(2002).
RN [3] {ECO:0000313|EMBL:AHN56089.1, ECO:0000313|Proteomes:UP000000803}
RP GENOME REANNOTATION.
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=12537572; DOI=10.1186/gb-2002-3-12-research0083;
RA Misra S., Crosby M.A., Mungall C.J., Matthews B.B., Campbell K.S.,
RA Hradecky P., Huang Y., Kaminker J.S., Millburn G.H., Prochnik S.E.,
RA Smith C.D., Tupy J.L., Whitfield E.J., Bayraktaroglu L., Berman B.P.,
RA Bettencourt B.R., Celniker S.E., de Grey A.D.N.J., Drysdale R.A.,
RA Harris N.L., Richter J., Russo S., Schroeder A.J., Shu S.Q., Stapleton M.,
RA Yamada C., Ashburner M., Gelbart W.M., Rubin G.M., Lewis S.E.;
RT "Annotation of the Drosophila melanogaster euchromatic genome: a systematic
RT review.";
RL Genome Biol. 3:RESEARCH0083.1-RESEARCH0083.22(2002).
RN [4] {ECO:0000313|EMBL:AHN56089.1, ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=12537573;
RA Kaminker J.S., Bergman C.M., Kronmiller B., Carlson J., Svirskas R.,
RA Patel S., Frise E., Wheeler D.A., Lewis S.E., Rubin G.M., Ashburner M.,
RA Celniker S.E.;
RT "The transposable elements of the Drosophila melanogaster euchromatin: a
RT genomics perspective.";
RL Genome Biol. 3:RESEARCH0084.1-RESEARCH0084.20(2002).
RN [5] {ECO:0000313|EMBL:AHN56089.1, ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=12537574;
RA Hoskins R.A., Smith C.D., Carlson J.W., Carvalho A.B., Halpern A.,
RA Kaminker J.S., Kennedy C., Mungall C.J., Sullivan B.A., Sutton G.G.,
RA Yasuhara J.C., Wakimoto B.T., Myers E.W., Celniker S.E., Rubin G.M.,
RA Karpen G.H.;
RT "Heterochromatic sequences in a Drosophila whole-genome shotgun assembly.";
RL Genome Biol. 3:RESEARCH0085-RESEARCH0085(2002).
RN [6] {ECO:0000313|EMBL:AHN56089.1, ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=16110336; DOI=10.1371/journal.pcbi.0010022;
RA Quesneville H., Bergman C.M., Andrieu O., Autard D., Nouaud D.,
RA Ashburner M., Anxolabehere D.;
RT "Combined evidence annotation of transposable elements in genome
RT sequences.";
RL PLoS Comput. Biol. 1:166-175(2005).
RN [7] {ECO:0000313|EMBL:AHN56089.1, ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=17569856; DOI=10.1126/science.1139815;
RA Smith C.D., Shu S., Mungall C.J., Karpen G.H.;
RT "The Release 5.1 annotation of Drosophila melanogaster heterochromatin.";
RL Science 316:1586-1591(2007).
RN [8] {ECO:0000313|EMBL:AHN56089.1, ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=17569867; DOI=10.1126/science.1139816;
RA Hoskins R.A., Carlson J.W., Kennedy C., Acevedo D., Evans-Holm M.,
RA Frise E., Wan K.H., Park S., Mendez-Lago M., Rossi F., Villasante A.,
RA Dimitri P., Karpen G.H., Celniker S.E.;
RT "Sequence finishing and mapping of Drosophila melanogaster
RT heterochromatin.";
RL Science 316:1625-1628(2007).
CC -!- SUBCELLULAR LOCATION: Cell membrane {ECO:0000256|ARBA:ARBA00004651};
CC Multi-pass membrane protein {ECO:0000256|ARBA:ARBA00004651}. Membrane
CC {ECO:0000256|ARBA:ARBA00004141}; Multi-pass membrane protein
CC {ECO:0000256|ARBA:ARBA00004141}.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AE013599; AHN56089.1; -; Genomic_DNA.
DR RefSeq; NP_001286291.1; NM_001299362.1.
DR SMR; A0A0B4LG27; -.
DR EnsemblMetazoa; FBtr0339463; FBpp0308549; FBgn0024836.
DR GeneID; 36125; -.
DR AGR; FB:FBgn0024836; -.
DR CTD; 36125; -.
DR FlyBase; FBgn0024836; stan.
DR VEuPathDB; VectorBase:FBgn0024836; -.
DR GeneTree; ENSGT00940000168029; -.
DR OrthoDB; 4006628at2759; -.
DR BioGRID-ORCS; 36125; 0 hits in 3 CRISPR screens.
DR GenomeRNAi; 36125; -.
DR Proteomes; UP000000803; Chromosome 2R.
DR Bgee; FBgn0024836; Expressed in eye disc (Drosophila) and 24 other cell types or tissues.
DR ExpressionAtlas; A0A0B4LG27; baseline and differential.
DR GO; GO:0005886; C:plasma membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005509; F:calcium ion binding; IEA:UniProtKB-UniRule.
DR GO; GO:0004930; F:G protein-coupled receptor activity; IEA:UniProtKB-KW.
DR GO; GO:0048513; P:animal organ development; IEA:UniProt.
DR GO; GO:0030154; P:cell differentiation; IEA:UniProt.
DR GO; GO:0007166; P:cell surface receptor signaling pathway; IEA:InterPro.
DR GO; GO:0016043; P:cellular component organization; IEA:UniProt.
DR GO; GO:0001736; P:establishment of planar polarity; IEA:UniProt.
DR GO; GO:0007163; P:establishment or maintenance of cell polarity; IEA:UniProt.
DR GO; GO:0007156; P:homophilic cell adhesion via plasma membrane adhesion molecules; IEA:InterPro.
DR GO; GO:0048731; P:system development; IEA:UniProt.
DR CDD; cd15441; 7tmB2_CELSR_Adhesion_IV; 1.
DR CDD; cd11304; Cadherin_repeat; 8.
DR CDD; cd00054; EGF_CA; 2.
DR CDD; cd00055; EGF_Lam; 1.
DR CDD; cd00110; LamG; 2.
DR Gene3D; 2.60.120.200; -; 2.
DR Gene3D; 2.60.220.50; -; 1.
DR Gene3D; 2.60.40.60; Cadherins; 9.
DR Gene3D; 4.10.1240.10; GPCR, family 2, extracellular hormone receptor domain; 1.
DR Gene3D; 2.10.25.10; Laminin; 2.
DR Gene3D; 1.20.1070.10; Rhodopsin 7-helix transmembrane proteins; 1.
DR Gene3D; 2.170.300.10; Tie2 ligand-binding domain superfamily; 1.
DR InterPro; IPR002126; Cadherin-like_dom.
DR InterPro; IPR015919; Cadherin-like_sf.
DR InterPro; IPR020894; Cadherin_CS.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR032471; GAIN_dom_N.
DR InterPro; IPR046338; GAIN_dom_sf.
DR InterPro; IPR017981; GPCR_2-like_7TM.
DR InterPro; IPR036445; GPCR_2_extracell_dom_sf.
DR InterPro; IPR001879; GPCR_2_extracellular_dom.
DR InterPro; IPR000832; GPCR_2_secretin-like.
DR InterPro; IPR000203; GPS.
DR InterPro; IPR001791; Laminin_G.
DR InterPro; IPR002049; LE_dom.
DR PANTHER; PTHR24026; FAT ATYPICAL CADHERIN-RELATED; 1.
DR PANTHER; PTHR24026:SF51; PROTOCADHERIN-LIKE WING POLARITY PROTEIN STAN; 1.
DR Pfam; PF00002; 7tm_2; 1.
DR Pfam; PF00028; Cadherin; 8.
DR Pfam; PF00008; EGF; 2.
DR Pfam; PF16489; GAIN; 1.
DR Pfam; PF00053; Laminin_EGF; 1.
DR Pfam; PF02210; Laminin_G_2; 2.
DR PRINTS; PR00205; CADHERIN.
DR PRINTS; PR00249; GPCRSECRETIN.
DR SMART; SM00112; CA; 8.
DR SMART; SM00181; EGF; 5.
DR SMART; SM00179; EGF_CA; 2.
DR SMART; SM00180; EGF_Lam; 1.
DR SMART; SM00303; GPS; 1.
DR SMART; SM00008; HormR; 1.
DR SMART; SM00282; LamG; 2.
DR SUPFAM; SSF49313; Cadherin-like; 9.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 2.
DR SUPFAM; SSF57196; EGF/Laminin; 2.
DR SUPFAM; SSF81321; Family A G protein-coupled receptor-like; 1.
DR PROSITE; PS00232; CADHERIN_1; 5.
DR PROSITE; PS50268; CADHERIN_2; 8.
DR PROSITE; PS00022; EGF_1; 3.
DR PROSITE; PS01186; EGF_2; 1.
DR PROSITE; PS50026; EGF_3; 3.
DR PROSITE; PS01248; EGF_LAM_1; 1.
DR PROSITE; PS50027; EGF_LAM_2; 1.
DR PROSITE; PS50227; G_PROTEIN_RECEP_F2_3; 1.
DR PROSITE; PS50261; G_PROTEIN_RECEP_F2_4; 1.
DR PROSITE; PS50221; GPS; 1.
DR PROSITE; PS50025; LAM_G_DOMAIN; 2.
PE 4: Predicted;
KW Calcium {ECO:0000256|ARBA:ARBA00022837, ECO:0000256|PROSITE-
KW ProRule:PRU00043}; Cell membrane {ECO:0000256|ARBA:ARBA00022475};
KW Developmental protein {ECO:0000256|ARBA:ARBA00022473};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076};
KW G-protein coupled receptor {ECO:0000256|ARBA:ARBA00023040};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Laminin EGF-like domain {ECO:0000256|ARBA:ARBA00023292,
KW ECO:0000256|PROSITE-ProRule:PRU00460};
KW Membrane {ECO:0000256|ARBA:ARBA00023136, ECO:0000256|SAM:Phobius};
KW Phosphoprotein {ECO:0000256|ARBA:ARBA00022553};
KW Receptor {ECO:0000256|ARBA:ARBA00023170};
KW Reference proteome {ECO:0000313|Proteomes:UP000000803};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP};
KW Transducer {ECO:0000256|ARBA:ARBA00023224};
KW Transmembrane {ECO:0000256|ARBA:ARBA00022692, ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|ARBA:ARBA00022989,
KW ECO:0000256|SAM:Phobius}.
FT SIGNAL 1..29
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 30..3603
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5002092765"
FT TRANSMEM 2817..2837
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 2849..2868
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 2888..2908
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 2920..2940
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 2960..2980
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 3001..3023
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 3029..3052
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 360..464
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 465..581
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 582..689
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 690..794
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 795..897
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 898..1007
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 1008..1113
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 1114..1220
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 1482..1518
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1556..1753
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 1756..1792
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1796..1963
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 1965..2000
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 2095..2142
FT /note="Laminin EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50027"
FT DOMAIN 2115..2201
FT /note="G-protein coupled receptors family 2 profile 1"
FT /evidence="ECO:0000259|PROSITE:PS50227"
FT DOMAIN 2812..3053
FT /note="G-protein coupled receptors family 2 profile 2"
FT /evidence="ECO:0000259|PROSITE:PS50261"
FT REGION 2553..2582
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2610..2635
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2654..2684
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3111..3225
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3343..3377
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3458..3486
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3530..3564
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2560..2582
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2665..2679
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3111..3164
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3165..3187
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3195..3210
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3211..3225
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3472..3486
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3536..3553
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 1508..1517
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1782..1791
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1969..1979
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1990..1999
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 2095..2107
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00460"
FT DISULFID 2097..2114
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00460"
FT DISULFID 2116..2125
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00460"
SQ SEQUENCE 3603 AA; 400298 MW; 339C39C77B66142F CRC64;
MQTREFPQRP LGLLLVLLVV LLQSSLIKSY LIIVHEDTPP GTVIFNASVY KLGSERHYKI
NAHKSANFVH HLVSVNHKDG QIQLRKALKC DGIYYPNLFT FYVDSTSNRL RSIDYYSLPV
RIFVSGHSCN EDRRIEQELH HHHYEEEDNT GYSKRRRRRS TQEMIQLNGN QLEEVFRQNS
TEFRAGDLIF GDSFDNEMRH RILSRKRRAV GSPDPLHLQP ALHRRISDAK QWISETYASY
AIHTTDKWNQ ICLRRSQFIN SLNAFLPRSV CQHCKVSFLD VNDERFAIEH QSRDLVASRD
VCIAESMWKV SITFNIRCDR RDIVDSDHRL KIVYHHQEFN DTDIARRVRR ELRNQSPYFE
QALYVASVLE EQPAGAAVTT VRARDPEDSP VVYSMVSLLD SRSQSLFKVD SRTGVVTTSA
SLDRELMDVH YFRVVATDDS FPPRSGTTTL QVNVLDCNDH SPTFEAEQFE ASIREGATVG
STVITLRATD QDIGKNAEIE YGIEAVTDGA GLAQDQEMPI FRIDSRSGVI STRSSLDRET
SDSYHLLVTA ADLASAQSER RTATASVQVK VLDDNDNYPQ FSERTYTVQV PEDQWGGTED
NTVAHIRATD ADQGNNAAIR YAIIGGNTQS QFSIDSMSGD VSLVKPLDYE SVRSYRLVIR
AQDGGSPSRS NTTQLLVNVI DANDNAPRFY TSQFQESVLE NVPVGYNIIR VQAYDSDEGA
NAEITYSISE RDDNFPLAVD PRTGWVQTIK PLDREEQGRF AFQVVAKDGG VPPKSASSSV
VITVQDVNDN DPAFNPKYYE ANVGEDQPPG TPVTTVTATD PDEDSRLHYE ITTGNTRGRF
AITSQNGRGL ITIAQSLDYK QEKRFLLTVA ATDSGGRSDT ATVHINITDA NNFAPIFENA
PYSASVFEDA PVGTTVLVVS ATDSDVGVNA QITYSLNEES INGLGSPDPF SINPQTGAIV
TNAPLDRETT SGYLLTVTAK DGGNPSLSDT TDVEIGVTDV NDNAPAFKSP LYQASILEDA
LVGTSVIQVA ASDPDVGLNG RIKYLLSDRD IEDGSFVIDP TSGTIRTNKG LDRESVAVFH
LTAIAVDKGS PPLSSTVEVQ IRLEDVNDSP PTFASDKITL YVPENSPVGS VVGEIHAHDP
DEGVNAVVHY SIIGGDDSNA FSLVTRPGSE RAQLLTMTEL DYESTRKRFE LVVRAASPPL
RNDAHIEILV TDVNDNAPVL RDFQVIFNNF RDHFPSGEIG RIPAFDADVS DKLHYRILSG
NNANLLRLNS SSGGLVLSPQ LNTNVPKFAT MEVSVSDGIN EAKAIMQLSV RLITEDMLFN
SVTVRLNEMT EEAFLSPLLN FFLDGLAAII PCPKEHIFVF SIQDDTDVSS RILNVSFSAR
RPDVSHEEFY TPQYLQERVY LNRAILARLA TVEVLPFDDN LCVREPCLNF EECLTVLKFG
NASEFIHSDT VLFRPIYPVN TFACSCPEGF TGSKEHYLCD TEVDLCYSDP CQNGGTCVRR
EGGYTCVCPS THTGQNCETG VGHLRPCPSE TCEGGLSCLS NYPSSQPPPY TATCELRARA
FGRNSFLTFE SLKQRHRFNL KLRFATVQEN GLLLYNGRYN ELHDFIALEI HEGHVSFSFS
LGDHSERISV IQEAKVSDGK WHQVEVVYLN RSVTLVLDNC DTAIALSGQL GDRWSCANRT
TLKLDKRCSL LTETCHRFLD LTGPLQVGGL PRIPAHFPVT NRDFVGCISD LRIDDRFVDL
NSYVADNGTL AGCPQKAPLC QSEPCFNGGT CREGWGTYSC ECPEGYAGNS CQDNIPAPWR
FSGDGSLSFN PLLRPIQLPW TTSFSLRTRQ KEAFLLQIQI GQNSSAAVCL RQGVLYYIFD
GEPMYLAGAF LSDGEWHRVE IRWQQGSEIH FSVDYGQRSG SVPMSQKVQG LYVGKIVMGS
PDGSIGAVPE ASPFEGCIQD VRIGAGQSVL SRPTIRENVE DGCESRAQCP DHCPNHSSCQ
SSWDLSTCEC DSGYVGTDCA PICTVRPCAS GVCRANTSLP RGYDCECNSS SRHGDYCEKE
LQQPCPGGWW GERVCGPCRC DLAQGYHPDC NKTTGQCYCK TNHYQPPNET ACLSCDCYSI
GSFSGACNPL TGQCECREGV IGRRCDSCSN PYAEVTLSGC EVVYDACPRS FAGGVWWPRT
PLGGVAIEGC PPPARGKGQR SCDVQSGSWN TPDMYNCTSE PFVELRRQLS QLEKLELELN
SFVAIKMAEQ LRKACEAVDR RGASKDQKIS GNGRPNRRYK MESSFLLSNG GNVWSHELEM
DYLSDELKFT HDRLYGADLL VTEGLLQELI NYELMQSGLN LSHSQDKYFI KNLVDAASVI
LDRKYEAEWR RATELIQRGP DDLVDAFNKY LVVLARSQHD TYTSPFEIVQ PNMALGLDIV
TTESLFGYEP EQLSEYHRSK YLKPNAFTTE SVVLPDTSGF LQHSARQRPV ISFPKYNNYI
LDRRKFDQHT KVLVPLEMLG ITPPESDEIS QSGRRGSSHD HRAIVAYAQY KDVGQLLPDL
YDETITRRWG VDVELATPIL SLQILVPSME REQETQRLEI PSRKIFSSSS PSSSSSSGST
EQQFVEVFDV PKAPTSSSEQ QIEDIRITAH EIPPPVSSVE QQEASSDEDG EEREPHIRLN
LDDIEFHGNS GEEVISPDSP EMLNPNYEGV SSTGSDEQPK GENEAVYRDR RLVKRQVEIT
YPSEQMQQTE QVVYRSLGSP HLAQPIKLQM WLDVDSARFG PRSNPQCVRW NSFTNQWTRL
GCQTEIPDFD GDFNPAAQQA ILVNCSCTHI SSYAVIVDVI DPEDIPEPSL LVQITSYSAF
LVSLPLLLGV LLALALLRGQ QTNSNTIHQN IVLCVFCAEL LFFVGMQSRR QLLESEFPCK
LTAICLHYFW LAAFAWTTVD CVHLYRMLTE MRDINHGPMG FYFAMGYGAP AIVVGLSVGV
RAHEYGNSLF CWLSVYEPVV WWLVGPIAGM SVVNLLILFV SVKAAFTLKD HVLGFGNLRT
LLWLSVVSLP LMGVMWVLAV LAASEHSQLL SLLLSGVVLL HALFCLIGYC IINKRVRENL
QRTCLRCMGR KVPLLDSSMV VSNSSHNVNA AARPSNFLAS GYDTTTRRNI GISASSTTSR
STAKTSSSPY SDGQLRQTST STSNYNSASD APSFLRGFES STTGRSRGGE EKPSRRQRKD
SDSGSETDGR SLELASSHSS DDDESRTARS SGTHRSTAVS STPAYLPNIT EHVQATTPPE
LNVVQSPQLF PSVNKPVYAP RWSSQLPDAY LQSPPNIGRW SQDTGSDNEH VHGQAKMTIS
PNPLPNPDLT DTSYLQQHHN KINMPPSILE NIRDAREGYE DSLYGRRGEY PDKYGSYKPP
SHYGSEKDYP GGGSGSQTIG HMRSFHPDAA YLSDNIYDKQ RTLGSGYLGA KSESPYLSKD
RITPDIYGSR DGHYSLKRQP AYATDSLHSV HSLLKNDYHQ QQQQQQQHHL QDRLSEGSDK
NGYHFPYTAE EDHLPARKLS HTQPPSLHGS QLMQPPGVGL VNDVNNPGLM GRHTLNGGSR
HSSRASSPPS TMVAPMQPLG PLTSITDTDS EFDIIWLHRR QQRQQYPWSL TIWRNIDDDE
TTV
//