ID A7S7E0_NEMVE Unreviewed; 999 AA.
AC A7S7E0;
DT 02-OCT-2007, integrated into UniProtKB/TrEMBL.
DT 02-OCT-2007, sequence version 1.
DT 24-JAN-2024, entry version 51.
DE RecName: Full=Glutamine-rich protein 2 {ECO:0008006|Google:ProtNLM};
GN ORFNames=NEMVEDRAFT_v1g243361 {ECO:0000313|EMBL:EDO40381.1};
OS Nematostella vectensis (Starlet sea anemone).
OC Eukaryota; Metazoa; Cnidaria; Anthozoa; Hexacorallia; Actiniaria;
OC Edwardsiidae; Nematostella.
OX NCBI_TaxID=45351 {ECO:0000313|EMBL:EDO40381.1, ECO:0000313|Proteomes:UP000001593};
RN [1] {ECO:0000313|EMBL:EDO40381.1, ECO:0000313|Proteomes:UP000001593}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CH2 X CH6 {ECO:0000313|Proteomes:UP000001593};
RX PubMed=17615350; DOI=10.1126/science.1139158;
RA Putnam N.H., Srivastava M., Hellsten U., Dirks B., Chapman J., Salamov A.,
RA Terry A., Shapiro H., Lindquist E., Kapitonov V.V., Jurka J.,
RA Genikhovich G., Grigoriev I.V., Lucas S.M., Steele R.E., Finnerty J.R.,
RA Technau U., Martindale M.Q., Rokhsar D.S.;
RT "Sea anemone genome reveals ancestral eumetazoan gene repertoire and
RT genomic organization.";
RL Science 317:86-94(2007).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; DS469592; EDO40381.1; -; Genomic_DNA.
DR RefSeq; XP_001632444.1; XM_001632394.1.
DR AlphaFoldDB; A7S7E0; -.
DR STRING; 45351.A7S7E0; -.
DR EnsemblMetazoa; EDO40381; EDO40381; NEMVEDRAFT_v1g243361.
DR KEGG; nve:5512092; -.
DR eggNOG; ENOG502R9P3; Eukaryota.
DR HOGENOM; CLU_300021_0_0_1; -.
DR InParanoid; A7S7E0; -.
DR OMA; DQFDAAC; -.
DR OrthoDB; 4491024at2759; -.
DR PhylomeDB; A7S7E0; -.
DR Proteomes; UP000001593; Unassembled WGS sequence.
DR InterPro; IPR032013; DUF4795.
DR PANTHER; PTHR46766; GLUTAMINE-RICH PROTEIN 2; 1.
DR PANTHER; PTHR46766:SF1; GLUTAMINE-RICH PROTEIN 2; 1.
DR Pfam; PF16043; DUF4795; 1.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Reference proteome {ECO:0000313|Proteomes:UP000001593}.
FT REGION 56..114
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 284..377
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 804..977
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 213..266
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 446..513
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 585..612
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 61..97
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 98..114
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 292..316
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 355..376
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 855..872
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 909..923
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 951..970
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 999 AA; 112288 MW; E9A87EBBE3A002A1 CRC64;
MVATDLELHK LVDLSLRTPE VGVVNFNHLH KLLHAILDHI GLGYDGKNIT KVYASKSPVS
ESKTRTDHTS EAEDVKDEGD TKATNETEKE KEYAGTEEST VVSEQETLKS PTNAVRAVQE
DEIERLQSVQ RYMQSKISEL EEKLKILDAL PTNRSIIEKA QQSESTNQLT SKDNALAGIW
QFLKINSRLL AAEEAIDRIM NILNEFLGSG KSVGELNDNL QDIAKELSNL KSQINNNNDA
ISSTEHQEAM GKIEEIQRKL SGLASKEDLR EYVKWPALEN ALNAGKQRNS SAKNETKTHT
EDAKLDDSEL ESSTPEDGRP KTVPVKPTQS PVPTPEVPKT AMTRSEFTQV TSNDLPDKEP
DKDADDAEDS SHPSQEMREC LRQIGELSDK NRIVEEQVKV IKEVLPNKID KTELDIPEDL
KNRIDAMQAD LDSISQKMYG NSSGELDELR QLYQENRDKI EAIKRELAAM ARQAKQAYIQ
RPSSGKGVDN EAMDNIRALL KDVQDEQEKA AKKMDYKLTD LQDDLSQKRE HIEALYNYVQ
KLQDSKADKD NVAIEMDIKA DKEALDSKVS LTTFDNSFNM LDEGLREALQ KMDDYMNEEL
ALKQALKQLS CEMKEKMDGQ AFKAMQDFLE RRIADVQKSR NMIAREAKVE LTGAAGIRRP
LLNFHCISCN RPVEVPYNRD AQQSLPSPQA VKLKRSKGPY ISYEMDQIRH YQRTFQQPAL
HPIEKPMNEF VAKRPCGGSH TVLNHTIRKP TRIVSSQLSN VLAASVRDDP TAHVPNWSRE
ISVIDLLKGK DGHVYRGRYK AIPPISRSHK HTPSESDLPP LHKDPVMPGP RWNSEPQLSR
GGDEFQPLES FAPRQGETID TNESQVTTPP PTTNQPSVLP PREKQGSPLR VQGQLIAVPS
PSPPAAQKNE HQHRIEREEF LQSQQPPRLE GEPQGRISPG QRLFSPRHAR NGSPCAQQIS
PTSQMQHQPG QRLGGDVVKV TDEPVDGRLS VDSIDEVVE
//