ID A0A1Q9EU00_SYMMI Unreviewed; 1077 AA.
AC A0A1Q9EU00;
DT 12-APR-2017, integrated into UniProtKB/TrEMBL.
DT 12-APR-2017, sequence version 1.
DT 27-MAR-2024, entry version 17.
DE SubName: Full=Pepsin A {ECO:0000313|EMBL:OLQ10902.1};
GN Name=PGA {ECO:0000313|EMBL:OLQ10902.1};
GN ORFNames=AK812_SmicGene5323 {ECO:0000313|EMBL:OLQ10902.1};
OS Symbiodinium microadriaticum (Dinoflagellate) (Zooxanthella
OS microadriatica).
OC Eukaryota; Sar; Alveolata; Dinophyceae; Suessiales; Symbiodiniaceae;
OC Symbiodinium.
OX NCBI_TaxID=2951 {ECO:0000313|EMBL:OLQ10902.1, ECO:0000313|Proteomes:UP000186817};
RN [1] {ECO:0000313|EMBL:OLQ10902.1, ECO:0000313|Proteomes:UP000186817}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CCMP2467 {ECO:0000313|EMBL:OLQ10902.1,
RC ECO:0000313|Proteomes:UP000186817};
RA Aranda M., Li Y., Liew Y.J., Baumgarten S., Simakov O., Wilson M., Piel J.,
RA Ashoor H., Bougouffa S., Bajic V.B., Ryu T., Ravasi T., Bayer T.,
RA Micklem G., Kim H., Bhak J., Lajeunesse T.C., Voolstra C.R.;
RT "Genome analysis of coral dinoflagellate symbionts highlights evolutionary
RT adaptations to a symbiotic lifestyle.";
RL Submitted (FEB-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- PATHWAY: Protein modification; protein ubiquitination.
CC {ECO:0000256|ARBA:ARBA00004906}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OLQ10902.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LSRX01000069; OLQ10902.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A1Q9EU00; -.
DR OrthoDB; 2419066at2759; -.
DR Proteomes; UP000186817; Unassembled WGS sequence.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR Gene3D; 2.40.70.10; Acid Proteases; 1.
DR Gene3D; 3.30.40.10; Zinc/RING finger domain, C3HC4 (zinc finger); 1.
DR InterPro; IPR033121; PEPTIDASE_A1.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR035979; RBD_domain_sf.
DR InterPro; IPR001841; Znf_RING.
DR InterPro; IPR013083; Znf_RING/FYVE/PHD.
DR InterPro; IPR024766; Znf_RING_H2.
DR PANTHER; PTHR45676; RING-H2 FINGER PROTEIN ATL51-RELATED; 1.
DR PANTHER; PTHR45676:SF171; RING-TYPE E3 UBIQUITIN TRANSFERASE; 1.
DR Pfam; PF00026; Asp; 1.
DR Pfam; PF12678; zf-rbx1; 1.
DR SMART; SM00184; RING; 1.
DR SUPFAM; SSF50630; Acid proteases; 1.
DR SUPFAM; SSF57850; RING/U-box; 1.
DR SUPFAM; SSF54928; RNA-binding domain, RBD; 1.
DR PROSITE; PS51257; PROKAR_LIPOPROTEIN; 1.
DR PROSITE; PS50089; ZF_RING_2; 1.
PE 4: Predicted;
KW Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00175};
KW Reference proteome {ECO:0000313|Proteomes:UP000186817};
KW Signal {ECO:0000256|SAM:SignalP};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00175};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00175}.
FT SIGNAL 1..16
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 17..1077
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5012593251"
FT DOMAIN 741..789
FT /note="RING-type"
FT /evidence="ECO:0000259|PROSITE:PS50089"
SQ SEQUENCE 1077 AA; 118715 MW; 1097A45C771ED795 CRC64;
MRRISLASAF LQYASAMSCV EFVTYSLLPD GSCCFQRQIF EDLRTYTWDL FNGSSLSAPP
WARAPQRKLL LSADHMEGRE GLEGLGGCVA LALAFLRLLE AQLALASALP LFASESPVAK
QLSRGTVEEG ARSLRTSMKE IRGSRERGDW PWQRVLRHAR RLLWVLDRSI EFPDFLRHVG
WLLTRLGTKS LLLPPARQSM QHHVPVVQLT SLQLHRALHS CLTALGRAQA QYFPSGGTLI
GILREGELHG KQSPGRVHVL DRDLEWFVRA DTPERWLELA QNISTSLISQ GWFRCELIGH
GANVWVGEKL LMRCVRAATE PYIAKAEFHA FSVMNRQSLL TGTCECKHLV SFGSLNLNMQ
GPCFCPGQSF AAKNVLPLRR CRVRKRSVPC PRRPVGFLAE LYGNERCFPL PRAPAPPRHL
PSHPRHGQRN FSDAALRRYA RFGNAAASHF IHGHPRCCLH SLDAHSQTRS FHKLWSDMLV
LCDTTEPSDP TQLNWAPVHQ PEDGYWQFAL AGLRIGNRTI SCGRNCRGIV DTSAAGLGLP
SSLHATLQST LGAAVCSGPD IYFDVKAADS TPAFSLKLTA KEYRSQDCQA MITATDLPEA
FADVLILGQP KNNSHILTEA QAPSKVGISH ALKDEGGVEL PILSTLEAEA LEEEVLRREV
LAADPEFTVG GMSMQAHALS VLLLQVLIMQ ESLRARLLFS PLGVALQCWF GVSLPPNSQL
SNWLIAIRPV PAAEAPEASE CVVCLGVREE DLKPGCCRPK WCKLRCGHAF HQECISQWLP
KVAQCPVCRS NIFDGKRTFD AHNPAVTSSA SRTLRGGALA VDQDFGELRH LWADLKKMIS
EGRPSIGRRL DWLSQLSGAA VKVVQPGPTK SMDAERELAD SFGRSAWNAV AKETLGAELS
EYIGGAGDAG TPRTCVDSLP ALVEVGSTTE SEEGWDSGSA DGHDVFRQLN ELNWRQLDMK
QEWRTTLRLR NLTWRLCDED TLRSFLDRSG LLECIEQFRV KPGAGHRAGS ALISVKSVSH
VSRVAKFFHG RQFPGARSPV AVSFASCRLV RRDRSSPRTL SWEGGLVAAS GMATAAG
//