ID A0A3R7NMZ6_PENVA Unreviewed; 2308 AA.
AC A0A3R7NMZ6;
DT 10-APR-2019, integrated into UniProtKB/TrEMBL.
DT 10-APR-2019, sequence version 1.
DT 27-MAR-2024, entry version 15.
DE RecName: Full=SHSP domain-containing protein {ECO:0000259|PROSITE:PS01031};
GN ORFNames=C7M84_019848 {ECO:0000313|EMBL:ROT62303.1};
OS Penaeus vannamei (Whiteleg shrimp) (Litopenaeus vannamei).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Crustacea; Multicrustacea;
OC Malacostraca; Eumalacostraca; Eucarida; Decapoda; Dendrobranchiata;
OC Penaeoidea; Penaeidae; Penaeus.
OX NCBI_TaxID=6689 {ECO:0000313|EMBL:ROT62303.1, ECO:0000313|Proteomes:UP000283509};
RN [1] {ECO:0000313|EMBL:ROT62303.1, ECO:0000313|Proteomes:UP000283509}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC TISSUE=Muscle {ECO:0000313|EMBL:ROT62303.1};
RA Zhang X., Yuan J., Li F., Xiang J.;
RL Submitted (APR-2018) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:ROT62303.1, ECO:0000313|Proteomes:UP000283509}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC TISSUE=Muscle {ECO:0000313|EMBL:ROT62303.1};
RA Sun Y., Gao Y., Yu Y.;
RT "The decoding of complex shrimp genome reveals the adaptation for benthos
RT swimmer, frequently molting mechanism and breeding impact on genome.";
RL Submitted (JAN-2019) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the small heat shock protein (HSP20) family.
CC {ECO:0000256|PROSITE-ProRule:PRU00285, ECO:0000256|RuleBase:RU003616}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:ROT62303.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; QCYY01003730; ROT62303.1; -; Genomic_DNA.
DR STRING; 6689.A0A3R7NMZ6; -.
DR Proteomes; UP000283509; Unassembled WGS sequence.
DR CDD; cd06526; metazoan_ACD; 5.
DR Gene3D; 2.60.40.790; -; 7.
DR InterPro; IPR002068; A-crystallin/Hsp20_dom.
DR InterPro; IPR001436; Alpha-crystallin/sHSP_animal.
DR InterPro; IPR008978; HSP20-like_chaperone.
DR PANTHER; PTHR45640:SF13; HEAT SHOCK PROTEIN 22-RELATED; 1.
DR PANTHER; PTHR45640; HEAT SHOCK PROTEIN HSP-12.2-RELATED; 1.
DR Pfam; PF00011; HSP20; 6.
DR SUPFAM; SSF49764; HSP20-like chaperones; 6.
DR PROSITE; PS01031; SHSP; 6.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000283509};
KW Stress response {ECO:0000256|ARBA:ARBA00023016}.
FT DOMAIN 88..196
FT /note="SHSP"
FT /evidence="ECO:0000259|PROSITE:PS01031"
FT DOMAIN 254..362
FT /note="SHSP"
FT /evidence="ECO:0000259|PROSITE:PS01031"
FT DOMAIN 426..534
FT /note="SHSP"
FT /evidence="ECO:0000259|PROSITE:PS01031"
FT DOMAIN 587..695
FT /note="SHSP"
FT /evidence="ECO:0000259|PROSITE:PS01031"
FT DOMAIN 762..871
FT /note="SHSP"
FT /evidence="ECO:0000259|PROSITE:PS01031"
FT DOMAIN 952..1058
FT /note="SHSP"
FT /evidence="ECO:0000259|PROSITE:PS01031"
FT REGION 1..34
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 369..397
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1491..1515
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1583..1606
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1797..1817
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1916..1936
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1970..1993
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2039..2061
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2136..2235
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1802..1817
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2308 AA; 248301 MW; 3F3AC0E6E281D062 CRC64;
MSSVQKQTTT RTVKQQSSET SGVTQTSTTT SRQTNRVQIT QRGLFFDDAC FEDTRNDFRN
AISDVVSRFG DKSSNIDELT QYRSLRSREI RDENQAITSS QDDSKYKIVV DVQDFVNGGE
INVKVVEERE VVVEGRVQKV EGNRTSSKSF QRSFFLPEDV DVDTLASVMS ADGVLTITAI
RRTVTQQLTQ VPVSIQRVEN RQVTTTTSTT GVQEVPVPVQ PVQDTNQTVS VQKKMNIQTA
NTDEISTYRN LRQRELREET QAATVNEDQT THKVVVDVQD FINGGEVNVK VVNEREIVIE
GHVEKKEGNT TSTKRFRKRY VLPEDIEVEK VTSVMSSDGV LTITTPKKPS ALPLSEVPVS
IQKIENKKTT TTTTTTGVQE TPAPVQPAQP IQPSQDTDQT VSVQKKMNIQ TANTDEISTY
RNLRQRELRE ETQAATVNED QTTHKVVVDV QDFINGGEVN VKVVNEREIV IEGHVEKKEG
NTTSTKRFRK RYVLPEDIEV EKVTSVMSSD GVLTIKTPKK HLSHFRSRPL IAKQRGRERS
VFRRPGDRCR QADHQTHTRP HHEVLQKMNI QTANTDEIST YRNLRQRELR EETQAATVNE
DQTTHKVVVD VQDFINGGEV NVKVVNEREI VIEGHVEKKE GNTTSTKRFR KRYVLPEDIE
VEKVTSVMSS DGVLTIKTPK KPSSQPLPEV PVSIQKIENK KTTTTTTTTG VQEIPAPVQP
AQPAQPIQPS QDTDQTVSVQ KKMNIQTANT DEISTYRNLR QRELREETQA ATVNEDQTTH
KIVVDVQDFV NGGEVIVKIV NEREVIIEGH VEKMEGNKTS IKRFRKRYVL PENIEVEKVT
SVMSSDGVLT ITTPKKPSSQ PLTQVTPVSI QKIENKKQET ITQETKTTTD SSVTDVPSRL
LTIIKKGNFF SDTFFEDCRQ NFQTAVIQIL KKLNIETSTV DVMTTYRTVR QRELREETQA
ITEKESEQVR KFVVDVQDFA NGGEVTVKVV QEREVVVEGR VDRQEGGVTS TKRCKKTFIL
SENIQVNSVT AVMSADGILT ITAPKKVTQQ QQVTVEKTTD TKKEVTKEAV VQKPGEDRVL
PVNKKGSFLS DPYFQDSRQD FQSSVTEVVM KSTEKDSKED HMTIYRRLRQ KTIKLENQAV
SVKEDAQFQK IVLDVLDFMN GEVKVMITSE NILVVEGRGT RQEGVKTSSH SFLRRFIVPD
NFQGPGKRRR AKIPCDMNVS AAMRVSPRQL CEDALTPGYY GLAVYKQPDS LLHVGSPRWH
FAIPCPPPTP PGSPVAPPGS PCRPPWLPLS PSWLRPVALL AHPAALWPPS VQPLSGSSPS
SAALPAASWL LPPSAALLAP PPLCRALLVP PSPPPTAFLA PPPPTAFLAP PAAPPGPLSS
VVTFLASPPA ALLATPAPVR PPGSPCPPSW LPLSPLTGSP CRPPASPVAL LASPAALLAP
SPASSSPSLC CRPPWLLPPA ASLASSPLCP PGSSPLPPSW LLPPLCRPPG SSPPNRLPGS
PPYRLPGSPP CRPPGSPCRP PGSLVTFLAP PCRPPGYPRP CRLLAPPAAL LAPPVALLAP
PAALLGLWSP SWLPPAALLT PPAPPPSGSP RPCLLAPRRP SGSSPPAPWL PPCRPSWLPC
RPPAPPVALL APLSPSWLPL SPSWLPLPPS WLPLPPPALA PSWLPPAALP GSPCRLWLPC
RPPGSPCRPP GPLPPSCLCS PSWLPLPPSW LPLSPSWPPC PPSWLPPAAI LAPPRLLGLC
HLSWLPPCRP PGSPPAALLA PLSALLAPLS PSLAPPCRPP GPPVALLAPP AALLVSRSSG
SPCPPPGSPP APAAPPPAAL LAPPAALLAP PPLPPSWLPP CRPPGSPPLL PPSWLLPPPC
CRPPGSPLRP PAPPLSPSWL PPCHPPGSPV ALLAPPLPPS WLPHAALLAP PPPRPPSWLP
CRPSGPPPCR SPGSPPAALL APPPLCPLLA PPPPLPPSWL PPSLPLPPPP PVALLAPPLP
PSGSPPPRPP SWLPLPPSWP PAALWLSPPA ALLAPPVRPP GSPCPALLAP PLPPSWLPPC
RPPGSPPARP PGSPPAALLA PPCALLAPPP PSWLPPLPPS WLPPAALLLP LPTSWFPPLP
PSWLLPCRPP APPPAPSWPP PAASWLPPAA LLPCRPPGSP LPPSLAPCRP PGSPRPPAPP
LPPPAPPLPP SCPPPPPSAS PPPFLAPPPA PPLPPSGSPP CRPLPPPPAA LLAPPLPPSL
LPPPLPPPAP PPPAALLAPP AASCLPCRPP APPPLPPSWL PPCRPPWLPP PLPPSCPPPP
RPCPPLLPSC PPPAALLAPP LPREPSVR
//