ID A0A3R7NP48_PENVA Unreviewed; 1031 AA.
AC A0A3R7NP48;
DT 10-APR-2019, integrated into UniProtKB/TrEMBL.
DT 10-APR-2019, sequence version 1.
DT 27-MAR-2024, entry version 15.
DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:ROT63000.1};
GN ORFNames=C7M84_019121 {ECO:0000313|EMBL:ROT63000.1};
OS Penaeus vannamei (Whiteleg shrimp) (Litopenaeus vannamei).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Crustacea; Multicrustacea;
OC Malacostraca; Eumalacostraca; Eucarida; Decapoda; Dendrobranchiata;
OC Penaeoidea; Penaeidae; Penaeus.
OX NCBI_TaxID=6689 {ECO:0000313|EMBL:ROT63000.1, ECO:0000313|Proteomes:UP000283509};
RN [1] {ECO:0000313|EMBL:ROT63000.1, ECO:0000313|Proteomes:UP000283509}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC TISSUE=Muscle {ECO:0000313|EMBL:ROT63000.1};
RA Zhang X., Yuan J., Li F., Xiang J.;
RL Submitted (APR-2018) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:ROT63000.1, ECO:0000313|Proteomes:UP000283509}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC TISSUE=Muscle {ECO:0000313|EMBL:ROT63000.1};
RA Sun Y., Gao Y., Yu Y.;
RT "The decoding of complex shrimp genome reveals the adaptation for benthos
RT swimmer, frequently molting mechanism and breeding impact on genome.";
RL Submitted (JAN-2019) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:ROT63000.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; QCYY01003506; ROT63000.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A3R7NP48; -.
DR STRING; 6689.A0A3R7NP48; -.
DR Proteomes; UP000283509; Unassembled WGS sequence.
DR Gene3D; 2.10.25.10; Laminin; 2.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR000436; Sushi_SCR_CCP_dom.
DR InterPro; IPR014853; VWF/SSPO/ZAN-like_Cys-rich_dom.
DR InterPro; IPR001846; VWF_type-D.
DR PANTHER; PTHR11339; EXTRACELLULAR MATRIX GLYCOPROTEIN RELATED; 1.
DR PANTHER; PTHR11339:SF373; HEMOLECTIN, ISOFORM A; 1.
DR Pfam; PF08742; C8; 1.
DR Pfam; PF00094; VWD; 1.
DR SMART; SM00832; C8; 1.
DR SMART; SM00181; EGF; 2.
DR SMART; SM00216; VWD; 1.
DR SUPFAM; SSF57196; EGF/Laminin; 1.
DR PROSITE; PS00022; EGF_1; 2.
DR PROSITE; PS50026; EGF_3; 2.
DR PROSITE; PS50923; SUSHI; 1.
DR PROSITE; PS51233; VWFD; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; EGF-like domain {ECO:0000256|PROSITE-ProRule:PRU00076};
KW Reference proteome {ECO:0000313|Proteomes:UP000283509};
KW Sushi {ECO:0000256|PROSITE-ProRule:PRU00302}.
FT DOMAIN 460..491
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 488..561
FT /note="Sushi"
FT /evidence="ECO:0000259|PROSITE:PS50923"
FT DOMAIN 590..622
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 733..927
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT REGION 62..93
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 119..179
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 62..87
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 463..473
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 481..490
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 594..604
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 612..621
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ SEQUENCE 1031 AA; 110054 MW; 98F1CCBEA293E592 CRC64;
MSLAVEESHS PIQLRVDTEG NIATTAPLGD VKPFACLVEA GVPRGSLTTA VSTQYPFDRN
PMFRASSSYN SRKGGPRTLS RSTDRSFSRF GGGPEAFLEA ATTERIEAEE DVMGFKFSGS
ANFGASRPSG GGRKGGPSKG GSKGGNPFAT FDRESAPGFP PIRNIGGAPG FSGASGPGFS
GASGPGFSGA SGPGFSGASG PGFSGANGPG FSGASGPGFS ASSSSATFGG SLGAPFNGPA
APAYSSPPGT FSGSLTQGGA KFGASFNPGF SGSTNYFGGN AGGAFPAPKT PFSAAFPGVS
GNINFDPNGG NIQFPPPSGQ FPPSSIFPSM TEEEGGPIVF PQGPGEGAAG LNLPFTIDDE
LAALLFGMQE ATTANGRQDE ADEEVEPPIL VRRPEVKQGC NSKPSAPANG RTDCSTYAGC
RSVCNPNYQF PSGERRLYLM CDNGEWKVTG SVWETLPDCE PRCSPPCENN GICLAPSVCQ
CPDNWEGDQC QIPKAPPAKK CSNKPPTPKN SRIFCSKDEC SARCHEGYHF AEGTARLSFQ
CEDGEWVIQD PRWTETPDCE RKSGLLWLGC GWVDEVVAGC VVAGVWMRGW RAACDPACEN
GGRCIAPDMC QCTVDFRGEL CQYRKYRLPL GCKSVTTSNC DPKKLGFKGG YNCSGEGMDF
GCAFWCPPEA DFEFPAEDFY KCDYATGLWS PSPIPNCDYG FLVVTPGAQD ISTGTYPPGF
VFTTTEYIRK SPGVCFTWSG SHYKSFDGRV YSFESSCPFT LLQDSTHGTF SVNLKSKDDC
TGSSCSKAIH LFLDDEEYVL QNTGEGEGRG SFFLECYLAL SESGQPSLEY HETNLAIPGQ
MNGVVSERVA HYVLLKVDAI GLTLKWDMKN TVITEVNELL WNRTSGLCGR RDGNKENDWG
YSDGRTEDNL SSFLQAWQSK TIGDQCLGQP MIKNPCGRRP VSSQATDFCS RLRDDPKFEP
CRQVVDVDPF VSACKWDYCG CESDDREGCA CESFSAFFRE CTSQGVDLPE GWRAPGLCRK
TRGWTLRVGW V
//