ID A0A158PPP0_ANISI Unreviewed; 1099 AA.
AC A0A158PPP0;
DT 08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT 08-JUN-2016, sequence version 1.
DT 27-MAR-2024, entry version 36.
DE SubName: Full=Neurexin-1a {ECO:0000313|WBParaSite:ASIM_0001517901-mRNA-1};
GN ORFNames=ASIM_LOCUS14589 {ECO:0000313|EMBL:VDK52841.1};
OS Anisakis simplex (Herring worm).
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC Spirurina; Ascaridomorpha; Ascaridoidea; Anisakidae; Anisakis;
OC Anisakis simplex complex.
OX NCBI_TaxID=6269 {ECO:0000313|Proteomes:UP000036680, ECO:0000313|WBParaSite:ASIM_0001517901-mRNA-1};
RN [1] {ECO:0000313|WBParaSite:ASIM_0001517901-mRNA-1}
RP IDENTIFICATION.
RG WormBaseParasite;
RL Submitted (APR-2016) to UniProtKB.
RN [2] {ECO:0000313|EMBL:VDK52841.1, ECO:0000313|Proteomes:UP000267096}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RG Pathogen Informatics;
RL Submitted (NOV-2018) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; UYRR01031827; VDK52841.1; -; Genomic_DNA.
DR WBParaSite; ASIM_0001517901-mRNA-1; ASIM_0001517901-mRNA-1; ASIM_0001517901.
DR Proteomes; UP000036680; Unplaced.
DR Proteomes; UP000267096; Unassembled WGS sequence.
DR CDD; cd00054; EGF_CA; 2.
DR CDD; cd00110; LamG; 5.
DR Gene3D; 2.60.120.200; -; 5.
DR Gene3D; 2.10.25.10; Laminin; 2.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR001791; Laminin_G.
DR PANTHER; PTHR15036:SF89; NEUREXIN 1, ISOFORM F; 1.
DR PANTHER; PTHR15036; PIKACHURIN-LIKE PROTEIN; 1.
DR Pfam; PF02210; Laminin_G_2; 5.
DR SMART; SM00181; EGF; 2.
DR SMART; SM00282; LamG; 5.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 5.
DR PROSITE; PS50026; EGF_3; 2.
DR PROSITE; PS50025; LAM_G_DOMAIN; 4.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00122}; EGF-like domain {ECO:0000256|PROSITE-ProRule:PRU00076};
KW Reference proteome {ECO:0000313|Proteomes:UP000267096}.
FT DOMAIN 26..221
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 217..262
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 289..485
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 493..686
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 691..728
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 930..1096
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DISULFID 1069..1096
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00122"
SQ SEQUENCE 1099 AA; 123454 MW; F8AAF3B17431D097 CRC64;
MIGYCVRPEN VQIFRKFFLL TRRLCQVPLN TLHRFAFRYP KWSHTFENQL SLEFRTRQSD
ALLLYTDDGG VQGNFYALTI ADGRLQLDFR LGDESNDLAS QRAVVTMRVD DIPVNDNRWH
QLTLFQAWEN VKLQLDDTVL FKILSQQSFV FGNLKTCSDV FVGGVPKDIH MLAAMSSPLK
RHTKTFAGTI KNLVYRLYPQ GVTSPQLLES VGMRQSDDDY CKPSAVGGNK EQYCKNDGVC
YSTNDGPKCD CSLSDFDGRR CEQGIEHMFT SSYNDHICMA DFSVRLDAEL SFFGNEWLGY
DVSNNSAATI RSRFENISFA FKTIQGRQTL FFSGDQLVVS FFRVQNYVYV TIDDGSLVAT
SKFDDTEKRL IRIFNEYPSG RYDDDQWHMV TVTRTLTLMT LIVDGRKDEI RQYAPEIDWL
KNSYAFVGGI PLEKQYEDLD KPNFRGCMKK VKFEADAHLI NFISLADQGY GQSVIRSAGD
LAFSCRKPTV PPDILSFNSG QHYITLPKWN SLGSGSIGFQ LRTYELDGLI LYHGSKSITN
DSSDYIAFEL IDGHLFLIIN LGSGHVRLQT TASKITEGTV WHSVTLERMG RTGTVIVDNI
RTDFSTPGVS ANLIIDEPIY VGAVPWPAND STPSSFRFPS TVWTANLRQG YVGCLKNVRL
NGISANIANV YEEQKTLVEA GISQGCPNTL NNDYCASSPC KNYGRCEIGY ATFRCDCSNS
HMEGPTCNIE PEVVELTEKG GELPPFHLPN PLHSESETIE CKFRTDDDKG IIFDSKSTAS
PSHRILIAII RGELELHLNF GVTQHTFNWG SGLNDDRFHS IRVKRRGEKL LLFLDGKWEH
SYFLPSSNIV LQIDQIAAGH SLHPTGLSDF VPPRNETNDE NFSGQMIKLT FNGYDVLKKA
KRRNGNFAAS SKSSEVRDGQ KSRNRKAKYS AVSFETNKGR VVFADSRIST IEGPYRISLK
FRTLAPSCII LVVTSNSTYS FGFGSRFESV LSPVLRRKQT LSDMRWHSLL IYQNAINGEH
HMIIDNTSTV MDTVGGHMAK LEGQLYLGGV PPLTPLSPRL TNVVGFRGCI SSLRIGDEHL
DAFEDAYEIV GVTKGCTGQ
//