ID U3I864_ANAPP Unreviewed; 1497 AA.
AC U3I864;
DT 13-NOV-2013, integrated into UniProtKB/TrEMBL.
DT 05-JUN-2019, sequence version 2.
DT 27-MAR-2024, entry version 65.
DE RecName: Full=Neurexin-1 {ECO:0000256|ARBA:ARBA00044151};
DE AltName: Full=Neurexin I-alpha {ECO:0000256|ARBA:ARBA00044347};
DE AltName: Full=Neurexin-1-alpha {ECO:0000256|ARBA:ARBA00044281};
GN Name=NRXN1 {ECO:0000313|Ensembl:ENSAPLP00000003435.2};
OS Anas platyrhynchos platyrhynchos (Northern mallard).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Galloanserae; Anseriformes; Anatidae;
OC Anatinae; Anas.
OX NCBI_TaxID=8840 {ECO:0000313|Ensembl:ENSAPLP00000003435.2, ECO:0000313|Proteomes:UP000016666};
RN [1] {ECO:0000313|Ensembl:ENSAPLP00000003435.2, ECO:0000313|Proteomes:UP000016666}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Hou Z.-C., Zhou Z.-K., Zhu F., Hou S.-S.;
RT "A new Pekin duck reference genome.";
RL Submitted (OCT-2017) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSAPLP00000003435.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004479}; Single-
CC pass type I membrane protein {ECO:0000256|ARBA:ARBA00004479}.
CC -!- SIMILARITY: Belongs to the neurexin family.
CC {ECO:0000256|ARBA:ARBA00010241}.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR Ensembl; ENSAPLT00000004036.2; ENSAPLP00000003435.2; ENSAPLG00000003929.2.
DR GeneTree; ENSGT00940000154292; -.
DR HOGENOM; CLU_001710_0_1_1; -.
DR Proteomes; UP000016666; Chromosome 3.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR CDD; cd00054; EGF_CA; 2.
DR CDD; cd00110; LamG; 6.
DR Gene3D; 2.60.120.200; -; 6.
DR Gene3D; 2.10.25.10; Laminin; 3.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
DR InterPro; IPR001791; Laminin_G.
DR InterPro; IPR003585; Neurexin-like.
DR InterPro; IPR027789; Syndecan/Neurexin_dom.
DR PANTHER; PTHR15036:SF51; NEUREXIN-1; 1.
DR PANTHER; PTHR15036; PIKACHURIN-LIKE PROTEIN; 1.
DR Pfam; PF00008; EGF; 1.
DR Pfam; PF02210; Laminin_G_2; 6.
DR Pfam; PF01034; Syndecan; 1.
DR SMART; SM00294; 4.1m; 1.
DR SMART; SM00181; EGF; 3.
DR SMART; SM00282; LamG; 6.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 6.
DR PROSITE; PS00010; ASX_HYDROXYL; 1.
DR PROSITE; PS50026; EGF_3; 3.
DR PROSITE; PS50025; LAM_G_DOMAIN; 6.
DR PROSITE; PS51257; PROKAR_LIPOPROTEIN; 1.
PE 3: Inferred from homology;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076};
KW Membrane {ECO:0000256|ARBA:ARBA00023136, ECO:0000256|SAM:Phobius};
KW Reference proteome {ECO:0000313|Proteomes:UP000016666};
KW Signal {ECO:0000256|SAM:SignalP};
KW Transmembrane {ECO:0000256|ARBA:ARBA00022692, ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|ARBA:ARBA00022989,
KW ECO:0000256|SAM:Phobius}.
FT SIGNAL 1..24
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 25..1497
FT /note="Neurexin-1"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5019778327"
FT TRANSMEM 1423..1443
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 29..223
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 207..245
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 266..463
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 470..662
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 666..703
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 708..881
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 895..1070
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 1073..1110
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1116..1314
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT REGION 1378..1414
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1464..1497
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1464..1480
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1481..1497
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1497 AA; 165018 MW; 11C61D1E22B21DCE CRC64;
MGAALVRRGG CLLLWLALLL GCWAELGSGL EFPGAEGQWT RFPKWNACCE SEMSFNMKTR
SSSGLVLYFD DEGFCDFLEL ILTQGGRLQL SFSIFCAEPA TLLSDTAVND NLWHAVVIRR
HFKNTTLIID RAEAKWVEVK SKRRDMTVFS GLFLGGLPPE LRSATLKLTL SSVKDREPFK
GWITDVRVNY TQTSPVESQE VRLDDEQSRL CAREDVCLNG GVCSVLNDQA VCDCSQTGFR
GKDCSEGLAH LMMGDQGKSK GKEEYIATFK GSEYFCYDLS QNPIQSSSDE ITLSFKTLQR
NGLMLHTGKS ADYVNLALKN GAVSLVINLG SGAFEALVEP VNGKFNDNAW HDVKVTRNLR
QHSGIGHAMV NKLHCSVTIS VDGILTTTGY TQEDYTMLGS DDFFYVGGSP STADLPGSPV
SNNFMGCLKE VVYKNNDVRL ELSRLAKQGD PKMKIHGVVA FKCENVATLD PITFETPESF
ISLPKWNAKK TGSISFDFRT TEPNGLILFS HGKPRHQKDA KHPQMVKVDF FAIEMLDGHL
YLLLDMGSGT IKIKALQKKV NDGEWYHVDF QRDGRSGTIS VNTLRTPYTA PGESEILDLD
DDLYLGGLPE NKAGLVFPTE VWTALLNYGY VGCIRDLFID GQSKDIRQMA EIQSTAGVKP
SCSRETAKPC LSNPCKNNGV CRDGWNRYVC DCSGTGYLGR SCEREATILS YDGSMFMKIQ
LPVVMHTEAE DVSLRFRSQR AYGILMATTS RESADTLRLE LDAGRVKLTV NLDCIRINCN
SSKGPETLFA GYNLNDNEWH TVRVVRRGKS LKLMVDDQQA MTGQMAGDHT RLEFHNIETG
IITERRYLSS VPSNFIGHLQ SLTFNGMAYI DLCKNGDIDY CELNARFGFR NIIADPVTFK
TKASYVALAT LQAYTSMHLF FQFKTTSLDG LILYNSGDGN DFIVVELVKG YLHYVFDLGN
GANLIKGSSN KPLNDNQWHN VMISRDTNNL HTVKIDTKIT TQSTAGARNL DLKSDLYIGG
VAKEMYKSLP KLVHAKEGFQ GCLASVDLNG RLPDLISDAL FCNGQIERGC EGPSTTCQED
SCANQGVCLQ QWDGFSCDCS MTSFSGPLCN DPGTTYIFSK GGGQITYTWP PNDRPSTRAD
RLAIGFSTVQ KEAVLVRVDS STGLGDYLEL HIHQGKIGVK FNVGTDDISI EEINAIINDG
KYHVVRFTRS GGNATLQVDN WPVIERYPAG NNDNERLAIA RQRIPYRLGR VVDEWLLDKG
RQLTIFNSQA TIKIGGKERG HPFQGQLSGL YYNGLKVLNM AAENDANIVI EGNVRLVGEV
PSSMTTESTA TAMQSEMSTS VMETTTTLAT STARRGKAPT KEPIGQTTDD ILVASAECPS
DDEDIDPCEP SSGGLANPTR AGGGREYPGS SEVIRESSST TGMVVGIVAA AALCILILLY
AMYKYRNRDE GSYHVDESRN YISNSAQSNG AVIKEKQPNS AKSSNKNKKN KDKEYYV
//