ID H9G1Z9_MACMU Unreviewed; 1504 AA.
AC H9G1Z9;
DT 16-MAY-2012, integrated into UniProtKB/TrEMBL.
DT 16-MAY-2012, sequence version 1.
DT 27-MAR-2024, entry version 53.
DE RecName: Full=Neurexin-1 {ECO:0000256|ARBA:ARBA00044151};
DE AltName: Full=Neurexin I-alpha {ECO:0000256|ARBA:ARBA00044347};
DE AltName: Full=Neurexin-1-alpha {ECO:0000256|ARBA:ARBA00044281};
GN Name=NRXN1 {ECO:0000313|EMBL:AFE80908.1};
OS Macaca mulatta (Rhesus macaque).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini;
OC Cercopithecidae; Cercopithecinae; Macaca.
OX NCBI_TaxID=9544 {ECO:0000313|EMBL:AFE80908.1};
RN [1] {ECO:0000313|EMBL:AFE80908.1}
RP NUCLEOTIDE SEQUENCE.
RC TISSUE=Caudate {ECO:0000313|EMBL:AFE80908.1};
RX PubMed=25319552; DOI=10.1186/1745-6150-9-20;
RA Zimin A.V., Cornish A.S., Maudhoo M.D., Gibbs R.M., Zhang X., Pandey S.,
RA Meehan D.T., Wipfler K., Bosinger S.E., Johnson Z.P., Tharp G.K.,
RA Marcais G., Roberts M., Ferguson B., Fox H.S., Treangen T., Salzberg S.L.,
RA Yorke J.A., Norgren R.B.Jr.;
RT "A new rhesus macaque assembly and annotation for next-generation
RT sequencing analyses.";
RL Biol. Direct 9:20-20(2014).
CC -!- SUBCELLULAR LOCATION: Presynaptic cell membrane
CC {ECO:0000256|ARBA:ARBA00035005}; Single-pass type I membrane protein
CC {ECO:0000256|ARBA:ARBA00035005}.
CC -!- SIMILARITY: Belongs to the neurexin family.
CC {ECO:0000256|ARBA:ARBA00010241}.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JU337155; AFE80908.1; -; mRNA.
DR GO; GO:0042734; C:presynaptic membrane; IEA:UniProtKB-SubCell.
DR CDD; cd00054; EGF_CA; 1.
DR CDD; cd00110; LamG; 6.
DR Gene3D; 2.60.120.200; -; 6.
DR Gene3D; 2.10.25.10; Laminin; 3.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
DR InterPro; IPR001791; Laminin_G.
DR InterPro; IPR003585; Neurexin-like.
DR PANTHER; PTHR15036:SF51; NEUREXIN-1; 1.
DR PANTHER; PTHR15036; PIKACHURIN-LIKE PROTEIN; 1.
DR Pfam; PF00008; EGF; 1.
DR Pfam; PF02210; Laminin_G_2; 6.
DR SMART; SM00294; 4.1m; 1.
DR SMART; SM00181; EGF; 3.
DR SMART; SM00282; LamG; 6.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 6.
DR PROSITE; PS00010; ASX_HYDROXYL; 1.
DR PROSITE; PS50026; EGF_3; 3.
DR PROSITE; PS50025; LAM_G_DOMAIN; 6.
PE 2: Evidence at transcript level;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Heparan sulfate {ECO:0000256|ARBA:ARBA00023207};
KW Membrane {ECO:0000256|ARBA:ARBA00023136, ECO:0000256|SAM:Phobius};
KW Proteoglycan {ECO:0000256|ARBA:ARBA00022974};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP};
KW Transmembrane {ECO:0000256|ARBA:ARBA00022692, ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|ARBA:ARBA00022989,
KW ECO:0000256|SAM:Phobius}.
FT SIGNAL 1..25
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 26..1504
FT /note="Neurexin-1"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5003619598"
FT TRANSMEM 1430..1450
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 30..217
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 219..256
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 283..473
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 480..672
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 676..713
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 718..891
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 905..1080
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 1083..1120
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1126..1324
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT REGION 198..221
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1358..1417
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1471..1504
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1471..1487
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1488..1504
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1504 AA; 165231 MW; 3F2425D2E9171450 CRC64;
MGTALLQRGG CFLLCLSLLL LGCWAELGSG LEFPGAEGQW TRFPKWNACC ESEMSFQLKT
RSARGLVLYF DDEGFCDFLE LILTRGGRLQ LSFSIFCAEP ATLLADTPVN DGAWHSVRIR
RQFRNTTLFI DQVEAKWVEV KSKRRDMTVF SGLFVGGLPP ELRAAALKLT LASVREREPF
KGWIRDVRVN SSQVLPVDSG EVKLDDEPPN SGGGSPCEAG EEGEGGVCLN GGVCSVVDDQ
AVCDCSRTGF RGKDCSQEDN NVEGLAHLMM GDQGKSKGKE EYIATFKGSE YFCYDLSQNP
IQSSSDEITL SFKTLQRNGL MLHTGKSADY VNLALKNGAV SLVINLGSGA FEALVEPVNG
KFNDNAWHDV KVTRNLRQHS GIGHAMVTIS VDGILTTTGY TQEDYTMLGS DDFFYVGGSP
STADLPGSPV SNNFMGCLKE VVYKNNDVRL ELSRLAKQGD PKMKIHGVVA FKCENVATLD
PITFETPESF ISLPKWNAKK TGSISFDFRT TEPNGLILFS HGKPRHQKDA KHPQMIKVDF
FAIEMLDGHL YLLLDMGSGT IKIKALLKKV NDGEWYHVDF QRDGRSGTIS VNTLRTPYTA
PGESEILDLD DELYLGGLPE NKAGLVFPTE VWTALLNYGY VGCIRDLFID GQSKDIRQMA
EVQSTAGVKP SCSRETAKPC LSNPCKNNGM CRDGWNRYVC DCSGTGYLGR SCEREATVLS
YDGSMFMKIQ LPVVMHTEAE DVSLRFRSQR AYGILMATTS RDSADTLRLE LDAGRVKLTV
NLDCIRINCN SSKGPETLFA GYNLNDNEWH TVRVVRRGKS LKLTVDDQQA MTGQMAGDHT
RLEFHNIETG IITERRYLSS VPSNFIGHLQ SLTFNGMAYI DLCKNGDIDY CELNARFGFR
NIIADPVTFK TKSSYVALAT LQAYTSMHLF FQFKTTSLDG LILYNSGDGN DFIVVELVKG
YLHYVFDLGN GANLIKGSSN KPLNDNQWHN VMISRDTSNL HTVKIDTKIT TQITAGARNL
DLKSDLYIGG VAKETYKSLP KLVHAKEGFQ GCLASVDLNG RLPDLISDAL FCNGQIERGC
EGPSTTCQED SCSNQGVCLQ QWDGFSCDCS MTSFSGPLCN DPGTTYIFSK GGGQITYKWP
PNDRPSTRAD RLAIGFSTVQ KEAVLVRVDS SSGLGDYLEL HIHQGKIGVK FNVGTDDIAI
EESNAIINDG KYHVVRFTRS GGNATLQVDS WPVIERYPAG NNDNERLAIA RQRIPYRLGR
VVDEWLLDKG RQLTIFNSQA TIIIGGKEQG QPFQGQLSGL YYNGLKVLNM AAENDANIAI
VGNVRLVGEV PSSMTTESTA TAMQSEMSTS IMETTTTLAT STARRGKPPT KEPISQTTDD
ILVASAECPS DDEDIDPCEP SSANPTRAGG REPYPGSAEV IRESSSTTGM VVGIVAAAAL
CILILLYAMY KYRNRDEGSY HVDESRNYIS NSAQSNGAVV KEKQPSSAKS ANKNKKNKDK
EYYV
//