ID F6YED9_HORSE Unreviewed; 1575 AA.
AC F6YED9;
DT 27-JUL-2011, integrated into UniProtKB/TrEMBL.
DT 13-SEP-2023, sequence version 4.
DT 27-MAR-2024, entry version 80.
DE SubName: Full=Neurexin 2 {ECO:0000313|Ensembl:ENSECAP00000019043.4};
GN Name=NRXN2 {ECO:0000313|Ensembl:ENSECAP00000019043.4,
GN ECO:0000313|VGNC:VGNC:20896};
OS Equus caballus (Horse).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Perissodactyla; Equidae; Equus.
OX NCBI_TaxID=9796 {ECO:0000313|Ensembl:ENSECAP00000019043.4, ECO:0000313|Proteomes:UP000002281};
RN [1] {ECO:0000313|Ensembl:ENSECAP00000019043.4, ECO:0000313|Proteomes:UP000002281}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000019043.4,
RC ECO:0000313|Proteomes:UP000002281};
RX PubMed=19892987; DOI=10.1126/science.1178158;
RG Broad Institute Genome Sequencing Platform;
RG Broad Institute Whole Genome Assembly Team;
RA Wade C.M., Giulotto E., Sigurdsson S., Zoli M., Gnerre S., Imsland F.,
RA Lear T.L., Adelson D.L., Bailey E., Bellone R.R., Bloecker H., Distl O.,
RA Edgar R.C., Garber M., Leeb T., Mauceli E., MacLeod J.N., Penedo M.C.T.,
RA Raison J.M., Sharpe T., Vogel J., Andersson L., Antczak D.F., Biagi T.,
RA Binns M.M., Chowdhary B.P., Coleman S.J., Della Valle G., Fryc S.,
RA Guerin G., Hasegawa T., Hill E.W., Jurka J., Kiialainen A., Lindgren G.,
RA Liu J., Magnani E., Mickelson J.R., Murray J., Nergadze S.G., Onofrio R.,
RA Pedroni S., Piras M.F., Raudsepp T., Rocchi M., Roeed K.H., Ryder O.A.,
RA Searle S., Skow L., Swinburne J.E., Syvaenen A.C., Tozaki T., Valberg S.J.,
RA Vaudin M., White J.R., Zody M.C., Lander E.S., Lindblad-Toh K.;
RT "Genome sequence, comparative analysis, and population genetics of the
RT domestic horse.";
RL Science 326:865-867(2009).
RN [2] {ECO:0000313|Ensembl:ENSECAP00000019043.4}
RP IDENTIFICATION.
RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000019043.4};
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR Ensembl; ENSECAT00000023015.4; ENSECAP00000019043.4; ENSECAG00000021096.4.
DR VGNC; VGNC:20896; NRXN2.
DR GeneTree; ENSGT00940000155978; -.
DR HOGENOM; CLU_001710_0_1_1; -.
DR TreeFam; TF321302; -.
DR Proteomes; UP000002281; Chromosome 12.
DR Bgee; ENSECAG00000021096; Expressed in prefrontal cortex and 8 other cell types or tissues.
DR ExpressionAtlas; F6YED9; baseline.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR CDD; cd00054; EGF_CA; 1.
DR CDD; cd00110; LamG; 5.
DR Gene3D; 2.60.120.200; -; 6.
DR Gene3D; 2.10.25.10; Laminin; 3.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR001791; Laminin_G.
DR InterPro; IPR003585; Neurexin-like.
DR PANTHER; PTHR15036:SF49; AXOTACTIN; 1.
DR PANTHER; PTHR15036; PIKACHURIN-LIKE PROTEIN; 1.
DR Pfam; PF02210; Laminin_G_2; 5.
DR SMART; SM00294; 4.1m; 1.
DR SMART; SM00181; EGF; 3.
DR SMART; SM00282; LamG; 5.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 6.
DR PROSITE; PS50026; EGF_3; 3.
DR PROSITE; PS50025; LAM_G_DOMAIN; 6.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW EGF-like domain {ECO:0000256|PROSITE-ProRule:PRU00076};
KW Membrane {ECO:0000256|ARBA:ARBA00023136, ECO:0000256|SAM:Phobius};
KW Reference proteome {ECO:0000313|Proteomes:UP000002281};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Transmembrane {ECO:0000256|ARBA:ARBA00022692, ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|ARBA:ARBA00022989,
KW ECO:0000256|SAM:Phobius}.
FT TRANSMEM 1501..1521
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 1..93
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 89..129
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 176..358
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 365..558
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 562..599
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 604..767
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 781..956
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 959..996
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1000..1208
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT REGION 1325..1347
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1388..1489
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1542..1575
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1438..1464
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1559..1575
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1575 AA; 170047 MW; 20F45B3D2116824A CRC64;
MVLLTRDARR TALAVDGEAR AAEVRSKRRE MQVASDLFVG GIPPDVRLSA LTLSTVKYEP
PFRGLLANLK LGERPPALLG SQGLRGAAAD PLCAPARNPC ANGGLCTVLA PGEVGCDCSH
TGFGGKFCSE EEHPMEGPAH LTLNSEVGSL LFSEGGAGRG GAGDVHQPTK GKEEFVATFK
GNEFFCYDLS HNPIQSSTDE ITLAFRTLQR NGLMLHTGKS ADYVNLSLKS GAVWLVINLG
SGAFEALVEP VNGKFNDNAW HDVRVTRNLR QVTISVDGIL TTTGYTQEDY TMLGSDDFFY
IGGSPNTADL PGSPVSNNFM GCLKDVVYKN NDFKLELSRL AKEGDPKMKL QGDLSFRCED
VAALDPVTFE SPEAFVALPR WSAKRTGSIS LDFRTTEPNG LLLFSQGRRA GAGAGSHSSA
QRADYFAMEL LDGYLYLLLD MGSGGIKLRA SSRKVNDGEW CHVDFQRDGR KGSISVNSRS
TPFLATGESE ILDLESELYL GGLPEGGRVD LPLPPEVWTA ALRAGYVGCV RDLFIDGRSR
DLRGLAEAQG AVGVAPFCSR ETLKQCASAP CRNGGICREG WNRFVCDCIG TGFLGRVCER
EATVLSYDGS MYMKIMLPNA MHTEAEDVSL RFMSQRAYGL MMATTSRESA DTLRLELDGG
QMKLTVNLGK GPETLFAGHK LNDNEWHTVR VVRRGKSLQL SVDNVTVEGQ MAGAHTRLEF
HNIETGIMTE RRFISVVPSN FIGHLSGLMF NGQPYMDQCK DGDITYCELN ARFGLRAIVA
DPVTFKSRSS YLALATLQAY ASMHLFFQFK TTAPDGLLLF NSGNGNDFIV IELVKGYIHY
VFDLGNGPSL MKGNSDKPVN DNQWHNVVVS RDPGNVHTLK IDSRTVTQHS NGARNLDLKG
ELYIGGLSKN MFSNLPKLVA SRDGFQGCLA SVDLNGRLPD LLADALHRIG QVERGCDGPS
TTCTEESCAN QGVCLQQWDG FTCDCTMTSY GGPICNDPGT TYIFGKGGAL ITYTWPPNDR
PSTRMDRLAV GFSTHQRSAV LVRVDSASGL GDYLQLHIDQ GTVGVIFNVG TDDITIDEPN
AIVSDGKYHV VRFTRSGGNA TLQVDSWPVN ERYPAGNFDN ERLAIARQRI PYRLGRVVDE
WLLDKGRQLT IFNSQAAIKI GGRDQGRPFQ GQVSGLYYNG LKVLALAAES DPNVRTEGHL
RLVGEGPSVL LSAETTATTL LADMATTIME TTTTMATTTT RRGRSPTLRD STTQNTDDLL
VASAECPSDD EDLEECEPST GGELILPIIT EDSLDPPPVA TRSPFVPPPP TFYPFLTGVG
ATQDTLPPPA ARRPPAGGPC QAEQDDSDCE EPIEASGFAS GEVFDSSLPP TDDEDFYTTF
PLVTDRTTLL SPRKPAPRPN LRTDGATGAP GVLLAPSAPA PNLPAGKMNH RDPLQPLLEN
PPLGPGAPTS FEPRRPPPLR PGVTAAPGFP HLPTANPTGP GERGPPGAVE VIRESSSTTG
MVVGIVAAAA LCILILLYAM YKYRNRDEGS YQVDQSRNYI SNSAQSNGAV VKEKAPAAPK
TPSKGKKNKD KEYYV
//