ID G1KZC1_ANOCA Unreviewed; 853 AA.
AC G1KZC1;
DT 19-OCT-2011, integrated into UniProtKB/TrEMBL.
DT 29-SEP-2021, sequence version 3.
DT 27-MAR-2024, entry version 92.
DE SubName: Full=Neurexin 3 {ECO:0000313|Ensembl:ENSACAP00000021589.3};
GN Name=NRXN3 {ECO:0000313|Ensembl:ENSACAP00000021589.3};
OS Anolis carolinensis (Green anole) (American chameleon).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Lepidosauria; Squamata; Bifurcata; Unidentata; Episquamata; Toxicofera;
OC Iguania; Dactyloidae; Anolis.
OX NCBI_TaxID=28377 {ECO:0000313|Ensembl:ENSACAP00000021589.3, ECO:0000313|Proteomes:UP000001646};
RN [1] {ECO:0000313|Ensembl:ENSACAP00000021589.3, ECO:0000313|Proteomes:UP000001646}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=JBL SC #1 {ECO:0000313|Ensembl:ENSACAP00000021589.3,
RC ECO:0000313|Proteomes:UP000001646};
RG The Genome Sequencing Platform;
RA Di Palma F., Alfoldi J., Heiman D., Young S., Grabherr M., Johnson J.,
RA Lander E.S., Lindblad-Toh K.;
RT "The Genome Sequence of Anolis carolinensis (Green Anole Lizard).";
RL Submitted (DEC-2009) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSACAP00000021589.3}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; G1KZC1; -.
DR STRING; 28377.ENSACAP00000021589; -.
DR Ensembl; ENSACAT00000017402.4; ENSACAP00000021589.3; ENSACAG00000017328.4.
DR eggNOG; KOG3514; Eukaryota.
DR GeneTree; ENSGT00940000154618; -.
DR HOGENOM; CLU_001710_0_1_1; -.
DR TreeFam; TF321302; -.
DR Proteomes; UP000001646; Chromosome 1.
DR Bgee; ENSACAG00000017328; Expressed in testis and 3 other cell types or tissues.
DR CDD; cd00054; EGF_CA; 2.
DR CDD; cd00110; LamG; 4.
DR Gene3D; 2.60.120.200; -; 4.
DR Gene3D; 2.10.25.10; Laminin; 2.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
DR InterPro; IPR001791; Laminin_G.
DR PANTHER; PTHR15036; PIKACHURIN-LIKE PROTEIN; 1.
DR PANTHER; PTHR15036:SF85; SP2353, ISOFORM A; 1.
DR Pfam; PF00008; EGF; 1.
DR Pfam; PF02210; Laminin_G_2; 4.
DR SMART; SM00181; EGF; 2.
DR SMART; SM00282; LamG; 4.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 4.
DR SUPFAM; SSF57196; EGF/Laminin; 1.
DR PROSITE; PS00010; ASX_HYDROXYL; 1.
DR PROSITE; PS50026; EGF_3; 2.
DR PROSITE; PS50025; LAM_G_DOMAIN; 4.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Reference proteome {ECO:0000313|Proteomes:UP000001646}.
FT DOMAIN 6..203
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 210..402
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 406..443
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 448..620
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 634..809
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 812..849
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
SQ SEQUENCE 853 AA; 94916 MW; 806F14B31915FA16 CRC64;
AREENVATFR GSEYLCYDLS QNPIQSSSDE ITLSFKTWQR NGLILHTGKS ADYVNLALKD
GAVSLIINLG SGAFEAIVEP VSGKFNDNQW HDVKVTRNLR QHSGIGHAMV NKLHCLVTIS
VDGILTTTGY TQEDYTMLGS DDFFYVGGSP STADLPGSPI SNNFMGCLKE VVYKNNDIRL
ELSRLARIGD TKMKIYGEVK FTCENVATLD PINFETPEAY ISLPKWNTKR MGSISFDFRT
TEPNGLILFT HGKPQERKDS RSQKNTKVDF FAVELLDGNL YLLLDMGSGT VKVKATQRKA
NDGEWYHVDI QRDGRSGTIS VNSRRTPFTA SGESEILDLE GDMYLGGLPE NRAGLVLPTE
LWTAMLNYGY VGCIRDLFID GRSKNIRQLA EQQNAAGVKS SCIRVNSKQC DSYPCKNNAV
CKDGWNRFIC DCTGTGYWGR TCEREASILS YDGSMYMKVV MPMVMHTEAE DVSFRFMSQR
AYGLLMATTS RESADTLRLE LDGGRVKLMV NLDCIRINCN ASKGPETLYA GQKLNDNEWH
TVRVVRRGKS LKLTVDDDVA EGTMVGDHTR LEFHNIETGI MTEKRYISVI PSSFIGHLQS
LMFNGLLYID LCKNGDIDYC ELKARFGLRN IIADPVTFKT KSSYLSLATL QAYTSMHLFF
QFKTTSADGF ILFNSGDGND FIAVELVKGY IHYVFDLGNG PNVIKGNSDR PLNDNQWHNV
VITRDNSNTH SLKVDTRIVT QVINGAKNLD LKGDLYIAGL AQGMYSNLPK LVASRDGFQG
CLASVDLNGR LPDLINDALH RSGQIERGCE GPSTTCQEDS CANQGICIQQ WEGFTCDCSM
TSYSGNQCND RES
//