ID I3KQD8_ORENI Unreviewed; 461 AA.
AC I3KQD8;
DT 11-JUL-2012, integrated into UniProtKB/TrEMBL.
DT 11-JUL-2012, sequence version 1.
DT 08-NOV-2023, entry version 67.
DE SubName: Full=Netrin-G1 {ECO:0000313|Ensembl:ENSONIP00000023333.1};
GN Name=LOC100691642 {ECO:0000313|Ensembl:ENSONIP00000023333.1};
OS Oreochromis niloticus (Nile tilapia) (Tilapia nilotica).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Ovalentaria; Cichlomorphae; Cichliformes; Cichlidae; African cichlids;
OC Pseudocrenilabrinae; Oreochromini; Oreochromis.
OX NCBI_TaxID=8128 {ECO:0000313|Ensembl:ENSONIP00000023333.1, ECO:0000313|Proteomes:UP000005207};
RN [1] {ECO:0000313|Proteomes:UP000005207}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RG Broad Institute Genome Assembly Team;
RG Broad Institute Sequencing Platform;
RA Di Palma F., Johnson J., Lander E.S., Lindblad-Toh K.;
RT "The Genome Sequence of Oreochromis niloticus (Nile Tilapia).";
RL Submitted (JAN-2012) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSONIP00000023333.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (JUL-2023) to UniProtKB.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_005476685.1; XM_005476628.3.
DR RefSeq; XP_005476686.1; XM_005476629.3.
DR AlphaFoldDB; I3KQD8; -.
DR STRING; 8128.ENSONIP00000023333; -.
DR Ensembl; ENSONIT00000023353.2; ENSONIP00000023333.1; ENSONIG00000018534.2.
DR GeneID; 100691642; -.
DR KEGG; onl:100691642; -.
DR eggNOG; KOG3512; Eukaryota.
DR GeneTree; ENSGT00940000153601; -.
DR HOGENOM; CLU_039838_1_1_1; -.
DR InParanoid; I3KQD8; -.
DR OMA; SHAMQHY; -.
DR OrthoDB; 2916807at2759; -.
DR TreeFam; TF333945; -.
DR Proteomes; UP000005207; Linkage group LG18.
DR GO; GO:0048856; P:anatomical structure development; IEA:UniProt.
DR CDD; cd00054; EGF_CA; 1.
DR CDD; cd00055; EGF_Lam; 1.
DR Gene3D; 2.60.120.260; Galactose-binding domain-like; 1.
DR Gene3D; 2.10.25.10; Laminin; 2.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR008211; Laminin_N.
DR InterPro; IPR002049; LE_dom.
DR PANTHER; PTHR10574:SF28; NETRIN-G1; 1.
DR PANTHER; PTHR10574; NETRIN/LAMININ-RELATED; 1.
DR Pfam; PF00053; Laminin_EGF; 1.
DR Pfam; PF00055; Laminin_N; 1.
DR SMART; SM00180; EGF_Lam; 1.
DR SMART; SM00136; LamNT; 1.
DR SUPFAM; SSF57196; EGF/Laminin; 2.
DR PROSITE; PS00022; EGF_1; 1.
DR PROSITE; PS01186; EGF_2; 1.
DR PROSITE; PS50026; EGF_3; 1.
DR PROSITE; PS51117; LAMININ_NTER; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; EGF-like domain {ECO:0000256|PROSITE-ProRule:PRU00076};
KW Laminin EGF-like domain {ECO:0000256|ARBA:ARBA00023292};
KW Reference proteome {ECO:0000313|Proteomes:UP000005207};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..23
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 24..461
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5003674831"
FT DOMAIN 50..301
FT /note="Laminin N-terminal"
FT /evidence="ECO:0000259|PROSITE:PS51117"
FT DOMAIN 382..417
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DISULFID 407..416
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ SEQUENCE 461 AA; 52038 MW; 07358F8B56C87875 CRC64;
MRSAMLLPVL FALQALCTSV GHAMQHYPAV WGHYDVCKSQ IYTEEGLAWD YMACQPEATD
MTEYLRVTLD PPNITCGDPP ETYCALENPY MCNNECDAAT EELAHPPELM FDIEGRNPTT
FWQSTSWKKY PKPLQVNITL SWNKTIELTD DIVLTFESGR PEQMVLEKSL DYGRTWTPYQ
FYATDCLDAF TMEPKTANDL TQQTLLDIIC TEDYSRGYVW KNDKTVRFEI KDRFALFAGP
RLHNMASLYG QLDTTKNLRD FFTITDLRIR LLKPATGATM VDENNLSRYF YAISDIKVQG
RCKCNLHANS CVFDKGKLGC ECEHNTTGPD CSRCKKHYHG RAWSVGSYLP IPKGTANICI
PSNHGPVHRA NASSLGVANR NQARVCDNAM LRCQNGGTCH HHQRCHCSPG FTGILCERAR
CQGPGDCDDQ LSGQASLHHR PIGHQRTLTL VVFPLLFVSL C
//