ID G1SE08_RABIT Unreviewed; 763 AA.
AC G1SE08;
DT 19-OCT-2011, integrated into UniProtKB/TrEMBL.
DT 11-DEC-2019, sequence version 2.
DT 27-MAR-2024, entry version 61.
DE SubName: Full=Neural EGFL like 1 {ECO:0000313|Ensembl:ENSOCUP00000000663.3};
GN Name=NELL1 {ECO:0000313|Ensembl:ENSOCUP00000000663.3};
OS Oryctolagus cuniculus (Rabbit).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Lagomorpha; Leporidae; Oryctolagus.
OX NCBI_TaxID=9986 {ECO:0000313|Ensembl:ENSOCUP00000000663.3, ECO:0000313|Proteomes:UP000001811};
RN [1] {ECO:0000313|Ensembl:ENSOCUP00000000663.3, ECO:0000313|Proteomes:UP000001811}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Thorbecke inbred {ECO:0000313|Ensembl:ENSOCUP00000000663.3,
RC ECO:0000313|Proteomes:UP000001811};
RX PubMed=21993624; DOI=10.1038/nature10530;
RA Lindblad-Toh K., Garber M., Zuk O., Lin M.F., Parker B.J., Washietl S.,
RA Kheradpour P., Ernst J., Jordan G., Mauceli E., Ward L.D., Lowe C.B.,
RA Holloway A.K., Clamp M., Gnerre S., Alfoldi J., Beal K., Chang J.,
RA Clawson H., Cuff J., Di Palma F., Fitzgerald S., Flicek P., Guttman M.,
RA Hubisz M.J., Jaffe D.B., Jungreis I., Kent W.J., Kostka D., Lara M.,
RA Martins A.L., Massingham T., Moltke I., Raney B.J., Rasmussen M.D.,
RA Robinson J., Stark A., Vilella A.J., Wen J., Xie X., Zody M.C., Baldwin J.,
RA Bloom T., Chin C.W., Heiman D., Nicol R., Nusbaum C., Young S.,
RA Wilkinson J., Worley K.C., Kovar C.L., Muzny D.M., Gibbs R.A., Cree A.,
RA Dihn H.H., Fowler G., Jhangiani S., Joshi V., Lee S., Lewis L.R.,
RA Nazareth L.V., Okwuonu G., Santibanez J., Warren W.C., Mardis E.R.,
RA Weinstock G.M., Wilson R.K., Delehaunty K., Dooling D., Fronik C.,
RA Fulton L., Fulton B., Graves T., Minx P., Sodergren E., Birney E.,
RA Margulies E.H., Herrero J., Green E.D., Haussler D., Siepel A., Goldman N.,
RA Pollard K.S., Pedersen J.S., Lander E.S., Kellis M.;
RT "A high-resolution map of human evolutionary constraint using 29 mammals.";
RL Nature 478:476-482(2011).
RN [2] {ECO:0000313|Ensembl:ENSOCUP00000000663.3}
RP IDENTIFICATION.
RC STRAIN=Thorbecke {ECO:0000313|Ensembl:ENSOCUP00000000663.3};
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AAGW02037210; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AAGW02037211; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AAGW02037212; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AAGW02037213; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AAGW02037214; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AAGW02037215; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AAGW02037216; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AAGW02037217; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AAGW02037218; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AAGW02037219; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR RefSeq; XP_002709032.1; XM_002708986.3.
DR AlphaFoldDB; G1SE08; -.
DR PaxDb; 9986-ENSOCUP00000000663; -.
DR Ensembl; ENSOCUT00000000764.3; ENSOCUP00000000663.3; ENSOCUG00000000761.3.
DR GeneID; 100342862; -.
DR CTD; 4745; -.
DR eggNOG; KOG1217; Eukaryota.
DR GeneTree; ENSGT00810000125439; -.
DR HOGENOM; CLU_006887_1_0_1; -.
DR OrthoDB; 5487at2759; -.
DR TreeFam; TF323325; -.
DR Proteomes; UP000001811; Chromosome 1.
DR Bgee; ENSOCUG00000000761; Expressed in brain and 7 other cell types or tissues.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR CDD; cd00054; EGF_CA; 3.
DR CDD; cd00110; LamG; 1.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 6.20.200.20; -; 2.
DR Gene3D; 2.10.70.10; Complement Module, domain 1; 1.
DR Gene3D; 2.10.25.10; Laminin; 5.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
DR InterPro; IPR018097; EGF_Ca-bd_CS.
DR InterPro; IPR024731; EGF_dom.
DR InterPro; IPR001791; Laminin_G.
DR InterPro; IPR048287; TSPN-like_N.
DR InterPro; IPR001007; VWF_dom.
DR PANTHER; PTHR24042; NEL HOMOLOG; 1.
DR PANTHER; PTHR24042:SF2; PROTEIN KINASE C-BINDING PROTEIN NELL1; 1.
DR Pfam; PF12947; EGF_3; 1.
DR Pfam; PF07645; EGF_CA; 2.
DR Pfam; PF02210; Laminin_G_2; 1.
DR Pfam; PF00093; VWC; 2.
DR SMART; SM00181; EGF; 5.
DR SMART; SM00179; EGF_CA; 3.
DR SMART; SM00282; LamG; 1.
DR SMART; SM00210; TSPN; 1.
DR SMART; SM00214; VWC; 4.
DR SMART; SM00215; VWC_out; 2.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR SUPFAM; SSF57196; EGF/Laminin; 4.
DR SUPFAM; SSF57603; FnI-like domain; 3.
DR PROSITE; PS00010; ASX_HYDROXYL; 2.
DR PROSITE; PS00022; EGF_1; 1.
DR PROSITE; PS01186; EGF_2; 3.
DR PROSITE; PS50026; EGF_3; 4.
DR PROSITE; PS01187; EGF_CA; 1.
DR PROSITE; PS50025; LAM_G_DOMAIN; 1.
DR PROSITE; PS01208; VWFC_1; 2.
DR PROSITE; PS50184; VWFC_2; 3.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Reference proteome {ECO:0000313|Proteomes:UP000001811};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..21
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 22..763
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5023803103"
FT DOMAIN 57..227
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 271..332
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 434..475
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 476..516
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 517..547
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 549..584
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 585..640
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 645..703
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DISULFID 519..529
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 537..546
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ SEQUENCE 763 AA; 84522 MW; 3DA050D31D5559A6 CRC64;
MPTDVILVVW FCVCSARTVL GFGMDPDLQM DIITELDLVN TTLGVTQVSG LHNTSKAFLF
QDTEREIHAA PHVSEKLIQL FRNKSEFTFL ATVQQKPSTS GVILSIRELE HSYFELESSG
LRDEIRYHYI HNEKPRTEAL PYRMADGQWH KIALSVSASH LLLHVDCNKI YERVIDPPET
NLPPGSNLWL GQRNQKHGFF KGIIQDGKII FMPNGYITQC PNLNRTCPTC SDFLSLVQGI
MDLQELLAKM TAKLNYAETR LSQLENCHCE KTCQVSGLLY RDQDSWVDGD HCRNCTCRSG
AVECRRMFCP PLNCSPDSLP VHITGQCCKV CRPKCIYGGK VLAEGQRILT KSCRECRGGV
LVKITETCPP LNCSEKDHIL PENQCCSVCR GHNFCAEGPK CGENSECKNW NTKATCECKN
GYISIQGDPA YCEDIDECAA KMHYCHANTV CINLPGLYRC DCVPGYIRVD DFSCTEHDEC
GSGQHNCDEN AICTNTVQGH SCTCKPGYVG NGTICRAFCE EGCRYGGTCV APNKCVCPSG
YTGSHCEKDI DECALKTHTC WNDSACINLA GGFDCLCPSG PSCSGDCPHE GGLKRNGQVW
TLKEDRCSVC SCKDGKIFCR RTACDCQNPS VDLFCCPECD TRVTSQCLDQ NGHKLYRSGD
NWTHSCQQCR CLDGEVDCWP LSCPNLSCEY TAVLEGECCP RCVSDPCLAD NITYDIRKTC
LDSYGVSRLS GSVWTMAGSP CTTCKCKNGR VCCSVDLECL PNN
//