GenomeNet

Database: UniProt
Entry: G1SE08_RABIT
LinkDB: G1SE08_RABIT
Original site: G1SE08_RABIT 
ID   G1SE08_RABIT            Unreviewed;       763 AA.
AC   G1SE08;
DT   19-OCT-2011, integrated into UniProtKB/TrEMBL.
DT   11-DEC-2019, sequence version 2.
DT   27-MAR-2024, entry version 61.
DE   SubName: Full=Neural EGFL like 1 {ECO:0000313|Ensembl:ENSOCUP00000000663.3};
GN   Name=NELL1 {ECO:0000313|Ensembl:ENSOCUP00000000663.3};
OS   Oryctolagus cuniculus (Rabbit).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Lagomorpha; Leporidae; Oryctolagus.
OX   NCBI_TaxID=9986 {ECO:0000313|Ensembl:ENSOCUP00000000663.3, ECO:0000313|Proteomes:UP000001811};
RN   [1] {ECO:0000313|Ensembl:ENSOCUP00000000663.3, ECO:0000313|Proteomes:UP000001811}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Thorbecke inbred {ECO:0000313|Ensembl:ENSOCUP00000000663.3,
RC   ECO:0000313|Proteomes:UP000001811};
RX   PubMed=21993624; DOI=10.1038/nature10530;
RA   Lindblad-Toh K., Garber M., Zuk O., Lin M.F., Parker B.J., Washietl S.,
RA   Kheradpour P., Ernst J., Jordan G., Mauceli E., Ward L.D., Lowe C.B.,
RA   Holloway A.K., Clamp M., Gnerre S., Alfoldi J., Beal K., Chang J.,
RA   Clawson H., Cuff J., Di Palma F., Fitzgerald S., Flicek P., Guttman M.,
RA   Hubisz M.J., Jaffe D.B., Jungreis I., Kent W.J., Kostka D., Lara M.,
RA   Martins A.L., Massingham T., Moltke I., Raney B.J., Rasmussen M.D.,
RA   Robinson J., Stark A., Vilella A.J., Wen J., Xie X., Zody M.C., Baldwin J.,
RA   Bloom T., Chin C.W., Heiman D., Nicol R., Nusbaum C., Young S.,
RA   Wilkinson J., Worley K.C., Kovar C.L., Muzny D.M., Gibbs R.A., Cree A.,
RA   Dihn H.H., Fowler G., Jhangiani S., Joshi V., Lee S., Lewis L.R.,
RA   Nazareth L.V., Okwuonu G., Santibanez J., Warren W.C., Mardis E.R.,
RA   Weinstock G.M., Wilson R.K., Delehaunty K., Dooling D., Fronik C.,
RA   Fulton L., Fulton B., Graves T., Minx P., Sodergren E., Birney E.,
RA   Margulies E.H., Herrero J., Green E.D., Haussler D., Siepel A., Goldman N.,
RA   Pollard K.S., Pedersen J.S., Lander E.S., Kellis M.;
RT   "A high-resolution map of human evolutionary constraint using 29 mammals.";
RL   Nature 478:476-482(2011).
RN   [2] {ECO:0000313|Ensembl:ENSOCUP00000000663.3}
RP   IDENTIFICATION.
RC   STRAIN=Thorbecke {ECO:0000313|Ensembl:ENSOCUP00000000663.3};
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC       feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AAGW02037210; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AAGW02037211; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AAGW02037212; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AAGW02037213; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AAGW02037214; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AAGW02037215; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AAGW02037216; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AAGW02037217; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AAGW02037218; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AAGW02037219; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   RefSeq; XP_002709032.1; XM_002708986.3.
DR   AlphaFoldDB; G1SE08; -.
DR   PaxDb; 9986-ENSOCUP00000000663; -.
DR   Ensembl; ENSOCUT00000000764.3; ENSOCUP00000000663.3; ENSOCUG00000000761.3.
DR   GeneID; 100342862; -.
DR   CTD; 4745; -.
DR   eggNOG; KOG1217; Eukaryota.
DR   GeneTree; ENSGT00810000125439; -.
DR   HOGENOM; CLU_006887_1_0_1; -.
DR   OrthoDB; 5487at2759; -.
DR   TreeFam; TF323325; -.
DR   Proteomes; UP000001811; Chromosome 1.
DR   Bgee; ENSOCUG00000000761; Expressed in brain and 7 other cell types or tissues.
DR   GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR   CDD; cd00054; EGF_CA; 3.
DR   CDD; cd00110; LamG; 1.
DR   Gene3D; 2.60.120.200; -; 1.
DR   Gene3D; 6.20.200.20; -; 2.
DR   Gene3D; 2.10.70.10; Complement Module, domain 1; 1.
DR   Gene3D; 2.10.25.10; Laminin; 5.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR   InterPro; IPR000742; EGF-like_dom.
DR   InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
DR   InterPro; IPR018097; EGF_Ca-bd_CS.
DR   InterPro; IPR024731; EGF_dom.
DR   InterPro; IPR001791; Laminin_G.
DR   InterPro; IPR048287; TSPN-like_N.
DR   InterPro; IPR001007; VWF_dom.
DR   PANTHER; PTHR24042; NEL HOMOLOG; 1.
DR   PANTHER; PTHR24042:SF2; PROTEIN KINASE C-BINDING PROTEIN NELL1; 1.
DR   Pfam; PF12947; EGF_3; 1.
DR   Pfam; PF07645; EGF_CA; 2.
DR   Pfam; PF02210; Laminin_G_2; 1.
DR   Pfam; PF00093; VWC; 2.
DR   SMART; SM00181; EGF; 5.
DR   SMART; SM00179; EGF_CA; 3.
DR   SMART; SM00282; LamG; 1.
DR   SMART; SM00210; TSPN; 1.
DR   SMART; SM00214; VWC; 4.
DR   SMART; SM00215; VWC_out; 2.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR   SUPFAM; SSF57196; EGF/Laminin; 4.
DR   SUPFAM; SSF57603; FnI-like domain; 3.
DR   PROSITE; PS00010; ASX_HYDROXYL; 2.
DR   PROSITE; PS00022; EGF_1; 1.
DR   PROSITE; PS01186; EGF_2; 3.
DR   PROSITE; PS50026; EGF_3; 4.
DR   PROSITE; PS01187; EGF_CA; 1.
DR   PROSITE; PS50025; LAM_G_DOMAIN; 1.
DR   PROSITE; PS01208; VWFC_1; 2.
DR   PROSITE; PS50184; VWFC_2; 3.
PE   4: Predicted;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW   ProRule:PRU00076};
KW   EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW   ProRule:PRU00076}; Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Reference proteome {ECO:0000313|Proteomes:UP000001811};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..21
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           22..763
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5023803103"
FT   DOMAIN          57..227
FT                   /note="Laminin G"
FT                   /evidence="ECO:0000259|PROSITE:PS50025"
FT   DOMAIN          271..332
FT                   /note="VWFC"
FT                   /evidence="ECO:0000259|PROSITE:PS50184"
FT   DOMAIN          434..475
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   DOMAIN          476..516
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   DOMAIN          517..547
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   DOMAIN          549..584
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   DOMAIN          585..640
FT                   /note="VWFC"
FT                   /evidence="ECO:0000259|PROSITE:PS50184"
FT   DOMAIN          645..703
FT                   /note="VWFC"
FT                   /evidence="ECO:0000259|PROSITE:PS50184"
FT   DISULFID        519..529
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT   DISULFID        537..546
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ   SEQUENCE   763 AA;  84522 MW;  3DA050D31D5559A6 CRC64;
     MPTDVILVVW FCVCSARTVL GFGMDPDLQM DIITELDLVN TTLGVTQVSG LHNTSKAFLF
     QDTEREIHAA PHVSEKLIQL FRNKSEFTFL ATVQQKPSTS GVILSIRELE HSYFELESSG
     LRDEIRYHYI HNEKPRTEAL PYRMADGQWH KIALSVSASH LLLHVDCNKI YERVIDPPET
     NLPPGSNLWL GQRNQKHGFF KGIIQDGKII FMPNGYITQC PNLNRTCPTC SDFLSLVQGI
     MDLQELLAKM TAKLNYAETR LSQLENCHCE KTCQVSGLLY RDQDSWVDGD HCRNCTCRSG
     AVECRRMFCP PLNCSPDSLP VHITGQCCKV CRPKCIYGGK VLAEGQRILT KSCRECRGGV
     LVKITETCPP LNCSEKDHIL PENQCCSVCR GHNFCAEGPK CGENSECKNW NTKATCECKN
     GYISIQGDPA YCEDIDECAA KMHYCHANTV CINLPGLYRC DCVPGYIRVD DFSCTEHDEC
     GSGQHNCDEN AICTNTVQGH SCTCKPGYVG NGTICRAFCE EGCRYGGTCV APNKCVCPSG
     YTGSHCEKDI DECALKTHTC WNDSACINLA GGFDCLCPSG PSCSGDCPHE GGLKRNGQVW
     TLKEDRCSVC SCKDGKIFCR RTACDCQNPS VDLFCCPECD TRVTSQCLDQ NGHKLYRSGD
     NWTHSCQQCR CLDGEVDCWP LSCPNLSCEY TAVLEGECCP RCVSDPCLAD NITYDIRKTC
     LDSYGVSRLS GSVWTMAGSP CTTCKCKNGR VCCSVDLECL PNN
//
DBGET integrated database retrieval system