ID W5QE04_SHEEP Unreviewed; 2704 AA.
AC W5QE04;
DT 16-APR-2014, integrated into UniProtKB/TrEMBL.
DT 16-APR-2014, sequence version 1.
DT 27-MAR-2024, entry version 66.
DE RecName: Full=Cadherin EGF LAG seven-pass G-type receptor 1 {ECO:0008006|Google:ProtNLM};
GN Name=CELSR1 {ECO:0000313|Ensembl:ENSOARP00000020952.1};
OS Ovis aries (Sheep).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Artiodactyla; Ruminantia; Pecora; Bovidae;
OC Caprinae; Ovis.
OX NCBI_TaxID=9940 {ECO:0000313|Ensembl:ENSOARP00000020952.1, ECO:0000313|Proteomes:UP000002356};
RN [1] {ECO:0000313|Ensembl:ENSOARP00000020952.1, ECO:0000313|Proteomes:UP000002356}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Texel {ECO:0000313|Ensembl:ENSOARP00000020952.1,
RC ECO:0000313|Proteomes:UP000002356};
RX PubMed=20809919; DOI=10.1111/j.1365-2052.2010.02100.x;
RA Archibald A.L., Cockett N.E., Dalrymple B.P., Faraut T., Kijas J.W.,
RA Maddox J.F., McEwan J.C., Hutton Oddy V., Raadsma H.W., Wade C., Wang J.,
RA Wang W., Xun X.;
RT "The sheep genome reference sequence: a work in progress.";
RL Anim. Genet. 41:449-453(2010).
RN [2] {ECO:0000313|Ensembl:ENSOARP00000020952.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- FUNCTION: Receptor that may have an important role in cell/cell
CC signaling during nervous system formation.
CC {ECO:0000256|ARBA:ARBA00002066}.
CC -!- SUBCELLULAR LOCATION: Cell membrane {ECO:0000256|ARBA:ARBA00004651};
CC Multi-pass membrane protein {ECO:0000256|ARBA:ARBA00004651}. Membrane
CC {ECO:0000256|ARBA:ARBA00004141}; Multi-pass membrane protein
CC {ECO:0000256|ARBA:ARBA00004141}.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AMGL01084378; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AMGL01084379; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AMGL01084380; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AMGL01084381; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AMGL01084382; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AMGL01084383; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AMGL01084384; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AMGL01084385; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AMGL01084386; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AMGL01084387; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR SMR; W5QE04; -.
DR STRING; 9940.ENSOARP00000020952; -.
DR PaxDb; 9940-ENSOARP00000020952; -.
DR Ensembl; ENSOART00000021244.1; ENSOARP00000020952.1; ENSOARG00000019502.1.
DR eggNOG; KOG4289; Eukaryota.
DR HOGENOM; CLU_000158_1_0_1; -.
DR OMA; YTFLRGN; -.
DR Proteomes; UP000002356; Chromosome 3.
DR Bgee; ENSOARG00000019502; Expressed in fallopian tube and 50 other cell types or tissues.
DR GO; GO:0005886; C:plasma membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005509; F:calcium ion binding; IEA:UniProtKB-UniRule.
DR GO; GO:0004930; F:G protein-coupled receptor activity; IEA:UniProtKB-KW.
DR GO; GO:0007166; P:cell surface receptor signaling pathway; IEA:InterPro.
DR GO; GO:0007156; P:homophilic cell adhesion via plasma membrane adhesion molecules; IEA:InterPro.
DR CDD; cd11304; Cadherin_repeat; 8.
DR CDD; cd00054; EGF_CA; 3.
DR CDD; cd00055; EGF_Lam; 1.
DR CDD; cd00110; LamG; 2.
DR Gene3D; 2.60.120.200; -; 2.
DR Gene3D; 2.60.220.50; -; 1.
DR Gene3D; 2.60.40.60; Cadherins; 10.
DR Gene3D; 4.10.1240.10; GPCR, family 2, extracellular hormone receptor domain; 1.
DR Gene3D; 2.10.25.10; Laminin; 4.
DR Gene3D; 1.20.1070.10; Rhodopsin 7-helix transmembrane proteins; 1.
DR Gene3D; 2.170.300.10; Tie2 ligand-binding domain superfamily; 1.
DR InterPro; IPR002126; Cadherin-like_dom.
DR InterPro; IPR015919; Cadherin-like_sf.
DR InterPro; IPR020894; Cadherin_CS.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
DR InterPro; IPR032471; GAIN_dom_N.
DR InterPro; IPR046338; GAIN_dom_sf.
DR InterPro; IPR017981; GPCR_2-like_7TM.
DR InterPro; IPR036445; GPCR_2_extracell_dom_sf.
DR InterPro; IPR000832; GPCR_2_secretin-like.
DR InterPro; IPR000203; GPS.
DR InterPro; IPR009030; Growth_fac_rcpt_cys_sf.
DR InterPro; IPR001791; Laminin_G.
DR InterPro; IPR002049; LE_dom.
DR PANTHER; PTHR24026:SF36; CADHERIN EGF LAG SEVEN-PASS G-TYPE RECEPTOR 1; 1.
DR PANTHER; PTHR24026; FAT ATYPICAL CADHERIN-RELATED; 1.
DR Pfam; PF00002; 7tm_2; 1.
DR Pfam; PF00028; Cadherin; 6.
DR Pfam; PF00008; EGF; 1.
DR Pfam; PF16489; GAIN; 1.
DR Pfam; PF01825; GPS; 1.
DR Pfam; PF00053; Laminin_EGF; 1.
DR Pfam; PF02210; Laminin_G_2; 2.
DR PRINTS; PR00205; CADHERIN.
DR PRINTS; PR00249; GPCRSECRETIN.
DR SMART; SM00112; CA; 9.
DR SMART; SM00181; EGF; 4.
DR SMART; SM00179; EGF_CA; 3.
DR SMART; SM00180; EGF_Lam; 1.
DR SMART; SM00303; GPS; 1.
DR SMART; SM00282; LamG; 2.
DR SUPFAM; SSF49313; Cadherin-like; 9.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 2.
DR SUPFAM; SSF57196; EGF/Laminin; 2.
DR SUPFAM; SSF81321; Family A G protein-coupled receptor-like; 1.
DR SUPFAM; SSF57184; Growth factor receptor domain; 1.
DR PROSITE; PS00010; ASX_HYDROXYL; 1.
DR PROSITE; PS00232; CADHERIN_1; 6.
DR PROSITE; PS50268; CADHERIN_2; 8.
DR PROSITE; PS00022; EGF_1; 1.
DR PROSITE; PS01186; EGF_2; 1.
DR PROSITE; PS50026; EGF_3; 4.
DR PROSITE; PS01248; EGF_LAM_1; 1.
DR PROSITE; PS50027; EGF_LAM_2; 1.
DR PROSITE; PS50261; G_PROTEIN_RECEP_F2_4; 1.
DR PROSITE; PS50221; GPS; 1.
DR PROSITE; PS50025; LAM_G_DOMAIN; 2.
PE 4: Predicted;
KW Calcium {ECO:0000256|ARBA:ARBA00022837, ECO:0000256|PROSITE-
KW ProRule:PRU00043}; Cell membrane {ECO:0000256|ARBA:ARBA00022475};
KW Developmental protein {ECO:0000256|ARBA:ARBA00022473};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076};
KW G-protein coupled receptor {ECO:0000256|ARBA:ARBA00023040};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW Laminin EGF-like domain {ECO:0000256|ARBA:ARBA00023292,
KW ECO:0000256|PROSITE-ProRule:PRU00460};
KW Membrane {ECO:0000256|ARBA:ARBA00023136, ECO:0000256|SAM:Phobius};
KW Receptor {ECO:0000256|ARBA:ARBA00023170};
KW Reference proteome {ECO:0000313|Proteomes:UP000002356};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Signal {ECO:0000256|ARBA:ARBA00022729};
KW Transducer {ECO:0000256|ARBA:ARBA00023224};
KW Transmembrane {ECO:0000256|ARBA:ARBA00022692, ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|ARBA:ARBA00022989,
KW ECO:0000256|SAM:Phobius}.
FT TRANSMEM 2156..2179
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 2191..2209
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 2215..2237
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 2258..2278
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 2298..2318
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 2338..2361
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 2367..2388
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 1..95
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 96..201
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 202..308
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 309..430
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 431..534
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 535..710
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 711..812
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 835..935
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 1014..1072
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1074..1110
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1114..1152
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1153..1357
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 1400..1581
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 1583..1620
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1692..1739
FT /note="Laminin EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50027"
FT DOMAIN 2154..2389
FT /note="G-protein coupled receptors family 2 profile 2"
FT /evidence="ECO:0000259|PROSITE:PS50261"
FT REGION 558..582
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1759..1804
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2032..2051
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2446..2678
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 566..582
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2033..2047
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2476..2494
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2496..2510
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2511..2525
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2558..2587
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2643..2657
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 1062..1071
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1692..1704
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00460"
FT DISULFID 1694..1711
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00460"
FT DISULFID 1713..1722
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00460"
SQ SEQUENCE 2704 AA; 293153 MW; 6063688A5B8DD462 CRC64;
PAGTAVIELR AHDPDEGEAG RLGYQMEALF DERSNGYFLI DADTGLVSTA RPLDRETKDT
HVLKVSAVDH GSPRRSAATY LTVTVSDTND HSPVFEQSEY RERVRENLEV GYEVLTIRAT
DGAAPSNANM RYRLLEGARG IFEIDERSGV VRTRAVVDRE EEASYQLLVE ANDQGRNPGP
LSATATVHIV VEDENDNYPQ FSEKRYVVQV PEDVAVNTPV LRVQATDRDQ GQNAAIHYSI
IVSGNLKGQF YLHSLSGSLD VINPLDFETI REYTLRIKAQ DGGRPPLINS SGLVSVQVLD
VNDNAPIFVS SPFQAAVLEN VPLGHSVLHI QAVDADAGEN ARLRYRLVDT ASASVGGGGS
GPAAPPLAAD FPFQIHNSSG WITVCAELDR EEVEHYSFGV EAVDHGSPPM SSSASVSITV
LDVNDNDPVF TQPVYELRLN EDAAVGSSVL TLRALDRDAN SVITYQLTGG NTRNRFALSS
QSGGGLITLA LPPDYKQERQ EVVGGTVTAS DGMRSHTAQV FINVTDANTH RPVFQSSHYT
VSISEDRPVG TSIATISATD EDTGENAPPP PRPAGPPPWR PQLCPINLAT YPMAFAGRDN
GIPQKSDTTS LEILILDAND NAPRFLRDFY QGSVFEDAPP STSVLQVSAT DRDSGPNGRL
LYTFQGGDDG DGAVYNLRAL AVDRGSPVSL SASVEIQVTV LDINDNPPVF EKDELELFVE
ENSPVGSVVA RIRASDPDEG PNAQIMYQIV EGNVPEVFQL DLLSGDLRAL MELDFEVRRE
YVLVVQATSA PLVSRATVHI RLLDQNDNPP VLPDFQILFN NYVTNKSNSF PSGVIGRIPA
HDPDLSDSLN YTFLQGNELQ LLLLDPATGE LQLSRDLDNN RPLEALMEVS VSDGIHSVVA
LCTLRVTVIT DDMLTNSITV RLEDMSQERF LSPLLSRFVE GVAAVLSTTK DAVFVFNIQN
DTDVSANILN VTFSALLPGG VRDKFFPSED LQEQIYLNRT LLTAVSTQRV LPFDDNICLR
EPCENYMKCV SVLRFDSSAP FISSPTVLFR PIHPVNGLRC RCPPGFTGDY CETEIDLCYS
SPCGAHGRCR SREGGYTCEC QEDFTGKFSE VSARSGRCAH GVCKNGGTCV NLLIGGFHCV
CPPGAFERPY CEVTTRSFPP QSFVTFRGLR QRFHFTVALA FATQERNALL LYNGRFNEKH
DFIALEIVDE QVQLTFSAGE TTTTVAPQVP GGVSDGRWHA VQVQYYNKPN IGRLGLPHGP
SGEKVAVVTV DDCDTAVAVR FGSFVGNYSC AAQGTQSGSK KSLDLTGPLL LGGVPNLPED
FPVRNRQFVG CMRNLSIDGR HVDMASFIAN NGTPAGCAAQ RNFCDGTWCQ NGGTPGSWWR
AGLCQSKQPP GPGFLRTVMP HPQRFSGDSV VFWSDLDITI SVPWYLGLMF RTRKEDGVLM
EATAGGSSRL HLQILNNHVQ FEVSHGSSDV VSMQLSRSRV TDGEWHHLLI ELKSAKEGKD
IKYLAVMTLD YGRDQDTVQI GNQLPGLRMR SLVVGGVSED KVSVRRGFRG CMQGVRMGET
ATNIATLNMN DALKVRVKDG CEVEDPCSSS PCPPHSRCRN TWDGYACVCD RGGHLPALPA
PARAGGRRHM SFYLPSQIDL PCPRGWWGNP VCGPCHCAVS KGFDADCNKT SGQCQCKENY
YRPPGQDACL PCDCFPHGSH SRVCDMDSGQ CSCKPGVIGR QCNRCDNPFA EVTVLGCEVI
YNGCPRAFEA GIWWPQTKFG QPPAAPRPRG GAGGSWEEVG GGWDPPPGGP ESSGPWRALS
PAFQSRPEGL GQGSFVPISD CLQTEVSHPA CSLDCRPRVP VTTRILTQAP SRQSGSCGLA
ARESEQDVVR AGSALLAPAT RAAWEQIQRS EPGTAQLLRR FEAYFSNVAR NLRRTYLRPF
VIVTANMGLV PRGLNTANFL QTGMCQIELE QKRMSSKGDR GFFFFPSPPR APAAQPPGPP
IPSPWPTASC QRPQPAVEPD TCSLITSLHV EFPESASVPE GRVLASPWLN LRQTRSPPPP
PSPSHALNRL PNRPVINTPV VSTVVYSEGA LLPSPLERPG LVELLETEER TKPVCVSWNH
SITTGGTGGW SAKGCELLSR NRTHVACRCS HAASSAGAAQ VSVAFSGGRE VLPLKIVTYA
AVSLSLAALL LAFVLLALVR TLRSNLNGIH KNLIAALFSS QLVFVIGIAQ TENPFLCTVI
AILLHYVYMS TFAWTFVESL HVYRMLTEVR NIDAGPMRFY YVVGWGIPAI VTGLAVGLDP
QGYGNPDFCW LSLRDTLIWS FAGPIGTVIV VNTVIFVLSA KVSCQRKRHY YERKRVVALL
RTAFLLLLLV SATWLLGLLA VNGDALAFHY LFAVFSCLQA VLLLHCLFNR EVRKHLRGAL
AGKKPYADDS ATTRATLLTR SLNCNNTYGE EPDMFRTALG ESTASLDSTA RDEGGQKLSV
SSGPARGGHG EPDASFVPRS AKKPHGHDSD SDSELSLDEQ SSSYASSRSS DSEDDGGEAE
DKWDPAQGPV HSTPKGERAP GHARAGGRAG GRAGSDSEEA GEPPRLKVET KVSVELHLDE
QGNHCGERLP SRDSGGPRPA AVPPSQPPEQ RKGILKNKVT YPPPLTEKTL KSRLREKLAE
CEQSPASSRS SSVGSSDGLR APDGAITVKT PCREPGREHL NGVAMSMNVR AGSAQAHGSG
SERP
//