ID K7F7L3_PELSI Unreviewed; 944 AA.
AC K7F7L3;
DT 09-JAN-2013, integrated into UniProtKB/TrEMBL.
DT 09-JAN-2013, sequence version 1.
DT 27-MAR-2024, entry version 76.
DE SubName: Full=Protocadherin alpha-8 {ECO:0000313|Ensembl:ENSPSIP00000004023.1};
OS Pelodiscus sinensis (Chinese softshell turtle) (Trionyx sinensis).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Testudinata; Testudines; Cryptodira; Trionychia;
OC Trionychidae; Pelodiscus.
OX NCBI_TaxID=13735 {ECO:0000313|Ensembl:ENSPSIP00000004023.1, ECO:0000313|Proteomes:UP000007267};
RN [1] {ECO:0000313|Proteomes:UP000007267}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Daiwa-1 {ECO:0000313|Proteomes:UP000007267};
RG Soft-shell Turtle Genome Consortium;
RL Submitted (OCT-2011) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Proteomes:UP000007267}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Daiwa-1 {ECO:0000313|Proteomes:UP000007267};
RX PubMed=23624526; DOI=10.1038/ng.2615;
RA Wang Z., Pascual-Anaya J., Zadissa A., Li W., Niimura Y., Huang Z., Li C.,
RA White S., Xiong Z., Fang D., Wang B., Ming Y., Chen Y., Zheng Y.,
RA Kuraku S., Pignatelli M., Herrero J., Beal K., Nozawa M., Li Q., Wang J.,
RA Zhang H., Yu L., Shigenobu S., Wang J., Liu J., Flicek P., Searle S.,
RA Wang J., Kuratani S., Yin Y., Aken B., Zhang G., Irie N.;
RT "The draft genomes of soft-shell turtle and green sea turtle yield insights
RT into the development and evolution of the turtle-specific body plan.";
RL Nat. Genet. 45:701-706(2013).
RN [3] {ECO:0000313|Ensembl:ENSPSIP00000004023.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Cell membrane {ECO:0000256|ARBA:ARBA00004251};
CC Single-pass type I membrane protein {ECO:0000256|ARBA:ARBA00004251}.
CC Membrane {ECO:0000256|ARBA:ARBA00004479}; Single-pass type I membrane
CC protein {ECO:0000256|ARBA:ARBA00004479}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AGCU01166617; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGCU01166618; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGCU01166619; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGCU01166620; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGCU01166621; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGCU01166622; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGCU01166623; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR RefSeq; XP_006135158.1; XM_006135096.2.
DR AlphaFoldDB; K7F7L3; -.
DR Ensembl; ENSPSIT00000004046.1; ENSPSIP00000004023.1; ENSPSIG00000003792.1.
DR GeneID; 102444260; -.
DR eggNOG; KOG3594; Eukaryota.
DR GeneTree; ENSGT00940000163564; -.
DR HOGENOM; CLU_006480_3_0_1; -.
DR OMA; YAMAGHC; -.
DR OrthoDB; 4259465at2759; -.
DR TreeFam; TF332299; -.
DR Proteomes; UP000007267; Unassembled WGS sequence.
DR GO; GO:0005886; C:plasma membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005509; F:calcium ion binding; IEA:UniProtKB-UniRule.
DR GO; GO:0007156; P:homophilic cell adhesion via plasma membrane adhesion molecules; IEA:InterPro.
DR CDD; cd11304; Cadherin_repeat; 6.
DR Gene3D; 2.60.40.60; Cadherins; 6.
DR InterPro; IPR002126; Cadherin-like_dom.
DR InterPro; IPR015919; Cadherin-like_sf.
DR InterPro; IPR031904; Cadherin_CBD.
DR InterPro; IPR020894; Cadherin_CS.
DR InterPro; IPR013164; Cadherin_N.
DR PANTHER; PTHR24028; CADHERIN-87A; 1.
DR PANTHER; PTHR24028:SF133; PROTOCADHERIN ALPHA-9; 1.
DR Pfam; PF00028; Cadherin; 5.
DR Pfam; PF08266; Cadherin_2; 1.
DR Pfam; PF15974; Cadherin_tail; 1.
DR PRINTS; PR00205; CADHERIN.
DR SMART; SM00112; CA; 6.
DR SUPFAM; SSF49313; Cadherin-like; 6.
DR PROSITE; PS00232; CADHERIN_1; 4.
DR PROSITE; PS50268; CADHERIN_2; 6.
PE 4: Predicted;
KW Calcium {ECO:0000256|ARBA:ARBA00022837, ECO:0000256|PROSITE-
KW ProRule:PRU00043}; Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW Membrane {ECO:0000256|SAM:Phobius};
KW Reference proteome {ECO:0000313|Proteomes:UP000007267};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP};
KW Transmembrane {ECO:0000256|ARBA:ARBA00022692, ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|ARBA:ARBA00022989,
KW ECO:0000256|SAM:Phobius}.
FT SIGNAL 1..29
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 30..944
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5003903839"
FT TRANSMEM 693..714
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 34..133
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 134..242
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 243..350
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 351..455
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 456..565
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 580..682
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT REGION 824..850
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 864..944
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 929..944
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 944 AA; 100565 MW; F996A7786E9A5C72 CRC64;
MAGPRREALG AGQLLRWVLL QAAWAAGGGQ LHYSVPEEAE PGTLVGRLAQ DVGLEVAELV
SRRLRAVAKG GRDCFEVNAQ SGALLVKSRL DREELCGPSP RCALDLEVLV DKPLRLFPVE
VEIRDINDNA PSFPLAEKNL FILESRLPGS RFPLEGASDA DIGTNSLLSY TLSPNEHFSL
DLQRNDGHTE SLVLVLKKPL DREQTPEHRL LLTATDGGKP ELMGTAQLVI SVRDVNDNAP
VFNQPVYKIQ LLENAANGTL VIKLNATDID EGSNKEIIYS IRSLGTPRER AAFRLSPETG
EISVNRELDF EDINLYEIWV QATDKGNPPL AGHCKVLVEV LDVNDHAPEL AVTSLSLPVP
EDAPPGTVVA LLSVSDRDSG DNGKVSCSIG PGLPFRLVST FKNYHSLVVA EALDRERVAE
YEVVVRARDQ GAPPLWASRR LVVPLSDVND NAPAFPQAVY TVFVRENNPA GAPVLSVSAW
DPDLGANGRV SYWAVERRVG ERPLSSYISV HSESGQVSAL QPLDYEELQV LQFEVSARDA
GLPALCGNVS VQLFVLDAND NAPVVSPPGS GRGALGPELV PLSAGAGHVV GKVRAVDADS
GYNAWLRYEL QEPRAAGPFR VGVYSGEIST ARALEEADGP SRSLVIVVRD HGEPALSATA
TVSLSLSERA QAVAWESGPP RAGSAGPAGA MNVSLMIAIC SVSGLFVLVI VLYVGLRCPG
APEVTCGPGK GPGASASEAG SWSYSQRQSR LVCVGEGTVR SDLMVFSPVC PQSSENGEAG
NGKVLTTNAS GMPKQPNPDW RYSASLRAGI QSAVHMEEAG VLRGGPGGPE QLWPTVSSAT
PEPEAGEVSP PVGAGVNSNS WTFKYGPGNP KQAGPGELPD KFIIPGSPAI ISIRQEPPNS
QIDKSDFITF GKKEETKKKK KKKKGNKTQE KKEKGNSTTD NSDQ
//