ID G3VZM2_SARHA Unreviewed; 3049 AA.
AC G3VZM2;
DT 16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT 07-APR-2021, sequence version 2.
DT 27-MAR-2024, entry version 56.
DE RecName: Full=Calx-beta domain-containing protein {ECO:0000259|SMART:SM00237};
OS Sarcophilus harrisii (Tasmanian devil) (Sarcophilus laniarius).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Metatheria; Dasyuromorphia; Dasyuridae; Sarcophilus.
OX NCBI_TaxID=9305 {ECO:0000313|Ensembl:ENSSHAP00000008627.2, ECO:0000313|Proteomes:UP000007648};
RN [1] {ECO:0000313|Ensembl:ENSSHAP00000008627.2, ECO:0000313|Proteomes:UP000007648}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21709235; DOI=10.1073/pnas.1102838108;
RA Miller W., Hayes V.M., Ratan A., Petersen D.C., Wittekindt N.E., Miller J.,
RA Walenz B., Knight J., Qi J., Zhao F., Wang Q., Bedoya-Reina O.C.,
RA Katiyar N., Tomsho L.P., Kasson L.M., Hardie R.A., Woodbridge P.,
RA Tindall E.A., Bertelsen M.F., Dixon D., Pyecroft S., Helgen K.M.,
RA Lesk A.M., Pringle T.H., Patterson N., Zhang Y., Kreiss A., Woods G.M.,
RA Jones M.E., Schuster S.C.;
RT "Genetic diversity and population structure of the endangered marsupial
RT Sarcophilus harrisii (Tasmanian devil).";
RL Proc. Natl. Acad. Sci. U.S.A. 108:12348-12353(2011).
RN [2] {ECO:0000313|Ensembl:ENSSHAP00000008627.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SIMILARITY: Belongs to the FRAS1 family.
CC {ECO:0000256|ARBA:ARBA00005529}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 9305.ENSSHAP00000008627; -.
DR Ensembl; ENSSHAT00000008698.2; ENSSHAP00000008627.2; ENSSHAG00000007475.2.
DR eggNOG; KOG1306; Eukaryota.
DR eggNOG; KOG3597; Eukaryota.
DR GeneTree; ENSGT00940000162501; -.
DR TreeFam; TF316876; -.
DR Proteomes; UP000007648; Unassembled WGS sequence.
DR GO; GO:0016020; C:membrane; IEA:InterPro.
DR GO; GO:0007155; P:cell adhesion; IEA:UniProtKB-KW.
DR GO; GO:0007154; P:cell communication; IEA:InterPro.
DR Gene3D; 2.60.40.2030; -; 5.
DR InterPro; IPR038081; CalX-like_sf.
DR InterPro; IPR003644; Calx_beta.
DR InterPro; IPR039005; CSPG_rpt.
DR InterPro; IPR045658; FRAS1-rel_N.
DR PANTHER; PTHR45739:SF5; FRAS1-RELATED EXTRACELLULAR MATRIX PROTEIN 3; 1.
DR PANTHER; PTHR45739; MATRIX PROTEIN, PUTATIVE-RELATED; 1.
DR Pfam; PF16184; Cadherin_3; 12.
DR Pfam; PF03160; Calx-beta; 5.
DR Pfam; PF19309; Frem_N; 1.
DR SMART; SM00237; Calx_beta; 5.
DR SUPFAM; SSF141072; CalX-like; 5.
DR PROSITE; PS51854; CSPG; 12.
DR PROSITE; PS51257; PROKAR_LIPOPROTEIN; 1.
PE 3: Inferred from homology;
KW Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Reference proteome {ECO:0000313|Proteomes:UP000007648};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..31
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 32..3049
FT /note="Calx-beta domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5029775996"
FT REPEAT 310..413
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 438..528
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 549..664
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 689..795
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 816..907
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 931..1023
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 1052..1154
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 1175..1268
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 1289..1387
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 1408..1500
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 1520..1609
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 1643..1740
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT DOMAIN 1747..1846
FT /note="Calx-beta"
FT /evidence="ECO:0000259|SMART:SM00237"
FT DOMAIN 1859..1969
FT /note="Calx-beta"
FT /evidence="ECO:0000259|SMART:SM00237"
FT DOMAIN 1984..2090
FT /note="Calx-beta"
FT /evidence="ECO:0000259|SMART:SM00237"
FT DOMAIN 2103..2207
FT /note="Calx-beta"
FT /evidence="ECO:0000259|SMART:SM00237"
FT DOMAIN 2225..2329
FT /note="Calx-beta"
FT /evidence="ECO:0000259|SMART:SM00237"
SQ SEQUENCE 3049 AA; 336494 MW; 165086183628ADB2 CRC64;
MTGAWHRPPG TARRLLTALA ALLLGCPALR AQTPEPELSA RALVPATRGS EGAGILAAPP
GLRVPLGRSV WLDPGRDLVL RVRPGDRCAV TVVGEDGGGG AAGSPAPGAL TPASFPCAFG
PEQVLYTHFG ARSPSRHRLR LQLRCDSSSR TLVLPFTLPV DVDSTRLEVV TRNLPLTVVR
LRGFSNAVDH RVLNFSTASL AASPRDGAPL ACRLTPLPRE SGPLPRHGKL VSPSGAPLSP
GREMDCQAFL NAGVRYRHLA STPSPDRDRV PFLVEILGPE KEEAVGLPRI LAREHFHMLV
RIQEGRENTP PRLSFGAMMM MEVGQFVLTP LTLESLAAED VETSSDDLVF NILNSPTVLP
GSHRQRGYFI NTDDPFGGTV TSFTQQDLRE LKIAYQPPTG DSNRERLFQL EMEVMDADGA
ASAPFAFMVV VKPMNTLAPV ASRNRGLLLF EGQSRPLSDA QGLEISDEDN LEEVRVTVAK
GLSHGQLVVL GAPPGRKFFT PAELTAGRVI YQHDGSDSYS DNVIFRLSDG HHDVEFLFPI
TIVPEDDEPP IVSTNTGLSL AEGEVVRISH LVLSATDIDS EDSTIRFVLK TSRLGEVLLR
QSEPPSSLMA EGGQGWTYVE KEDLYEKVVT EWLQQDILEG RLFYRHLGPH KPSSMIDQLV
FHVQDDHDPP NQSGQHFFTI KVQPVDVLSP ELHPGVSLEM TVKAYQLTPF QKQFLCYTDA
DSDDWHLQYM LLTPPMDMDN NHPVPAGEIV LTDEPTRSVT QFTQAQVNHH KVAYQPPQQE
VGIAPRVVQF TYRVEDAAGN SVPGTFRLFL QPVDDQPPEV TNRGFAVQKG ESFILTSREL
DVTDPDTEAD NIVFTLAQGP QHGQLLYLEG VMTSGTYFMK DDIIKGHVSY QHDGSETTKD
AFQLLVSDRI HQVPIIVRII IKTIDDRTPV PGGGWVGTAI DVLENGATEM TTGIIQGTEK
NTDDLMLTFM VEKSPQLGTI LLNGLPTEQF TQGDLINGAV TYAHTGGEIG MQKQHDAFNL
TVSNPSNIWM VAGKVVEGVQ VQVTVLPLDS VPPEVKGGEL LSVPEGGKGT LTLRHLDAKD
VDTPHDDILC TIIGQPSFGY LENLSPAPGS EKSQVGNPIS AFTIKDICLG HINYVQSIHQ
GIEPREDQVT FYCSDGVNVS PNYIFPIIIL PTNDEQPELF IHDFVVLEGM SLVIDTPLLN
SADSDLPPDE LRFQVTVLPQ NGRIVQQLAT GSRPVHSFTL EEIQEASTIV YEHDDSETTM
DSFEVWVTDG KHAIHKKIPI TVILVDDETP RLSINDGLEV DIGQTKVITN QVLKATDLDS
NDKDLAYILR SGPGQGLLQK LLEPGGKVWS NLTLGMNFTQ SEVDQGLICY SHKGLGGVQD
LIKFDVTDGI NPLIDRYFYV TISGPDKVFP EVINKGVTLK EGDRVTLTTD LLSTSDINSP
DEQLCFSITR APNRGHLESS DFPGKPIDSF TQLQLAGNKI SYVHTAKDEV KMDSFEFEVT
DGHNTIFQTF HVFIIDVDNK KPVLTVHPLL VQEGERKLIT PFELSAEDQD SPDGSLLFTI
THIPAHGQIL YNGSQPITSF TKQDLNENLI SYWHDGTETT EDSFSFVVTD GTHTDFFIFP
DTVLATHKPQ VMRIRINSLD NGIPRVVVNK GAPDLRILRT GQLGFLITSK SLKAEDQDSP
HNLLKYTVTS GPEHGHLIHL DLGNENIKVF TQDDIDKMKL CYVLKVGSNA TSDIFYFSVE
DSGGNKLRSQ SFHLNWAWIS LEKEHYIIDE DSKFLEVTLK RRGYLKETSF ISIGTKDGTA
QKDKDFKGKP QKQVQFNPGQ STAMWRVRII SDGKYEASET FQIILSDPVM AALEFPKMAT
VEIIDPSDES TVYIPKPEYR IEEDIGELLI PIRRSGDVSQ ELMVLCSTHQ GTASGTIPST
VLSYSDYISR PEDHTSILRF DKDETEKRCR VIIIDDSLYE EETFNVTLSM AMGGQVAVRY
PSTKVVILED ADDEPALYFE DAEYHVDESA GYVEVCVRRK GTDLSKPAVI TIQSRKTEPV
SAEAGMDYVG VSQNLYFAPG ENMKTFRVTI VDDLGQPVLE GPEKFELVLC MPVGSALGKP
NTTTVIINDS FTDLPKVQFK KPSCTGNERD GLVRVIIHRD GDISLSSTVR CYTRQGSAQV
AADYKERPNT DDSTVIFLPG EREKACVVIL EDDSIYEEDE EFRLLLGTPK SSSAFGASVG
EQKEILIRIK DEEDKPVIKF SKTRYSIQEP QHPRETAVLR ISVVRLGDTS KVSVVGTHTK
DGSASSGEDY NPMSKDVEFK KGETEHFVEI EVLYDGIREI RETLTVHLKP DENMVAETQM
NKAIIYIEET DSVADVTFPA IPQVMSLFAY GDTYATQEKA SPPTGYPVVC VTACNSKYSD
YDKTESICTE ENINDTLTLY RWLVGAPASS NGVTSPMQEI DANTFFTDTK SITLDSIYFQ
AGSRIQCAAR AVRTTGDIGL ELLSSIITVS KEHGLCQPRK PEMVGAEPFS ARIRYTGPDE
PDYPNLIKLT VIMSHLDGML PVISTQPLSN LELTLSPDGS RVSNHKCSNL LDYSEIQTKH
GFIKEPARSP DIISETLPYQ YSSSLRSART LRFYHNLDLD SCLWEFSNYY DMSELLTACG
GSVSTDGQVL NLVQSHVTLR VPLFVSYVFH SPAAIGGWKH FDLRSELRLT FVYDTAILWK
EGIGSPPGSE LQGTLHPTSM HINEEGQLVV DFQTEIRFQG QFVMSHPGTS LTSVVMSVDH
PGLTFTLTLL SSEPTFHRPE QRWSFVSDFA IRDYSGTYTI KLVPCVAAPQ QEHTLPMTCH
PREPIAFDLD IRFQQVSDPV AAEFSLNTRL LLFSKKELWE SDERVGSGAG SDVAFPEGST
IYGRVSVEPA QNLGAAFSCS VEQVFLCTGV DGHVPKYNPE NEDFGCLADS PSLLHRFKIL
DRARPATQAR SFHNVSFEAH LAAEPEAALP SVRQPGSDGF SLSSAALFQV GPGREWYLHA
VYTVRSRREG PAGAPSFRRS RRADAVDTGT DLRRVALLRS APAALGPEG
//