ID H2ZSC5_LATCH Unreviewed; 2091 AA.
AC H2ZSC5;
DT 18-APR-2012, integrated into UniProtKB/TrEMBL.
DT 18-APR-2012, sequence version 1.
DT 27-MAR-2024, entry version 67.
DE SubName: Full=FRAS1 related extracellular matrix 1 {ECO:0000313|Ensembl:ENSLACP00000000296.1};
GN Name=FREM1 {ECO:0000313|Ensembl:ENSLACP00000000296.1};
OS Latimeria chalumnae (Coelacanth).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Coelacanthiformes; Coelacanthidae; Latimeria.
OX NCBI_TaxID=7897 {ECO:0000313|Ensembl:ENSLACP00000000296.1, ECO:0000313|Proteomes:UP000008672};
RN [1] {ECO:0000313|Proteomes:UP000008672}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Wild caught {ECO:0000313|Proteomes:UP000008672};
RA Di Palma F., Alfoldi J., Johnson J., Berlin A., Gnerre S., Jaffe D.,
RA MacCallum I., Young S., Walker B.J., Lander E., Lindblad-Toh K.;
RT "The draft genome of Latimeria chalumnae.";
RL Submitted (AUG-2011) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSLACP00000000296.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AFYH01185387; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AFYH01185388; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AFYH01185389; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AFYH01185390; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AFYH01185391; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AFYH01185392; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AFYH01185393; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AFYH01185394; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR STRING; 7897.ENSLACP00000000296; -.
DR Ensembl; ENSLACT00000000298.1; ENSLACP00000000296.1; ENSLACG00000000266.1.
DR eggNOG; KOG3597; Eukaryota.
DR GeneTree; ENSGT00940000156990; -.
DR HOGENOM; CLU_001041_0_0_1; -.
DR InParanoid; H2ZSC5; -.
DR OMA; MAMFTLE; -.
DR TreeFam; TF316876; -.
DR Proteomes; UP000008672; Unassembled WGS sequence.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR039005; CSPG_rpt.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR045658; FRAS1-rel_N.
DR PANTHER; PTHR45739:SF7; FRAS1-RELATED EXTRACELLULAR MATRIX PROTEIN 1; 1.
DR PANTHER; PTHR45739; MATRIX PROTEIN, PUTATIVE-RELATED; 1.
DR Pfam; PF16184; Cadherin_3; 12.
DR Pfam; PF19309; Frem_N; 1.
DR SUPFAM; SSF56436; C-type lectin-like; 1.
DR PROSITE; PS51854; CSPG; 12.
PE 4: Predicted;
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Reference proteome {ECO:0000313|Proteomes:UP000008672};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..21
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 22..2091
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5003579473"
FT DOMAIN 24..187
FT /note="FRAS1-related extracellular matrix protein N-
FT terminal"
FT /evidence="ECO:0000259|Pfam:PF19309"
FT REPEAT 297..389
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 414..501
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 522..616
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 643..755
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 777..868
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 888..983
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 1025..1127
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 1148..1257
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 1279..1376
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 1397..1489
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 1510..1600
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 1632..1728
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
SQ SEQUENCE 2091 AA; 236458 MW; 36E55C9025753D76 CRC64;
SLNWKCHLFV FLVLLLKNTD STFIKANHGI KATKGQSAFL SEKHLQFDIP KEKDACKIEV
VMNEPITQRV GKLIPQVFDC HFLPNEVKYI HGGCPILDED QVMLRLYRFT ETETFTETFL
LRVTLIDADC NIIKLGSKPL EVPEFYGLSN VIDRNVLMFD YDRKLNLECT VRVATLESLL
PAHGQLVIGE LLKEEPHGDQ PQSFFPLRPQ SKHKGGLRCR AGNCQKGLKQ IKTTKVSCEE
FLVMGLKYQH LDPPSPEIDY IAIRLDLTDS RSRTVYKSEH AWIPVRIRNA IPNQIPKAAF
MSMFILEVDQ FILTPMTIGA LDAEDSETPQ NLLVFNITTP PPQGYITHLS DHTKPITSFT
WQDLNEMLIA YQPPNSSHTE RRNYEVEFEV HDFFFKKSAP IMVHISIRTA DTNAPRVSWN
MGLNLLEGQS RPITWEQFQI VDNDDISAVR LITVDGLQHG RLTVRGGKGF MFTVSDIKAG
VVRYHHDDSD TTKDFVVFRI FDGRHSIRHK FPINILPKDD SPPFLITNVV FELCEGQNIL
IQCSMLQASD MDSSDDYILF NITKPPQAGE IVKQPGPELI GYPVTSFLQR DLFNAIIYYH
HLGGEVFEDS FEFVLSDSHD PPNLSEPQAV IIHIAPVDDQ LPKEVPGVVR QLVVKETEIV
HLTKRQLHFM DTESPDRQLT YTITTTPFFT SVYGRPDAGR LFLVDSVPKL VKDPTALMLR
SFTQHAVNYM KVAYMPPIQD IGPDPQHVQF IFSVSNQHGG TLIGICFNIT VLPVDNQAPE
VFTNQLKVEE GGVGRITVDH LLVNDADTKP EDLRVWLRRK PLYGQLQLDG FSMKEGDSFT
LEDLKTFKAR YQHDGSEVLR DEIFLTATDG INSEECVLQI KVLPVNDEPP VIKNGLSPMM
QCLEGEEVVI TSEYLYAIDA DSDDMKLSYI IVRQPFYGVV RKSGIIVDGF SQADIVSELV
TYKHTGREIG LTPCFDTITV VISDDEAEAG KSCCYDGLHH PQVPFHGSFP VYDLNITVFP
VDNQPPSIVI GDMFVVDEGS SAVITVNHLC ATDPDTPVDE LQFVLVAPPQ FGYIENTLPS
PGFEKSNTGI SIASFRLKHL KDLHINYVQS RHQRIEPTAD QFMVYVTDGK HQSVETPLYI
IIRPTNDEVP EFLARNITVH EGQMKELDPS VINAVDMDIP RDHLVFTITK QPRHGMFMEG
LYGNDLTRYK RLIHSHQNHA LLLHDFTMDH LKNGRMKLMY MHDDSENMAD SFTVQLSDGK
HKVRKAVSVK IIPLNDEKPI LLKKKSELTV YMGETRTISS LVLSAEDKDT PREGVYYMFD
SAPKHGLLQL KEGRDWATLS AGMNCTQDNI DMNLLRFVHT GAMGSKGQDD FKFHLWDKEN
RSPLQTFYIS IKDMEKGEIV TFIKLLRVSK GDRVLLTTNF LLAVDGSDRP EELLYVITSP
PMYGQIEYVN YPGSVVTNFS QMDVAAQTVC YVHKSKAHAT KDSFRFIVSN GLSTKNGTFE
IIIENVDCAL PTVSKNKGMR LVEGTMMIIS PEILQLSDPD SPPQNLTYLI AQFPQYGQLY
RRKAVLNQHN FTQQDIDNMD IAYRHGGGAS QIDRFTFIAS DKSNHGFLVN GRMHTEPVVF
TIEVDRLNKT TPRIIHLQCA SKVEYFKTGR YGIYITSRDL KASDPDSKDD EISFKILRGP
QYGYLENVTT GGFIHEGYTQ KDLNSKQILY VINPALEVTS DSLEFQVSDL TGNTALPQMF
CVSQMSWQFI SWIQQAYRNS CVNLCQVPLE YIQGGVAPDV AVVGLSVLSL SCGREFICNP
SRFLFFDVMY KTRDQNSALK AHKVSAVYID GPRSIDTPPP VFVQILSLNY NDTKVSYSTT
HNYQVPWMRG KIVPFSADIS PPEHGAVQRE ESAVTPLKAE RVQTRGDTPQ TFGHSNPPKN
RLRAVGSGKT VSVFILGSPH TYTYNHYHGL VSLRVEDDTS PEKFGKKAEV LVMNQGQQKL
LAGSHRKVEI LQADRFNSTW CHCFLGCFHD IPPSFPKSCT PDLKGLLHFD QAIQKMFKCD
GISWKPWQPE FEDLSAKKCP TGWTNHDSHC YLLSSEHKVT WNTAARACKE R
//