GenomeNet

Database: UniProt
Entry: H2ZSC5_LATCH
LinkDB: H2ZSC5_LATCH
Original site: H2ZSC5_LATCH 
ID   H2ZSC5_LATCH            Unreviewed;      2091 AA.
AC   H2ZSC5;
DT   18-APR-2012, integrated into UniProtKB/TrEMBL.
DT   18-APR-2012, sequence version 1.
DT   27-MAR-2024, entry version 67.
DE   SubName: Full=FRAS1 related extracellular matrix 1 {ECO:0000313|Ensembl:ENSLACP00000000296.1};
GN   Name=FREM1 {ECO:0000313|Ensembl:ENSLACP00000000296.1};
OS   Latimeria chalumnae (Coelacanth).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Coelacanthiformes; Coelacanthidae; Latimeria.
OX   NCBI_TaxID=7897 {ECO:0000313|Ensembl:ENSLACP00000000296.1, ECO:0000313|Proteomes:UP000008672};
RN   [1] {ECO:0000313|Proteomes:UP000008672}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Wild caught {ECO:0000313|Proteomes:UP000008672};
RA   Di Palma F., Alfoldi J., Johnson J., Berlin A., Gnerre S., Jaffe D.,
RA   MacCallum I., Young S., Walker B.J., Lander E., Lindblad-Toh K.;
RT   "The draft genome of Latimeria chalumnae.";
RL   Submitted (AUG-2011) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|Ensembl:ENSLACP00000000296.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AFYH01185387; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AFYH01185388; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AFYH01185389; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AFYH01185390; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AFYH01185391; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AFYH01185392; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AFYH01185393; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AFYH01185394; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   STRING; 7897.ENSLACP00000000296; -.
DR   Ensembl; ENSLACT00000000298.1; ENSLACP00000000296.1; ENSLACG00000000266.1.
DR   eggNOG; KOG3597; Eukaryota.
DR   GeneTree; ENSGT00940000156990; -.
DR   HOGENOM; CLU_001041_0_0_1; -.
DR   InParanoid; H2ZSC5; -.
DR   OMA; MAMFTLE; -.
DR   TreeFam; TF316876; -.
DR   Proteomes; UP000008672; Unassembled WGS sequence.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR039005; CSPG_rpt.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR045658; FRAS1-rel_N.
DR   PANTHER; PTHR45739:SF7; FRAS1-RELATED EXTRACELLULAR MATRIX PROTEIN 1; 1.
DR   PANTHER; PTHR45739; MATRIX PROTEIN, PUTATIVE-RELATED; 1.
DR   Pfam; PF16184; Cadherin_3; 12.
DR   Pfam; PF19309; Frem_N; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 1.
DR   PROSITE; PS51854; CSPG; 12.
PE   4: Predicted;
KW   Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Reference proteome {ECO:0000313|Proteomes:UP000008672};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..21
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           22..2091
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5003579473"
FT   DOMAIN          24..187
FT                   /note="FRAS1-related extracellular matrix protein N-
FT                   terminal"
FT                   /evidence="ECO:0000259|Pfam:PF19309"
FT   REPEAT          297..389
FT                   /note="CSPG"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT   REPEAT          414..501
FT                   /note="CSPG"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT   REPEAT          522..616
FT                   /note="CSPG"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT   REPEAT          643..755
FT                   /note="CSPG"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT   REPEAT          777..868
FT                   /note="CSPG"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT   REPEAT          888..983
FT                   /note="CSPG"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT   REPEAT          1025..1127
FT                   /note="CSPG"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT   REPEAT          1148..1257
FT                   /note="CSPG"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT   REPEAT          1279..1376
FT                   /note="CSPG"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT   REPEAT          1397..1489
FT                   /note="CSPG"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT   REPEAT          1510..1600
FT                   /note="CSPG"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT   REPEAT          1632..1728
FT                   /note="CSPG"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
SQ   SEQUENCE   2091 AA;  236458 MW;  36E55C9025753D76 CRC64;
     SLNWKCHLFV FLVLLLKNTD STFIKANHGI KATKGQSAFL SEKHLQFDIP KEKDACKIEV
     VMNEPITQRV GKLIPQVFDC HFLPNEVKYI HGGCPILDED QVMLRLYRFT ETETFTETFL
     LRVTLIDADC NIIKLGSKPL EVPEFYGLSN VIDRNVLMFD YDRKLNLECT VRVATLESLL
     PAHGQLVIGE LLKEEPHGDQ PQSFFPLRPQ SKHKGGLRCR AGNCQKGLKQ IKTTKVSCEE
     FLVMGLKYQH LDPPSPEIDY IAIRLDLTDS RSRTVYKSEH AWIPVRIRNA IPNQIPKAAF
     MSMFILEVDQ FILTPMTIGA LDAEDSETPQ NLLVFNITTP PPQGYITHLS DHTKPITSFT
     WQDLNEMLIA YQPPNSSHTE RRNYEVEFEV HDFFFKKSAP IMVHISIRTA DTNAPRVSWN
     MGLNLLEGQS RPITWEQFQI VDNDDISAVR LITVDGLQHG RLTVRGGKGF MFTVSDIKAG
     VVRYHHDDSD TTKDFVVFRI FDGRHSIRHK FPINILPKDD SPPFLITNVV FELCEGQNIL
     IQCSMLQASD MDSSDDYILF NITKPPQAGE IVKQPGPELI GYPVTSFLQR DLFNAIIYYH
     HLGGEVFEDS FEFVLSDSHD PPNLSEPQAV IIHIAPVDDQ LPKEVPGVVR QLVVKETEIV
     HLTKRQLHFM DTESPDRQLT YTITTTPFFT SVYGRPDAGR LFLVDSVPKL VKDPTALMLR
     SFTQHAVNYM KVAYMPPIQD IGPDPQHVQF IFSVSNQHGG TLIGICFNIT VLPVDNQAPE
     VFTNQLKVEE GGVGRITVDH LLVNDADTKP EDLRVWLRRK PLYGQLQLDG FSMKEGDSFT
     LEDLKTFKAR YQHDGSEVLR DEIFLTATDG INSEECVLQI KVLPVNDEPP VIKNGLSPMM
     QCLEGEEVVI TSEYLYAIDA DSDDMKLSYI IVRQPFYGVV RKSGIIVDGF SQADIVSELV
     TYKHTGREIG LTPCFDTITV VISDDEAEAG KSCCYDGLHH PQVPFHGSFP VYDLNITVFP
     VDNQPPSIVI GDMFVVDEGS SAVITVNHLC ATDPDTPVDE LQFVLVAPPQ FGYIENTLPS
     PGFEKSNTGI SIASFRLKHL KDLHINYVQS RHQRIEPTAD QFMVYVTDGK HQSVETPLYI
     IIRPTNDEVP EFLARNITVH EGQMKELDPS VINAVDMDIP RDHLVFTITK QPRHGMFMEG
     LYGNDLTRYK RLIHSHQNHA LLLHDFTMDH LKNGRMKLMY MHDDSENMAD SFTVQLSDGK
     HKVRKAVSVK IIPLNDEKPI LLKKKSELTV YMGETRTISS LVLSAEDKDT PREGVYYMFD
     SAPKHGLLQL KEGRDWATLS AGMNCTQDNI DMNLLRFVHT GAMGSKGQDD FKFHLWDKEN
     RSPLQTFYIS IKDMEKGEIV TFIKLLRVSK GDRVLLTTNF LLAVDGSDRP EELLYVITSP
     PMYGQIEYVN YPGSVVTNFS QMDVAAQTVC YVHKSKAHAT KDSFRFIVSN GLSTKNGTFE
     IIIENVDCAL PTVSKNKGMR LVEGTMMIIS PEILQLSDPD SPPQNLTYLI AQFPQYGQLY
     RRKAVLNQHN FTQQDIDNMD IAYRHGGGAS QIDRFTFIAS DKSNHGFLVN GRMHTEPVVF
     TIEVDRLNKT TPRIIHLQCA SKVEYFKTGR YGIYITSRDL KASDPDSKDD EISFKILRGP
     QYGYLENVTT GGFIHEGYTQ KDLNSKQILY VINPALEVTS DSLEFQVSDL TGNTALPQMF
     CVSQMSWQFI SWIQQAYRNS CVNLCQVPLE YIQGGVAPDV AVVGLSVLSL SCGREFICNP
     SRFLFFDVMY KTRDQNSALK AHKVSAVYID GPRSIDTPPP VFVQILSLNY NDTKVSYSTT
     HNYQVPWMRG KIVPFSADIS PPEHGAVQRE ESAVTPLKAE RVQTRGDTPQ TFGHSNPPKN
     RLRAVGSGKT VSVFILGSPH TYTYNHYHGL VSLRVEDDTS PEKFGKKAEV LVMNQGQQKL
     LAGSHRKVEI LQADRFNSTW CHCFLGCFHD IPPSFPKSCT PDLKGLLHFD QAIQKMFKCD
     GISWKPWQPE FEDLSAKKCP TGWTNHDSHC YLLSSEHKVT WNTAARACKE R
//
DBGET integrated database retrieval system