ID F1R2T2_DANRE Unreviewed; 2137 AA.
AC F1R2T2; A0A8M3AMZ9;
DT 03-MAY-2011, integrated into UniProtKB/TrEMBL.
DT 20-JUN-2018, sequence version 3.
DT 27-MAR-2024, entry version 69.
DE SubName: Full=Fras-related extracellular matrix protein 1b isoform X1 {ECO:0000313|RefSeq:XP_009294520.1};
DE SubName: Full=Fras1-related extracellular matrix 1b {ECO:0000313|Ensembl:ENSDARP00000084650};
GN Name=frem1b {ECO:0000313|Ensembl:ENSDARP00000084650,
GN ECO:0000313|RefSeq:XP_009294520.1,
GN ECO:0000313|ZFIN:ZDB-GENE-050208-783};
GN Synonyms=si:ch211-219p7.1 {ECO:0000313|RefSeq:XP_009294520.1};
OS Danio rerio (Zebrafish) (Brachydanio rerio).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; Cypriniformes;
OC Danionidae; Danioninae; Danio.
OX NCBI_TaxID=7955 {ECO:0000313|Ensembl:ENSDARP00000084650};
RN [1] {ECO:0000313|Ensembl:ENSDARP00000084650}
RP IDENTIFICATION.
RC STRAIN=Tuebingen {ECO:0000313|Ensembl:ENSDARP00000084650};
RG Ensembl;
RL Submitted (JUL-2011) to UniProtKB.
RN [2] {ECO:0000313|Ensembl:ENSDARP00000084650, ECO:0000313|Proteomes:UP000000437}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Tuebingen {ECO:0000313|Ensembl:ENSDARP00000084650};
RX PubMed=23594743; DOI=10.1038/nature12111;
RG Genome Reference Consortium Zebrafish;
RA Howe K., Clark M.D., Torroja C.F., Torrance J., Berthelot C., Muffato M.,
RA Collins J.E., Humphray S., McLaren K., Matthews L., McLaren S., Sealy I.,
RA Caccamo M., Churcher C., Scott C., Barrett J.C., Koch R., Rauch G.J.,
RA White S., Chow W., Kilian B., Quintais L.T., Guerra-Assuncao J.A., Zhou Y.,
RA Gu Y., Yen J., Vogel J.H., Eyre T., Redmond S., Banerjee R., Chi J., Fu B.,
RA Langley E., Maguire S.F., Laird G.K., Lloyd D., Kenyon E., Donaldson S.,
RA Sehra H., Almeida-King J., Loveland J., Trevanion S., Jones M., Quail M.,
RA Willey D., Hunt A., Burton J., Sims S., McLay K., Plumb B., Davis J.,
RA Clee C., Oliver K., Clark R., Riddle C., Elliot D., Eliott D.,
RA Threadgold G., Harden G., Ware D., Begum S., Mortimore B., Mortimer B.,
RA Kerry G., Heath P., Phillimore B., Tracey A., Corby N., Dunn M.,
RA Johnson C., Wood J., Clark S., Pelan S., Griffiths G., Smith M.,
RA Glithero R., Howden P., Barker N., Lloyd C., Stevens C., Harley J.,
RA Holt K., Panagiotidis G., Lovell J., Beasley H., Henderson C., Gordon D.,
RA Auger K., Wright D., Collins J., Raisen C., Dyer L., Leung K.,
RA Robertson L., Ambridge K., Leongamornlert D., McGuire S., Gilderthorp R.,
RA Griffiths C., Manthravadi D., Nichol S., Barker G., Whitehead S., Kay M.,
RA Brown J., Murnane C., Gray E., Humphries M., Sycamore N., Barker D.,
RA Saunders D., Wallis J., Babbage A., Hammond S., Mashreghi-Mohammadi M.,
RA Barr L., Martin S., Wray P., Ellington A., Matthews N., Ellwood M.,
RA Woodmansey R., Clark G., Cooper J., Cooper J., Tromans A., Grafham D.,
RA Skuce C., Pandian R., Andrews R., Harrison E., Kimberley A., Garnett J.,
RA Fosker N., Hall R., Garner P., Kelly D., Bird C., Palmer S., Gehring I.,
RA Berger A., Dooley C.M., Ersan-Urun Z., Eser C., Geiger H., Geisler M.,
RA Karotki L., Kirn A., Konantz J., Konantz M., Oberlander M.,
RA Rudolph-Geiger S., Teucke M., Lanz C., Raddatz G., Osoegawa K., Zhu B.,
RA Rapp A., Widaa S., Langford C., Yang F., Schuster S.C., Carter N.P.,
RA Harrow J., Ning Z., Herrero J., Searle S.M., Enright A., Geisler R.,
RA Plasterk R.H., Lee C., Westerfield M., de Jong P.J., Zon L.I.,
RA Postlethwait J.H., Nusslein-Volhard C., Hubbard T.J., Roest Crollius H.,
RA Rogers J., Stemple D.L.;
RT "The zebrafish reference genome sequence and its relationship to the human
RT genome.";
RL Nature 496:498-503(2013).
RN [3] {ECO:0000313|RefSeq:XP_009294520.1}
RP IDENTIFICATION.
RC STRAIN=Tuebingen {ECO:0000313|RefSeq:XP_009294520.1};
RG RefSeq;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SIMILARITY: Belongs to the FRAS1 family.
CC {ECO:0000256|ARBA:ARBA00005529}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CR356237; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; CR361565; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; CR381682; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR RefSeq; XP_009294520.1; XM_009296245.3.
DR PaxDb; 7955-ENSDARP00000084650; -.
DR Ensembl; ENSDART00000090217; ENSDARP00000084650; ENSDARG00000062402.
DR Ensembl; ENSDART00000090217.6; ENSDARP00000084650.6; ENSDARG00000062402.8.
DR GeneID; 557221; -.
DR AGR; ZFIN:ZDB-GENE-050208-783; -.
DR CTD; 557221; -.
DR ZFIN; ZDB-GENE-050208-783; frem1b.
DR eggNOG; KOG3597; Eukaryota.
DR eggNOG; KOG4297; Eukaryota.
DR OrthoDB; 5470912at2759; -.
DR TreeFam; TF316876; -.
DR Proteomes; UP000000437; Chromosome 22.
DR Bgee; ENSDARG00000062402; Expressed in testis and 13 other cell types or tissues.
DR GO; GO:0062023; C:collagen-containing extracellular matrix; IBA:GO_Central.
DR GO; GO:0016020; C:membrane; IEA:InterPro.
DR GO; GO:0009653; P:anatomical structure morphogenesis; IBA:GO_Central.
DR GO; GO:0007155; P:cell adhesion; IEA:UniProtKB-KW.
DR GO; GO:0007154; P:cell communication; IEA:InterPro.
DR CDD; cd00037; CLECT; 1.
DR Gene3D; 2.60.40.2030; -; 1.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR InterPro; IPR001304; C-type_lectin-like.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR038081; CalX-like_sf.
DR InterPro; IPR003644; Calx_beta.
DR InterPro; IPR039005; CSPG_rpt.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR045658; FRAS1-rel_N.
DR PANTHER; PTHR45739:SF3; FRAS-RELATED EXTRACELLULAR MATRIX PROTEIN 1B PRECURSOR; 1.
DR PANTHER; PTHR45739; MATRIX PROTEIN, PUTATIVE-RELATED; 1.
DR Pfam; PF16184; Cadherin_3; 12.
DR Pfam; PF03160; Calx-beta; 1.
DR Pfam; PF19309; Frem_N; 1.
DR Pfam; PF00059; Lectin_C; 1.
DR SMART; SM00237; Calx_beta; 1.
DR SMART; SM00034; CLECT; 1.
DR SUPFAM; SSF56436; C-type lectin-like; 1.
DR SUPFAM; SSF141072; CalX-like; 1.
DR PROSITE; PS50041; C_TYPE_LECTIN_2; 1.
DR PROSITE; PS51854; CSPG; 12.
PE 3: Inferred from homology;
KW Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Reference proteome {ECO:0000313|Proteomes:UP000000437};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..22
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 23..2137
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5041117176"
FT REPEAT 288..382
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 407..494
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 515..609
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 636..748
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 770..861
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 882..976
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 1018..1120
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 1141..1249
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 1270..1367
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 1388..1480
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 1501..1591
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 1621..1719
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT DOMAIN 2020..2120
FT /note="C-type lectin"
FT /evidence="ECO:0000259|PROSITE:PS50041"
FT REGION 1983..2007
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1989..2007
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2137 AA; 238087 MW; 6A106AE63476AF4B CRC64;
MGVFFLALTV LTSSHLWSHV QASSLVQLNR VIRVARGQSV FITANELQFH MDHKSEACKV
EVVLNEPITQ RVGKLTPQVF DCQFFPDEVK YVHNGSPLLE EDTVMLRVYR FTDTETFVES
MLLKVRVFEP QRSLVELGNV PLVVPEFYGL SNAINASVLT FKTQPDVICT LRLLSSEISV
PALGQLVVEE SGTMVDTEAG PRKGRHTAIA CPGNKACDLK TRQVEFLKTD CSEFLSSGLK
YQHLSPPSPE IDYIPVRLEF RDQTSRALLE TESVWLPVLI QGAMQNQPPH AAFMSTFILE
VDQFILTPLS TAALDAKDDE TPQDQLIFSV TKPPAEGYIT HLDDHTKMAS SFSWQDLNEM
KIAYQPPNSS HTARRNYEVE FQAIDGSFAP SLPILMHISI RTAETNAPRV SWNMGLDLLE
GQSRPITWEH LQIVDKDNID AVTLIAMDGP LNGHLSVRGG KGFMFKVKDL REGVVVYHHS
DSDTTRDYVV FRITDGRHSI RHKFPIYILP KDDTPPFLIN NVAFEVQEGG TVRVEEYMLM
ASDLDSSDDY IQYQLITFPR AGQLIKKSSP HEPGLPVKSF LQRDLFQGLI YYKHSGEEVF
EDSFDFILSD VHQPPNLSDR HTVVIHVFPV KDQLPVEVSG TVRSITVKET EVVYITQSLL
HFRDTEHPDT DLMYVITHPC FSPGNTRLSD AGRLFYTDST NAMKKDAMVP VLKSFTQHAV
NHHKVAYMPP IEDIGPEPLF VQFVFSVSDH QGGALTGLTF NITVTPVDNQ APEMFTNLLR
VEEGGGSFLM EEHLLVQDVD SSEDQLRIHM KTKPQHGRLE LQGTAMLEGD SFTLRDLKAL
RVRYIHDDSE TQKDSIGLTI TDGINSAHGA LLIQILPVND EPPQLGSDLK AKLSCKEGGQ
VQITVEYLSA TDVDSDDTRL TYMLARTPGR GVIQRNGVTV DKFSQLDVLN GLIFYLHTGG
EIGPDPVSDT VTLIVSDGEA GTMDGCCQED ALPPPVPLHG TLPVYDLNIT VMPVNNQVPT
ITLGGMIVVD EGARACLCGG VLEASDPDSR PEELMFHLVT PPQYGFLENT LPSPGFEKSN
AGLRVVFFSQ LHLSSGFINY VQSVHQGVEP TADFFTISVS DGMQRSAPLH VYIIINPTND
EVPSLLLSNF TVMEGGMKDL SPDILNAVDV DIPAESLTLT ILDPPAHGTL INGIYGLEMN
RYKSMNPEVL QQTLAIQSFT LEELQQGMKI MYMHDDTETL KDAFTIQLTD GGHTVQGTAC
VRIIPVNDEK PWLLKNAGVE VEALEKRVIS SVVLEAGDQD TPSDHLLYIL NAGPRFGLLQ
LRTEAGWVDL SPGQNFTQED VEMNRLWYSH TTVSGFKGHD RFHFVLTDGE NTTPPQSFFI
SVRTVQKGDI ALITKPVTLL EGERVILTTD ILTAADSGGR PDELIYTVAV PPEHGHLHMV
QTPGVPVFSF SQMDVAANRV CYTHDNSRFA DRDSFSFAIS NGVASRSGTV HFTIEHSDRI
PPTLSTNKGL ELTEGSMKTI STEELKLTDP DTALENLTYV VTQSPQYGKL LFKGLPLSKP
RFTQLDINNM DLAYQHLNGR ATIDRFAFQP TDGTNKGYLE YGQLKSDPAV FTIQIEILDK
TPPSIINKGI PSTVENLPDG KHGIYITSKE LQASDPDSPD DTLEFTITRP PHFGFLENTL
TGAFIKGRFT QKDVDQKAVR YVIPVDVEVT ADSFEFQVTD PAGNTILPEV LELRWSRVDL
SASCYRVCET AGTLAVQVLR SGNSKDPAYI GIQVEEGTAK VGKDFTHSSA SLIQFDPGVN
VKMWNIYLKD DGLEENHERF DIILKAPKNA VLGQRNKASV EIVDPRNGRC NPDDLIVEED
ENQHTYQIQP HIPEPETPVL EEYTPDPRSG ILWENYPPRG DVPYRTDFNH YSLTGQHMEQ
EAFHSPGRRQ LRVLEGNRQR AGLSEVDSRN QERVWRFHGV IPVSQQEAAP RVHSELEITP
IWSWPGDSAE TPQRDSASDI QQHKSQTSVS IDCPNGWTHY RRNCYILGPG TASWSSAQHA
CTLLEGQLTG IHSKNDMKWI WKFAGKQPFW IGLVGGPEHG WIWTDGKVLS FSRLRKDEED
QTRSVCVLAQ SQKMWIPKGC IDGSEHRYVC SAPAQIS
//