From 15cce89a45d7032abc1e9e622ce0ce2b200e5273 Mon Sep 17 00:00:00 2001 From: tamo Date: Tue, 6 Apr 2021 16:06:49 +0200 Subject: [PATCH] update the README with instructions to get the download the dataset --- milli/benches/README.md | 11 +++++++++-- 1 file changed, 9 insertions(+), 2 deletions(-) diff --git a/milli/benches/README.md b/milli/benches/README.md index c02af0084..9b53fc0d1 100644 --- a/milli/benches/README.md +++ b/milli/benches/README.md @@ -3,6 +3,13 @@ Benchmarks For our benchmark we are using a small subset of the dataset songs.csv. It was generated with this command: ``` -xsv sample --seed 42 song.csv -o smol_songs.csv +xsv sample --seed 42 song.csv -o smol-songs.csv +``` +You can download it [here](https://meili-datasets.s3.fr-par.scw.cloud/benchmarks/smol-songs.csv.gz) +And the original `songs.csv` dataset is available [here](https://meili-datasets.s3.fr-par.scw.cloud/songs.csv.gz). + +You need to put this file in the current directory: `milli/milli/benches/smol-songs.csv.gz` +You can run the following command from the root of this git repository +``` +wget https://meili-datasets.s3.fr-par.scw.cloud/benchmarks/smol-songs.csv.gz -O milli/benches/smol-songs.csv.gz ``` -The original songs.csv datasets is available [here](https://meili-datasets.s3.fr-par.scw.cloud/songs.csv.gz)