meilisearch

mirror of https://github.com/meilisearch/meilisearch.git synced 2024-12-17 00:46:05 +08:00

Author	SHA1	Message	Date
tamo	4536dfccd0	add a way to provide primary_key or autogenerate documents ids	2021-06-02 11:13:20 +02:00
tamo	06c414a753	move the benchmarks to another crate so we can download the datasets automatically without adding overhead to the build of milli	2021-06-02 11:11:50 +02:00
tamo	3c84075d2d	uses an env variable to find the datasets	2021-06-02 11:05:07 +02:00
tamo	4969abeaab	update the facets for the benchmarks	2021-06-02 11:05:07 +02:00
tamo	e5dfde88fd	fix the facets conditions	2021-06-02 11:05:07 +02:00
tamo	7c7fba4e57	remove the time limitation to let criterion do what it wants	2021-06-02 11:05:07 +02:00
tamo	5d5d115608	reformat all the files	2021-06-02 11:05:07 +02:00
tamo	7086009f93	improve the base search	2021-06-02 11:05:07 +02:00
tamo	d0b44c380f	add benchmarks on a wiki dataset	2021-06-02 11:05:07 +02:00
tamo	beae843766	add a missing space	2021-06-02 11:05:07 +02:00
tamo	5132a106a1	refactorize everything related to the songs dataset in a songs benchmark file	2021-06-02 11:05:07 +02:00
tamo	136efd6b53	fix the benches	2021-06-02 11:05:07 +02:00
tamo	4b78ef31b6	add the configuration of the searchable fields and displayed fields and a default configuration for the songs	2021-06-02 11:05:07 +02:00
tamo	ea0c6d8c40	add a bunch of queries and start the introduction of the filters and the new dataset	2021-06-02 11:05:07 +02:00
tamo	3def42abd8	merge all the criterion only benchmarks in one file	2021-06-02 11:05:07 +02:00
tamo	a2bff68c1a	remove the optional words for the typo criterion	2021-06-02 11:05:07 +02:00
tamo	aee49bb3cd	add the proximity criterion	2021-06-02 11:05:07 +02:00
tamo	49e4cc3daf	add the words criterion to the bench	2021-06-02 11:05:07 +02:00
tamo	15cce89a45	update the README with instructions to get the download the dataset	2021-06-02 11:05:07 +02:00
tamo	e425f70ef9	let criterion decide how much iteration it wants to do in 10s	2021-06-02 11:05:07 +02:00
tamo	4fdbfd6048	push a first version of the benchmark for the typo	2021-06-02 11:05:07 +02:00
Tamo	2d7785ae0c	remove the dump_batch_size option from the CLI	2021-06-01 20:42:06 +02:00
Tamo	d0552e765e	forbid deserialization of Setting<Checked>	2021-06-01 20:41:45 +02:00
bors[bot]	270da98c46	Merge #202 202: Add field id word count docids database r=Kerollmops a=LegendreM This PR introduces a new database, `field_id_word_count_docids`, that maps the number of words in an attribute with a list of document ids. This relation is limited to attributes that contain less than 11 words. This database is used by the exactness criterion to know if a document has an attribute that contains exactly the query without any additional word. Fix #165 Fix #196 Related to [specifications:#36](https://github.com/meilisearch/specifications/pull/36) Co-authored-by: many <maxime@meilisearch.com> Co-authored-by: Many <legendre.maxime.isn@gmail.com>	2021-06-01 16:09:48 +00:00
many	e857ca4d7d	Fix PR comments	2021-06-01 18:06:46 +02:00
Many	ab2cf69e8d	Update milli/src/update/delete_documents.rs Co-authored-by: Clément Renault <clement@meilisearch.com>	2021-06-01 17:04:10 +02:00
Many	8e6d1ff0dc	Update milli/src/update/index_documents/store.rs Co-authored-by: Clément Renault <clement@meilisearch.com>	2021-06-01 17:04:02 +02:00
bors[bot]	168fe0aa28	Merge #206 206: Fix http-ui r=Kerollmops a=irevoire I just noticed that `http-ui` was not compiling on `main`. I'm not sure this is the best fix, but it works 👀 Co-authored-by: Tamo <irevoire@hotmail.fr>	2021-06-01 14:31:32 +00:00
Tamo	608c5bad24	fix http-ui	2021-06-01 16:24:46 +02:00
bors[bot]	7d36d664a7	Merge #203 203: Make the MatchingWords return the number of matching bytes r=Kerollmops a=LegendreM Make the MatchingWords return the number of matching bytes using a custom Levenshtein algorithm. Fix #138 Co-authored-by: many <maxime@meilisearch.com>	2021-06-01 12:00:33 +00:00
many	225ae6fd25	Resolve PR comments	2021-06-01 11:53:09 +02:00
bors[bot]	3a7c1f2469	Merge #191 191: dumps v2 r=irevoire a=MarinPostma Co-authored-by: Marin Postma <postma.marin@protonmail.com> Co-authored-by: marin <postma.marin@protonmail.com>	2021-06-01 09:46:31 +00:00
marin	df6ba0e824	Apply suggestions from code review Co-authored-by: Irevoire <tamo@meilisearch.com>	2021-06-01 11:18:37 +02:00
bors[bot]	2f9f6a1f21	Merge #169 169: Optimize roaring codec r=Kerollmops a=MarinPostma Optimize the `BoRoaringBitmapCodec` by preventing it from emiting useless error that caused allocation. On my flamegraph, the byte_decode function went from 4.13% to 1.70% (of transplant graph). This may not be the greatest optimization ever, but hey, this was a low hanging fruit. before: ![image](https://user-images.githubusercontent.com/28804882/116241125-17018880-a754-11eb-9f9d-a67418d100e1.png) after: ![image](https://user-images.githubusercontent.com/28804882/116241167-21bc1d80-a754-11eb-9afc-d9d72727477c.png) Co-authored-by: Marin Postma <postma.marin@protonmail.com>	2021-06-01 06:30:25 +00:00
Marin Postma	984dc7c1ed	rewrite roaring codec without byteorder.	2021-05-31 22:15:39 +02:00
Marin Postma	1373637da1	optimize roaring codec	2021-05-31 22:15:35 +02:00
Marin Postma	6609f9e3be	review edits	2021-05-31 18:41:37 +02:00
many	1df68d342a	Make the MatchingWords return the number of matching bytes	2021-05-31 18:22:29 +02:00
many	b8e6db0feb	Add database in infos crate	2021-05-31 16:29:27 +02:00
many	c701f8bf36	Use field id word count database in exactness criterion	2021-05-31 16:27:28 +02:00
many	4ddf008be2	add field id word count database	2021-05-31 16:27:28 +02:00
Marin Postma	1c4f0b2ccf	clippy, fmt & tests	2021-05-31 16:03:39 +02:00
Marin Postma	10fc870684	improve dump info reports	2021-05-31 15:49:04 +02:00
bors[bot]	2f5e61bacb	Merge #184 184: Transfer numbers and strings facets into the appropriate facet databases r=Kerollmops a=Kerollmops This pull request is related to https://github.com/meilisearch/milli/issues/152 and changes the layout of the facets values, numbers and strings are now in dedicated databases and the user no more needs to define the type of the fields. No more conversion between the two types is done, numbers (floats and integers converted to f64) go to the facet float database and strings go to the strings facet database. There is one related issue that I found regarding CSVs, the values in a CSV are always considered to be strings, [meilisearch/specifications#28](`d916b57d74/text/0028-indexing-csv.md`) fixes this issue by allowing the user to define the fields types using `:` in the "CSV Formatting Rules" section. All previous tests on facets have been modified to pass again and I have also done hand-driven tests with the 115m songs dataset. Everything seems to be good! Fixes #192. Co-authored-by: Clément Renault <clement@meilisearch.com> Co-authored-by: Kerollmops <clement@meilisearch.com>	2021-05-31 13:32:58 +00:00
Kerollmops	1c0a5cd136	Resolve code modification suggestions	2021-05-31 15:22:50 +02:00
tamo	dffbaca63b	bump sentry version	2021-05-31 13:59:31 +02:00
Marin Postma	b3c8f0e1f6	fix empty index error	2021-05-31 10:58:51 +02:00
Marin Postma	bc5a5e37ea	fix dump v1	2021-05-31 10:42:31 +02:00
Marin Postma	33c6c4f0ee	add timestamos to dump info	2021-05-30 15:55:17 +02:00
Marin Postma	39c16c0fe4	fix dump import	2021-05-30 12:35:17 +02:00

... 110 111 112 113 114 ...

8897 Commits