meilisearch

mirror of https://github.com/meilisearch/meilisearch.git synced 2025-01-18 08:48:32 +08:00

Author	SHA1	Message	Date
bors[bot]	65e6aa0de2	Merge #523 523: Improve geosearch error messages r=irevoire a=irevoire Improve the geosearch error messages (#488). And try to parse the string as specified in https://github.com/meilisearch/meilisearch/issues/2354 Co-authored-by: Tamo <tamo@meilisearch.com>	2022-05-04 13:36:11 +00:00
bors[bot]	f3b9f7b867	Merge #527 527: Remove the wip section part of the contributing file r=curquiza a=Kerollmops Everything was good in the _Development Workflow_ section so I removed the _WIP Section_ part, now this PR fixes https://github.com/meilisearch/milli/issues/513. Co-authored-by: Kerollmops <clement@meilisearch.com>	2022-05-04 13:11:30 +00:00
Kerollmops	48cdfddebf	Remove the wip section part of the contributing file	2022-05-04 14:44:51 +02:00
Tamo	c55368ddd4	apply code suggestion Co-authored-by: Kerollmops <kero@meilisearch.com>	2022-05-04 14:11:03 +02:00
bors[bot]	60ccb3fa4c	Merge #524 524: Add benchmark on nested fields r=irevoire a=irevoire fixes #500 Co-authored-by: Tamo <tamo@meilisearch.com>	2022-05-04 11:56:18 +00:00
bors[bot]	0c2c8af44e	Merge #520 520: fix mistake in Settings initialization r=irevoire a=MarinPostma fix settings not being correctly initialized and add a test to make sure that they are in the future. fix https://github.com/meilisearch/meilisearch/issues/2358 Co-authored-by: ad hoc <postma.marin@protonmail.com>	2022-05-03 15:32:18 +00:00
bors[bot]	2fe9a02b1c	Merge #522 522: Do not generate keys that are too long for LMDB r=Kerollmops a=Kerollmops This PR fixes https://github.com/meilisearch/meilisearch/issues/2338 by making sure that we do not generate keys that are too long for LMDB especially when we are creating our prefix and proximity pairs keys. Co-authored-by: Kerollmops <clement@meilisearch.com>	2022-05-03 11:54:10 +00:00
Kerollmops	211c8763b9	Make sure that we do not generate too long keys	2022-05-03 10:03:15 +02:00
Kerollmops	7e47031bdc	Add a test for long keys in LMDB	2022-05-03 10:03:13 +02:00
Tamo	f820c9804d	add one nested benchmark	2022-05-02 19:35:57 +02:00
Tamo	3cb1f6d0a1	improve geosearch error messages	2022-05-02 19:20:47 +02:00
ad hoc	1ee3d6ae33	fix mistake in Settings initialization	2022-04-29 16:24:25 +02:00
bors[bot]	312515dd6b	Merge #507 507: deny warnings in CI r=Kerollmops a=MarinPostma Add `RUSTFLAGS= -D warnings` to the CI so all warnings are treated as hard errors. Co-authored-by: ad hoc <postma.marin@protonmail.com>	2022-04-28 15:16:35 +00:00
ad hoc	3eb3f0269e	deny warnings in CI	2022-04-28 15:35:12 +02:00
bors[bot]	9db86aac51	Merge #518 518: Return facets even when there is no value associated to it r=Kerollmops a=Kerollmops This PR is related to https://github.com/meilisearch/meilisearch/issues/2352 and should fix the issue when Meilisearch is up-to-date with this PR. Co-authored-by: Kerollmops <clement@meilisearch.com>	2022-04-28 09:04:36 +00:00
bors[bot]	2aae19dc52	Merge #517 517: Make nightly CI run every week r=Kerollmops a=curquiza Co-authored-by: Clémentine Urquizar <clementine@meilisearch.com>	2022-04-26 16:22:25 +00:00
Kerollmops	a4d343aade	Add a test to check for the returned facet distribution	2022-04-26 18:12:58 +02:00
bors[bot]	c2bd94c871	Merge #511 511: Update version in every workspace r=curquiza a=curquiza Checked with `@Kerollmops` - Update the version into every workspace (the current version is v0.27.0, but I forgot to update it for the previous release) - add `publish = false` except in `milli` workspace. Co-authored-by: Clémentine Urquizar <clementine@meilisearch.com>	2022-04-26 16:06:47 +00:00
Kerollmops	7d1c2d97bf	Return facets even when there is no values associated to it	2022-04-26 17:59:53 +02:00
bors[bot]	d388ea0f9d	Merge #506 506: fix cargo warnings r=Kerollmops a=MarinPostma fix cargo warnings Co-authored-by: ad hoc <postma.marin@protonmail.com>	2022-04-26 15:45:20 +00:00
Clémentine Urquizar	ec89030483	Update bors toml	2022-04-26 17:36:04 +02:00
ad hoc	5c29258e8e	fix cargo warnings	2022-04-26 17:33:11 +02:00
bors[bot]	2fdf520271	Merge #514 514: Stop flattening every field r=Kerollmops a=irevoire When we need to flatten a document: * The primary key contains a `.`. * Some fields need to be flattened Instead of flattening the whole object and thus creating a lot of allocations with the `serde_json_flatten_crate`, we instead generate a minimal sub-object containing only the fields that need to be flattened. That should create fewer allocations and thus index faster. --------- ``` group indexing_main_e1e362fa indexing_stop-flattening-every-field_40d1bd6b ----- ---------------------- --------------------------------------------- indexing/Indexing geo_point 1.99 23.7±0.23s ? ?/sec 1.00 11.9±0.21s ? ?/sec indexing/Indexing movies in three batches 1.00 18.2±0.24s ? ?/sec 1.01 18.3±0.29s ? ?/sec indexing/Indexing movies with default settings 1.00 17.5±0.09s ? ?/sec 1.01 17.7±0.26s ? ?/sec indexing/Indexing songs in three batches with default settings 1.00 64.8±0.47s ? ?/sec 1.00 65.1±0.49s ? ?/sec indexing/Indexing songs with default settings 1.00 54.9±0.99s ? ?/sec 1.01 55.7±1.34s ? ?/sec indexing/Indexing songs without any facets 1.00 50.6±0.62s ? ?/sec 1.01 50.9±1.05s ? ?/sec indexing/Indexing songs without faceted numbers 1.00 54.0±1.14s ? ?/sec 1.01 54.7±1.13s ? ?/sec indexing/Indexing wiki 1.00 996.2±8.54s ? ?/sec 1.02 1021.1±30.63s ? ?/sec indexing/Indexing wiki in three batches 1.00 1136.8±9.72s ? ?/sec 1.00 1138.6±6.59s ? ?/sec ``` So basically everything slowed down a liiiiiittle bit except the dataset with a nested field which got twice faster Co-authored-by: Tamo <tamo@meilisearch.com>	2022-04-26 11:50:33 +00:00
Tamo	f19d2dc548	Only flatten the required fields apply review comments Co-authored-by: Kerollmops <kero@meilisearch.com>	2022-04-26 12:33:46 +02:00
bors[bot]	5adeac8047	Merge #516 516: Fix the indexing fuzzer r=irevoire a=irevoire Co-authored-by: Tamo <tamo@meilisearch.com>	2022-04-26 08:35:03 +00:00
Clémentine Urquizar	7cb7643565	Make nightly CI run every week Update CI Fix CI	2022-04-25 18:52:27 +02:00
Clémentine Urquizar	d138b3c704	Update version	2022-04-25 18:43:46 +02:00
Tamo	fa6f495662	fix the indexing fuzzer	2022-04-25 18:32:06 +02:00
bors[bot]	8cc86d5a8d	Merge #515 515: Improve the README r=curquiza a=Kerollmops This PR closes #512 by adding more content to the README. We listed all of the subcrates of the repository, changed the descriptions of the subcrates, and added a simple example usage in the README. Co-authored-by: Kerollmops <clement@meilisearch.com> Co-authored-by: Clémentine Urquizar - curqui <clementine@meilisearch.com>	2022-04-25 16:15:12 +00:00
Clémentine Urquizar - curqui	5e562ffecf	Update README.md	2022-04-25 18:14:43 +02:00
Clémentine Urquizar - curqui	2277172f9c	Update README.md	2022-04-25 18:14:39 +02:00
Clémentine Urquizar - curqui	2db3d60259	Update README.md	2022-04-25 18:14:35 +02:00
Kerollmops	7e19bf1c0e	Add an example usage of the library in the README	2022-04-25 17:25:46 +02:00
Kerollmops	fb192aaa9f	Update the list of milli's subcrates	2022-04-25 15:55:38 +02:00
bors[bot]	e1e362fa43	Merge #509 509: Remove pr_status from bors settings r=Kerollmops a=curquiza Because of multiple issue we had with bors. https://github.com/bors-ng/bors-ng/issues/1492 Co-authored-by: Clémentine Urquizar <clementine@meilisearch.com>	2022-04-25 11:45:37 +00:00
Clémentine Urquizar	08753d002a	Remove pr_status from bors settings	2022-04-25 13:39:45 +02:00
Clément Renault	8d15ae37a1	Merge pull request #503 from meilisearch/improve-flatten-fuzzer Improve the fuzzer of the flatten crate	2022-04-25 13:38:43 +02:00
Clément Renault	3e53791de3	Merge pull request #508 from meilisearch/contributing First version of new CONTRIBUTING.md	2022-04-25 13:36:41 +02:00
bors[bot]	8010eca9c7	Merge #505 505: normalize exact words r=curquiza a=MarinPostma Normalize the exact words, as specified in the specification. Co-authored-by: ad hoc <postma.marin@protonmail.com>	2022-04-25 09:35:32 +00:00
Clémentine Urquizar	dc0d4addd9	First version of new CONTRIBUTING.md	2022-04-21 19:02:22 +02:00
Clément Renault	71414630fc	Merge pull request #504 from meilisearch/test-long-words Add a test to make sure that long words are handled	2022-04-21 16:06:13 +02:00
ad hoc	2e0089d5ff	normalize exact words	2022-04-21 15:38:40 +02:00
ad hoc	3a2451fcba	add test normalize exact words	2022-04-21 13:52:09 +02:00
Clément Renault	eb5830aa40	Add a test to make sure that long words are handled	2022-04-21 13:45:28 +02:00
Tamo	d81a3f4a74	improve the fuzzer of the flatten crate	2022-04-20 16:11:23 +02:00
bors[bot]	c7d0097c97	Merge #498 498: Get rid of the threshold when comparing benchmarks r=curquiza a=irevoire It just hides things Co-authored-by: Tamo <tamo@meilisearch.com>	2022-04-19 14:04:11 +00:00
Tamo	152a10344c	Get rid of the threshold when comparing benchmarks It just hide things	2022-04-19 15:39:58 +02:00
bors[bot]	04eb32e539	Merge #499 499: fix min-word-len-for-typo not reset properly r=Kerollmops a=MarinPostma fix min word len for typo not resettign properly, as reported in https://github.com/meilisearch/meilisearch/issues/2330 Co-authored-by: ad hoc <postma.marin@protonmail.com>	2022-04-19 13:22:19 +00:00
ad hoc	8b14090927	fix min-word-len-for-typo not reset properly	2022-04-19 15:20:16 +02:00
bors[bot]	ea4bb9402f	Merge #483 483: Enhance matching words r=Kerollmops a=ManyTheFish # Summary Enhance milli word-matcher making it handle match computing and cropping. # Implementation ## Computing best matches for cropping Before we were considering that the first match of the attribute was the best one, this was accurate when only one word was searched but was missing the target when more than one word was searched. Now we are searching for the best matches interval to crop around, the chosen interval is the one: 1) that have the highest count of unique matches > for example, if we have a query `split the world`, then the interval `the split the split the` has 5 matches but only 2 unique matches (1 for `split` and 1 for `the`) where the interval `split of the world` has 3 matches and 3 unique matches. So the interval `split of the world` is considered better. 2) that have the minimum distance between matches > for example, if we have a query `split the world`, then the interval `split of the world` has a distance of 3 (2 between `split` and `the`, and 1 between `the` and `world`) where the interval `split the world` has a distance of 2. So the interval `split the world` is considered better. 3) that have the highest count of ordered matches > for example, if we have a query `split the world`, then the interval `the world split` has 2 ordered words where the interval `split the world` has 3. So the interval `split the world` is considered better. ## Cropping around the best matches interval Before we were cropping around the interval without checking the context. Now we are cropping around words in the same context as matching words. This means that we will keep words that are farther from the matching words but are in the same phrase, than words that are nearer but separated by a dot. > For instance, for the matching word `Split` the text: `Natalie risk her future. Split The World is a book written by Emily Henry. I never read it.` will be cropped like: `…. Split The World is a book written by Emily Henry. …` and not like: `Natalie risk her future. Split The World is a book …` Co-authored-by: ManyTheFish <many@meilisearch.com>	2022-04-19 11:42:32 +00:00

1 2 3 4 5 ...

1737 Commits