meilisearch

mirror of https://github.com/meilisearch/meilisearch.git synced 2025-03-06 22:02:34 +08:00

Author	SHA1	Message	Date
ManyTheFish	967033579d	Refactor search and facet-search Changes: The search filters are now using the FilterableAttributesFeatures from the FilterableAttributesRules to know if a field is filterable. Moreover, the FilterableAttributesFeatures is more precise and an error will be returned if an operator is used on a field that doesn't have the related feature. The facet-search is now checking if the feature is allowed in the FilterableAttributesFeatures and an error will be returned if the field doesn't have the related feature. Impact: - facet-search is now relying on AttributePatterns to match the locales - search using filters is now relying on FilterableAttributesFeatures - distinct attribute is now relying on FilterableAttributesRules	2025-03-03 10:25:32 +01:00
ManyTheFish	0200c65ebf	Change the filterableAttributes setting API Changes: The filterableAttributes type has been changed from a `BTreeSet<String>` to a `Vec<FilterableAttributesRule>`, Which is a list of rules defining patterns to match the documents' fields and a set of feature to apply on the matching fields. The rule order given by the user is now an important information, the features applied on a filterable field will be chosen based on the rule order as we do for the LocalizedAttributesRules. This means that the list will not be reordered anymore and will keep the user defined order, moreover, if there are any duplicates, they will not be de-duplicated anymore. Impact: - Settings API - the database format of the filterable attributes changed - may impact the LocalizedAttributesRules due to the AttributePatterns factorization - OpenAPI generator	2025-03-03 10:22:02 +01:00
meili-bors[bot]	c63c25a9a2	Merge #5355 Some checks failed Look for flaky tests / flaky (push) Failing after 1s Details Indexing bench (push) / Run and upload benchmarks (push) Has been cancelled Details Benchmarks of indexing (push) / Run and upload benchmarks (push) Has been cancelled Details Benchmarks of search for geo (push) / Run and upload benchmarks (push) Has been cancelled Details Benchmarks of search for songs (push) / Run and upload benchmarks (push) Has been cancelled Details Benchmarks of search for Wikipedia articles (push) / Run and upload benchmarks (push) Has been cancelled Details Publish binaries to GitHub release / Check the version validity (push) Failing after 5s Details Test suite / Tests almost all features (push) Failing after 13s Details Test suite / Tests on ubuntu-22.04 (push) Failing after 19s Details Test suite / Test with Ollama (push) Failing after 7s Details Test suite / Test disabled tokenization (push) Failing after 10s Details Test suite / Run tests in debug (push) Failing after 15s Details Test suite / Run Rustfmt (push) Failing after 16s Details Test suite / Run Clippy (push) Successful in 9m39s Details SDKs tests / define-docker-image (push) Failing after 5s Details SDKs tests / .NET SDK tests (push) Has been skipped Details SDKs tests / Dart SDK tests (push) Has been skipped Details SDKs tests / Go SDK tests (push) Has been skipped Details SDKs tests / Java SDK tests (push) Has been skipped Details SDKs tests / JS SDK tests (push) Has been skipped Details SDKs tests / PHP SDK tests (push) Has been skipped Details SDKs tests / Python SDK tests (push) Has been skipped Details SDKs tests / Ruby SDK tests (push) Has been skipped Details SDKs tests / Rust SDK tests (push) Has been skipped Details SDKs tests / Swift SDK tests (push) Has been skipped Details SDKs tests / meilisearch-js-plugins tests (push) Has been skipped Details SDKs tests / meilisearch-rails tests (push) Has been skipped Details SDKs tests / meilisearch-symfony tests (push) Has been skipped Details Test suite / Tests on macos-13 (push) Has been cancelled Details Test suite / Tests on windows-2022 (push) Has been cancelled Details 5355: Support fetching the pooling method from the model configuration r=Kerollmops a=dureuill # Pull Request ## Related issue Fixes #5354 ## What does this PR do? - Fetches the pooling configuration from the model repository - Use a pooling method that depends on the pooling configuration of that model. - Allow overriding the pooling method with a new huggingFace embedder parameter `pooling` - for backward-compatibility with Meilisearch v1.13 - for compatibility with embedders that exhibit the same behavior as Meilisearch v1.13 - Handle the default value of that new parameter - for compatibility, when importing a db/a dump, it should be set to `forceMean` - when (re)set from the settings for an embedder, it should be set to `useModel` Co-authored-by: Louis Dureuil <louis@meilisearch.com>	2025-02-27 14:55:13 +00:00
Kerollmops	dc78d8e9c4	Fix the dumpless upgrade log	2025-02-26 17:02:46 +01:00
ManyTheFish	d4063c9dcd	Fix fmt	2025-02-26 17:02:45 +01:00
Many the fish	abebc574f6	Update crates/milli/src/index.rs Co-authored-by: Tamo <tamo@meilisearch.com>	2025-02-26 17:02:45 +01:00
Many the fish	f32ab67819	Update crates/milli/src/index.rs Co-authored-by: Tamo <tamo@meilisearch.com>	2025-02-26 17:02:44 +01:00
ManyTheFish	d25953f322	fix clippy	2025-02-26 17:02:43 +01:00
ManyTheFish	405bbd04c1	Dumpless upgrade	2025-02-26 17:01:38 +01:00
ManyTheFish	5d421abdc4	Update Snapshots	2025-02-26 17:01:37 +01:00
ManyTheFish	9f3663e768	Implement Incremental document database stats computing	2025-02-26 17:01:35 +01:00
ManyTheFish	d9642ec916	Use checked_div in average computation	2025-02-26 17:01:34 +01:00
ManyTheFish	818e8b0237	Fix zero division	2025-02-26 17:01:31 +01:00
ManyTheFish	4f77a7fba5	fix clippy	2025-02-26 17:01:29 +01:00
ManyTheFish	058f08dff5	fix snapshots	2025-02-26 17:01:26 +01:00
ManyTheFish	9a6c1730aa	Add document database stats	2025-02-26 17:01:25 +01:00
Strift	91a8a97045	Bump	2025-02-26 17:01:24 +01:00
ManyTheFish	15788773af	Check the exact_word database when computing zero typo query	2025-02-26 17:01:22 +01:00
Kerollmops	025b9b79bb	Update the snapshots	2025-02-26 17:01:21 +01:00
Kerollmops	dfce20be21	Rename callTrace into progressTrace	2025-02-25 10:09:03 +01:00
Kerollmops	76fd5d92d7	Clarify the tail writing to database	2025-02-20 17:35:23 +01:00
Kerollmops	245a55722a	Remove commented code	2025-02-20 16:48:18 +01:00
Kerollmops	434fad5327	Fix insta tests again	2025-02-20 16:41:48 +01:00
Kerollmops	243a5fa6a8	Log the call trace and congestion	2025-02-20 14:17:34 +01:00
Kerollmops	9d314ace09	Fix the insta tests	2025-02-20 11:51:58 +01:00
Kerollmops	1b1172ad16	Fix dump tests	2025-02-20 10:44:53 +01:00
Kerollmops	1d99c8465c	Hide the batch stats to make insta pass	2025-02-20 10:16:54 +01:00
Kerollmops	05cc8c650c	Expose the write channel congestion in the batches	2025-02-19 15:47:54 +01:00
Louis Dureuil	589bf30ec6	make clippy happy	2025-02-19 11:38:07 +01:00
Louis Dureuil	b367c71ad2	fixup test	2025-02-19 11:31:17 +01:00
Kerollmops	3ff1de0a21	Expose the call trace in the batch stats	2025-02-19 11:24:11 +01:00
Louis Dureuil	1005a60fb8	Fixup dump settings	2025-02-19 11:03:48 +01:00
Kerollmops	e9add14189	Reorder steps	2025-02-18 19:26:41 +01:00
Kerollmops	4a058a080e	Simplify the name generation	2025-02-18 18:48:44 +01:00
Kerollmops	11a11fc870	Accumulate step durations from the progress system	2025-02-18 18:33:19 +01:00
Louis Dureuil	cd0dfa3f1b	Fix test cases	2025-02-18 17:21:52 +01:00
Louis Dureuil	7b4ce468a6	Allow overriding pooling method	2025-02-18 17:12:23 +01:00
Louis Dureuil	11759c4be4	Support pooling	2025-02-18 16:10:51 +01:00
meili-bors[bot]	0f1aeb8eaa	Merge #5351 Some checks failed Look for flaky tests / flaky (push) Failing after 19s Details SDKs tests / define-docker-image (push) Failing after 5s Details SDKs tests / .NET SDK tests (push) Has been skipped Details SDKs tests / Dart SDK tests (push) Has been skipped Details SDKs tests / Go SDK tests (push) Has been skipped Details SDKs tests / Java SDK tests (push) Has been skipped Details SDKs tests / JS SDK tests (push) Has been skipped Details SDKs tests / PHP SDK tests (push) Has been skipped Details SDKs tests / Python SDK tests (push) Has been skipped Details SDKs tests / Ruby SDK tests (push) Has been skipped Details SDKs tests / Rust SDK tests (push) Has been skipped Details SDKs tests / Swift SDK tests (push) Has been skipped Details SDKs tests / meilisearch-js-plugins tests (push) Has been skipped Details SDKs tests / meilisearch-rails tests (push) Has been skipped Details SDKs tests / meilisearch-symfony tests (push) Has been skipped Details Publish binaries to GitHub release / Check the version validity (push) Successful in 9s Details Publish binaries to GitHub release / Publish binary for aarch64 (meilisearch-linux-aarch64, aarch64-unknown-linux-gnu) (push) Failing after 2s Details Publish binaries to GitHub release / Publish binary for Linux (push) Failing after 12s Details Publish binaries to GitHub release / Publish binary for macos-13 (push) Has been cancelled Details Publish binaries to GitHub release / Publish binary for windows-2022 (push) Has been cancelled Details Publish binaries to GitHub release / Publish binary for macOS silicon (meilisearch-macos-apple-silicon, aarch64-apple-darwin) (push) Has been cancelled Details Test suite / Tests on ubuntu-20.04 (push) Failing after 12s Details Test suite / Test with Ollama (push) Failing after 7s Details Test suite / Test disabled tokenization (push) Failing after 11s Details Test suite / Run tests in debug (push) Failing after 11s Details Test suite / Run Clippy (push) Failing after 17s Details Test suite / Run Rustfmt (push) Successful in 1m51s Details Test suite / Tests almost all features (push) Failing after 7m7s Details Test suite / Tests on macos-13 (push) Has been cancelled Details Test suite / Tests on windows-2022 (push) Has been cancelled Details 5351: Bring back v1.13.0 changes into main r=irevoire a=Kerollmops This PR brings back the changes made in v1.13 into the main branch. Co-authored-by: ManyTheFish <many@meilisearch.com> Co-authored-by: Kerollmops <clement@meilisearch.com> Co-authored-by: Louis Dureuil <louis@meilisearch.com> Co-authored-by: Clémentine <clementine@meilisearch.com> Co-authored-by: meili-bors[bot] <89034592+meili-bors[bot]@users.noreply.github.com> Co-authored-by: Tamo <tamo@meilisearch.com> Co-authored-by: Clément Renault <clement@meilisearch.com>	2025-02-18 08:05:02 +00:00
meili-bors[bot]	885710a07b	Merge #5341 5341: Embeddings stats r=ManyTheFish a=ManyTheFish # Pull Request ## Related issue Fixes #5321 ## What does this PR do? - Add embedding stats - force dumpless upgrade to recompute stats - add tests Co-authored-by: ManyTheFish <many@meilisearch.com>	2025-02-12 15:46:37 +00:00
ManyTheFish	c55fdad2c3	Fix dumpless upgrade target version	2025-02-12 16:35:05 +01:00
ManyTheFish	1caad4c4b0	Add multiple embeddings for the same embedder in tests	2025-02-12 16:13:34 +01:00
ManyTheFish	8419ed52a1	fix clippy	2025-02-12 14:38:51 +01:00
ManyTheFish	a65c52cc97	Convert dump test into snapshots	2025-02-12 14:14:10 +01:00
ManyTheFish	49e9655c24	Update snapshots	2025-02-12 14:05:32 +01:00
meili-bors[bot]	fa763ca5dc	Merge #5339 5339: Add back timeout from v1.11.3 r=Kerollmops a=dureuill # Pull Request ## Related issue Fixes #5337 ## What does this PR do? - Fix regression compared with v1.11 by reintroducing the 30s timeout on all REST API calls. Thanks to `@migueltarga` for reporting the issue Co-authored-by: Louis Dureuil <louis@meilisearch.com>	2025-02-12 12:50:27 +00:00
ManyTheFish	c7aeb554b2	Add tests	2025-02-12 13:37:41 +01:00
Louis Dureuil	8e0d8d31f9	Add back timeout from v1.11.3	2025-02-12 11:53:00 +01:00
meili-bors[bot]	81a38099ec	Merge #5336 5336: Meilitool Hair Dryer r=dureuill a=Kerollmops This pull request introduces a new subcommand to hair dry a specific part of specific indexes. It is useful when [the memory-mapped pages are not hot in the cache](https://arc.net/l/quote/ixhcdwcq) and must be. Hair drying those interesting pages makes the search requests using the vector store much faster. The previous technique used the "cat method," which consists of reading the whole LMDB data file and pipping it into the null file descriptor. By doing that, the whole LMDB data file becomes hot in the cache. However, when the database is large, at least 30% of it is free, and unused pages and many other pages don't need to be hot, e.g., raw JSON documents or uninteresting parts of the inverted index. This new subcommand reads all the Arroy pages of a given index to make them hot, and only those. More coming... The current algorithm is single-threaded and takes a lot of time. I am in the process of multithreading it. This is the time it takes to hair dry a 305GiB database with a single thread. ``` real 21m51.054s user 0m3.155s sys 0m19.393s ``` ## To Do - [ ] (optional) Do the reads in parallel. Co-authored-by: Kerollmops <clement@meilisearch.com>	2025-02-12 10:45:16 +00:00
ManyTheFish	bd27fe7d02	force dumpless upgrade to recompute stats	2025-02-12 11:45:02 +01:00

1 2 3 4 5 ...

749 Commits