meilisearch

mirror of https://github.com/meilisearch/meilisearch.git synced 2025-03-07 06:04:05 +08:00

Author	SHA1	Message	Date
ManyTheFish	967033579d	Refactor search and facet-search Changes: The search filters are now using the FilterableAttributesFeatures from the FilterableAttributesRules to know if a field is filterable. Moreover, the FilterableAttributesFeatures is more precise and an error will be returned if an operator is used on a field that doesn't have the related feature. The facet-search is now checking if the feature is allowed in the FilterableAttributesFeatures and an error will be returned if the field doesn't have the related feature. Impact: - facet-search is now relying on AttributePatterns to match the locales - search using filters is now relying on FilterableAttributesFeatures - distinct attribute is now relying on FilterableAttributesRules	2025-03-03 10:25:32 +01:00
ManyTheFish	0200c65ebf	Change the filterableAttributes setting API Changes: The filterableAttributes type has been changed from a `BTreeSet<String>` to a `Vec<FilterableAttributesRule>`, Which is a list of rules defining patterns to match the documents' fields and a set of feature to apply on the matching fields. The rule order given by the user is now an important information, the features applied on a filterable field will be chosen based on the rule order as we do for the LocalizedAttributesRules. This means that the list will not be reordered anymore and will keep the user defined order, moreover, if there are any duplicates, they will not be de-duplicated anymore. Impact: - Settings API - the database format of the filterable attributes changed - may impact the LocalizedAttributesRules due to the AttributePatterns factorization - OpenAPI generator	2025-03-03 10:22:02 +01:00
meili-bors[bot]	c63c25a9a2	Merge #5355 Some checks failed Look for flaky tests / flaky (push) Failing after 1s Details Indexing bench (push) / Run and upload benchmarks (push) Has been cancelled Details Benchmarks of indexing (push) / Run and upload benchmarks (push) Has been cancelled Details Benchmarks of search for geo (push) / Run and upload benchmarks (push) Has been cancelled Details Benchmarks of search for songs (push) / Run and upload benchmarks (push) Has been cancelled Details Benchmarks of search for Wikipedia articles (push) / Run and upload benchmarks (push) Has been cancelled Details Publish binaries to GitHub release / Check the version validity (push) Failing after 5s Details Test suite / Tests almost all features (push) Failing after 13s Details Test suite / Tests on ubuntu-22.04 (push) Failing after 19s Details Test suite / Test with Ollama (push) Failing after 7s Details Test suite / Test disabled tokenization (push) Failing after 10s Details Test suite / Run tests in debug (push) Failing after 15s Details Test suite / Run Rustfmt (push) Failing after 16s Details Test suite / Run Clippy (push) Successful in 9m39s Details SDKs tests / define-docker-image (push) Failing after 5s Details SDKs tests / .NET SDK tests (push) Has been skipped Details SDKs tests / Dart SDK tests (push) Has been skipped Details SDKs tests / Go SDK tests (push) Has been skipped Details SDKs tests / Java SDK tests (push) Has been skipped Details SDKs tests / JS SDK tests (push) Has been skipped Details SDKs tests / PHP SDK tests (push) Has been skipped Details SDKs tests / Python SDK tests (push) Has been skipped Details SDKs tests / Ruby SDK tests (push) Has been skipped Details SDKs tests / Rust SDK tests (push) Has been skipped Details SDKs tests / Swift SDK tests (push) Has been skipped Details SDKs tests / meilisearch-js-plugins tests (push) Has been skipped Details SDKs tests / meilisearch-rails tests (push) Has been skipped Details SDKs tests / meilisearch-symfony tests (push) Has been skipped Details Test suite / Tests on macos-13 (push) Has been cancelled Details Test suite / Tests on windows-2022 (push) Has been cancelled Details 5355: Support fetching the pooling method from the model configuration r=Kerollmops a=dureuill # Pull Request ## Related issue Fixes #5354 ## What does this PR do? - Fetches the pooling configuration from the model repository - Use a pooling method that depends on the pooling configuration of that model. - Allow overriding the pooling method with a new huggingFace embedder parameter `pooling` - for backward-compatibility with Meilisearch v1.13 - for compatibility with embedders that exhibit the same behavior as Meilisearch v1.13 - Handle the default value of that new parameter - for compatibility, when importing a db/a dump, it should be set to `forceMean` - when (re)set from the settings for an embedder, it should be set to `useModel` Co-authored-by: Louis Dureuil <louis@meilisearch.com>	2025-02-27 14:55:13 +00:00
ManyTheFish	d4063c9dcd	Fix fmt	2025-02-26 17:02:45 +01:00
Many the fish	abebc574f6	Update crates/milli/src/index.rs Co-authored-by: Tamo <tamo@meilisearch.com>	2025-02-26 17:02:45 +01:00
Many the fish	f32ab67819	Update crates/milli/src/index.rs Co-authored-by: Tamo <tamo@meilisearch.com>	2025-02-26 17:02:44 +01:00
ManyTheFish	d25953f322	fix clippy	2025-02-26 17:02:43 +01:00
ManyTheFish	405bbd04c1	Dumpless upgrade	2025-02-26 17:01:38 +01:00
ManyTheFish	9f3663e768	Implement Incremental document database stats computing	2025-02-26 17:01:35 +01:00
ManyTheFish	d9642ec916	Use checked_div in average computation	2025-02-26 17:01:34 +01:00
ManyTheFish	818e8b0237	Fix zero division	2025-02-26 17:01:31 +01:00
ManyTheFish	4f77a7fba5	fix clippy	2025-02-26 17:01:29 +01:00
ManyTheFish	9a6c1730aa	Add document database stats	2025-02-26 17:01:25 +01:00
ManyTheFish	15788773af	Check the exact_word database when computing zero typo query	2025-02-26 17:01:22 +01:00
Kerollmops	76fd5d92d7	Clarify the tail writing to database	2025-02-20 17:35:23 +01:00
Kerollmops	245a55722a	Remove commented code	2025-02-20 16:48:18 +01:00
Kerollmops	05cc8c650c	Expose the write channel congestion in the batches	2025-02-19 15:47:54 +01:00
Kerollmops	e9add14189	Reorder steps	2025-02-18 19:26:41 +01:00
Kerollmops	4a058a080e	Simplify the name generation	2025-02-18 18:48:44 +01:00
Kerollmops	11a11fc870	Accumulate step durations from the progress system	2025-02-18 18:33:19 +01:00
Louis Dureuil	7b4ce468a6	Allow overriding pooling method	2025-02-18 17:12:23 +01:00
Louis Dureuil	11759c4be4	Support pooling	2025-02-18 16:10:51 +01:00
meili-bors[bot]	0f1aeb8eaa	Merge #5351 Some checks failed Look for flaky tests / flaky (push) Failing after 19s Details SDKs tests / define-docker-image (push) Failing after 5s Details SDKs tests / .NET SDK tests (push) Has been skipped Details SDKs tests / Dart SDK tests (push) Has been skipped Details SDKs tests / Go SDK tests (push) Has been skipped Details SDKs tests / Java SDK tests (push) Has been skipped Details SDKs tests / JS SDK tests (push) Has been skipped Details SDKs tests / PHP SDK tests (push) Has been skipped Details SDKs tests / Python SDK tests (push) Has been skipped Details SDKs tests / Ruby SDK tests (push) Has been skipped Details SDKs tests / Rust SDK tests (push) Has been skipped Details SDKs tests / Swift SDK tests (push) Has been skipped Details SDKs tests / meilisearch-js-plugins tests (push) Has been skipped Details SDKs tests / meilisearch-rails tests (push) Has been skipped Details SDKs tests / meilisearch-symfony tests (push) Has been skipped Details Publish binaries to GitHub release / Check the version validity (push) Successful in 9s Details Publish binaries to GitHub release / Publish binary for aarch64 (meilisearch-linux-aarch64, aarch64-unknown-linux-gnu) (push) Failing after 2s Details Publish binaries to GitHub release / Publish binary for Linux (push) Failing after 12s Details Publish binaries to GitHub release / Publish binary for macos-13 (push) Has been cancelled Details Publish binaries to GitHub release / Publish binary for windows-2022 (push) Has been cancelled Details Publish binaries to GitHub release / Publish binary for macOS silicon (meilisearch-macos-apple-silicon, aarch64-apple-darwin) (push) Has been cancelled Details Test suite / Tests on ubuntu-20.04 (push) Failing after 12s Details Test suite / Test with Ollama (push) Failing after 7s Details Test suite / Test disabled tokenization (push) Failing after 11s Details Test suite / Run tests in debug (push) Failing after 11s Details Test suite / Run Clippy (push) Failing after 17s Details Test suite / Run Rustfmt (push) Successful in 1m51s Details Test suite / Tests almost all features (push) Failing after 7m7s Details Test suite / Tests on macos-13 (push) Has been cancelled Details Test suite / Tests on windows-2022 (push) Has been cancelled Details 5351: Bring back v1.13.0 changes into main r=irevoire a=Kerollmops This PR brings back the changes made in v1.13 into the main branch. Co-authored-by: ManyTheFish <many@meilisearch.com> Co-authored-by: Kerollmops <clement@meilisearch.com> Co-authored-by: Louis Dureuil <louis@meilisearch.com> Co-authored-by: Clémentine <clementine@meilisearch.com> Co-authored-by: meili-bors[bot] <89034592+meili-bors[bot]@users.noreply.github.com> Co-authored-by: Tamo <tamo@meilisearch.com> Co-authored-by: Clément Renault <clement@meilisearch.com>	2025-02-18 08:05:02 +00:00
meili-bors[bot]	885710a07b	Merge #5341 5341: Embeddings stats r=ManyTheFish a=ManyTheFish # Pull Request ## Related issue Fixes #5321 ## What does this PR do? - Add embedding stats - force dumpless upgrade to recompute stats - add tests Co-authored-by: ManyTheFish <many@meilisearch.com>	2025-02-12 15:46:37 +00:00
ManyTheFish	c55fdad2c3	Fix dumpless upgrade target version	2025-02-12 16:35:05 +01:00
ManyTheFish	8419ed52a1	fix clippy	2025-02-12 14:38:51 +01:00
Louis Dureuil	8e0d8d31f9	Add back timeout from v1.11.3	2025-02-12 11:53:00 +01:00
ManyTheFish	bd27fe7d02	force dumpless upgrade to recompute stats	2025-02-12 11:45:02 +01:00
ManyTheFish	41203f0931	Add embedders stats	2025-02-12 11:37:47 +01:00
Louis Dureuil	b83275c9c5	Change the `updated*` functions to `only_new` functions, hopefully better communicating what they do	2025-02-11 15:27:10 +01:00
Louis Dureuil	d7f35ee3ba	Use merged document instead of updated	2025-02-11 15:27:10 +01:00
meili-bors[bot]	0c3e7fe963	Merge #5316 Some checks failed Test suite / Tests on ubuntu-20.04 (push) Failing after 2s Details Test suite / Tests almost all features (push) Has been skipped Details Test suite / Test disabled tokenization (push) Has been skipped Details Test suite / Run tests in debug (push) Failing after 16s Details Test suite / Run Clippy (push) Failing after 12s Details Test suite / Run Rustfmt (push) Failing after 32s Details Test suite / Tests on macos-13 (push) Has been cancelled Details Test suite / Tests on windows-2022 (push) Has been cancelled Details 5316: Fix the dumpless upgrade corruption r=dureuill a=irevoire # Pull Request ## Related issue Fixes https://github.com/meilisearch/meilisearch/issues/5280 ## What does this PR do? - Add a test that ensure we write the version in the index-scheduler even if we have a bug while writing the VERSION file - Do what was described in the issue Co-authored-by: Tamo <tamo@meilisearch.com>	2025-02-10 09:53:57 +00:00
Tamo	45f843ccb9	fmt	2025-02-10 10:46:42 +01:00
Kerollmops	2b0e17ede0	Make sure arroy is using the rayon thread-pool	2025-02-06 15:28:10 +01:00
Louis Dureuil	04ac0af54b	Add WeightedScoreValues to be able to compare remote scores	2025-02-05 15:03:16 +01:00
Louis Dureuil	9996533364	Make search types serialize and deserialize so that reading from a proxy is possible	2025-02-05 15:03:16 +01:00
meili-bors[bot]	796acd1aee	Merge #5288 Some checks failed Test suite / Tests almost all features (push) Has been skipped Details Test suite / Test disabled tokenization (push) Has been skipped Details Test suite / Tests on ubuntu-20.04 (push) Failing after 13s Details Test suite / Run tests in debug (push) Failing after 13s Details Test suite / Run Clippy (push) Failing after 19s Details Test suite / Tests on windows-2022 (push) Failing after 48s Details Test suite / Run Rustfmt (push) Successful in 1m28s Details Test suite / Tests on macos-13 (push) Has been cancelled Details 5288: Improve AI logging r=dureuill a=Kerollmops This PR fixes #5285 and brings the changes from #5233 to simplify debugging indexation and search performance issues related to AI. The following texts can be found in the logs to debug and understand performance issues: - `embed_one: search` represents the time we spent waiting for the embedding generation, i.e., OpenAI, local HuggingFace, Ollama. - `filtered_universe: search::universe` the time spent filtering the documents. - ~`next_bucket: search::vector_sort` is the time spent finding the nearest neighbors (ANNs) in the vector store (arroy), locally~ was being triggered too many times. - `indexing::vectors` is the time arroy spends indexing the new vectors for a batch. - `documents::extract vectors` and `documents::merge vectors` to see the time spent generating and writing the embeddings. Co-authored-by: Kerollmops <clement@meilisearch.com>	2025-02-04 10:20:45 +00:00
Kerollmops	cc8df5e11f	Move back the search-side logging to tracing	2025-02-04 11:16:17 +01:00
meili-bors[bot]	ede74ccc42	Merge #5306 Some checks failed Test suite / Tests on ubuntu-20.04 (push) Failing after 2s Details Test suite / Tests almost all features (push) Has been skipped Details Test suite / Test disabled tokenization (push) Has been skipped Details Test suite / Run tests in debug (push) Failing after 2s Details Test suite / Tests on windows-2022 (push) Failing after 24s Details Test suite / Run Rustfmt (push) Successful in 1m33s Details Test suite / Run Clippy (push) Successful in 6m20s Details Test suite / Tests on macos-13 (push) Has been cancelled Details 5306: Fix internal error when passing `documentTemplateMaxBytes` to a source that doesn't support it r=ManyTheFish a=dureuill # Pull Request ## Related issue Fixes #5305 ## What does this PR do? - add `DOCUMENT_TEMPLATE_MAX_BYTES` to `allowed_sources_for_field` and `allowed_fields_for_source` to prevent a panic Co-authored-by: Louis Dureuil <louis@meilisearch.com>	2025-02-04 08:46:13 +00:00
Tamo	d34f0b606c	Update crates/milli/src/update/new/document_change.rs	2025-02-03 12:08:52 +01:00
Kerollmops	acc400face	Support merging update and replacement operations	2025-02-03 11:47:17 +01:00
Kerollmops	aa2327591e	Add more mixing updates and replacements tests	2025-02-03 10:34:07 +01:00
Kerollmops	60470bb647	Fix the tests to use the new replace/update documents	2025-02-03 10:34:07 +01:00
Kerollmops	8e6893ddbe	Make sure we correctly mix different document operations	2025-02-03 10:34:06 +01:00
Kerollmops	7a9382b115	Better document the rayon limitation condition	2025-02-03 10:24:53 +01:00
Kerollmops	62dabeba5f	Do not create too many rayon tasks when processing the settings	2025-02-03 10:24:52 +01:00
Kerollmops	48812229a9	Remove a log that would log too much	2025-02-03 10:24:52 +01:00
Louis Dureuil	96544bfa43	add `DOCUMENT_TEMPLATE_MAX_BYTES` to `allowed_sources_for_field` and `allowed_fields_for_source`	2025-02-03 09:59:17 +01:00
Kerollmops	aaefbfae1f	Do not create too many rayon tasks	2025-01-30 16:36:12 +01:00
Kerollmops	97e17f52a1	Add more logs to see calls to the embedders	2025-01-30 16:36:12 +01:00

1 2 3 4 5 ...

347 Commits