milvus

mirror of https://gitee.com/milvus-io/milvus.git synced 2025-12-28 22:45:26 +08:00

Author	SHA1	Message	Date
Buqian Zheng	85a7a7b1e3	fix: skip json path index if the query path includes number (#46200 ) issue: #45511 our tantivy inverted index currently does not include item index if the value is an array, thus we can't do `a[0] == 'b'` type of look up in the inverted index. for such, we need to skip the index and use brute force search. we may improve our index in the future, so this is a temp solution Signed-off-by: Buqian Zheng <zhengbuqian@gmail.com>	2025-12-10 13:59:13 +08:00
cai.zhang	bb486c0db3	fix: Fix path concatenation error when rootPath = "." in minio (#46220 ) issue: #46219 --------- Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2025-12-10 13:53:13 +08:00
liliu-z	3f063a29b0	feat: Support Search By PK (#45820 ) issue: #39157 Overview: Support search by PK by resolving IDs to vectors on Proxy side. Upgrade go-api to adapt to new proto definitions. Design: - Upgrade milvus-proto/go-api to latest master. - Implement handleIfSearchByPK in Proxy: resolve IDs to vectors via internal Query, then rewrite SearchRequest. - Adapt to 'SearchInput' oneof field in SearchRequest across client and handlers. - Fix binary vector stride calculation bug in placeholder utils. Compatibility: - Old Pymilvus can still work w/o this feature What is included: - Dense and Sparse - Multi vector fields - Rejection on BM25 What is not include: - Hybrid Search - EmbeddingList - Restful API Signed-off-by: Li Liu <li.liu@zilliz.com>	2025-12-10 10:59:14 +08:00
cai.zhang	b5e11f810d	fix: Fix panic when search empty result with output geometry field (#46230 ) issue: #46146 Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2025-12-09 20:37:13 +08:00
zhagnlu	8f0b7983ec	enhance: add jemalloc cached monitor (#46041 ) #46133 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2025-12-09 19:53:13 +08:00
zhuwenxing	f9ff0e8402	test: add testcases for add/alter/drop text embedding function (#46229 ) /kind improvement Signed-off-by: zhuwenxing <wenxing.zhu@zilliz.com>	2025-12-09 19:23:14 +08:00
zhuwenxing	abe0318bec	test: use predefined fake_de instead of creating new Faker instances to reduce run time (#46194 ) related: https://github.com/milvus-io/milvus/issues/46014 /kind improvement Signed-off-by: zhuwenxing <wenxing.zhu@zilliz.com>	2025-12-09 17:59:14 +08:00
wei liu	046693eaf7	test: [skip e2e] fix race condition in TestQueryNodePipeline/TestBasic (#46218 ) issue: #46217 The test was failing intermittently because it didn't wait for the pipeline to finish processing messages before exiting. The test sent a message to the pipeline and immediately returned, causing the deferred Close() to execute before ProcessInsert, ProcessDelete, and UpdateTSafe could be called. Fix by: - Moving message construction before mock expectations setup - Adding a done channel to synchronize on UpdateTSafe completion - Waiting for the signal before test exits Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2025-12-09 17:57:14 +08:00
zhenshan.cao	765768b0e4	fix: restfulv2 parsing fixes and schema defaults support with timestamptz (#46057 ) issue: https://github.com/milvus-io/milvus/issues/44585 Signed-off-by: zhenshan.cao <zhenshan.cao@zilliz.com>	2025-12-09 17:53:17 +08:00
Feilong Hou	624147740b	test: fix timestamptz e2e case failure on Jenkins Weekly (#46210 ) Issue: #46188 Bug was caused by inconsistent version of tzdata as well as wrong month assignment in convert_timestamptz function. Also fix when debug_mode=True the compare function can correctly return True or False. --------- Signed-off-by: Eric Hou <eric.hou@zilliz.com> Co-authored-by: Eric Hou <eric.hou@zilliz.com>	2025-12-09 16:09:15 +08:00
wei liu	d7050c417f	fix: Add field data alignment validation to prevent partial update panic (#46177 ) issue: #46176 - Add checkAligned validation before processing partial update field data to prevent index out of range panic when field data arrays have mismatched lengths - Fix GetNumRowOfFieldDataWithSchema to handle Timestamptz string format and Geometry WKT format properly - Add unit tests for empty data array scenarios in partial update --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2025-12-09 14:17:12 +08:00
congqixia	728cdc15b2	fix: fill partition_id in load index info and close RemoteOutputStream properly (#46203 ) This PR fixes two issues related to segment loading and index deserialization: 1. Fill partition_id in LoadIndexInfo when converting field index info, which is required by cardinal (DiskANN) index deserialization. 2. Close RemoteOutputStream in destructor to ensure buffer flushed and resources released properly. issue: #46141 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-12-09 13:27:13 +08:00
zhuwenxing	8fac376afd	test: upgrade minio sdk from 7.1.5 to 7.2.0 (#46186 ) /kind improvement fix minio sdk error ``` 2025-12-08T07:54:48Z {container="step-test"} def _put_object(self, bucket_name, object_name, data, headers, 2025-12-08T07:54:48Z {container="step-test"} query_params=None): 2025-12-08T07:54:48Z {container="step-test"} """Execute PutObject S3 API.""" 2025-12-08T07:54:48Z {container="step-test"} response = self._execute( 2025-12-08T07:54:48Z {container="step-test"} "PUT", 2025-12-08T07:54:48Z {container="step-test"} bucket_name, 2025-12-08T07:54:48Z {container="step-test"} object_name, 2025-12-08T07:54:48Z {container="step-test"} body=data, 2025-12-08T07:54:48Z {container="step-test"} headers=headers, 2025-12-08T07:54:48Z {container="step-test"} query_params=query_params, 2025-12-08T07:54:48Z {container="step-test"} no_body_trace=True, 2025-12-08T07:54:48Z {container="step-test"} ) 2025-12-08T07:54:48Z {container="step-test"} return ObjectWriteResult( 2025-12-08T07:54:48Z {container="step-test"} bucket_name, 2025-12-08T07:54:48Z {container="step-test"} object_name, 2025-12-08T07:54:48Z {container="step-test"} > response.getheader("x-amz-version-id"), 2025-12-08T07:54:48Z {container="step-test"} response.getheader("etag").replace('"', ""), 2025-12-08T07:54:48Z {container="step-test"} response.getheaders(), 2025-12-08T07:54:48Z {container="step-test"} ) 2025-12-08T07:54:48Z {container="step-test"} E AttributeError: 'HTTPResponse' object has no attribute 'getheader' 2025-12-08T07:54:48Z {container="step-test"} 2025-12-08T07:54:48Z {container="step-test"} /usr/local/lib/python3.10/site-packages/minio/api.py:1582: AttributeError ``` --------- Signed-off-by: zhuwenxing <wenxing.zhu@zilliz.com>	2025-12-09 11:39:12 +08:00
Zhen Ye	b8086cb62b	fix: lost database in restful v2 (#46171 ) issue: #45812 --------- Signed-off-by: chyezh <chyezh@outlook.com>	2025-12-09 10:59:13 +08:00
Zhen Ye	459425ac84	fix: wrong context using by session of grpc client (#46183 ) issue: #46182 Signed-off-by: chyezh <chyezh@outlook.com>	2025-12-08 21:47:12 +08:00
congqixia	a042a6e1e8	enhance: support pause GC at collection level (#45943 ) Add collection-level granularity to the garbage collector pause/resume mechanism. Previously, GC pause affected all collections globally. Now operators can pause GC for specific collections while allowing other collections to continue normal GC operations. Changes: - Add `pausedCollection` concurrent map to track per-collection pause state - Extend `Pause()` and `Resume()` methods with `collectionID` parameter - Add `collectionGCPaused()` helper to check collection pause status - Skip dropped segment recycling when collection GC is paused - Update management API to accept optional `collection_id` query parameter - Add `GetInt64Value()` utility function for parsing int64 from KV pairs - Maintain backward compatibility: collectionID <= 0 triggers global pause This provides DevOps with finer control over Milvus data lifecycle. issue: #45941 --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-12-08 19:33:15 +08:00
zhuwenxing	4fe41ff14d	test: add emb list recall test (#46135 ) /kind improvement Signed-off-by: zhuwenxing <wenxing.zhu@zilliz.com>	2025-12-08 19:21:13 +08:00
Linkwei	a4c1c5a304	enhance:[skip e2e] Enhance monitoring select db related to Milvus 2.5 (#46178 ) enhance: Enhance monitoring select db related to Milvus 2.5 https://github.com/milvus-io/milvus/issues/46179 Signed-off-by: Linkwei <link.xie@zilliz.com>	2025-12-08 15:03:11 +08:00
Spade A	253449ea18	fix: add timeout for TestArrayStructDataNodeUtil to avoid blocking (#45867 ) issue: https://github.com/milvus-io/milvus/issues/45866 Signed-off-by: SpadeA <tangchenjie1210@gmail.com>	2025-12-08 14:41:12 +08:00
wei liu	8accde97be	test: [skip e2e] unstable ut caused by duplicate pk in same batch (#46132 ) issue: #46105 --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2025-12-08 14:13:16 +08:00
Zhen Ye	354927374f	fix: wrong auth of restfulv2 forward when upgrading (#46139 ) issue: #45812 Signed-off-by: chyezh <chyezh@outlook.com>	2025-12-08 12:45:12 +08:00
Buqian Zheng	95a535cb4d	fix: struct reduce incorrect (#46150 ) issue: https://github.com/milvus-io/milvus/issues/42148 --------- Signed-off-by: Buqian Zheng <zhengbuqian@gmail.com>	2025-12-08 10:23:11 +08:00
tinswzy	1917bb720f	enhance: add fallback mechanism for WP when accessing object storage without Condition Write support (#45735 ) related issue: #45733 related [wp issue: #60](https://github.com/zilliztech/woodpecker/issues/60) Signed-off-by: tinswzy <zhenyuan.wei@zilliz.com>	2025-12-07 21:59:11 +08:00
yihao.dai	b69f4ab1cd	fix: Fix replicate lag metric calculation to prevent false-positive health (#46120 ) This change fixes the calculation by using timestamp subtraction (WAL confirmed time - Last replicate time). This ensures the lag metric immediately spikes when replication is blocked, providing reliable monitoring. issue: https://github.com/milvus-io/milvus/issues/46116 Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2025-12-07 21:51:12 +08:00
foxspy	3a3163e613	fix: skip gpu init for streaming node (#45650 ) issue: #45597 The streaming node currently cannot use GPU resources and does not need to perform initialization. Signed-off-by: xianliang.li <xianliang.li@zilliz.com>	2025-12-07 13:59:11 +08:00
congqixia	d4450b2f57	enhance: [StorageV2] Integrate CMEK support into Loon FFI interface (#46123 ) This PR adds Customer Managed Encryption Keys (CMEK) support to the StorageV2 FFI layer, enabling data encryption/decryption through the cipher plugin system. Changes: - Add ffi_writer_c.cpp/h with GetEncParams() to retrieve encryption parameters (key and metadata) from cipher plugin for data encryption - Extend GetLoonReader() in ffi_reader_c.cpp to support CMEK decryption by configuring KeyRetriever when plugin context is provided - Add encryption property constants in ffi_common.go for writer config - Integrate CMEK encryption in NewFFIPackedWriter() to pass encryption parameters to the underlying storage writer issue: #44956 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-12-05 17:59:12 +08:00
congqixia	8e82631282	fix: correct index_has_raw_data logic for fielddata loading (#46117 ) Related to #46098 This fix addresses a bug where the segment loader incorrectly determined whether scalar fields have raw data in their indexes, leading to unnecessary field data loading or skipping indexed raw data retrieval. - Build `field_ids` vector that handles both single field and column group cases (when `child_fields_size() > 0`) - Move the mmap setting and index_has_raw_data checks before the skip decision, iterating over the correctly built `field_ids` - Fix the boolean AND logic in both `Load()` and `LoadColumnGroup()` to properly check if ALL fields in the group have raw data in their indexes This bug was hiding the root cause of issue #46098, where QueryNode panics when outputting timestamptz data from scalar index with raw data. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-12-05 17:47:12 +08:00
cai.zhang	141547d8a8	enhance: Add log with segment size for tasks (#46118 ) Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2025-12-05 16:45:11 +08:00
aoiasd	d8c9d15c07	fix: highlighter return error when search return empty result (#46107 ) relate: https://github.com/milvus-io/milvus/issues/42589 Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2025-12-05 14:23:10 +08:00
wei liu	354fe9c9d2	fix: unstable test case TestTask_VarCharPrimaryKey (#46106 ) issue: #46105 Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2025-12-05 14:01:12 +08:00
congqixia	3daff1ab2b	enhance: use specified manifest version in loon ffi reader (#46101 ) Related to #44956 Use the exact manifest version from the path parameter instead of always fetching the latest manifest. This ensures data consistency by reading from the specific version that was requested. Changes: - Update GetColumnGroups to use transaction.begin(version) with the specified version from the path JSON - Replace get_latest_manifest() with get_current_manifest() after beginning transaction at the target version - Update Go FFI binding to call get_column_groups_by_version instead of get_latest_column_groups - Remove unused GetManifest function from util.cpp/util.h - Bump milvus-storage version from 5fff4f5 to 33bf815 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-12-05 11:45:11 +08:00
Buqian Zheng	1372e84d7f	fix: move cursor after skip index skipped a chunk (#46054 ) issue: #46053 Signed-off-by: Buqian Zheng <zhengbuqian@gmail.com>	2025-12-05 10:47:11 +08:00
Anurag Ojha	3d17ddb71b	doc: remove outdated archive links from table (#46062 ) This PR removes several archived project links from the CONTRIBUTING.md table. These links point to inactive repositories and no longer provide value to contributors. The update improves clarity and keeps the documentation current. Signed-off-by: intojhanurag <aojharaj2004@gmail.com>	2025-12-05 10:01:11 +08:00
congqixia	3bbc5d0825	fix: correct loop logic for timestamptz scalar index output (#46100 ) Related to #46098 Fix the ReverseDataFromIndex function where the assignment of raw_data to scalar_array and the break statement were incorrectly placed inside the for loop for TIMESTAMPTZ data type. This caused QueryNode to panic when outputting timestamptz fields from scalar index. Move the assignment and break statement outside the loop to match the pattern used by other data types like VARCHAR. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-12-05 00:09:11 +08:00
Yiqing Lu	89b4c58266	enhance: bump etcd version to 3.5.25 (#46093 ) As title. Signed-off-by: AlintaLu <yiqing.lu@zilliz.com>	2025-12-04 17:37:15 +08:00
Xinyi7	59752f216d	fix: add check in batch_score function to prevent query node seg fault (#46025 ) previously we saw that when doing reranker with phrase matching, the query node throws a segmentation fault error. github issue link: https://github.com/milvus-io/milvus/issues/45990 --------- Signed-off-by: Xinyi Jiang <xinyi.jiang@reddit.com> Co-authored-by: Xinyi Jiang <xinyi.jiang@reddit.com>	2025-12-04 17:35:17 +08:00
XuanYang-cn	2cb6073bdd	test: Increase PyMilvus version to 2.7.0rc83 for master branch (#46072 ) Automated daily bump from pymilvus master branch. Updates tests/python_client/requirements.txt. Signed-off-by: XuanYang-cn <xuan.yang@zilliz.com>	2025-12-04 17:33:20 +08:00
Zhen Ye	c22cdbbf9a	enhance: support proxy DQL forward (#46036 ) issue: #45812 Signed-off-by: chyezh <chyezh@outlook.com>	2025-12-04 17:11:12 +08:00
aoiasd	354ab2f55e	enhance: sync file resource to querynode and datanode (#44480 ) relate:https://github.com/milvus-io/milvus/issues/43687 Support use file resource with sync mode. Auto download or remove file resource to local when user add or remove file resource. Sync file resource to node when find new node session. --------- Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2025-12-04 16:23:11 +08:00
wei liu	0c63ed95bb	test: [skip e2e] fix unstable assignment tests in balancer (#46042 ) issue: #46038 - Add assertSegmentPlanNumAndTargetNodeMatch and assertChannelPlanNumAndTargetNodeMatch helper functions to validate plan count and target node membership for unstable assignment tests - Mark "test assigning channels with resource exhausted nodes" as unstable since node 2 and 3 have equal priority after filtering - Replace simple length check with target node validation to ensure plans assign to expected node set even when order is non-deterministic Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2025-12-04 16:17:11 +08:00
XuanYang-cn	64d19fb4f3	fix: Set plugin context when cipher enabled (#46050 ) See also: #46008 Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2025-12-04 15:17:15 +08:00
Zhen Ye	bf76a9e8e2	fix: milvus fast fail if any component is not ready (#46069 ) issue: #45243 Signed-off-by: chyezh <chyezh@outlook.com>	2025-12-04 15:15:10 +08:00
cai.zhang	cfd49b7680	enhance: Estimate the taskSlot based on whether scalar or vector index (#45850 ) issue: #45186 --------- Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2025-12-04 14:15:10 +08:00
sre-ci-robot	605816949f	[automated] Bump milvus version to v2.6.7 (#46068 ) Bump milvus version to v2.6.7 Signed-off-by: sre-ci-robot sre-ci-robot@users.noreply.github.com Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>	2025-12-04 11:09:12 +08:00
congqixia	a659506e82	enhance: Bump go version to 1.24.11 for builder image (#46049 ) Follow-up of #46034 Bump go version in builder image fixing CVE-2025-61729 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-12-04 11:07:11 +08:00
Ted Xu	20ce9fdc23	feat: bump loon version (#46029 ) See: #44956 This PR upgrades loon to the latest version and resolves building conflicts. --------- Signed-off-by: Ted Xu <ted.xu@zilliz.com> Signed-off-by: Congqi Xia <congqi.xia@zilliz.com> Co-authored-by: Congqi Xia <congqi.xia@zilliz.com>	2025-12-04 10:57:12 +08:00
aoiasd	8efe9ccac6	feat: Add support for using highlight without returning the field as the output field. (#45984 ) relate: https://github.com/milvus-io/milvus/issues/42589 Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2025-12-04 10:35:11 +08:00
nico	43fe215787	test: update sdk version and skip some debug log (#46040 ) Signed-off-by: nico <cheng.yuan@zilliz.com>	2025-12-04 10:33:11 +08:00
wei liu	f85e86a6ec	fix: change upsert duplicate PK behavior from dedup to error (#45997 ) issue: #44320 Replace the DeduplicateFieldData function with CheckDuplicatePkExist that returns an error when duplicate primary keys are detected in the same batch, instead of silently deduplicating. Changes: - Replace DeduplicateFieldData with CheckDuplicatePkExist in util.go - Update upsertTask.PreExecute to return error on duplicate PKs - Simplify helper function from findLastOccurrenceIndices to hasDuplicates - Update unit tests to verify the new error behavior - Add Python integration tests for duplicate PK error cases Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2025-12-04 10:23:11 +08:00
wei liu	a308331b81	fix: Set replica field in balance plans to prevent panic (#45722 ) issue: #45598 The MultiTargetBalancer was missing replica field assignment in the generated segment and channel plans, which caused panic during balance operations. This change ensures that all balance plans have the replica field properly set to fix the panic issue. Also refactored the balance test to extract common test logic into a reusable helper function and added a new integration test specifically for MultipleTargetBalancer policy. Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2025-12-04 10:19:11 +08:00

1 2 3 4 5 ...

23633 Commits