milvus

mirror of https://gitee.com/milvus-io/milvus.git synced 2025-12-07 01:28:27 +08:00

Author	SHA1	Message	Date
aoiasd	cfeb095ad7	enhance: forbid build analyzer at proxy (#44067 ) relate: https://github.com/milvus-io/milvus/issues/43687 We used to run the temporary analyzer and validate analyzer on the proxy, but the proxy should not be a computation-heavy node. This PR move all analyzer calculations to the streaming node. --------- Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2025-10-23 10:58:12 +08:00
Bingyi Sun	0c0630cc38	feat: support dropping index without releasing collection (#42941 ) issue: #42942 This pr includes the following changes: 1. Added checks for index checker in querycoord to generate drop index tasks 2. Added drop index interface to querynode 3. To avoid search failure after dropping the index, the querynode allows the use of lazy mode (warmup=disable) to load raw data even when indexes contain raw data. 4. In segcore, loading the index no longer deletes raw data; instead, it evicts it. 5. In expr, the index is pinned to prevent concurrent errors. --------- Signed-off-by: sunby <sunbingyi1992@gmail.com>	2025-09-02 16:17:52 +08:00
zhagnlu	fc876639cf	enhance: support json stats with shredding design (#42534 ) #42533 Co-authored-by: luzhang <luzhang@zilliz.com>	2025-09-01 10:49:52 +08:00
Tianx	c0d62268ac	feat: add timesatmptz data type (#44005 ) issue: https://github.com/milvus-io/milvus/issues/27467 > https://github.com/milvus-io/milvus/issues/27467#issuecomment-3092211420 > * [x] M1 Create collection with timestamptz field > * [x] M2 Insert timestamptz field data > * [x] M3 Retrieve timestamptz field data > * [x] M4 Implement handoff[ ] The second PR of issue: https://github.com/milvus-io/milvus/issues/27467, which completes M1-M4 described above. --------- Signed-off-by: xtx <xtianx@smail.nju.edu.cn>	2025-08-26 15:59:53 +08:00
Zhen Ye	5551d99425	enhance: remove old arch non-streaming arch code (#43651 ) issue: #41609 - remove all dml dead code at proxy - remove dead code at l0_write_buffer - remove msgstream dependency at proxy - remove timetick reporter from proxy - remove replicate stream implementation --------- Signed-off-by: chyezh <chyezh@outlook.com>	2025-08-06 14:41:40 +08:00
congqixia	5d90b65342	enhance: [StorageV2] Add storage version in Data/Query view resp (#43348 ) Related to #39173 Add `storage_version` in data/query view segment info response --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-07-16 15:52:51 +08:00
wei liu	bf5fde1431	fix: Prevent delegator unserviceable due to shard leader change (#42689 ) issue: #42098 #42404 Fix critical issue where concurrent balance segment and balance channel operations cause delegator view inconsistency. When shard leader switches between load and release phases of segment balance, it results in loading segments on old delegator but releasing on new delegator, making the new delegator unserviceable. The root cause is that balance segment modifies delegator views, and if these modifications happen on different delegators due to leader change, it corrupts the delegator state and affects query availability. Changes include: - Add shardLeaderID field to SegmentTask to track delegator for load - Record shard leader ID during segment loading in move operations - Skip release if shard leader changed from the one used for loading - Add comprehensive unit tests for leader change scenarios This ensures balance segment operations are atomic on single delegator, preventing view corruption and maintaining delegator serviceability. --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2025-06-19 12:10:38 +08:00
wei liu	78c39edbce	fix: Fix potential panic when DeleteCheckpoint is nil (#42664 ) issue: #42663 Fix panic issue when processing VchannelInfo messages from older coordinator versions that don't have DeleteCheckpoint field. Changes: - Add null safety check for DeleteCheckpoint before accessing methods - Maintain backward compatibility with legacy message formats - Improve seek position selection logic for both old and new versions --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2025-06-13 14:26:36 +08:00
wei liu	e7c0a6ffbb	enhance: Refine QueryNode task parallelism based on CPU core count (#42166 ) issue: #42165 Implement dynamic task execution capacity calculation based on QueryNode CPU core count instead of static configuration for better resource utilization. Changes include: - Add CpuCoreNum() method and WithCpuCoreNum() option to NodeInfo - Implement GetTaskExecutionCap() for dynamic capacity calculation - Add QueryNodeTaskParallelismFactor parameter for tuning - Update proto definition to include cpu_core_num field - Add unit tests for new functionality This allows QueryCoord to automatically adjust task parallelism based on actual hardware resources. Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2025-06-11 13:20:35 +08:00
wei liu	f3fe117840	fix: Use delete checkpoint to prevent delete record loss in L0 refactoring (#42628 ) issue: #39333 #41570 Fix delete record missing issue introduced in PR #39552 L0 refactoring: - Use delete checkpoint as consume start position when deleteCP < channelCP - Add logging when delete checkpoint is used instead of seek position - Prevent delete record loss when deleteCP is earlier than default channelCP Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2025-06-10 17:34:35 +08:00
aoiasd	2ae4d80120	enhance: support run analyzer by loaded collection field (#42113 ) relate: https://github.com/milvus-io/milvus/issues/42094 Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2025-05-29 10:54:30 +08:00
wei liu	54619eaa2c	feat: Implement partial result support on node down (#42009 ) issue: https://github.com/milvus-io/milvus/issues/41690 This commit implements partial search result functionality when query nodes go down, improving system availability during node failures. The changes include: - Enhanced load balancing in proxy (lb_policy.go) to handle node failures with retry support - Added partial search result capability in querynode delegator and distribution logic - Implemented tests for various partial result scenarios when nodes go down - Added metrics to track partial search results in querynode_metrics.go - Updated parameter configuration to support partial result required data ratio - Replaced old partial_search_test.go with more comprehensive partial_result_on_node_down_test.go - Updated proto definitions and improved retry logic These changes improve query resilience by returning partial results to users when some query nodes are unavailable, ensuring that queries don't completely fail when a portion of data remains accessible. --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2025-05-28 00:12:28 +08:00
Xianhui Lin	6a0e182e13	enhance: support TTL expiration with queries returning no results (#42086 ) support TTL expiration with queries returning no results issue:https://github.com/milvus-io/milvus/issues/41959 Signed-off-by: Xianhui.Lin <xianhui.lin@zilliz.com>	2025-05-27 18:28:27 +08:00
wei liu	78010262f0	enhance: Optimize shard serviceable mechanism (#41937 ) issue: https://github.com/milvus-io/milvus/issues/41690 - Merge leader view and channel management into ChannelDistManager, allowing a channel to have multiple delegators. - Improve shard leader switching to ensure a single replica only has one shard leader per channel. The shard leader handles all resource loading and query requests. - Refine the serviceable mechanism: after QC completes loading, sync the query view to the delegator. The delegator then determines its serviceable status based on the query view. - When a delegator encounters forwarding query or deletion failures, mark the corresponding segment as offline and transition it to an unserviceable state. Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2025-05-22 11:38:24 +08:00
congqixia	b5443ddbd0	enhance: [AddField] Reopen loaded segments after AddField (#41529 ) Related to #39718 This PR: - Add reopen logic for growing & sealed segments - Lazy reopen when schema version increases - Add FinishLoad api for loading progress --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-04-26 08:48:39 +08:00
congqixia	b36c88f3c8	enhance: [AddField] Broadcast schema change via WAL (#41373 ) Related to #39718 Add Broadcast logic for collection schema change and notifies: - Streamnode - Delegator - Streamnode - Flush component - QueryNodes via grpc --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-04-22 16:28:37 +08:00
Xianhui Lin	3bc24c264f	enhance: Add json key inverted index in stats for optimization (#38039 ) Add json key inverted index in stats for optimization https://github.com/milvus-io/milvus/issues/36995 --------- Signed-off-by: Xianhui.Lin <xianhui.lin@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2025-04-10 15:20:28 +08:00
wei liu	06310a5994	fix: Fix L0 segment retention and improve delete buffer logging (#40884 ) issue：#40207 related to https://github.com/milvus-io/milvus/pull/39552 - Correct comparison operator in UnRegister from > to >= to prevent premature release of L0 segments with matching timestamps - Add detailed logging for segment retention decisions during unregistration - Enhance error logging for buffer cleanup operations - Add trace logs for segment registration/release lifecycle - Include timestamp comparisons in debug logs for future troubleshooting Signed-off-by: Wei Liu <wei.liu@zilliz.com> Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2025-03-27 11:24:21 +08:00
congqixia	9824615efb	enhance: Close component in topological order when unsub channel (#40796 ) Related to #40795 --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-03-21 10:22:11 +08:00
congqixia	94a859c028	enhance: Add buffer forwarder for stream delta loading (#40559 ) See also #40558 Related to #35303 & #38066 as well This PR: - Add `BufferedForward` to limit memory usage forwarding stream delete - Add `UseLoad` flag to determine `Delete` shall use `segment.Delete` or `segment.LoadDelta` - Fix delegator accidentally use always true candidate while load streaming delta --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-03-17 15:24:10 +08:00
wei liu	0420dc1eb1	fix: use correct delete checkpoint to prevent premature data cleanup (#40366 ) issue: #40292 related to #39552 - Fix incorrect delete checkpoint usage in SyncDistribution - Change checkpoint parameter from action.GetCheckpoint() to action.GetDeleteCP() in SyncTargetVersion call - This resolves the issue where delete buffer data was being cleaned prematurely due to wrong checkpoint reference Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2025-03-12 15:00:08 +08:00
Spade A	95e2680a36	fix: ref collection for search/query (#40549 ) ref https://github.com/milvus-io/milvus/issues/40473 Collection is got without ref which means the collection could be releases and the struct could be freed during the search which leads schema inconsistency. Signed-off-by: SpadeA <tangchenjie1210@gmail.com>	2025-03-12 11:30:07 +08:00
wei liu	69b8b89369	enhance: Remove QueryCoord's scheduling of L0 segments (#39552 ) issue: #39551 This PR remove querycoord's scheduling of l0 segments: - only load l0 segment when watch channel - only release l0 segment when release channel or sync data distribution --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2025-02-26 21:38:00 +08:00
congqixia	cb7f2fa6fd	enhance: Use v2 package name for pkg module (#39990 ) Related to #39095 https://go.dev/doc/modules/version-numbers Update pkg version according to golang dep version convention --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-02-22 23:15:58 +08:00
Bingyi Sun	b59555057d	feat: support json index (#36750 ) https://github.com/milvus-io/milvus/issues/35528 This PR adds json index support for json and dynamic fields. Now you can only do unary query like 'a["b"] > 1' using this index. We will support more filter type later. basic usage: ``` collection.create_index("json_field", {"index_type": "INVERTED", "params": {"json_cast_type": DataType.STRING, "json_path": 'json_field["a"]["b"]'}}) ``` There are some limits to use this index: 1. If a record does not have the json path you specify, it will be ignored and there will not be an error. 2. If a value of the json path fails to be cast to the type you specify, it will be ignored and there will not be an error. 3. A specific json path can have only one json index. 4. If you try to create more than one json indexes for one json field, sdk(pymilvus<=2.4.7) may return immediately because of internal implementation. This will be fixed in a later version. --------- Signed-off-by: sunby <sunbingyi1992@gmail.com>	2025-02-15 14:06:15 +08:00
congqixia	bca2a62b78	enhance: Handle PutOrRef collection schema failure error (#39310 ) Related to previous pr #39279 When NewCollection returns nil, the error shall be returned and handled by caller Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-01-16 10:13:06 +08:00
Zhen Ye	bb8d1ab3bf	enhance: make new go package to manage proto (#39114 ) issue: #39095 --------- Signed-off-by: chyezh <chyezh@outlook.com>	2025-01-10 10:49:01 +08:00
cai.zhang	bb5f38e574	enhance: Return collection not loaded rather than not found on querynode (#38593 ) issue: #38586 --------- Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2024-12-23 14:36:48 +08:00
jaime	78438ef41e	fix: revert optimize CPU usage for CheckHealth requests (#35589 ) (#38555 ) issue: #35563 Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-12-19 00:38:45 +08:00
jaime	28fdbc4e30	enhance: optimize CPU usage for CheckHealth requests (#35589 ) issue: #35563 1. Use an internal health checker to monitor the cluster's health state, storing the latest state on the coordinator node. The CheckHealth request retrieves the cluster's health from this latest state on the proxy sides, which enhances cluster stability. 2. Each health check will assess all collections and channels, with detailed failure messages temporarily saved in the latest state. 3. Use CheckHealth request instead of the heavy GetMetrics request on the querynode and datanode Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-12-17 11:02:45 +08:00
tinswzy	27229f7907	enhance: refine exists log print with ctx (#38080 ) issue: #35917 Refines exists log print with ctx Signed-off-by: tinswzy <zhenyuan.wei@zilliz.com>	2024-12-14 22:36:44 +08:00
congqixia	10460ed3f8	enhance: Simplify querynode tsafe & reduce goroutine number (#38416 ) Related to #37630 TSafe manager is too complex for current implementation and each delegator need one goroutine waiting for tsafe update event. Tsafe updating could be executed in pipeline. This PR remove tsafe manager and simplify the entire logic of tsafe updating. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-12-13 10:56:43 +08:00
Gao	8977454311	enhance: support recall estimation (#38017 ) issue: #37899 Only `search` api will be supported --------- Signed-off-by: chasingegg <chao.gao@zilliz.com>	2024-12-11 20:40:48 +08:00
congqixia	051bc280dd	enhance: Make dynamic load/release partition follow targets (#38059 ) Related to #37849 --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-12-05 16:24:40 +08:00
wei liu	108434969f	enhance: Add collection id to search request count metrics (#38069 ) Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-11-29 17:28:38 +08:00
congqixia	1ed686783f	enhance: Use `PrimaryKeys` to replace interface slice for segment delete (#37880 ) Related to #35303 Reduce temporary memory usage for PK interface for segment delete. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-11-22 11:52:33 +08:00
SimFG	54aaeda63f	fix: add the request ctx for stream pipeline interface (#37835 ) - issue: #37834 Signed-off-by: SimFG <bang.fu@zilliz.com>	2024-11-22 03:10:34 +08:00
wei liu	5f3601a6a5	fix: unstable integration test caused by paramtable.GetNodeID (#37909 ) issue: #37908 cause paramtable is global single instance, which cause paramtable.GetNodeID may return wrong server id in integration test. This PR use node.GetNodeID to replace paramtable.GetNodeID Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-11-21 22:16:33 +08:00
Zhen Ye	fca946dee1	enhance: move the search utilities from querynode into new module (#37531 ) issue: #33285 - The search utilities will be shared between query node and streaming node. Signed-off-by: chyezh <chyezh@outlook.com>	2024-11-11 11:10:27 +08:00
congqixia	224d797f94	fix: Use singleton delete pool and avoid goroutine leakage (#37220 ) Related to #36887 Previously using newly create pool per request shall cause goroutine leakage. This PR change this behavior by using singleton delete pool. This change could also provide better concurrency control over delete memory usage. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-10-29 10:02:24 +08:00
jaime	9d16b972ea	feat: add tasks page into management WebUI (#37002 ) issue: #36621 1. Add API to access task runtime metrics, including: - build index task - compaction task - import task - balance (including load/release of segments/channels and some leader tasks on querycoord) - sync task 2. Add a debug model to the webpage by using debug=true or debug=false in the URL query parameters to enable or disable debug mode. Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-10-28 10:13:29 +08:00
yihao.dai	ed37c27bda	fix: Fix collection leak in querynode (#37061 ) Unref the removed L0 segment count. issue: https://github.com/milvus-io/milvus/issues/36918 Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-10-25 19:59:29 +08:00
congqixia	f43527ef6f	enhance: Batch forward delete when using DirectForward (#37076 ) Relatedt #36887 DirectFoward streaming delete will cause memory usage explode if the segments number was large. This PR add batching delete API and using it for direct forward implementation. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-10-24 10:39:28 +08:00
Gao	1d61b604e1	enhance: support retry search when topk is reduced and result not enough (#35645 ) issue: #35576 This pr is to cover those cases when queryHook optimize search params and make the result size insufficient, add retry search mechanism and add related metrics for alarming. --------- Signed-off-by: chasingegg <chao.gao@zilliz.com>	2024-10-23 19:19:30 +08:00
jaime	4746f47282	feat: management WebUI homepage (#36822 ) issue: #36784 1. Implement an embedded web server for WebUI access. 2. Complete the homepage development. Home page demo: <img width="2177" alt="iShot_2024-10-10_17 57 34" src="https://github.com/user-attachments/assets/38539917-ce09-4e54-a5b5-7f4f7eaac353"> Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-10-23 11:29:28 +08:00
yihao.dai	f3b6792a25	enhance: Enhance segment log (#36848 ) /kind improvement Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-10-15 20:43:30 +08:00
wei liu	470bb0cc3f	enhance: Enable balance on querynode with different mem capacity (#36466 ) issue: #36464 This PR enable balance on querynode with different mem capacity, for query node which has more mem capactity will be assigned more records, and query node with the largest difference between assignedScore and currentScore will have a higher priority to carry the new segment. Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-09-30 16:15:17 +08:00
cai.zhang	ecb2b242e2	enhance: Add sorted for segment info (#36469 ) issue: #33744 Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2024-09-30 10:01:16 +08:00
wei liu	329fb421cd	fix: fix search/query/count may access same growing and sealed segment (#36258 ) issue: #36257 during syncTargetVersion, sealed segment should be excluded, to avoid it's growing segment be conusmed from stream again. Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-09-14 14:21:07 +08:00
Chun Han	e480b103bd	feat: supporing hybrid search group_by (#35982 ) related: #35096 Signed-off-by: MrPresent-Han <chun.han@gmail.com> Co-authored-by: MrPresent-Han <chun.han@gmail.com>	2024-09-08 17:09:04 +08:00

1 2 3 4

175 Commits