milvus

mirror of https://gitee.com/milvus-io/milvus.git synced 2025-12-06 17:18:35 +08:00

Author	SHA1	Message	Date
wei liu	a308331b81	fix: Set replica field in balance plans to prevent panic (#45722 ) issue: #45598 The MultiTargetBalancer was missing replica field assignment in the generated segment and channel plans, which caused panic during balance operations. This change ensures that all balance plans have the replica field properly set to fix the panic issue. Also refactored the balance test to extract common test logic into a reusable helper function and added a new integration test specifically for MultipleTargetBalancer policy. Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2025-12-04 10:19:11 +08:00
Zhen Ye	df7e507c49	fix: balance may not trigger at balance checker when upgrading (#43462 ) issue: #43416 Signed-off-by: chyezh <chyezh@outlook.com>	2025-07-22 16:02:53 +08:00
Zhen Ye	25b76e1fde	fix: cannot auto balance the channel from old arch to streamingnode (#43424 ) issue: #43416, #43413 - also fix the panic on streamingnode when concurrent sync Signed-off-by: chyezh <chyezh@outlook.com>	2025-07-20 23:00:52 +08:00
Zhen Ye	508264f953	fix: querynode upgrade from 2.5 get stucked (#42502 ) issue: #42492 - consider the old RO query node (not streaming node) when balancing channel. - querynode graceful stop can be done if there's only L0 segment exists. Signed-off-by: chyezh <chyezh@outlook.com>	2025-06-04 11:20:30 +08:00
wei liu	2669d14ba0	refactor: Remove balance constraints between channel and segment tasks (#42177 ) issue: #42176 Remove the mutual exclusion constraints between channel and segment balance tasks to allow them to run concurrently. Changes include: - Remove permitBalanceChannel() and permitBalanceSegment() methods from RoundRobinBalancer - Update ChannelLevelScoreBalancer, MultiTargetBalancer, RowCountBasedBalancer, and ScoreBasedBalancer to remove constraint checks - Allow segment balance tasks to proceed even when channel balance tasks are running - Update test cases to reflect new behavior where balance tasks no longer block each other This change improves the efficiency of load balancing by removing unnecessary coordination overhead between different types of balance operations. Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2025-05-30 18:14:25 +08:00
congqixia	cb7f2fa6fd	enhance: Use v2 package name for pkg module (#39990 ) Related to #39095 https://go.dev/doc/modules/version-numbers Update pkg version according to golang dep version convention --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-02-22 23:15:58 +08:00
Zhen Ye	c84a0748c4	enhance: add rw/ro streaming query node replica management (#38677 ) issue: #38399 - Embed the query node into streaming node to make delegator available at streaming node. - The embedded query node has a special server label `QUERYNODE_STREAMING-EMBEDDED`. - Change the balance strategy to make the channel assigned to streaming node as much as possible. Signed-off-by: chyezh <chyezh@outlook.com>	2025-01-24 16:55:07 +08:00
tinswzy	e76802f910	enhance: refine querycoord meta/catalog related interfaces to ensure that each method includes a ctx parameter (#37916 ) issue: #35917 This PR refine the querycoord meta related interfaces to ensure that each method includes a ctx parameter. Signed-off-by: tinswzy <zhenyuan.wei@zilliz.com>	2024-11-25 11:14:34 +08:00
wei liu	0a440e0d38	fix: Prevent simultaneous balance of segments and channels (#37850 ) issue: #33550 balance segment and balance segment execute at same time, which will cause bounch of corner case. This PR disable simultaneous balance of segments and channels Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-11-21 17:56:55 +08:00
congqixia	3fe0f82923	enhance: Add balance report log for qc balancer (#36747 ) Related to #36746 --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-10-11 10:25:24 +08:00
Bingyi Sun	ae1b81ac1a	fix: fix panic when generating plans (#35309 ) issue: https://github.com/milvus-io/milvus/issues/35335 Signed-off-by: sunby <sunbingyi1992@gmail.com>	2024-08-07 18:14:16 +08:00
jaime	fcec4c21b9	fix: check collection health(queryable) fail for releasing collection (#34947 ) issue: #34946 Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-08-02 17:20:15 +08:00
wei liu	27b6d58981	fix: Set legacy level to l0 segment after qc restart (#35197 ) issue: #35087 after qc restarts, and target is not ready yet, if dist_handler try to update segment dist, it will set legacy level to l0 segment, which may cause l0 segment be moved to other node, cause search/query failed. Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-08-02 10:18:13 +08:00
wei liu	03912a8788	enhance: Avoid balance stuck after segment list become stable (#34728 ) issue: #34715 if collection's segment list doesn't changes anymore, then the next target will be empty at most time, and balance segment will check whether segment exist in both current and next target, so the balance cloud be blocked due to next target is empty. This PR permit segment to be moved if next target is empty, to avoid balance stuck. Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-07-31 18:09:48 +08:00
wei liu	a7f6193bfc	fix: query node may stuck at stopping progress (#33104 ) issue: #33103 when try to do stopping balance for stopping query node, balancer will try to get node list from replica.GetNodes, then check whether node is stopping, if so, stopping balance will be triggered for this replica. after the replica refactor, replica.GetNodes only return rwNodes, and the stopping node maintains in roNodes, so balancer couldn't find replica which contains stopping node, and stopping balance for replica won't be triggered, then query node will stuck forever due to segment/channel doesn't move out. --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-05-20 10:21:38 +08:00
chyezh	a2502bde75	enhance: replica manager enhancement (#31496 ) issue: #30647 - ReplicaManager manage read only node now, and always do persistent of node distribution of replica. - All segment/channel checker using ReplicaManager to get read-only node or read-write node, but not ResourceManager. - ReplicaManager promise that only apply unique querynode to one replica in same collection now (replicas in same collection never hold same querynode at same time). - ReplicaManager promise that fairly node count assignment policy if multi replicas of collection is assigned to one resource group. - Move some parameters check into ReplicaManager to avoid data race. - Allow transfer replica to resource group that already load replica of same collection - Allow transfer node between resource groups that load replica of same collection --------- Signed-off-by: chyezh <chyezh@outlook.com>	2024-04-05 04:57:16 +08:00
wei liu	06b191b164	fix: Balance channel stuck forever due to logic dead lock (#31202 ) issue: #30816 cause balance channel will stuck until leader view catch up the current target, then start to unsub the old delegator. which make sure that the new delegator can provide search before release old delegator. but another logic in segment_checker skip loading segment during balance channel. so during balance channel, if query node crash, new delegator can't catch up target forever, then stuck forever. This PR remove the rule that skip loading segment during balance channel to avoid the logic dead lock here. Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-03-13 15:05:04 +08:00
wei liu	efe8cecc88	enhance: refactor segment dist manager interface (#31073 ) issue: #31091 This PR add `GetByFilter` interface in segment dist manager, instead of all kind of get func Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-03-08 16:29:01 +08:00
Bingyi Sun	ece9d273a7	enhance: some patches for #30636 (#30664 ) Signed-off-by: sunby <sunbingyi1992@gmail.com>	2024-02-26 11:42:55 +08:00
Bingyi Sun	564b12c661	enhance: make balance cost threshold configurable (#30636 ) Signed-off-by: sunby <sunbingyi1992@gmail.com>	2024-02-19 15:24:50 +08:00
congqixia	a6d9eb7f20	fix: Remove balance plan of which From, To nodes are same when merging (#30634 ) See also #30627 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-02-18 17:24:50 +08:00
Bingyi Sun	715f042965	feat: add a balancer based on both of row count and segment count (#30188 ) issue: https://github.com/milvus-io/milvus/issues/30039 --------- Signed-off-by: sunby <sunbingyi1992@gmail.com>	2024-02-06 17:15:50 +08:00

22 Commits