milvus

mirror of https://gitee.com/milvus-io/milvus.git synced 2026-02-02 01:06:41 +08:00

Author	SHA1	Message	Date
yihao.dai	a7c818cadb	fix: [2.5] Fix no candidate segments error for small import (#41772 ) When autoID is enabled, the preimport task estimates row distribution by evenly dividing the total row count (numRows) across all vchannels: `estimatedCount = numRows / vchannelNum`. However, the actual import task hashes real auto-generated IDs to determine the target vchannel. This mismatch can lead to inaccurate row distribution estimation in such corner cases: - Importing 1 row into 2 vchannels: • Preimport: 1 / 2 = 0 → both v0 and v1 are estimated to have 0 rows • Import: real autoID (e.g., 457975852966809057) hashes to v1 → actual result: v0 = 0, v1 = 1 To resolve such corner case, we now allocate at least one segment for each vchannel when autoID is enabled, ensuring all vchannels are prepared to receive data even if no rows are estimated for them. issue: https://github.com/milvus-io/milvus/issues/41759 pr: https://github.com/milvus-io/milvus/pull/41771 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2025-05-14 10:36:22 +08:00
aoiasd	8af350d9db	fix: [2.5] bulk insert should use function runner's input field list instead schema's (#41561 ) relate: https://github.com/milvus-io/milvus/issues/41213 pr: https://github.com/milvus-io/milvus/pull/41560 Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2025-04-27 22:16:40 +08:00
SimFG	18eb627533	fix: [2.5] Update logging context and upgrade dependencies (#41319 ) - issue: #41291 - pr: #41318 --------- Signed-off-by: SimFG <bang.fu@zilliz.com> Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>	2025-04-24 23:50:40 +08:00
yihao.dai	1caeac937b	fix: [2.5] Fix delete data loss due to duplicate binlogID (#40985 ) This PR is a supplement to PR [#40960](https://github.com/milvus-io/milvus/pull/40960). issue: https://github.com/milvus-io/milvus/issues/40207 pr: https://github.com/milvus-io/milvus/pull/40960 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2025-03-31 10:38:31 +08:00
yihao.dai	27ea5d14dc	fix: [2.5] Fix delete data loss due to duplicate binlogID (#40976 ) With concurrenct L0 compaction (https://github.com/milvus-io/milvus/pull/36816), delta logs might be written to the same L1 segment, causing logID duplication when using the incremental beginLogID. This PR removes the beginLogID mechanism and instead passes a log ID range, where the number of IDs in the range equals the number of compaction segment binlogs multiplied by an expansion factor. issue: https://github.com/milvus-io/milvus/issues/40207 pr: https://github.com/milvus-io/milvus/pull/40960 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2025-03-28 14:34:21 +08:00
XuanYang-cn	281260e48a	fix: Massive memory cost when compacting (#40763 ) downloads batch binlogs instead of all segment's binlogs See also: #40761 --------- Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2025-03-20 11:28:11 +08:00
XuanYang-cn	f455923ac9	enhance: Use correct counter metrics for overall wa calculation (#40394 ) (#40679 ) pr: #40394 - Use CounterVec to calculate sum of increase during a time period. - Use entries number instead of binlog size Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2025-03-17 15:06:19 +08:00
congqixia	709594f158	enhance: [2.5] Use v2 package name for pkg module (#40117 ) Cherry-pick from master pr: #39990 Related to #39095 https://go.dev/doc/modules/version-numbers Update pkg version according to golang dep version convention Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-02-23 00:46:01 +08:00
XuanYang-cn	8067113133	enhance: [cp25]Enable to observe write amplification (#39661 ) (#39743 ) pr: #39661 Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2025-02-17 16:00:17 +08:00
zhenshan.cao	9918e1008d	fix: Fix import failed due to 0 row num (#39887 ) (#39904 ) issue: https://github.com/milvus-io/milvus/issues/39885 pr: https://github.com/milvus-io/milvus/pull/39886 --------- Signed-off-by: zhenshan.cao <zhenshan.cao@zilliz.com> Co-authored-by: yihao.dai <yihao.dai@zilliz.com>	2025-02-17 01:36:15 +08:00
congqixia	a48749cc11	enhance: [2.5] Use mockery pkg config for datacoord&datanode (#39567 ) (#39577 ) Cherry-pick from master pr: #39567 Related to #38339 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-01-24 17:21:13 +08:00
XuanYang-cn	afef5fed60	fix: Clustering compaction ignoring deltalogs (#39133 ) See also: #39131 pr: #39132 Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2025-01-10 14:07:05 +08:00
Zhen Ye	95809ca767	enhance: make new go package to manage proto (#39128 ) issue: #39095 pr: #39114 --------- Signed-off-by: chyezh <chyezh@outlook.com>	2025-01-10 10:53:01 +08:00
jaime	0693634f62	enhance: add db name in replica description (#38673 ) issue: #36621 pr: #38672 Signed-off-by: jaime <yun.zhang@zilliz.com>	2025-01-09 19:43:04 +08:00
XuanYang-cn	b457c2f415	enhance: [2.5]Add missing delete metrics (#38634 ) (#38747 ) Add 2 counter metrics: - Total delete entries from deltalog: milvus_datanode_compaction_delete_count - Total missing deletes: milvus_datanode_compaction_missing_delete_count See also: #34665 pr: #38634 Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2025-01-07 11:20:56 +08:00
aoiasd	6fa096eb39	fix:[Cherry-pick] bm25 import segment loss stats (#38881 ) relate: https://github.com/milvus-io/milvus/issues/38854 pr: https://github.com/milvus-io/milvus/pull/38855 Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2024-12-31 19:24:54 +08:00
jaime	78438ef41e	fix: revert optimize CPU usage for CheckHealth requests (#35589 ) (#38555 ) issue: #35563 Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-12-19 00:38:45 +08:00
jaime	29e620fa6d	fix: sync task still running after DataNode has stopped (#38377 ) issue: #38319 Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-12-17 18:06:44 +08:00
jaime	28fdbc4e30	enhance: optimize CPU usage for CheckHealth requests (#35589 ) issue: #35563 1. Use an internal health checker to monitor the cluster's health state, storing the latest state on the coordinator node. The CheckHealth request retrieves the cluster's health from this latest state on the proxy sides, which enhances cluster stability. 2. Each health check will assess all collections and channels, with detailed failure messages temporarily saved in the latest state. 3. Use CheckHealth request instead of the heavy GetMetrics request on the querynode and datanode Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-12-17 11:02:45 +08:00
SimFG	2afe2eaf3e	feat: support to replicate collection when the services contains the system tt msg (#37559 ) - issue: #37105 --------- Signed-off-by: SimFG <bang.fu@zilliz.com>	2024-12-17 09:08:46 +08:00
tinswzy	27229f7907	enhance: refine exists log print with ctx (#38080 ) issue: #35917 Refines exists log print with ctx Signed-off-by: tinswzy <zhenyuan.wei@zilliz.com>	2024-12-14 22:36:44 +08:00
cai.zhang	6ffc57c8dc	fix: Fix sorting buffer in clustering compaction (#38417 ) issue: #28410 Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2024-12-13 10:12:49 +08:00
Ted Xu	dc85d8e968	enhance: improve mix compaction performance by removing max segment limitations (#38344 ) See #37234 --------- Signed-off-by: Ted Xu <ted.xu@zilliz.com>	2024-12-11 20:38:42 +08:00
yihao.dai	43e0e2b7ed	fix: Fix empty import task result (#38316 ) Ensure the idempotency of import tasks to prevent duplicate tasks in DataNode. issue: https://github.com/milvus-io/milvus/issues/38313 Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-12-11 15:42:49 +08:00
cai.zhang	41b19c6b1d	enhance: Determine the number of buffers based on the resource limits of the DataNode (#38209 ) issue: #28410 --------- Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2024-12-08 18:02:40 +08:00
jaime	8ed019735c	enhance: add disk stats within system metrics (#38033 ) issue: ##36621 Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-12-06 16:32:41 +08:00
jaime	7bbfe86bcd	enhance: add list index and segment index retrieval API for WebUI (#37861 ) issue: #36621 Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-11-22 16:58:34 +08:00
cai.zhang	dae4160466	enhance: Whether to enable mergeSort mode when performing mixCompaction (#37664 ) issue: #37579 Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2024-11-19 11:28:31 +08:00
congqixia	b0bd290a6e	enhance: Use internal json(sonic) to replace std json lib (#37708 ) Related to #35020 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-11-18 10:46:31 +08:00
congqixia	5e90f348fc	enhance: Handle legacy proxy load fields request (#37565 ) Related to #35415 In rolling upgrade, legacy proxy may dispatch load request wit empty load field list. The upgraded querycoord may report error by mistake that load field list is changed. This PR: - Auto field empty load field list with all user field ids - Refine the error messag when load field list updates - Refine load job unit test with service cases Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-11-11 10:14:26 +08:00
sthuang	70605cf5b3	enhance: Support custom privilege group for RBAC (#37087 ) issue: #37031 --------- Signed-off-by: shaoting-huang <shaoting.huang@zilliz.com>	2024-11-09 08:44:28 +08:00
yihao.dai	81879425e1	enhance: Optimize the performance of stats task (#37374 ) 1. Increase the writer's `batchSize` to avoid multiple serialization operations. 2. Perform asynchronous upload of binlog files to prevent blocking the data processing flow. 3. Reduce multiple calls to `writer.Flush()`. issue: https://github.com/milvus-io/milvus/issues/37373 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-11-08 10:08:27 +08:00
Ted Xu	bc9562feb1	enhance: avoid memory copy and serde in mix compaction (#37479 ) See: #37234 --------- Signed-off-by: Ted Xu <ted.xu@zilliz.com>	2024-11-07 16:30:57 -08:00
wei liu	00f6d0ec51	fix: watch channel stuck due to misuse of timer.Reset (#37433 ) issue: #37166 cause the misuse of timer.Reset, which cause dispatcher failed to send msg to virtual channel buffer, and dispatcher do splitting again and again, which hold the dispatcher manager's lock, block watching channel progress. This PR fix the misuse of timer.Reset Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-11-07 14:34:24 +08:00
jaime	f348bd9441	feat: add segment,pipeline, replica and resourcegroup api for WebUI (#37344 ) issue: #36621 Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-11-07 11:52:25 +08:00
aoiasd	b4c749dcd5	fix: merge sort segment loss data (#37400 ) relate: https://github.com/milvus-io/milvus/issues/37238 Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2024-11-07 11:18:26 +08:00
Ted Xu	b792b199d7	enhance: load deltalogs on demand when doing compactions (#37310 ) See #37234 --------- Signed-off-by: Ted Xu <ted.xu@zilliz.com>	2024-11-01 16:40:21 +08:00
Zhen Ye	448cc08960	fix: flowgraph crash when channel releasing (#37285 ) issue: #37284 Signed-off-by: chyezh <chyezh@outlook.com>	2024-10-31 16:30:21 +08:00
Ted Xu	262a994d6d	enhance: generally improve the performance of mix compactions (#37163 ) See #37234 --------- Signed-off-by: Ted Xu <ted.xu@zilliz.com>	2024-10-29 18:12:20 +08:00
congqixia	3106384fc4	enhance: Return deltadata for `DeleteCodec.Deserialize` (#37214 ) Related to #35303 #30404 This PR change return type of `DeleteCodec.Deserialize` from `storage.DeleteData` to `DeltaData`, which reduces the memory usage of interface header. Also refine `storage.DeltaData` methods to make it easier to usage. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-10-29 12:04:24 +08:00
jaime	9d16b972ea	feat: add tasks page into management WebUI (#37002 ) issue: #36621 1. Add API to access task runtime metrics, including: - build index task - compaction task - import task - balance (including load/release of segments/channels and some leader tasks on querycoord) - sync task 2. Add a debug model to the webpage by using debug=true or debug=false in the URL query parameters to enable or disable debug mode. Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-10-28 10:13:29 +08:00
yihao.dai	d7b2906318	enhance: Make dataNode.import.maxConcurrentTaskNum dynamic (#37102 ) Resize import execution pool when config `dataNode.import.maxConcurrentTaskNum` update. issue: https://github.com/milvus-io/milvus/issues/37095 Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-10-25 16:51:29 +08:00
jaime	4746f47282	feat: management WebUI homepage (#36822 ) issue: #36784 1. Implement an embedded web server for WebUI access. 2. Complete the homepage development. Home page demo: <img width="2177" alt="iShot_2024-10-10_17 57 34" src="https://github.com/user-attachments/assets/38539917-ce09-4e54-a5b5-7f4f7eaac353"> Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-10-23 11:29:28 +08:00
cai.zhang	04c306e63f	fix: Fix clustering compaction task leak (#36800 ) issue: #36686 bug reason: - The clustering compaction tasks on the datanode were never cleaned up. - The clustering compaction task contains a mapping from clustering key to buffer, this caused a large memory leak. fix: - clean the tasks on datanode by datacoord when clustering compaction finished. - reset the mapping that from clustering key to buffer on datanode when clustering finished. Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2024-10-17 20:43:30 +08:00
XuanYang-cn	b172ea1093	fix: Remove enableLevelZeroSegment config (#36535 ) See also: #36504 --------- Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2024-10-17 11:59:24 +08:00
aoiasd	5ec4163d0f	feat: support bm25 logs mixcompaction (#36072 ) relate: https://github.com/milvus-io/milvus/issues/35853 --------- Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2024-10-14 16:57:22 +08:00
Buqian Zheng	82c5cf2fa2	feat: add bulk insert support for Functions (#36715 ) issue: https://github.com/milvus-io/milvus/issues/35853 and https://github.com/milvus-io/milvus/issues/35856 Signed-off-by: Buqian Zheng <zhengbuqian@gmail.com>	2024-10-12 17:19:20 +08:00
CharlesFeng	7c8b71e26c	fix: BinlogDeserializeReader leak in mix_compactor.go (#36270 ) https://github.com/milvus-io/milvus/issues/36269 Signed-off-by: fengjun2016 <jornfeng@gmail.com>	2024-10-11 15:41:20 +08:00
XuanYang-cn	290ceb4e84	enhance: Add more info in logs (#36731 ) Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2024-10-10 17:51:25 +08:00
yihao.dai	0fc2a4aa53	enhance: Optimize import scheduling and add time cost metric (#36601 ) 1. Optimize import scheduling strategic: a. Revise slot weights, calculating them based on the number of files and segments for both import and pre-import tasks. b. Ensure that the DN executes tasks in ascending order of task ID. 2. Add time cost metric and log. issue: https://github.com/milvus-io/milvus/issues/36600, https://github.com/milvus-io/milvus/issues/36518 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-10-09 14:41:20 +08:00

1 2 3 4 5 ...

1151 Commits