milvus

mirror of https://gitee.com/milvus-io/milvus.git synced 2025-12-07 01:28:27 +08:00

Author	SHA1	Message	Date
congqixia	cb7f2fa6fd	enhance: Use v2 package name for pkg module (#39990 ) Related to #39095 https://go.dev/doc/modules/version-numbers Update pkg version according to golang dep version convention --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-02-22 23:15:58 +08:00
wei liu	b9e3ec7175	enhance: Add trigger interval config for auto balance (#39154 ) issue: #39156 Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2025-02-14 16:12:15 +08:00
Xiaofan	13d908f302	enhance: improve bloomfilter performance (#39730 ) 1. remove unnecessary allocations 2. recude the concurrency to avoid extra context switch Signed-off-by: xiaofanluan <xiaofan.luan@zilliz.com>	2025-02-13 22:12:14 +08:00
Zhen Ye	0988807160	enhance: enable write ahead buffer for streaming service (#39771 ) issue: #38399 - Make a timetick-commit-based write ahead buffer at write side. - Add a switchable scanner at read side to transfer the state between catchup and tailing read Signed-off-by: chyezh <chyezh@outlook.com>	2025-02-12 20:38:46 +08:00
yihao.dai	a5a83a0904	fix: Fix consume blocked due to too many consumers (#38455 ) This PR limits the maximum number of consumers per pchannel to 10 for each QueryNode and DataNode. issue: https://github.com/milvus-io/milvus/issues/37630 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2025-01-15 21:37:01 +08:00
yihao.dai	ce41778fe6	enhance: Optimize GetLocalDiskSize and segment loader mutex (#38599 ) 1. Make the segment loader lock protect only the resource. 2. Optimize GetDiskUsage to avoid excessive overhead. issue: https://github.com/milvus-io/milvus/issues/37630 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2025-01-15 15:45:01 +08:00
yihao.dai	ec2e77b5d7	enhance: Reduce memory usage of BF in DataNode and QueryNode (#38129 ) 1. DataNode: Skip generating BF during the insert phase (BF will be regenerated during the sync phase). 2. QueryNode: Skip generating or maintaining BF for growing segments; deletion checks will be handled in the segcore. issue: https://github.com/milvus-io/milvus/issues/37630 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2025-01-15 01:59:01 +08:00
Zhen Ye	fd84ed817c	enhance: add broadcast operation for msgstream (#39040 ) issue: #38399 - make broadcast service available for msgstream by reusing the architecture streaming service --------- Signed-off-by: chyezh <chyezh@outlook.com>	2025-01-14 15:14:59 +08:00
jaime	78438ef41e	fix: revert optimize CPU usage for CheckHealth requests (#35589 ) (#38555 ) issue: #35563 Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-12-19 00:38:45 +08:00
jaime	29e620fa6d	fix: sync task still running after DataNode has stopped (#38377 ) issue: #38319 Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-12-17 18:06:44 +08:00
jaime	28fdbc4e30	enhance: optimize CPU usage for CheckHealth requests (#35589 ) issue: #35563 1. Use an internal health checker to monitor the cluster's health state, storing the latest state on the coordinator node. The CheckHealth request retrieves the cluster's health from this latest state on the proxy sides, which enhances cluster stability. 2. Each health check will assess all collections and channels, with detailed failure messages temporarily saved in the latest state. 3. Use CheckHealth request instead of the heavy GetMetrics request on the querynode and datanode Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-12-17 11:02:45 +08:00
wei liu	e279ccf109	enhance: Enable score based balance channel policy (#38143 ) issue: #38142 current balance channel policy only consider current collection's distribution, so if all collections has 1 channel, and all channels has been loaded on same querynode, after querynode num increase, balance channel won't be triggered. This PR enable score based balance channel policy, to achieve: 1. distribute all channels evenly across multiple querynodes 2. distribute each collection's channel evenly across multiple querynodes. --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-12-11 17:20:43 +08:00
SimFG	49ee46ec1d	enhance: support to config the default db properties (#38035 ) - issue: #38034 Signed-off-by: SimFG <bang.fu@zilliz.com>	2024-11-27 10:04:34 +08:00
SimFG	2208b7c2ef	fix: the too long default root password does not take effect (#37983 ) - issue: #36987 Signed-off-by: SimFG <bang.fu@zilliz.com>	2024-11-26 17:24:35 +08:00
Zhen Ye	2b4f211d84	enhance: add switch for local rpc enabled (#37985 ) issue: #33285 - Add switch for local rpc --------- Signed-off-by: chyezh <chyezh@outlook.com>	2024-11-26 17:00:54 +08:00
wei liu	0a440e0d38	fix: Prevent simultaneous balance of segments and channels (#37850 ) issue: #33550 balance segment and balance segment execute at same time, which will cause bounch of corner case. This PR disable simultaneous balance of segments and channels Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-11-21 17:56:55 +08:00
yihao.dai	0fc0d1a888	fix: Limit the concurrency of channel tasks (#37740 ) Limit the maximum concurrency of channel tasks for each DataNode to prevent excessive subscriptions from causing DataNode OOM. issue: https://github.com/milvus-io/milvus/issues/37665 Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-11-18 16:26:30 +08:00
Zhen Ye	81fa7dd52c	fix: add ddl and dcl concurrency to avoid competition (#37672 ) issue: #37166 Signed-off-by: chyezh <chyezh@outlook.com>	2024-11-15 15:04:31 +08:00
yihao.dai	f0b3942a08	enhance: Limit import job number (#36891 ) issue: https://github.com/milvus-io/milvus/issues/36890 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-10-23 16:01:28 +08:00
yihao.dai	0fc2a4aa53	enhance: Optimize import scheduling and add time cost metric (#36601 ) 1. Optimize import scheduling strategic: a. Revise slot weights, calculating them based on the number of files and segments for both import and pre-import tasks. b. Ensure that the DN executes tasks in ascending order of task ID. 2. Add time cost metric and log. issue: https://github.com/milvus-io/milvus/issues/36600, https://github.com/milvus-io/milvus/issues/36518 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-10-09 14:41:20 +08:00
Zhen Ye	a6545b2e29	fix: refactor milvus config and change default txn timeout (#36522 ) issue: #36498 Signed-off-by: chyezh <chyezh@outlook.com>	2024-09-29 11:01:15 +08:00
SimFG	c50fe71163	fix: long buffering causes mq to be unable to receive messages. (#36420 ) - issue: #36397 Signed-off-by: SimFG <bang.fu@zilliz.com>	2024-09-23 16:33:18 +08:00
wei liu	3b10085f61	enhance: Optimize workload based replica selection policy (#36181 ) issue: #35859 This PR introduce two new param: toleranceFactor and checkRequestNum, after every checkRequestNum request has been assigned, try to compute querynode's workload score. if the diff is less than the toleranceFactor, replica selection policy will fallback to round_robin, which reduce the average cost to about 500ns. if the diff is larger than the toleranceFactor, replica selection policy will compute querynode's score to select the target node with smallest score in every assigment. --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-09-20 12:33:11 +08:00
yihao.dai	763fd0dfc5	enhance: Use a separate mmap config for chunk cache (#36276 ) issue: https://github.com/milvus-io/milvus/issues/35273 Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-09-15 16:23:09 +08:00
Ted Xu	d9a40784a2	fix: fallback params may be overridden (#35972 ) See #35756 --------- Signed-off-by: Ted Xu <ted.xu@zilliz.com>	2024-09-05 16:19:04 +08:00
wei liu	cf242f9e09	fix: fix dynamic update config doesn't works for some param (#35572 ) issue: #35570 milvus support config cache to spped up config access, but only evict param's cache when param has been updated. but milvus's param may rely on other param's value, let's say ParamsA relys on paramsB, when paramsB updated, it will evict paramB's cache, but the paramA's cache still keep the old value. This PR evict all config cache to solve the above issue, cause dynamic update config won't be much frequetly. Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-08-21 11:02:56 +08:00
wei liu	a570567644	enhance: Enable ReadOnly/ReadWrite/Admin Privilege Group to simplify RBAC grant progress (#35472 ) issue: #35471 --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-08-16 14:18:54 +08:00
wei liu	344dc6a9f8	enhance: enable to set load config in cluster level (#35169 ) issue: #35170 This PR enable to set load configs in cluster level, such as replicas and resource groups. then when load collections will use the load config. Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-08-07 12:38:21 +08:00
cai.zhang	6542c1ab0e	enhance: Add monitoring metrics for task execution time in datacoord (#35139 ) issue: #35138 Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2024-08-05 16:26:17 +08:00
jaime	fcec4c21b9	fix: check collection health(queryable) fail for releasing collection (#34947 ) issue: #34946 Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-08-02 17:20:15 +08:00
wei liu	3b735b4b02	enhance: Refine param init for MmapDirPath (#35181 ) Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-08-02 11:12:14 +08:00
cai.zhang	196a7986b3	enhance: Change the fixed value to a ratio for clustering segment size (#35076 ) issue: #34495 --------- Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2024-08-01 22:04:14 +08:00
wei liu	e9d61daa3f	enhance: Reduce delegator memory overloaded factor to 0.1 (#35092 ) Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-08-01 10:21:50 +08:00
wayblink	ce3f836876	fix: compaction task not be cleaned correctly (#34765 ) 1.fix compaction task not be cleaned correctly 2.add a new parameter to control compaction gc loop interval 3.remove some useless configs of clustering compaction bug: #34764 Signed-off-by: wayblink <anyang.wang@zilliz.com>	2024-07-30 20:21:56 +08:00
cai.zhang	2372452fac	enhance: Optimized the GC logic to ensure that memory is released in time (#34949 ) issue: #34703 Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2024-07-28 23:53:47 +08:00
wei liu	166fc902b0	enhance: Limit collection's normal balance speed (#34810 ) issue: #34798 after we remove the task priority on query coord, to avoid load/release segment blocked by too much balance task, we limit the balance task size in each round. at same time, we reduce the balance interval to trigger balance more frequently. Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-07-24 19:11:44 +08:00
yihao.dai	b22e549844	enhance: Rename config of sealing by growing segmetns size (#34787 ) /kind improvement --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-07-19 20:27:41 +08:00
wayblink	c79d1af390	enhance: Add compaction task slot usage logic (#34581 ) #34544 Signed-off-by: wayblink <anyang.wang@zilliz.com>	2024-07-18 10:27:41 +08:00
yihao.dai	4939f82d4f	enhance: Seal by total growing segments size (#34692 ) Seals the largest growing segment if the total size of growing segments of each shard exceeds the size threshold(default 4GB). Introducing this policy can help keep the size of growing segments within a suitable level, alleviating the pressure on the delegator. issue: https://github.com/milvus-io/milvus/issues/34554 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-07-17 21:45:41 +08:00
SimFG	203fb554a4	enhance: support to config root user's password (#34752 ) - issue: #33058 Signed-off-by: SimFG <bang.fu@zilliz.com>	2024-07-17 20:19:42 +08:00
wei liu	acb33bba4d	enhance: Preserve fixed-size memory in delegator node for growing segment. (#34596 ) issue: #34595 When consuming insert data on the delegator node, QueryCoord will move out some sealed segments to manage its memory usage. After the growing segment gets flushed, some sealed segments from other workers will be moved back to the delegator node. To avoid the frequent movement of segments, we estimate the maximum growing row count and preserve a fixed-size memory in the delegator node. --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-07-15 20:51:46 +08:00
congqixia	8b5754f7fe	enhance: Add segment seal proportion jitter (#34636 ) See also #34574 Add jitter for segment seal proportion to avoid seal operation burst in short period of time. This PR also fix license header in paramtable pkg. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-07-15 14:47:39 +08:00
chyezh	1bc3c0b925	enhance: implement balancer at streaming coord (#34435 ) issue: #33285 - add balancer implementation - add channel count fair balance policy - add channel assignment discover grpc service Signed-off-by: chyezh <chyezh@outlook.com>	2024-07-11 09:58:48 +08:00
wayblink	f9a0f7bb25	Add an option to enable/disable vector field clustering key (#34097 ) #30633 Signed-off-by: wayblink <anyang.wang@zilliz.com>	2024-06-25 18:52:04 +08:00
wei liu	4987067375	enhance: Execute bloom filter apply in parallel to speed up segment predict (#33792 ) issue: #33610 Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-06-14 11:37:56 +08:00
wei liu	ab93d9c23d	enhance: Use BatchPkExist to reduce bloom filter func call cost (#33611 ) issue:#33610 Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-06-13 17:57:56 +08:00
SimFG	ecee7d90d4	enhance: try to speed up the loading of small collections (#33570 ) - issue: #33569 Signed-off-by: SimFG <bang.fu@zilliz.com>	2024-06-07 08:25:53 +08:00
cai.zhang	27cc9f2630	enhance: Support analyze data (#33651 ) issue: #30633 Signed-off-by: Cai Zhang <cai.zhang@zilliz.com> Co-authored-by: chasingegg <chao.gao@zilliz.com>	2024-06-06 17:37:51 +08:00
wei liu	c6a1c49e02	enhance: Use Blocked Bloom Filter instead of basic bloom fitler impl. (#33405 ) issue: #32995 To speed up the construction and querying of Bloom filters, we chose a blocked Bloom filter instead of a basic Bloom filter implementation. WARN: This PR is compatible with old version bf impl, but if fall back to old milvus version, it may causes bloom filter deserialize failed. In single Bloom filter test cases with a capacity of 1,000,000 and a false positive rate (FPR) of 0.001, the blocked Bloom filter is 5 times faster than the basic Bloom filter in both querying and construction, at the cost of a 30% increase in memory usage. - Block BF construct time {"time": "54.128131ms"} - Block BF size {"size": 3021578} - Block BF Test cost {"time": "55.407352ms"} - Basic BF construct time {"time": "210.262183ms"} - Basic BF size {"size": 2396308} - Basic BF Test cost {"time": "192.596229ms"} In multi Bloom filter test cases with a capacity of 100,000, an FPR of 0.001, and 100 Bloom filters, we reuse the primary key locations for all Bloom filters to avoid repeated hash computations. As a result, the blocked Bloom filter is also 5 times faster than the basic Bloom filter in querying. - Block BF TestLocation cost {"time": "529.97183ms"} - Basic BF TestLocation cost {"time": "3.197430181s"} --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-05-31 17:49:45 +08:00
yihao.dai	bbb69980ac	enhance: Replace 'off' with 'disable' (#33433 ) YAML will automatically parse "off" as a boolean variable. We should avoid using "off" in the future. issue: https://github.com/milvus-io/milvus/issues/32772 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-05-29 12:17:43 +08:00

1 2 3

110 Commits