milvus

mirror of https://gitee.com/milvus-io/milvus.git synced 2026-02-02 01:06:41 +08:00

Author	SHA1	Message	Date
jaime	319f5494cd	enhance: optimize CPU usage for CheckHealth requests (#35595 ) issue: #35563 pr: #35589 Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-12-04 14:26:41 +08:00
wei liu	93063ce1f9	fix: Prevent simultaneous balance of segments and channels (#37850 ) (#37939 ) issue: #33550 pr: #37850 balance segment and balance segment execute at same time, which will cause bounch of corner case. This PR disable simultaneous balance of segments and channels Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-11-26 10:26:40 +08:00
yihao.dai	4e0f5845a1	enhance: Limit import job number (#36891 ) (#36892 ) issue: https://github.com/milvus-io/milvus/issues/36890 pr: https://github.com/milvus-io/milvus/pull/36891 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-10-18 18:13:25 +08:00
yihao.dai	8923936c9a	enhance: Support memory mode chunk cache (#35347 ) (#35836 ) Chunk cache supports loading raw vectors into memory. issue: https://github.com/milvus-io/milvus/issues/35273 pr: https://github.com/milvus-io/milvus/pull/35347 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-10-18 17:03:25 +08:00
yihao.dai	a4ef93457d	enhance: Optimize import scheduling and add time cost metric (#36601 ) (#36684 ) 1. Optimize import scheduling strategic: a. Revise slot weights, calculating them based on the number of files and segments for both import and pre-import tasks. b. Ensure that the DN executes tasks in ascending order of task ID. 2. Add time cost metric and log. issue: https://github.com/milvus-io/milvus/issues/36600, https://github.com/milvus-io/milvus/issues/36518 pr: https://github.com/milvus-io/milvus/pull/36601 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-10-11 10:27:22 +08:00
wei liu	ad5d24be65	enhance: Optimize workload based replica selection policy (#36181 ) (#36384 ) issue: #35859 pr: #36181 This PR introduce two new param: toleranceFactor and checkRequestNum, after every checkRequestNum request has been assigned, try to compute querynode's workload score. if the diff is less than the toleranceFactor, replica selection policy will fallback to round_robin, which reduce the average cost to about 500ns. if the diff is larger than the toleranceFactor, replica selection policy will compute querynode's score to select the target node with smallest score in every assigment. --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-09-26 11:19:14 +08:00
SimFG	a35d99eabf	fix: [2.4] long buffering causes mq to be unable to receive messages. (#36425 ) - issue: #36397 - pr: #36420 Signed-off-by: SimFG <bang.fu@zilliz.com>	2024-09-23 16:33:17 +08:00
Ted Xu	45b2049d5d	fix: fallback params may be overridden (#35972 ) (#36006 ) See #35756 --------- pr: #35972 Signed-off-by: Ted Xu <ted.xu@zilliz.com>	2024-09-05 19:05:05 +08:00
wei liu	e014ad9280	fix: fix dynamic update config doesn't works for some param (#35572 ) (#35637 ) issue: #35570 pr: #35572 milvus support config cache to spped up config access, but only evict param's cache when param has been updated. but milvus's param may rely on other param's value, let's say ParamsA relys on paramsB, when paramsB updated, it will evict paramB's cache, but the paramA's cache still keep the old value. This PR evict all config cache to solve the above issue, cause dynamic update config won't be much frequetly. Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-08-22 16:00:58 +08:00
wei liu	14ec3dc357	enhance: Enable ReadOnly/ReadWrite/Admin Privilege Group to simplify RBAC grant progress (#35472 ) (#35543 ) issue: #35471 pr: #35472 #35515 --------- --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-08-19 16:24:54 +08:00
wei liu	0201e00a2f	enhance: enable to set load config in cluster level (#35293 ) issue: #35170 pr: #35169 This PR enable to set load configs in cluster level, such as replicas and resource groups. then when load collections will use the load config. Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-08-07 12:38:21 +08:00
cai.zhang	2534b30e39	enhance: [cherry-pick] Add monitoring metrics for task execution time in datacoord (#35141 ) issue: #35138 master pr: #35139 Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2024-08-02 19:46:16 +08:00
wei liu	d767f8977a	enhance: Refine param init for MmapDirPath (#35181 ) (#35214 ) pr: #35181 Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-08-02 16:30:15 +08:00
wei liu	5f601fcc50	enhance: Reduce delegator memory overloaded factor to 0.1 (#35092 ) (#35164 ) pr: #35092 Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-08-01 14:20:13 +08:00
cai.zhang	c340f387cf	enhance: [cherry-pick] Change the fixed value to a ratio for clustering segment size (#35075 ) issue: #34495 master pr: #35076 Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2024-07-31 10:32:00 +08:00
wei liu	b3bc7f3985	enhance: Limit collection's normal balance speed (#34810 ) (#34987 ) issue: #34798 pr: #34810 after we remove the task priority on query coord, to avoid load/release segment blocked by too much balance task, we limit the balance task size in each round. at same time, we reduce the balance interval to trigger balance more frequently. Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-07-26 10:13:46 +08:00
jaime	77ae127a62	fix: check collection health(queryable) fail for releasing collection (#34948 ) issue: #34946 pr: #34947 --------- Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-07-25 10:25:57 +08:00
cai.zhang	74adedf750	enhance: Optimized the GC logic to ensure that memory is released in time (#34950 ) issue: #34703 master pr: #34949 Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2024-07-24 14:07:43 +08:00
yihao.dai	07bc1b6717	enhance: Seal by total growing segments size (#34692 ) (#34779 ) Seals the largest growing segment if the total size of growing segments of each shard exceeds the size threshold(default 4GB). Introducing this policy can help keep the size of growing segments within a suitable level, alleviating the pressure on the delegator. issue: https://github.com/milvus-io/milvus/issues/34554 pr: https://github.com/milvus-io/milvus/pull/34692 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-07-19 18:25:50 +08:00
SimFG	0e226502e4	enhance: [2.4] pick default root password and log level pr (#34777 ) default root password - issue: #33058 - pr: #34752 set log level - issue: #34756 - pr: #34757 --------- Signed-off-by: SimFG <bang.fu@zilliz.com>	2024-07-18 13:45:43 +08:00
wayblink	a26e965e6a	enhance:[cherry-pick] Add compaction task slot usage logic (#34625 ) issue: #34544 pr: #34581 --------- Signed-off-by: wayblink <anyang.wang@zilliz.com>	2024-07-18 09:55:43 +08:00
wayblink	83fc26c31a	fix: [cherry-pick] compaction task not be cleaned correctly (#34766 ) 1.fix compaction task not be cleaned correctly 2.add a new parameter to control compaction gc loop interval 3.remove some useless configs of clustering compaction bug: #34764 pr: #34765 Signed-off-by: wayblink <anyang.wang@zilliz.com>	2024-07-17 22:17:42 +08:00
wei liu	cf701a9bf0	enhance: Preserve fixed-size memory in delegator node for growing segment (#34600 ) issue: #34595 pr: #34596 When consuming insert data on the delegator node, QueryCoord will move out some sealed segments to manage its memory usage. After the growing segment gets flushed, some sealed segments from other workers will be moved back to the delegator node. To avoid the frequent movement of segments, we estimate the maximum growing row count and preserve a fixed-size memory in the delegator node. --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-07-15 20:51:46 +08:00
wei liu	d3e94f9861	enhance: Use Blocked Bloom Filter instead of basic bloom fitler impl (#34377 ) issue: #32995 pr: #33405 To speed up the construction and querying of Bloom filters, we chose a blocked Bloom filter instead of a basic Bloom filter implementation. WARN: This PR is compatible with old version bf impl, but if fall back to old milvus version, it may causes bloom filter deserialize failed. In single Bloom filter test cases with a capacity of 1,000,000 and a false positive rate (FPR) of 0.001, the blocked Bloom filter is 5 times faster than the basic Bloom filter in both querying and construction, at the cost of a 30% increase in memory usage. Block BF construct time {"time": "54.128131ms"} Block BF size {"size": 3021578} Block BF Test cost {"time": "55.407352ms"} Basic BF construct time {"time": "210.262183ms"} Basic BF size {"size": 2396308} Basic BF Test cost {"time": "192.596229ms"} In multi Bloom filter test cases with a capacity of 100,000, an FPR of 0.001, and 100 Bloom filters, we reuse the primary key locations for all Bloom filters to avoid repeated hash computations. As a result, the blocked Bloom filter is also 5 times faster than the basic Bloom filter in querying. Block BF TestLocation cost {"time": "529.97183ms"} Basic BF TestLocation cost {"time": "3.197430181s"} Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-07-05 17:04:10 +08:00
wayblink	c62bf8a0b0	fix: [Cherry-pick]Pick major compaction fixs and optimizations (#34360 ) This PR cherry-picks the following commits: - fix: sync partitiion stats blocking balance task #33742 - fix: Fix meta prefix overlap bug #33830 - fix: Small fixs of major compaction #33929 - fix: Fix memory buffer error & some renaming #33850 - fix: sync part stats task cannot be finished #34027 - Add an option to enable/disable vector field clustering key #34097 - fix: fix error ignore in compactor #34169 - fix:load major compaction partial result #34052 - Use new stream segment reader in clustering compaction #34232 issue: #30633 pr: #33742 #33830 #33929 #33850 #34027 #34097 #34169 #34052 #34232 --------- Signed-off-by: MrPresent-Han <chun.han@zilliz.com> Signed-off-by: wayblink <anyang.wang@zilliz.com> Signed-off-by: MrPresent-Han <chun.han@gmail.com> Co-authored-by: Chun Han <116052805+MrPresent-Han@users.noreply.github.com> Co-authored-by: MrPresent-Han <chun.han@gmail.com>	2024-07-03 09:53:37 +08:00
cai.zhang	c924b0b502	enhance: [cherry-pick] Refine index code and support analyze data (#34311 ) This PR primary picks up the support analyzing functionality, including the following commits: - main functionality: https://github.com/milvus-io/milvus/pull/33651 - refine indexnode code: https://github.com/milvus-io/milvus/pull/33458 - related fixes: - https://github.com/milvus-io/milvus/pull/33832 - https://github.com/milvus-io/milvus/pull/33161 issue: #30633 master prs: #33651 , #33458 , #33832 , #33161 --------- Signed-off-by: Cai Zhang <cai.zhang@zilliz.com> Signed-off-by: Patrick Weizhi Xu <weizhi.xu@zilliz.com> Co-authored-by: chasingegg <chao.gao@zilliz.com> Co-authored-by: Patrick Weizhi Xu <weizhi.xu@zilliz.com>	2024-07-02 09:50:39 +08:00
yihao.dai	b1e74dc7cb	enhance: [cherry-pick] Decouple compaction from shard (#34157 ) This PR cherry-picks the following commits: - Implement task limit control logic in datanode. https://github.com/milvus-io/milvus/pull/32881 - Load bf from storage instead of memory during L0 compaction. https://github.com/milvus-io/milvus/pull/32913 - Remove dependencies on shards (e.g. SyncSegments, injection). https://github.com/milvus-io/milvus/pull/33138 - Rename Compaction interface to CompactionV2. https://github.com/milvus-io/milvus/pull/33858 - Remove the unused residual compaction logic. https://github.com/milvus-io/milvus/pull/33932 issue: https://github.com/milvus-io/milvus/issues/32809 pr: https://github.com/milvus-io/milvus/pull/32881, https://github.com/milvus-io/milvus/pull/32913, https://github.com/milvus-io/milvus/pull/33138, https://github.com/milvus-io/milvus/pull/33858, https://github.com/milvus-io/milvus/pull/33932 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-06-25 20:22:03 +08:00
wei liu	25d8b74f71	enhance: Execute bloom filter apply in parallel to speed up segment predict (#33793 ) issue: #33610 pr: #33792 Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-06-13 14:14:04 +08:00
wei liu	54feef30e7	enhance: Use BatchPkExist to reduce bloom filter func call cost (#33752 ) issue: #33610 pr: #33611 Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-06-12 17:45:58 +08:00
SimFG	f664b51ebe	enhance: [2.4] try to speed up the loading of small collections (#33746 ) - issue: #33569 - pr: #33570 Signed-off-by: SimFG <bang.fu@zilliz.com>	2024-06-11 15:07:55 +08:00
yihao.dai	7384bfe3f8	fix: use seperate warmup pool and disable warmup by default (#33348 ) (#33349 ) 1. use a small warmup pool to reduce the impact of warmup 2. change the warmup pool to nonblocking mode 3. disable warmup by default 4. remove the maximum size limit of 16 for the load pool issue: https://github.com/milvus-io/milvus/issues/32772 pr: https://github.com/milvus-io/milvus/pull/33348 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com> Co-authored-by: xiaofanluan <xiaofan.luan@zilliz.com>	2024-05-28 19:27:43 +08:00
foxspy	f6777267e3	enhance: add score compute consistency config for knowhere (#32997 ) issue: https://github.com/milvus-io/milvus/issues/32583 related: #32584 Signed-off-by: xianliang.li <xianliang.li@zilliz.com>	2024-05-13 14:21:31 +08:00
Bingyi Sun	4724779b3b	enhance: remove fallback keys for config generator (#32946 ) Signed-off-by: sunby <sunbingyi1992@gmail.com>	2024-05-13 13:33:31 +08:00
wei liu	e2332bdc17	enhance: Enable channel exclusive balance policy (#32911 ) issue: #32910 * split replica's node list to channels when create replicas * balance nodes among channels when node change happens * implement channel level balance, let balance happens in channel level Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-05-10 17:27:31 +08:00
chyezh	641f702f64	fix: add request resource timeout for lazy load, refactor context usage in cache (#32709 ) issue: #32663 - Use new param to control request resource timeout for lazy load. - Remove the timeout parameter of `Do`, remove `DoWait`. use `context` to control the timeout. - Use `VersionedNotifier` to avoid notify event lost and broadcast, remove the redundant goroutine in cache. related dev pr: #32684 Signed-off-by: chyezh <chyezh@outlook.com>	2024-05-07 16:33:30 +08:00
SimFG	09cd56d44f	enhance: add the skip auto id and partition key check config (#32592 ) /kind improvement issue: #32591 Signed-off-by: SimFG <bang.fu@zilliz.com>	2024-04-29 10:29:26 +08:00
SimFG	8594b55ad5	enhance: add `max insert request size` and `must use partition key` configs (#32433 ) issue: https://github.com/milvus-io/milvus/issues/30577 /kind improvement Signed-off-by: SimFG <bang.fu@zilliz.com>	2024-04-19 10:31:20 +08:00
yihao.dai	49d109de18	enhance: Use an individual buffer size parameter for imports (#31833 ) Use an individual buffer size parameter for imports and set buffer size to 64MB. issue: https://github.com/milvus-io/milvus/issues/28521 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-04-08 21:07:18 +08:00
chyezh	7b400252ff	fix: add configuration disk capacity config for lru and fix some bug (#31977 ) issue: #30361 - Add configurable disk capacity limit - fix bitset reset logic - make insert record reinsert after clear Signed-off-by: chyezh <chyezh@outlook.com>	2024-04-08 15:55:16 +08:00
yihao.dai	4e264003bf	enhance: Ensure ImportV2 waits for the index to be built and refine some logic (#31629 ) Feature Introduced: 1. Ensure ImportV2 waits for the index to be built Enhancements Introduced: 1. Utilization of local time for timeout ts instead of allocating ts from rootcoord. 3. Enhanced input file length check for binlog import. 4. Removal of duplicated manager in datanode. 5. Renaming of executor to scheduler in datanode. 6. Utilization of a thread pool in the scheduler in datanode. issue: https://github.com/milvus-io/milvus/issues/28521 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-04-01 20:09:13 +08:00
yihao.dai	f65a796d18	enhance: Add max file num limit and max file size limit for import (#31497 ) The max number of import files per request should not exceed 1024 by default (configurable). The import file size allowed for importing should not exceed 16GB by default (configurable). issue: https://github.com/milvus-io/milvus/issues/28521 Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-03-22 18:13:06 +08:00
yihao.dai	0fe5e90e8b	enhance: Remove import v1 (#31403 ) Remove all code and logic related to import v1. issue: https://github.com/milvus-io/milvus/issues/28521 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-03-22 15:29:09 +08:00
wei liu	06b191b164	fix: Balance channel stuck forever due to logic dead lock (#31202 ) issue: #30816 cause balance channel will stuck until leader view catch up the current target, then start to unsub the old delegator. which make sure that the new delegator can provide search before release old delegator. but another logic in segment_checker skip loading segment during balance channel. so during balance channel, if query node crash, new delegator can't catch up target forever, then stuck forever. This PR remove the rule that skip loading segment during balance channel to avoid the logic dead lock here. Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-03-13 15:05:04 +08:00
Chun Han	3298e64bd3	enhance: cache config values for saving cpu cycles to parse config item (#30947 ) related: #30958 Signed-off-by: MrPresent-Han <chun.han@zilliz.com>	2024-03-12 11:09:04 +08:00
yihao.dai	c411cb4a49	enhance: Prevent the backlog of channelCP update tasks, perform batch updates of channelCPs (#30941 ) This PR includes the following adjustments: 1. To prevent channelCP update task backlog, only one task with the same vchannel is retained in the updater. Additionally, the lastUpdateTime is refreshed after the flowgraph submits the update task, rather than in the callBack function. 2. Batch updates of multiple vchannel checkpoints are performed in the UpdateChannelCheckpoint RPC (default batch size is 128). Additionally, the lock for channelCPs in DataCoord meta has been switched from key lock to global lock. 3. The concurrency of UpdateChannelCheckpoint RPCs in the datanode has been reduced from 1000 to 10. issue: https://github.com/milvus-io/milvus/issues/30004 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com> Co-authored-by: jaime <yun.zhang@zilliz.com> Co-authored-by: congqixia <congqi.xia@zilliz.com>	2024-03-07 20:39:02 +08:00
yihao.dai	a434d33e75	feat: Add import scheduler and manager (#29367 ) This PR introduces novel managerial roles for importv2: 1. ImportMeta: To manage all the import tasks; 2. ImportScheduler: To process tasks and modify their states; 3. ImportChecker: To ascertain the completion of all tasks and instigate relevant operations. issue: https://github.com/milvus-io/milvus/issues/28521 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-03-01 18:31:02 +08:00
chyezh	dd957cf9e3	enhance: add configurable memory index load predict memory usage factor (#30561 ) related pr: https://github.com/milvus-io/milvus/pull/30475 Signed-off-by: chyezh <chyezh@outlook.com>	2024-03-01 15:23:00 +08:00
chyezh	0c7474d7e8	enhance: add graceful stop timeout to avoid node stop hang under extreme cases (#30317 ) 1. add coordinator graceful stop timeout to 5s 2. change the order of datacoord component while stop 3. change querynode grace stop timeout to 900s, and we should potentially change this to 600s when graceful stop is smooth issue: #30310 also see pr: #30306 --------- Signed-off-by: chyezh <chyezh@outlook.com>	2024-02-29 17:01:50 +08:00
yihao.dai	c5918290e6	feat: Add import executor and manager for datanode (#29438 ) This PR introduces novel importv2 roles for datanode: 1. Executor: To execute tasks, a import task will be divided into the following steps: read data -> hash data -> sync data; 2. Manager: To manage all the tasks; issue: https://github.com/milvus-io/milvus/issues/28521 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-01-31 20:45:04 +08:00
yihao.dai	c02fb64ad6	enhance: Allows proactive warming up of chunk cache (#30182 ) Allows proactive warming up of chunk cache. Original vector data will be asynchronously loaded into the chunk cache during the load process. It has the potential to significantly reduce query/search latency for a certain duration after the load, albeit with a concurrent increase in disk usage. issue: https://github.com/milvus-io/milvus/issues/30181 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-01-25 19:55:39 +08:00

1 2

89 Commits