milvus

mirror of https://gitee.com/milvus-io/milvus.git synced 2026-01-07 19:31:51 +08:00

Author	SHA1	Message	Date
wei liu	e70c01362d	enhance: Add resource exhaustion querynode penalty policy (#45808 ) issue: #40513 for querynode which return resource exhausted error, add a penalty duration on it, and suspend loading new resource until penalty duration expired. --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2025-12-02 16:59:11 +08:00
congqixia	c01fd94a6a	enhance: integrate Storage V2 FFI interface for unified storage access (#45723 ) Related #44956 This commit integrates the Storage V2 FFI (Foreign Function Interface) interface throughout the Milvus codebase, enabling unified storage access through the Loon FFI layer. This is a significant step towards standardizing storage operations across different storage versions. 1. Configuration Support - configs/milvus.yaml: Added `useLoonFFI` configuration flag under `common.storage.file.splitByAvgSize` section - Allows runtime toggle between traditional binlog readers and new FFI-based manifest readers - Default: `false` (maintains backward compatibility) 2. Core FFI Infrastructure Enhanced Utilities (internal/core/src/storage/loon_ffi/util.cpp/h) - ToCStorageConfig(): Converts Go's `StorageConfig` to C's `CStorageConfig` struct for FFI calls - GetManifest(): Parses manifest JSON and retrieves latest column groups using FFI - Accepts manifest path with `base_path` and `ver` fields - Calls `get_latest_column_groups()` FFI function - Returns column group information as string - Comprehensive error handling for JSON parsing and FFI errors 3. Dependency Updates - internal/core/thirdparty/milvus-storage/CMakeLists.txt: - Updated milvus-storage version from `0883026` to `302143c` - Ensures compatibility with latest FFI interfaces 4. Data Coordinator Changes All compaction task builders now include manifest path in segment binlogs: - compaction_task_clustering.go: Added `Manifest: segInfo.GetManifestPath()` to segment binlogs - compaction_task_l0.go: Added manifest path to both L0 segment selection and compaction plan building - compaction_task_mix.go: Added manifest path to mixed compaction segment binlogs - meta.go: Updated metadata completion logic: - `completeClusterCompactionMutation()`: Set `ManifestPath` in new segment info - `completeMixCompactionMutation()`: Preserve manifest path in compacted segments - `completeSortCompactionMutation()`: Include manifest path in sorted segments 5. Data Node Compactor Enhancements All compactors updated to support dual-mode reading (binlog vs manifest): 6. Flush & Sync Manager Updates Pack Writer V2 (pack_writer_v2.go) - BulkPackWriterV2.Write(): Extended return signature to include `manifest string` - Implementation: - Generate manifest path: `path.Join(pack.segmentID, "manifest.json")` - Write packed data using FFI-based writer - Return manifest path along with binlogs, deltas, and stats Task Handling (task.go) - Updated all sync task result handling to accommodate new manifest return value - Ensured backward compatibility for callers not using manifest 7. Go Storage Layer Integration New Interfaces and Implementations - record_reader.go: Interface for unified record reading across storage versions - record_writer.go: Interface for unified record writing across storage versions - binlog_record_writer.go: Concrete implementation for traditional binlog-based writing Enhanced Schema Support (schema.go, schema_test.go) - Schema conversion utilities to support FFI-based storage operations - Ensures proper Arrow schema mapping for V2 storage Serialization Updates - serde.go, serde_events.go, serde_events_v2.go: Updated to work with new reader/writer interfaces - Test files updated to validate dual-mode serialization 8. Storage V2 Packed Format FFI Common (storagev2/packed/ffi_common.go) - Common FFI utilities and type conversions for packed storage format Packed Writer FFI (storagev2/packed/packed_writer_ffi.go) - FFI-based implementation of packed writer - Integrates with Loon storage layer for efficient columnar writes Packed Reader FFI (storagev2/packed/packed_reader_ffi.go) - Already existed, now complemented by writer implementation 9. Protocol Buffer Updates data_coord.proto & datapb/data_coord.pb.go - Added `manifest` field to compaction segment messages - Enables passing manifest metadata through compaction pipeline worker.proto & workerpb/worker.pb.go - Added compaction parameter for `useLoonFFI` flag - Allows workers to receive FFI configuration from coordinator 10. Parameter Configuration component_param.go - Added `UseLoonFFI` parameter to compaction configuration - Reads from `common.storage.file.useLoonFFI` config path - Default: `false` for safe rollout 11. Test Updates - clustering_compactor_storage_v2_test.go: Updated signatures to handle manifest return value - mix_compactor_storage_v2_test.go: Updated test helpers for manifest support - namespace_compactor_test.go: Adjusted writer calls to expect manifest - pack_writer_v2_test.go: Validated manifest generation in pack writing This integration follows a dual-mode approach: 1. Legacy Path: Traditional binlog-based reading/writing (when `useLoonFFI=false` or no manifest) 2. FFI Path: Manifest-based reading/writing through Loon FFI (when `useLoonFFI=true` and manifest exists) --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-11-24 19:57:07 +08:00
tinswzy	1427825133	enhance: improve WAL retention strategy (#45350 ) issue: #44369 woodpecker related[ issue: #59](https://github.com/zilliztech/woodpecker/issues/59) Refactor the WAL retention logic in Milvus StreamingNode: - Remove the simple sampling-based truncation mechanism. - After flush, WAL data is directly truncated. - The retention control is now delegated to the underlying message queue (MQ) implementation. Signed-off-by: tinswzy <zhenyuan.wei@zilliz.com>	2025-11-23 21:41:05 +08:00
Xiaofan	a9895bb904	enhance: add robust handle etcd servercrash (#45304 ) related to #45303 fix milvus pod may restart when etcd pod start Signed-off-by: xiaofanluan <xiaofan.luan@zilliz.com>	2025-11-13 10:23:36 +08:00
wei liu	931d4bf95f	enhance: increase session TTL from 10s to 30s (#45228 ) issue: #45227 Increase the default session TTL to 30 seconds to tolerate etcd failover time. This prevents session expiration during etcd cluster failover, improving system stability. When etcd undergoes failover (leader election or node restart), the previous 10s TTL was too short to survive the failover window, causing unnecessary session expiration and component restarts. The new 30s TTL provides sufficient buffer for etcd to complete failover while maintaining session liveness. Changes: - Update DefaultSessionTTL constant from 10 to 30 - Update SessionTTL ParamItem DefaultValue from "10" to "30" Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2025-11-10 16:47:36 +08:00
yihao.dai	e8bdc3f2a1	enhance: Add RBAC support for UpdateReplicateConfiguration (#45123 ) issue: https://github.com/milvus-io/milvus/issues/44123 Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2025-11-07 17:11:34 +08:00
sparknack	9032bb7668	enhance: unify the aligned buffer for both buffered and direct I/O (#45323 ) issue: #43040 Signed-off-by: Shawn Wang <shawn.wang@zilliz.com>	2025-11-06 10:53:33 +08:00
zhagnlu	792e931fcb	enhance: rename jsonstats related user config params (#45254 ) #44132 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2025-11-04 20:21:36 +08:00
cai.zhang	ed8ba4a28c	enhance: Make GeometryCache an optional configuration (#45192 ) issue: #45187 Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2025-11-03 19:59:32 +08:00
Zhen Ye	309d564796	enhance: support collection and index with WAL-based DDL framework (#45033 ) issue: #43897 - Part of collection/index related DDL is implemented by WAL-based DDL framework now. - Support following message type in wal, CreateCollection, DropCollection, CreatePartition, DropPartition, CreateIndex, AlterIndex, DropIndex. - Part of collection/index related DDL can be synced by new CDC now. - Refactor some UT for collection/index DDL. - Add Tombstone scheduler to manage the tombstone GC for collection or partition meta. - Move the vchannel allocation into streaming pchannel manager. --------- Signed-off-by: chyezh <chyezh@outlook.com>	2025-10-30 14:24:08 +08:00
wei liu	3566cb745c	enhance: remove max vector field number limit (#45151 ) issue: #45150 Removed the maximum limit constraint (value range [1, 10]) for vector fields in a collection to support more flexible schema design. Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2025-10-30 12:42:07 +08:00
sparknack	935160840c	enhance: add a disk quota for the loaded binlog size to prevent load failure of querynode (#44893 ) issue: #41435 --------- Signed-off-by: Shawn Wang <shawn.wang@zilliz.com>	2025-10-19 19:44:01 +08:00
foxspy	b91878857e	fix: update aisaq param (#44861 ) issue: #44365 Signed-off-by: xianliang.li <xianliang.li@zilliz.com>	2025-10-15 19:18:00 +08:00
Buqian Zheng	3140bd0ca6	enhance: enable default json stats (#44810 ) issue: https://github.com/milvus-io/milvus/issues/44132 Signed-off-by: Buqian Zheng <zhengbuqian@gmail.com>	2025-10-13 21:29:59 +08:00
sparknack	6d5b41644b	enhance: remove logical usage checks during segment loading (#44743 ) issue: #41435 Signed-off-by: Shawn Wang <shawn.wang@zilliz.com>	2025-10-13 14:21:58 +08:00
Xiaofan	7c00f292bc	enhance: add config for meta batch(#44569 ) (#44645 ) fix: https://github.com/milvus-io/milvus/issues/44569 add a new config to control meta batch to avoid too large Signed-off-by: xiaofanluan <xiaofan.luan@zilliz.com>	2025-09-30 17:31:02 +08:00
zhagnlu	4c49295c3d	Revert "enhance: enable default json stats (#44559 )" (#44644 ) This reverts commit 1b5191974c71eee342e4f7a8c804e1d95cfd094b. #44132 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2025-09-30 12:11:53 +08:00
zhagnlu	1b5191974c	enhance: enable default json stats (#44559 ) #44132 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2025-09-28 10:45:04 +08:00
Gao	539f17f1ad	enhance: tiered index updates (#44433 ) issue: #42032 #44212 - special case for warmup param and cell storage size for tiered index - add a config to enable/disable storage usage tracking --------- Signed-off-by: chasingegg <chao.gao@zilliz.com>	2025-09-22 21:34:11 +08:00
sthuang	edd250ffef	fix: [StorageV2] force virtual host for oss and cos (#44484 ) related: #44481 Signed-off-by: shaoting-huang <shaoting.huang@zilliz.com>	2025-09-22 16:58:11 +08:00
Bingyi Sun	94d53a5ac6	feat: encode cluster id in auto id (#44471 ) https://github.com/milvus-io/milvus/issues/44326 prev: [physical_ts][logical_ts] after [sign_bit][cluster_id][physical_ts][logical_ts] --------- Signed-off-by: sunby <sunbingyi1992@gmail.com>	2025-09-22 10:40:02 +08:00
tinswzy	c7f21d5a06	enhance: purge small files right after wp segment compaction (#44473 ) #43638 improve wp log output [wp#43](https://github.com/zilliztech/woodpecker/issues/43) intro purge small files right after segment compaction [wp#47](https://github.com/zilliztech/woodpecker/issues/47) The rootpath configured by milvus is uniformly used as the base for wp local fs storage. update to v0.1.5 Signed-off-by: tinswzy <zhenyuan.wei@zilliz.com>	2025-09-21 16:32:01 +08:00
Bingyi Sun	5cd2d99799	enhance: Revert "feat: encode cluster id in auto id (#44324 )" (#44426 ) This reverts commit 7af159410395f0e7079d4875d96544c01f1d477b	2025-09-17 17:56:01 +08:00
Bingyi Sun	7af1594103	feat: encode cluster id in auto id (#44324 ) https://github.com/milvus-io/milvus/issues/44326 prev: `[physical_ts][logical_ts]` after `[sign_bit][cluster_id][physical_ts][logical_ts]` --------- Signed-off-by: sunby <sunbingyi1992@gmail.com>	2025-09-17 16:56:01 +08:00
congqixia	fc968ff1c2	enhance: [StorageV2] Pass args for avg size split policy (#44301 ) Related to #44257 This PR - Pass column stats for avg size split policy - Add param items for policy configuration --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-09-11 10:43:57 +08:00
sparknack	4a01c726f3	enhance: cachinglayer: some metric and params update (#44276 ) issue: #41435 --------- Signed-off-by: Shawn Wang <shawn.wang@zilliz.com>	2025-09-10 11:03:57 +08:00
zhagnlu	d67f1ea0ab	enhance: add param to modify dump snapshot batch size (#44215 ) issue: #44216 Signed-off-by: luzhang <luzhang@zilliz.com>	2025-09-05 14:29:54 +08:00
wei liu	db6595d7a5	enhance: Reduce compaction task cleanup tolerance time (#44207 ) issue: #43858 Reduce CompactionDropToleranceInSeconds from 24 hours to 1 hour to improve memory efficiency and faster task metadata cleanup. Changes include: - Update default value from 86400s (24h) to 3600s (1h) in component_param.go - Update corresponding configuration in milvus.yaml - Faster cleanup of completed compaction task metadata - Reduce memory footprint by shorter retention period Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2025-09-05 10:33:54 +08:00
Bingyi Sun	e3ecacca9e	feat: Add namespace prop (#43962 ) issue: https://github.com/milvus-io/milvus/issues/44011 namespace is an alias for tenant. if this property is enabled, milvus will add a __namespace_id field. Modifications in the future will use this property to do compaction and search. --------- Signed-off-by: sunby <sunbingyi1992@gmail.com>	2025-09-03 12:57:53 +08:00
Jean-Francois Weber-Marx	330a871979	enhance: add configuration to allow custom characters in names (#42417 ) (#44063 ) related: #42417 - Add NameValidationAllowedChars and RoleNameValidationAllowedChars configuration parameters to specify additional characters allowed respectively in (generic) names and a role names - All validations in validateName method is moved to a the new method validateNameWithCustomChars which is called by both validateName and ValidateRoleName while specifying characters allowed Signed-off-by: Jean-Francois Weber-Marx <jfwm@hotmail.com> Signed-off-by: Jean-Francois Weber-Marx <jf.webermarx@criteo.com>	2025-09-02 11:57:52 +08:00
nish112022	1e704ecf9f	fix: Add Kafka buffer size limit to prevent DataNode OOM (#44106 ) issue: https://github.com/milvus-io/milvus/issues/44105 - I have added support to set this property queued.max.messages.kbytes in kafka consumers from the user side. - It limits the size (in KB) of the consumer’s local message queue (buffer) where messages are temporarily stored after being fetched from Kafka but before your application actually processes them --------- Signed-off-by: Nischay Yadav <Nischay.Yadav@ibm.com>	2025-09-01 18:19:21 +08:00
zhagnlu	fc876639cf	enhance: support json stats with shredding design (#42534 ) #42533 Co-authored-by: luzhang <luzhang@zilliz.com>	2025-09-01 10:49:52 +08:00
Chun Han	da156981c6	feat: milvus support posix-compatible mode(milvus-io#43942) (#43944 ) related: #43942 Signed-off-by: MrPresent-Han <chun.han@gmail.com> Co-authored-by: MrPresent-Han <chun.han@gmail.com>	2025-08-27 16:29:50 +08:00
Zhen Ye	5bdc593b8a	enhance: use v0.15.1 official pulsar client and add logging for pulsar client (#43913 ) issue: #43785 - pulsar client will print log into milvus logger now. - pulsar client open the metric by default. - upgrade the pulsar client to v0.15.1, and use offical repo. - the fixing of milvus-io/pulsar-client-go is already covered by official v0.15.1. Signed-off-by: chyezh <chyezh@outlook.com>	2025-08-26 16:45:53 +08:00
zhagnlu	8934c18792	enhance: support cache result cache for expr (#43923 ) issue: #43878 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2025-08-26 10:55:52 +08:00
junjiejiangjjj	f1ce84996d	enhance: refactor model service configuration and environment variables (#44036 ) - Add enable configuration for all model service providers - Migrate environment variables from MILVUSAI_* to MILVUS_* prefix with backward compatibility - Unify model service enable/disable logic using configuration - Add tests for environment variable parsing with fallback support #35856 Signed-off-by: junjie.jiang <junjie.jiang@zilliz.com>	2025-08-26 10:49:52 +08:00
zhagnlu	1a30012014	enhance: support trace log level for segcore (#44003 ) #43230 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2025-08-25 17:55:52 +08:00
cqy123456	d987dd7103	enhance: Make build ratio of interim index configurable (#43939 ) issue: https://github.com/milvus-io/milvus/issues/43993 Signed-off-by: cqy123456 <qianya.cheng@zilliz.com>	2025-08-25 14:43:51 +08:00
sparknack	4fae074d56	enhance: add write rate limit for disk file writer (#43912 ) issue: #43040 --------- Signed-off-by: Shawn Wang <shawn.wang@zilliz.com>	2025-08-25 10:27:47 +08:00
junjiejiangjjj	f3d7e47227	feat: Supports more rerankers (#43270 ) https://github.com/milvus-io/milvus/issues/35856 Signed-off-by: junjiejiangjjj <junjie.jiang@zilliz.com>	2025-08-22 17:29:47 +08:00
Zhen Ye	a86b6f2a54	enhance: extend the stats manage at streaming shard manager for L0 (#43371 ) issue: #42416 - Rename the InsertMetric into ModifiedMetric. - Add L0 control configuration. - Add some L0 current state collect. Signed-off-by: chyezh <chyezh@outlook.com>	2025-08-18 20:41:46 +08:00
sparknack	9a31d88d49	fix: cachinglayer: config: align the watermark ratio in the YAML file… (#43910 ) issue: #41435 Signed-off-by: Shawn Wang <shawn.wang@zilliz.com>	2025-08-18 17:53:44 +08:00
cai.zhang	d8a3236e44	fix: Reorder worker proto fields to ensure compatibility (#43735 ) issue: #43734 Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2025-08-05 14:59:38 +08:00
sparknack	544c7c0600	enhance: update cachinglayer default cache ratio to 0.3 (#43723 ) issue: #41435 --------- Signed-off-by: Shawn Wang <shawn.wang@zilliz.com>	2025-08-05 01:35:39 +08:00
aoiasd	4f02b06abc	enhance: support set lindera dict build dir and download url in yaml (#43541 ) relate: https://github.com/milvus-io/milvus/issues/43120 --------- Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2025-08-04 09:47:38 +08:00
sparknack	bdd65871ea	enhance: tiered storage: estimate segment loading resource usage while considering eviction (#43323 ) issue: #41435 After introducing the caching layer's lazy loading and eviction mechanisms, most parts of a segment won't be loaded into memory or disk immediately, even if the segment is marked as LOADED. This means physical resource usage may be very low. However, we still need to reserve enough resources for the segments marked as LOADED. Thus, the logic of resource usage estimation during segment loading, which based on physcial resource usage only for now, should be changed. To address this issue, we introduced the concept of logical resource usage in this patch. This can be thought of as the base reserved resource for each LOADED segment. A segment’s logical resource usage is derived from its final evictable and inevictable resource usage and calculated as follows: ``` SLR = SFPIER + evitable_cache_ratio * SFPER ``` it also equals to ``` SLR = (SFPIER + SFPER) - (1.0 - evitable_cache_ratio) * SFPER ``` `SLR`: The logical resource usage of a segment. `SFPIER`: The final physical inevictable resource usage of a segment. `SFPER`: The final physical evictable resource usage of a segment. `evitable_cache_ratio`: The ratio of a segment's evictable resources that can be cached locally. The higher the ratio, the more physical memory is reserved for evictable memory. When loading a segment, two types of resource usage are taken into account. First is the estimated maximum physical resource usage: ``` PPR = HPR + CPR + SMPR - SFPER ``` `PPR`: The predicted physical resource usage after the current segment is allowed to load. `HPR`: The physical resource usage obtained from hardware information. `CPR`: The total physical resource usage of segments that have been committed but not yet loaded. When one new segment is allow to load, `CPR' = CPR + (SMR - SER)`. When one of the committed segments is loaded, `CPR' = CPR - (SMR - SER)`. `SMPR`: The maximum physical resource usage of the current segment. `SFPER`: The final physical evictable resource usage of the current segment. Second is the estimated logical resource usage, this check is only valid when eviction is enabled: ``` PLR = LLR + CLR + SLR ``` `PLR`: The predicted logical resource usage after the current segment is allowed to load. `LLR`: The total logical resource usage of all loaded segments. When a new segment is loaded, `LLR` should be updated to `LLR' = LLR + SLR`. `CLR`: The total logical resource usage of segments that have been committed but not yet loaded. When one new segment is allow to load, `CLR' = CLR + SLR`. When one of the committed segments is loaded, `CLR' = CLR - SLR`. `SLR`: The logical resource usage of the current segment. Only when `PPR < PRL && PLR < PRL` (`PRL`: Physical resource limit of the querynode), the segment is allowed to be loaded. --------- Signed-off-by: Shawn Wang <shawn.wang@zilliz.com>	2025-08-01 21:31:37 +08:00
Ted Xu	e37cd19da2	enhance: enable storage v2 by default (#43652 ) Signed-off-by: Ted Xu <ted.xu@zilliz.com>	2025-08-01 08:59:36 +08:00
sthuang	df02014b3b	enhance: [rbac] privilege groups add import and add field privileges (#43664 ) related: https://github.com/milvus-io/milvus/issues/29367, https://github.com/milvus-io/milvus/pull/42687 Signed-off-by: shaoting-huang <shaoting.huang@zilliz.com>	2025-07-31 20:47:36 +08:00
Zhen Ye	0d5e0ca795	fix: close timetick protection by default (#43650 ) issue: #43266 Signed-off-by: chyezh <chyezh@outlook.com>	2025-07-30 19:51:37 +08:00
Buqian Zheng	052fb6c562	feat: add time based eviction to data managed by cachinglayer (#43490 ) issue: https://github.com/milvus-io/milvus/issues/41435 also added disk capacity protection --------- Signed-off-by: Buqian Zheng <zhengbuqian@gmail.com>	2025-07-29 16:17:35 +08:00

1 2 3 4 5 ...

778 Commits