milvus

mirror of https://gitee.com/milvus-io/milvus.git synced 2026-01-07 19:31:51 +08:00

Author	SHA1	Message	Date
congqixia	8962b0058d	fix: [StorageV2] Check writer nil when closing not written one (#43056 ) Related to #43047 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-07-02 14:22:43 +08:00
Zhen Ye	09c6df62d8	fix: use impl and remove the close method of broadcast service (#42992 ) issue: #38399 Signed-off-by: chyezh <chyezh@outlook.com>	2025-07-02 10:30:44 +08:00
wei liu	c381bf3e41	enhance: add logs for count(*) (#43001 ) Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2025-07-01 19:36:43 +08:00
Zhen Ye	08fff353af	fix: Revert "enhance: Enable mergeSort by default starting from version 2.6.0 (#42981 )" (#43046 ) issue: #43034 - implementation of mergeSortMultipleSegments is wrong. Signed-off-by: chyezh <chyezh@outlook.com>	2025-07-01 17:30:29 +08:00
Spade A	26ec841feb	feat: optimize `Like` query with n-gram (#41803 ) Ref #42053 This is the first PR for optimizing `LIKE` with ngram inverted index. Now, only VARCHAR data type is supported and only InnerMatch LIKE (%xxx%) query is supported. How to use it: ``` milvus_client = MilvusClient("http://localhost:19530") schema = milvus_client.create_schema() ... schema.add_field("content_ngram", DataType.VARCHAR, max_length=10000) ... index_params = milvus_client.prepare_index_params() index_params.add_index(field_name="content_ngram", index_type="NGRAM", index_name="ngram_index", min_gram=2, max_gram=3) milvus_client.create_collection(COLLECTION_NAME, ...) ``` min_gram and max_gram controls how we tokenize the documents. For example, for min_gram=2 and max_gram=4, we will tokenize each document with 2-gram, 3-gram and 4-gram. --------- Signed-off-by: SpadeA <tangchenjie1210@gmail.com> Signed-off-by: SpadeA-Tang <tangchenjie1210@gmail.com>	2025-07-01 10:08:44 +08:00
wei liu	396120ade5	enhance: Improve delegator serviceable check with coordinator sync state (#42975 ) issue: #42404 Add syncedByCoord field to ensure delegator only becomes serviceable after coordinator sync, preventing unreliable service state when memory is insufficient. Issue: When memory is low, delegator may become serviceable before current target is ready, but segments can be released at any time, making the serviceable state unreliable. Changes include: - Add syncedByCoord field to track coordinator sync status - Update Serviceable() to require both data readiness and coord sync - Set syncedByCoord=true in SyncTargetVersion - Add comprehensive test coverage Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2025-07-01 10:00:43 +08:00
Zhen Ye	ecb24e7232	enhance: use multi-process framework in integration test (#42976 ) issue: #41609 - add env `MILVUS_NODE_ID_FOR_TESTING` to set up a node id for milvus process. - add env `MILVUS_CONFIG_REFRESH_INTERVAL` to set up the refresh interval of paramtable. - Init paramtable when calling `paramtable.Get()`. - add new multi process framework for integration test. - change all integration test into multi process. - merge some test case into one suite to speed up it. - modify some test, which need to wait for issue #42966, #42685. - remove the waittssync for delete collection to fix issue: #42989 --------- Signed-off-by: chyezh <chyezh@outlook.com>	2025-06-30 14:22:43 +08:00
wei liu	c919340763	enhance: Optimize channel node balancing for uneven QN distribution (#42786 ) issue: #42860 Fix channel node allocation when QueryNode count is not a multiple of channel count. The previous algorithm used simple division which caused uneven distribution with remainders. Key improvements: - Implement smart remainder distribution algorithm - Refactor large function into focused helper functions - Support two-phase rebalancing (release then allocate) - Handle edge cases like insufficient nodes gracefully --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2025-06-30 12:14:42 +08:00
rhys	48661655d6	fix: streamingcoord and streamingnode client support internal tls (#42685 ) https://github.com/milvus-io/milvus/issues/42680 streamingnode/streamingcoord support internal tls Signed-off-by: rhys <sdbwlr@163.com>	2025-06-27 17:50:42 +08:00
Zhen Ye	8367e4ec6a	fix: set 72h for wal retention (#42910 ) issue: #42706 Signed-off-by: chyezh <chyezh@outlook.com>	2025-06-27 17:36:43 +08:00
Bingyi Sun	23c784cf69	fix: Fix querynode crash caused by json index (#42982 ) issue: https://github.com/milvus-io/milvus/issues/42978 Signed-off-by: sunby <sunbingyi1992@gmail.com>	2025-06-27 16:44:41 +08:00
XuanYang-cn	17f1ab71bb	enhance: Remove not inused BuildIndexInfo (#42926 ) 1. removed not inuse cgo methods in index_c.h/cpp 2. removed indexcogowrapper/build_index_info.go See also: #39242 Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2025-06-27 15:00:42 +08:00
congqixia	9b06ecb72f	enhance: [StorageV2] Release record and close reader (#42983 ) Related to #39173 This PR - Close packed reader after sort - Release arrow.Record preventing memory leakage - Invoke `pack_reader->Close()` for CloseReader --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-06-27 14:46:43 +08:00
sthuang	238bd30f42	fix: [StorageV2] end to end minor issues for sync, stats, and load (#42948 ) Fix issues in end-to-end tests: 1. Split column groups based on schema, rather than estimating by average chunk row size. Ensure column group consistency within a segment, to avoid errors caused by loading multiple column group chunks simultaneously. 2. Use sorted segmentId when generating the stats binlog path, to ensure consistent and correct file path resolution. 3. Determine field IDs as follows: For multi-column column groups, retrieve the field ID list from metadata. For single-column column groups, use the column group ID directly as the field ID. related: #39173 fix: #42862 --------- Signed-off-by: shaoting-huang <shaoting.huang@zilliz.com>	2025-06-27 14:44:42 +08:00
Zhen Ye	2d73e6eaa8	fix: mixcoord will not handle timetick anymore (#42965 ) issue: #42954 Signed-off-by: chyezh <chyezh@outlook.com>	2025-06-26 19:14:42 +08:00
Zhen Ye	3602817c53	fix: dynamic log level for streaming node (#42964 ) issue: #42963 Signed-off-by: chyezh <chyezh@outlook.com>	2025-06-26 19:12:50 +08:00
congqixia	5dd1f841d2	enhance: [AddField] Add Restful API for addfield (#42972 ) Related to #39718 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-06-26 18:46:41 +08:00
Bingyi Sun	289b8b85d3	enhance: remove name check for alter index task (#42953 ) issue: https://github.com/milvus-io/milvus/issues/42952 Signed-off-by: sunby <sunbingyi1992@gmail.com>	2025-06-26 16:32:41 +08:00
foxspy	be05b653c1	enhance: update knowhere version (#42938 ) issue: #42937 Signed-off-by: xianliang.li <xianliang.li@zilliz.com>	2025-06-26 01:22:41 +08:00
yihao.dai	d7c9914eff	fix: Consider fields number when preallocating ids for import (#42810 ) In corner cases where there are many fields but only a small number of rows to import, the default preallocated IDs may be insufficient. To address this, consider the number of fields when preallocating IDs. issue: https://github.com/milvus-io/milvus/issues/42518 Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2025-06-25 23:38:41 +08:00
wei liu	be492c2939	fix: Add missing keylocks in ReleasePartition operation (#42940 ) issue: #42098 Fix concurrent access issue by adding proper locking around ReleasePartition operation to prevent race conditions when releasing partitions on the same collection. Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2025-06-25 21:48:42 +08:00
congqixia	336e743b55	fix: [AddField] Respect growing mmap setting adding empty field (#42933 ) Related to #42856 Data under mmapped growing segment shall be treated respecting growingMmap setting. Otherwise, varchar datatype could be treated with logic error. --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-06-25 21:10:42 +08:00
congqixia	942055fa7d	fix: Use task timestamp to calculate TTL timestamp (#42920 ) Related to #42918 Previously the `CollectionTtlTimestamp` could be overflowed when the guarantee_ts==1, which means using `Eventually` consistency level. This patch use task timestamp, allocated by scheduler, to generate ttl timestamp ignore the potential very small timestamp being used. Also add overflow check for ttl timestamp calculated. --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-06-25 20:48:42 +08:00
zhagnlu	69872f45ad	fix: fix is_not_in for trie index (#42716 ) #42604 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2025-06-25 16:52:42 +08:00
cai.zhang	ebe1c95bb1	enhance: Add Size interface to FileReader to eliminate the StatObject call during Read (#42908 ) issue: #42907 --------- Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2025-06-25 14:36:41 +08:00
aoiasd	e2566c0e92	enhance: bm25 stats local cache use local storage path (#42923 ) Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2025-06-25 13:44:46 +08:00
XuanYang-cn	0dfe5308e1	enhance: Tidy Download and decode in segcore storage (#42902 ) 1. Unify calling from GetObjectData 2. Move SetData inside Deserialize See also: #40013 --------- Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2025-06-25 11:10:43 +08:00
sthuang	0d57acb13a	enhance: [StorageV2] field id as meta path for wide column when load (#42863 ) related: #42862 #39173 Signed-off-by: shaoting-huang <shaoting.huang@zilliz.com>	2025-06-25 11:08:48 +08:00
sthuang	d4260b47fa	fix: [StorageV2] sync panic with add field (#42932 ) related: #39663 Signed-off-by: shaoting-huang <shaoting.huang@zilliz.com>	2025-06-25 10:08:40 +08:00
sthuang	ad6d620e9f	fix: [StorageV2] Compiling debug mode throw DCHECK s3 initialize error (#42922 ) related: https://github.com/milvus-io/milvus/issues/42844 Signed-off-by: shaoting-huang <shaoting.huang@zilliz.com>	2025-06-24 19:30:41 +08:00
Spade A	50f7579d8f	fix: fix some bugs discovered by chaos tests (#42906 ) fix: https://github.com/milvus-io/milvus/issues/42870 This PR fixes: 1. SetBitset fn shuold consider growing segments with concurrent write 2. avoid using from_raw_parts directly --------- Signed-off-by: SpadeA <tangchenjie1210@gmail.com>	2025-06-24 16:32:42 +08:00
XuanYang-cn	0adf44e6f8	enhance: Check if segment has too many deletions together (#42668 ) This PR moves the deltalog file count check inside hasTooManyDeletions check. Unifies the logic on checking if a segment has too many deletions including: delta log count, deleted rows ratio and deltalog size. This change removes several uncessary traverse through segment's binlogs and deltalogs. And add more clear trigger logs Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2025-06-24 16:30:49 +08:00
Bingyi Sun	669ea51ce5	enhance: Make json index compatible with caching layer (#42484 ) issue: https://github.com/milvus-io/milvus/issues/42483 --------- Signed-off-by: sunby <sunbingyi1992@gmail.com>	2025-06-24 15:16:41 +08:00
congqixia	718cd203c6	fix: OR binary expr is prunable only when both children are prunable (#42912 ) Related to #42903 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-06-24 09:38:24 +08:00
zhagnlu	1024121ad9	fix:fix incorrect use of class member (#42885 ) #39173 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2025-06-23 20:36:46 +08:00
congqixia	0a0a6b3471	enhance: Fill dbName for `OperatePrivilegeV2Request` in interceptor (#42898 ) Related to #40340 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-06-23 18:04:41 +08:00
cai.zhang	59b003adac	enhance: Skip modify field meta when rename collection or rename dbName (#42875 ) issue: #42873 --------- Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2025-06-23 17:04:41 +08:00
congqixia	ee056f0bff	fix: [AddField] Fill default value in serde logic when field missing (#42891 ) Related to #42856 Default value will be missing after segment get sorted/compacted. This PR is a temp workaround since in long term default value shall be filled with storage engine instead. --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-06-23 14:20:41 +08:00
Bingyi Sun	24e24caf14	fix: Remove cached null expr result (#42818 ) issue: #42698 cached result may be changed in caller so there is no need to cache it Signed-off-by: sunby <sunbingyi1992@gmail.com>	2025-06-23 10:44:40 +08:00
Zhen Ye	a081906fb4	enhance: smaller backoff configuration for wal balancer to make faster recovery (#42869 ) issue: #42835 Signed-off-by: chyezh <chyezh@outlook.com>	2025-06-23 10:32:40 +08:00
Xianhui Lin	b902960057	fix: revert remote jsonstats path (#42882 ) fix: revert remote jsonstats path relate-pr:https://github.com/milvus-io/milvus/pull/42676 issue:https://github.com/milvus-io/milvus/issues/42872 Signed-off-by: Xianhui.Lin <xianhui.lin@zilliz.com>	2025-06-21 13:24:39 +08:00
cai.zhang	8f8ffe9989	fix: Reduce task slot for standalone to 1/4 of normal datanode (#42808 ) issue: #42129 --------- Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2025-06-20 16:38:46 +08:00
Spade A	e15926b40c	enhance: optimize tantivy cargo config (#42880 ) fix: https://github.com/milvus-io/milvus/issues/42879 Signed-off-by: SpadeA <tangchenjie1210@gmail.com>	2025-06-20 16:17:49 +08:00
aoiasd	43a9f7a79e	enhance: Add and run rust format command in makefile (#42807 ) relate: https://github.com/milvus-io/milvus/issues/42806 Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2025-06-20 10:22:39 +08:00
Zhen Ye	6798fdc3b3	fix: rocksmq cannot graceful stop (#42841 ) issue: #40532 Signed-off-by: chyezh <chyezh@outlook.com>	2025-06-19 19:38:39 +08:00
congqixia	74ea57bac1	enhance: Remove unused load field check from proxy (#42816 ) Related to #42489 Since load list works as hint after cachelayer implemented, the related check logic could be removed to keep code logic clean. --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-06-19 19:34:47 +08:00
Zhen Ye	fadc053d7a	fix: filter new proxy when initializing proxy session at timeticksync (#42831 ) issue: #40532 Signed-off-by: chyezh <chyezh@outlook.com>	2025-06-19 16:44:40 +08:00
Zhen Ye	2fd8f910b0	fix: data duplicated when msgdispatcher make splitting (#42827 ) issue: #41570 Signed-off-by: chyezh <chyezh@outlook.com>	2025-06-19 16:32:39 +08:00
junjiejiangjjj	9865d672f7	fix: Model rerank supports Truncate (#42643 ) https://github.com/milvus-io/milvus/issues/42632 Signed-off-by: junjie.jiang <junjie.jiang@zilliz.com>	2025-06-19 15:02:41 +08:00
sthuang	4a0a2441f2	enhance: [StorageV2] field id as meta path for wide column (#42787 ) related: #39173 Signed-off-by: shaoting-huang <shaoting.huang@zilliz.com>	2025-06-19 15:00:38 +08:00

1 2 3 4 5 ...

10757 Commits