23363 Commits

Author SHA1 Message Date
github-actions[bot]
c8799279fd Update Builder image changes 2025-11-11 12:54:42 +00:00
zhenshan.cao
2f6940253d
fix: Add tzdata dependency to enable IANA Time Zone ID recognition (#45495)
Also Set timezone to UTC and ensure tzdata support (#45483)

issue: https://github.com/milvus-io/milvus/issues/45473
pr: https://github.com/milvus-io/milvus/pull/45475
https://github.com/milvus-io/milvus/pull/45483

Signed-off-by: zhenshan.cao <zhenshan.cao@zilliz.com>
2025-11-11 20:45:37 +08:00
congqixia
9ff5731c7d
fix: [2.6] Support JSON default values in FillFieldData (#45455) (#45470)
Cherry-pick from master
pr: #45455
Related to #45445

Previously, FillFieldData for JSON fields would assert and fail when a
default_value was provided, blocking index creation for JSON fields with
default values (including dynamic fields like $meta).

This change enables JSON default value support by:
- Removing the assertion that blocked default values
- Parsing bytes_data into Json objects when default_value is present
- Properly filling data_ array and setting valid_data_ bitset to true
- Maintaining null behavior when no default_value is provided

Impact:
- Fixes index creation failure for JSON fields with default values
- Resolves upgrade issues from 2.5 to 2.6.5 where dynamic fields with
default values couldn't be indexed
- Index builds that were stuck in InProgress state can now complete

Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>
2025-11-11 18:21:37 +08:00
congqixia
7d13bdcf4c
fix: [2.6] correct field data offset calculation in rerank functions for bulk search (#45444) (#45482)
Cherry-pick from master
pr: #45444
Related to #45338

When using bulk vector search in hybrid search with rerank functions,
the output field values for different queries were all equal to the
values returned by the first query, instead of the correct values
belonging to each document ID. The document IDs were correct, but the
entity field values were wrong.

In rerank functions (RRF, weighted, decay, model), when processing
multiple queries in a batch, the `idLocations` stored only the relative
offset within each result set (`idx`), not accounting for the absolute
position within the entire batch. This caused `FillFieldData` to
retrieve field data from the wrong positions, always using offsets
relative to the first query.

This fix ensures that when processing bulk searches with rerank
functions, each result correctly retrieves its corresponding field data
based on the absolute offset within the entire batch, resolving the
issue where all queries returned the first query's field values.

Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>
2025-11-11 18:15:37 +08:00
sparknack
aaa8f4335d
enhance: [2.6] some optimization of scalar field fetching in tiered storage scenarios (#45361)
issue: #43611
pr: #45360

---------

Signed-off-by: Shawn Wang <shawn.wang@zilliz.com>
2025-11-11 17:27:40 +08:00
cai.zhang
8b16216e01
fix: [2.6]Fix filter geometry for growing with mmap (#45465)
issue: #45450
master pr: #45464

Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>
2025-11-11 15:41:40 +08:00
Chun Han
85c8cca094
feat: milvus support huawei cloud iam verification(#45298) (#45312)
related: #45298
pr: https://github.com/milvus-io/milvus/pull/45457

Signed-off-by: MrPresent-Han <chun.han@gmail.com>
Co-authored-by: MrPresent-Han <chun.han@gmail.com>
2025-11-11 15:11:36 +08:00
Spade A
7cee398df1
fix: nextFieldID does not consider STRUCT [2.6] (#45438)
issue: https://github.com/milvus-io/milvus/issues/45362
pr: https://github.com/milvus-io/milvus/pull/45437

Signed-off-by: SpadeA <tangchenjie1210@gmail.com>
2025-11-11 11:29:35 +08:00
aoiasd
2637854a12
enhance: [2.6] fix typo of analyzer params (#45434)
pr: https://github.com/milvus-io/milvus/pull/45299

Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>
2025-11-11 10:33:36 +08:00
Gao
1398a069d3
enhance: override index_type while creating segment index (#45417)
issue: #44752
pr: #45416

---------

Signed-off-by: chasingegg <chao.gao@zilliz.com>
2025-11-11 09:45:36 +08:00
zhuwenxing
7380684b9e
test: [2.6]add struct array mmap testcases (#45453)
master pr: https://github.com/milvus-io/milvus/pull/45309

Signed-off-by: zhuwenxing <wenxing.zhu@zilliz.com>
2025-11-10 19:49:36 +08:00
Chun Han
94563fb4f2
fix: Group value is nil(#45418) (#45419)
related: #45418 
pr: https://github.com/milvus-io/milvus/pull/45422

Signed-off-by: MrPresent-Han <chun.han@gmail.com>
Co-authored-by: MrPresent-Han <chun.han@gmail.com>
2025-11-10 16:17:38 +08:00
yihao.dai
58f4afd3f0
enhance: [2.6] Add RBAC support for UpdateReplicateConfiguration (#45236)
issue: https://github.com/milvus-io/milvus/issues/44123

pr: https://github.com/milvus-io/milvus/pull/45123

Signed-off-by: bigsheeper <yihao.dai@zilliz.com>
2025-11-08 17:43:34 +08:00
XuanYang-cn
7c37e444a2
fix: [cp26]Accurate size estimation for sliced arrow arrays in compaction (#45352)
Sliced arrow arrays "incorrectly" returned the original array's size via
SizeInBytes(), causing inaccurate memory estimates during compaction.

This resulted in segments closing prematurely in mergeSplit mode -
expected 500MB compactions produced 4x100+MB segments instead.

Fixed by calculating actual byte size of sliced arrays, ensuring proper
segment sizing and more accurate memory usage tracking.

See also: #45293
pr: #45294

Signed-off-by: yangxuan <xuan.yang@zilliz.com>
2025-11-07 18:07:35 +08:00
yihao.dai
0bfb5f6012
fix: [2.6] Fix data race in replicate stream client (#45347)
issue: https://github.com/milvus-io/milvus/issues/44123

pr: https://github.com/milvus-io/milvus/pull/45346

Signed-off-by: bigsheeper <yihao.dai@zilliz.com>
2025-11-07 16:43:35 +08:00
XuanYang-cn
750af91c5a
test: Increase PyMilvus version to 2.6.4rc3 for 2.6 branch (#45386)
Automated daily bump from pymilvus 2.6 branch. Updates
tests/python_client/requirements.txt.

Signed-off-by: XuanYang-cn <xuan.yang@zilliz.com>
2025-11-07 15:23:38 +08:00
cai.zhang
12e3fb7655
fix: [2.6]Skip building text index for newly added columns (#45317)
issue: #45315 
master pr: #45316

Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>
2025-11-07 15:21:42 +08:00
XuanYang-cn
f3e5a53fc5
fix: [2.6]Accidentally ignored sealed segments in L0 Compaction (#45341)
When there're no growing segments in the collection, L0 Compaction will
try to choose all L0 segments that hits all L1/L2 segments.

However, if there's Sealed Segment still under flushing in DataNode at
the same time L0 Compaction selects satisfied L1/L2 segments, L0
Compaction will ignore this Segment because it's not in "FlushState",
which is wrong, causing missing deletes on the Sealed Segment.

This quick solution here is to fail this L0 compaction task once
selected a Sealed segment.

See also: #45339
pr: #45340

---------

Signed-off-by: yangxuan <xuan.yang@zilliz.com>
2025-11-07 11:49:34 +08:00
congqixia
b856b396da
enhance: [2.6] Bump go version to 1.24.9 (#45369)
Cherry-pick from master
pr: #45359 
Fixing CVE-2025-58187

Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>
2025-11-07 11:23:34 +08:00
cai.zhang
b33c58807a
enhance: [2.6] [test] Move R-Tree index tests into the implementation package (#45356)
master pr: #45355

Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>
2025-11-07 10:05:35 +08:00
zhagnlu
27d737bbaf
enhance: disable jsonshredding for default config (#45349)
#42533

Signed-off-by: luzhang <luzhang@zilliz.com>
Co-authored-by: luzhang <luzhang@zilliz.com>
2025-11-06 19:21:30 +08:00
congqixia
2c50d7e1f8
fix: [2.6] Move FinishLoad before text index creation to ensure raw data availability (#45335)
Cherry-pick from master
pr: #45334
Related to #45333

Fix segment loading failure when adding fields with text match enabled.
The issue occurred because text indexes were being loaded before
FinishLoad() was called, meaning raw data was not properly available
when text index creation attempted to access it, resulting in "failed to
create text index, neither raw data nor index are found" errors.

Solution is to move the FinishLoad() call to execute after raw data
loading but before text index loading. This ensures that:
1. Raw data is properly loaded and available in memory
2. Text indexes can access the raw data they need during creation
3. The segment is in the correct state before any index operations

Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>
2025-11-06 17:11:34 +08:00
XuanYang-cn
19fe1423d6
test: Increase PyMilvus version to 2.6.4rc2 for 2.6 branch (#45272)
Automated daily bump from pymilvus 2.6 branch. Updates
tests/python_client/requirements.txt.

Signed-off-by: XuanYang-cn <xuan.yang@zilliz.com>
2025-11-06 16:53:38 +08:00
zhagnlu
d91470e59e
fix: not use json_shredding for json path is null (#45311)
pr: #45310

Signed-off-by: luzhang <luzhang@zilliz.com>
Co-authored-by: luzhang <luzhang@zilliz.com>
2025-11-06 11:49:33 +08:00
zhenshan.cao
a42d3248e1
fix: cherry pick fixes related to timestamptz (#45321)
pr: https://github.com/milvus-io/milvus/pull/45111
https://github.com/milvus-io/milvus/pull/45287
issue: https://github.com/milvus-io/milvus/issues/44527
https://github.com/milvus-io/milvus/issues/44537
https://github.com/milvus-io/milvus/issues/44538
https://github.com/milvus-io/milvus/issues/44585
https://github.com/milvus-io/milvus/issues/44622
https://github.com/milvus-io/milvus/issues/44585

---------

Signed-off-by: zhenshan.cao <zhenshan.cao@zilliz.com>
2025-11-06 11:05:34 +08:00
sparknack
fb1b16186a
enhance: [2.6] unify the aligned buffer for both buffered and direct I/O (#45325)
issue: https://github.com/milvus-io/milvus/issues/43040
pr: https://github.com/milvus-io/milvus/pull/45323

Signed-off-by: Shawn Wang <shawn.wang@zilliz.com>
2025-11-06 10:59:33 +08:00
yihao.dai
c19fa9f5c3
fix: [2.6] Fix load segment failed due to get disk usage error (#45300)
When getting disk usage, files or directories may be removed
concurrently due to segment release. This PR ignores “file or directory
does not exist” errors in such cases.

issue: https://github.com/milvus-io/milvus/issues/45239

pr: https://github.com/milvus-io/milvus/pull/45255

Signed-off-by: bigsheeper <yihao.dai@zilliz.com>
2025-11-06 10:35:34 +08:00
congqixia
9a26b11614
fix: [2.6] Support JSON default value in compaction (#45331)
Cherry-pick from master
pr: #45330
Related to #45329

Fix compaction failure when handling newly added dynamic fields with
storage v1 binlogs. The issue occurred because the
`GenerateEmptyArrayFromSchema` function did not support JSON data type
default values, causing "Unexpected default value" errors during
compaction.

Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>
2025-11-06 10:21:34 +08:00
cai.zhang
3df5f89cb0
fix: [2.6] Compute the correct batch size for the geometry index of the growing segment (#45261)
issue: #44648 
master pr: #45253

Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>
2025-11-05 15:51:33 +08:00
foxspy
e1ea30b04c
enhance: [2.6] update knowhere version (#45271)
issue: #42937 
pr: #45270

Signed-off-by: xianliang.li <xianliang.li@zilliz.com>
2025-11-05 11:11:33 +08:00
zhagnlu
8942bb7594
enhance: rename jsonstats related user config params (#45252)
pr: #45254

Signed-off-by: luzhang <luzhang@zilliz.com>
Co-authored-by: luzhang <luzhang@zilliz.com>
2025-11-05 10:35:32 +08:00
Zhen Ye
40806f9162
fix: ddl framework bug patch (#45292)
issue: #45080, #45274, #45285
pr: #45290

- LoadCollection doesn't ignore the ignorable request, for false field
array.
- CreatIndex doesn't ignore the ignorable request, for wrong index.
- index meta is not thread safe.
- lost parameter check of DDL.
- DDL Ack scheduler may get stuck and DDL is block until next incoming
DDL.

Signed-off-by: chyezh <chyezh@outlook.com>
2025-11-05 00:29:34 +08:00
Gao
844fa8c999
enhance: [2.6] make knowhere thread pool config refreshable (#45191)
pr: #45190

---------

Signed-off-by: chasingegg <chao.gao@zilliz.com>
2025-11-04 20:43:34 +08:00
Spade A
282798371d
fix: alter collection failed with MMAP setting for STRUCT [2.6] (#45240)
pr: https://github.com/milvus-io/milvus/pull/45173
issue: https://github.com/milvus-io/milvus/issues/45001
ref: https://github.com/milvus-io/milvus/issues/42148

---------

Signed-off-by: SpadeA <tangchenjie1210@gmail.com>
Signed-off-by: SpadeA-Tang <tangchenjie1210@gmail.com>
2025-11-04 20:25:37 +08:00
Zhen Ye
122d024df4
enhance: cherry pick patch of new DDL framework and CDC 3 (#45280)
issue: #43897, #44123
pr: #45266
also pick pr: #45237, #45264,#45244,#45275

fix: kafka should auto reset the offset from earliest to read (#45237)

issue: #44172, #45210, #44851,#45244

kafka will auto reset the offset to "latest" if the offset is
Out-of-range. the recovery of milvus wal cannot read any message from
that. So once the offset is out-of-range, kafka should read from eariest
to read the latest uncleared data.


https://kafka.apache.org/documentation/#consumerconfigs_auto.offset.reset

enhance: support alter collection/database with WAL-based DDL framework
(#45266)

issue: #43897

- Alter collection/database is implemented by WAL-based DDL framework
now.
- Support AlterCollection/AlterDatabase in wal now.
- Alter operation can be synced by new CDC now.
- Refactor some UT for alter DDL.

fix: milvus role cannot stop at initializing state (#45244)

issue: #45243

fix: support upgrading from 2.6.x -> 2.6.5 (#45264)

issue: #43897

---------

Signed-off-by: chyezh <chyezh@outlook.com>
2025-11-04 20:21:37 +08:00
congqixia
9d7ef929e1
fix: [2.6] Initialize timestamp range in composite binlog writer (#45283)
Related to #45282

Initialize `tsFrom` and `tsTo` fields in the composite binlog record
writer constructor to prevent timestamp range information loss in stats
tasks.

The composite binlog writer now properly initializes the timestamp range
fields, ensuring that:
1. The first timestamp update will correctly set the minimum (`tsFrom`)
2. The first timestamp update will correctly set the maximum (`tsTo`)
3. All subsequent data writes will maintain accurate timestamp range
tracking

Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>
2025-11-04 19:47:34 +08:00
zhuwenxing
af766513f6
test: add struct array testcases (#44973)
/kind improvement

master pr: https://github.com/milvus-io/milvus/pull/44940
https://github.com/milvus-io/milvus/pull/45146

---------

Signed-off-by: zhuwenxing <wenxing.zhu@zilliz.com>
2025-11-04 17:45:34 +08:00
cai.zhang
852b801e90
fix: [2.6] Skip create tmp dir for growing R-Tree index (#45257)
issue: #45181 

master pr: #45256

Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>
2025-11-04 17:43:33 +08:00
congqixia
d490a5b4bf
enhance: [2.6] set schema version when creating new collection (#45263) (#45269)
Cherry pick from master
pr: #45263
Related to #43028

Initialize the schema version field when creating a new collection
instance in QueryNode. The schema version is extracted from loadMetaInfo
and assigned to the collection, ensuring proper schema version tracking
and consistency across the distributed system.

Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>
2025-11-04 17:05:34 +08:00
sparknack
efaa538238
fix:[2.6] avoid potential race conditions when updating the executor (#45232)
issue: #43040 
pr: #45230

Signed-off-by: Shawn Wang <shawn.wang@zilliz.com>
2025-11-04 16:11:33 +08:00
groot
f21d7ce05b
enhance: Support JSONL/NDJSON files for bulkinsert (#44602) (#44717)
issue: https://github.com/milvus-io/milvus/issues/44567
pr: https://github.com/milvus-io/milvus/pull/44602

Signed-off-by: yhmo <yihua.mo@zilliz.com>
2025-11-04 16:01:33 +08:00
yihao.dai
b8257facf2
enhance: [2.6] Wait for replicate stream client to finish (#45260)
Make channel replicator stop more gracefully.

issue: https://github.com/milvus-io/milvus/issues/44123

pr: https://github.com/milvus-io/milvus/pull/45259

Signed-off-by: bigsheeper <yihao.dai@zilliz.com>
2025-11-04 14:23:33 +08:00
Spade A
bc47935600
feat: impl StructArray -- support diskann index [2.6] (#45234)
pr: https://github.com/milvus-io/milvus/pull/45223
issue: https://github.com/milvus-io/milvus/issues/42148

---------

Signed-off-by: SpadeA-Tang <tangchenjie1210@gmail.com>
Signed-off-by: SpadeA <tangchenjie1210@gmail.com>
2025-11-04 12:13:34 +08:00
Spade A
f26ce204ce
fix: allow "[" and "]" in index name [2.6] (#45194)
pr: https://github.com/milvus-io/milvus/pull/45193
issue: https://github.com/milvus-io/milvus/issues/42148

---------

Signed-off-by: SpadeA <tangchenjie1210@gmail.com>
Signed-off-by: SpadeA-Tang <tangchenjie1210@gmail.com>
2025-11-04 12:11:34 +08:00
zhagnlu
cb2bc2b41b
fix: fix bug for shredding json when empty but not null json (#45214)
pr: #45221

Signed-off-by: luzhang <luzhang@zilliz.com>
Co-authored-by: luzhang <luzhang@zilliz.com>
2025-11-04 11:13:34 +08:00
cai.zhang
4bc2dc86a0
enhance: [2.6]Make GeometryCache an optional configuration (#45196)
issue: #45187 
master pr: #45192

Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>
2025-11-04 09:51:33 +08:00
Zhen Ye
02e2170601
enhance: cherry pick patch of new DDL framework and CDC 2 (#45241)
issue: #43897, #44123
pr: #45224
also pick pr: #45216,#45154,#45033,#45145,#45092,#45058,#45029

enhance: Close channel replicator more gracefully (#45029)

issue: https://github.com/milvus-io/milvus/issues/44123

enhance: Show create time for import job (#45058)

issue: https://github.com/milvus-io/milvus/issues/45056

fix: wal state may be unconsistent after recovering from crash (#45092)

issue: #45088, #45086

- Message on control channel should trigger the checkpoint update.
- LastConfrimedMessageID should be recovered from the minimum of
checkpoint or the LastConfirmedMessageID of uncommitted txn.
- Add more log info for wal debugging.

fix: make ack of broadcaster cannot canceled by client (#45145)

issue: #45141

- make ack of broadcaster cannot canceled by rpc.
- make clone for assignment snapshot of wal balancer.
- add server id for GetReplicateCheckpoint to avoid failure.

enhance: support collection and index with WAL-based DDL framework
(#45033)

issue: #43897

- Part of collection/index related DDL is implemented by WAL-based DDL
framework now.
- Support following message type in wal, CreateCollection,
DropCollection, CreatePartition, DropPartition, CreateIndex, AlterIndex,
DropIndex.
- Part of collection/index related DDL can be synced by new CDC now.
- Refactor some UT for collection/index DDL.
- Add Tombstone scheduler to manage the tombstone GC for collection or
partition meta.
- Move the vchannel allocation into streaming pchannel manager.

enhance: support load/release collection/partition with WAL-based DDL
framework (#45154)

issue: #43897

- Load/Release collection/partition is implemented by WAL-based DDL
framework now.
- Support AlterLoadConfig/DropLoadConfig in wal now.
- Load/Release operation can be synced by new CDC now.
- Refactor some UT for load/release DDL.

enhance: Don't start cdc by default (#45216)

issue: https://github.com/milvus-io/milvus/issues/44123


fix: unrecoverable when replicate from old (#45224)

issue: #44962

---------

Signed-off-by: bigsheeper <yihao.dai@zilliz.com>
Signed-off-by: chyezh <chyezh@outlook.com>
Co-authored-by: yihao.dai <yihao.dai@zilliz.com>
2025-11-04 01:35:33 +08:00
yihao.dai
cefdd25ef7
enhance: [2.6] Don't start cdc by default (#45217)
issue: https://github.com/milvus-io/milvus/issues/44123

pr: https://github.com/milvus-io/milvus/pull/45216

Signed-off-by: bigsheeper <yihao.dai@zilliz.com>
2025-11-03 20:51:33 +08:00
cai.zhang
7451d89a22
enhance: [2.6]Add log to debug index task (#45199)
master pr: #45198

---------

Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>
2025-11-03 20:03:34 +08:00
Spade A
902eea8e2c
feat: implement ngram tokenizer with token_chars and custom_token_chars [2.6] (#45046)
pr: https://github.com/milvus-io/milvus/pull/45040
issue: https://github.com/milvus-io/milvus/issues/45039

Signed-off-by: SpadeA <tangchenjie1210@gmail.com>
2025-11-03 18:11:34 +08:00