23470 Commits

Author SHA1 Message Date
Zhen Ye
3439c3eb45
fix: interleave the go and cpp log (#46005)
issue: #45640
pr: #46004

Signed-off-by: chyezh <chyezh@outlook.com>
v2.6.7 pkg/v2.6.7
2025-12-03 14:27:12 +08:00
sre-ci-robot
7109b2062a
test: e2e tests support to run on gcp and aws on 2.6 branch [p0] (#45980)
pr: #45621

---------

Signed-off-by: Zhikun Yao <zhikun.yao@zilliz.com>
Co-authored-by: zhikunyao <zhikun.yao@zilliz.com>
2025-12-03 11:01:10 +08:00
nico
7b6aa2175f
test: update sdk version (#46002)
pr: #45809

Signed-off-by: nico <cheng.yuan@zilliz.com>
2025-12-02 19:05:11 +08:00
wei liu
1e89b8b94a
enhance: [2.6] Upgrade pulsar-client-go to v0.17.0(#46007) (#46013)
issue: #46006
pr: #46007

Upgrade apache/pulsar-client-go from v0.15.1 to v0.17.0 to fix send
buffer race condition (apache/pulsar-client-go#1394)

Signed-off-by: Wei Liu <wei.liu@zilliz.com>
2025-12-02 18:47:10 +08:00
wei liu
c258d03a01
fix: [2.6] Enable leader checker to sync segment distribution to RO nodes (#45949) (#45991)
issue: #45865
pr: #45949

- Modified leader_checker.go to include all nodes (RO + RW) instead of
only RW nodes, preventing channel balance from stucking on RO nodes
- Added debug logging in segment_checker.go when no shard leader found
- Enhanced target_observer.go with detailed logging for delegator check
failures to improve debugging visibility
- Fixed integration tests:
- Temporarily disabled partial result counter assertion in
partial_result_on_node_down_test.go pending concurrent issue fix
- Increased transfer channel timeout from 10s to 20s in
manual_rolling_upgrade_test.go to avoid flaky test caused by target
update interval (10s)

---------

Signed-off-by: Wei Liu <wei.liu@zilliz.com>
2025-12-02 17:47:10 +08:00
congqixia
99eccffe03
enhance: [2.6] Bump Go SDK pkg dependency fixing code checker (#45992)
Follow up for #45971

Update the `milvus/pkg/v2` dependency in both root and client modules to
align with the latest v2.6.7 release, and improve Makefile lint-fix
target logging.

---------

Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>
2025-12-02 13:09:10 +08:00
Zhen Ye
b71a123d25
fix: write ahead buffer unittest failure (#45979)
issue: #45977 
pr: #45978

Signed-off-by: chyezh <chyezh@outlook.com>
2025-12-02 11:39:09 +08:00
zhuwenxing
ce762358af
test: [cp2.6]unify schema retrieval to use get_schema() method in chaos checker (#45986)
master pr: https://github.com/milvus-io/milvus/pull/45985



Replace direct self.schema access and describe_collection() calls with
get_schema() method to ensure consistent schema handling with complete
struct_fields information. Also fix FlushChecker error handling and
change schema log level from info to debug.

Signed-off-by: zhuwenxing <wenxing.zhu@zilliz.com>
2025-12-02 10:51:10 +08:00
congqixia
af64f2acba
enhance: Bump milvus & proto version to v2.6.7 (#45971)
Bump milvus & proto version

Also bump golang.org/x/crypto to v0.45.0 fixing CVE-2025-47914

Related to #45976

---------

Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>
2025-12-01 20:03:10 +08:00
zhuwenxing
9b0e5cf032
test: [cp2.6]refactor connection method to prioritize uri/token and add query limit (#45948)
master pr: #45901

Signed-off-by: zhuwenxing <wenxing.zhu@zilliz.com>
2025-12-01 19:55:10 +08:00
Zhen Ye
e9d920a785
enhance: support proxy DML forward (#45922)
issue: #45812
pr: #45921

- 2.6 proxy will try to forward DWL to 2.5 proxy if streaming service is
not ready

Signed-off-by: chyezh <chyezh@outlook.com>
2025-12-01 19:39:10 +08:00
Zhen Ye
9942454811
enhance: remove watch at session liveness check (#45974)
issue: #45724
pr: #45968

---------

Signed-off-by: chyezh <chyezh@outlook.com>
2025-12-01 19:29:10 +08:00
zhagnlu
731178a97f
enhance: remove some meta cache for json shredding (#45889)
pr: #45888

Signed-off-by: luzhang <luzhang@zilliz.com>
Co-authored-by: luzhang <luzhang@zilliz.com>
2025-12-01 15:55:09 +08:00
zhagnlu
8ee392a682
enhance: make estimate json stats size more accurate (#45876)
pr: #45875

Signed-off-by: luzhang <luzhang@zilliz.com>
Co-authored-by: luzhang <luzhang@zilliz.com>
2025-12-01 15:49:09 +08:00
zhuwenxing
511516a61b
test: [cp2.6]add dynamicfield.enabled property alter in chaos checker (#45950)
pr: #45625

Signed-off-by: zhuwenxing <wenxing.zhu@zilliz.com>
2025-12-01 15:45:10 +08:00
zhuwenxing
9ac5d66dc6
test: [cp 2.6]fix apikey setting in restful v2 testcases (#45940)
master pr: #45396

Signed-off-by: zhuwenxing <wenxing.zhu@zilliz.com>
2025-12-01 15:43:15 +08:00
junjiejiangjjj
2989fbc666
fix: [2.6] Sum AllSearchCount from multiple search results (#45904)
https://github.com/milvus-io/milvus/issues/45842
pr: https://github.com/milvus-io/milvus/pull/45914

Signed-off-by: junjie.jiang <junjie.jiang@zilliz.com>
2025-12-01 14:45:10 +08:00
congqixia
17ac58a731
fix: [2.6] always call handleNodeUp in rewatchNodes for proper stopping balance (#45963)
Cherry-pick from master
pr: #45961 
Related to #45960

When QueryCoord restarts or reconnects to etcd, the rewatchNodes
function previously skipped handleNodeUp for QueryNodes in stopping
state. This caused stopping balance to fail because necessary components
were not initialized:
- Task scheduler executor was not added
- Dist handler was not started
- Node was not registered in resource manager

This fix ensures handleNodeUp is always called for new nodes regardless
of their stopping state, followed by handleNodeStopping if the node is
stopping. This allows the graceful shutdown process to correctly migrate
segments and channels away from stopping nodes.

---------

Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>
2025-12-01 14:03:09 +08:00
Buqian Zheng
5b77d45a27
fix: [2.6] term expr to correctly handle in of string in json (#45956)
issue: https://github.com/milvus-io/milvus/issues/45887
pr: https://github.com/milvus-io/milvus/pull/45955

---------

Signed-off-by: Buqian Zheng <zhengbuqian@gmail.com>
2025-11-29 21:55:09 +08:00
sparknack
45bda55716
enhance: [2.6] always use buffered io for high load priority (#45958)
issue: #43040
pr: #45900

Signed-off-by: Shawn Wang <shawn.wang@zilliz.com>
2025-11-29 20:53:08 +08:00
Xiaofan
b8e28d227d
enhance: [2.6]Upgrade etcd to 3.5.23 (#45953)
upgrade the etcd client and dependency to 2.5.23
related: #45947
pr: #44666
issue: #44614

Signed-off-by: xiaofanluan <xiaofan.luan@zilliz.com>
2025-11-29 16:38:09 +08:00
sparknack
52aa31324c
fix: [2.6] milvus-common update (#45930)
issue: #41435
pr: #45929

fix some usage tracking bugs in caching layer.

Signed-off-by: Shawn Wang <shawn.wang@zilliz.com>
2025-11-29 09:39:06 +08:00
sparknack
e0fe0d7169
enhance: [2.6] check both eviction and warmup when estimate segment loading size (#45891)
issue: #44857
pr: #45222

Signed-off-by: Shawn Wang <shawn.wang@zilliz.com>
2025-11-29 02:19:09 +08:00
sparknack
7afa1e7ce4
enhance: [2.6] mmap once for each group chunk (#45893)
issue: #45486
pr: #45487

This commit refactors the chunk writing system by introducing a
two-phase
approach: size calculation followed by writing to a target. This enables
efficient group chunk creation where multiple fields share a single mmap
region, significantly reducing the number of mmap system calls and VMAs.

- Optimize `mmap` usage: single `mmap` per group chunk instead of per
field
- Split ChunkWriter into two phases:
  - `calculate_size()`: Pre-compute required memory without allocation
  - `write_to_target()`: Write data to a provided ChunkTarget
- Implement `ChunkMmapGuard` for unified mmap region lifecycle
management
  - Handles `munmap` and file cleanup via RAII
  - Shared via `std::shared_ptr` across multiple chunks in a group



---------

Signed-off-by: Shawn Wang <shawn.wang@zilliz.com>
2025-11-28 23:55:09 +08:00
sparknack
5d3e4dd038
enhance: [2.6] add cancellation checking in each operator and expr (#45894)
issue: #45353
pr: #45354

Signed-off-by: Shawn Wang <shawn.wang@zilliz.com>
2025-11-28 19:41:08 +08:00
wei liu
b2ef076597
fix: [2.6] prevent panic in standby mixcoord during shutdown #45859 (#45898)
issue: #45728
pr: #45730
When mixcoord is in standby mode and shutdown is triggered, the
ProcessActiveStandBy goroutine may panic if context cancellation occurs.
This happens because the error handling didn't check for
context.Canceled errors before panicking.

Changes:
- Add context cancellation check in mix_coord Register() before panic
- Check s.ctx.Err() == context.Canceled and gracefully exit
- Remove unused ForceActiveStandby() function from session_util

This ensures standby mixcoord can shutdown gracefully without panic when
context is cancelled during the standby process.

Signed-off-by: Wei Liu <wei.liu@zilliz.com>
2025-11-28 19:33:09 +08:00
Spade A
5ba7c4ed35
fix: fix false negative panic on missing fields [2.6] (#45903)
pr: https://github.com/milvus-io/milvus/pull/45902
issue: https://github.com/milvus-io/milvus/issues/45834

Signed-off-by: SpadeA <tangchenjie1210@gmail.com>
2025-11-28 18:53:08 +08:00
Feilong Hou
eaed10538d
test: add e2e test cases for Timestamptz (#45800)
Issue: #44518, #45756
pr: #44871, #45128, #45770, #45524,  #44794, #45014

---------

Signed-off-by: Eric Hou <eric.hou@zilliz.com>
Signed-off-by: zhuwenxing <wenxing.zhu@zilliz.com>
Co-authored-by: Eric Hou <eric.hou@zilliz.com>
Co-authored-by: zhuwenxing <wenxing.zhu@zilliz.com>
2025-11-28 18:01:09 +08:00
zhenshan.cao
b948c62413
feat: Add /livez for Liveness Probes (#45454) (#45481)
issue: https://github.com/milvus-io/milvus/issues/45443
pr: https://github.com/milvus-io/milvus/pull/45454

Signed-off-by: zhenshan.cao <zhenshan.cao@zilliz.com>
2025-11-28 18:00:00 +08:00
Zhen Ye
3e96bcc979
enhance: support async write syncer for milvus logging (#45806)
issue: #45640
pr: #45805

- log may be dropped if the underlying file system is busy.
- use async write syncer to avoid the log operation block the milvus
major system.
- remove some log dependency from the until function to avoid
dependency-loop.

---------

Signed-off-by: chyezh <chyezh@outlook.com>
2025-11-28 17:55:08 +08:00
yihao.dai
20592fb236
enhance: [2.6] Skip redundant failure marking for completed import jobs (#45768)
Skip redundant failure marking for completed import jobs when the
collection is dropped or import jobs are timeout.

issue: https://github.com/milvus-io/milvus/issues/45766

pr: https://github.com/milvus-io/milvus/pull/45767

Signed-off-by: bigsheeper <yihao.dai@zilliz.com>
2025-11-28 17:47:08 +08:00
Buqian Zheng
adf2be6506
enhance: batch cp optimizations to 2.6 (#45869)
issue: #44452
pr: #45829
pr: #45328
pr: #45307
pr: #45008
pr: #44634

---------

Signed-off-by: zhagnlu <lu.zhang@zilliz.com>
Signed-off-by: Buqian Zheng <zhengbuqian@gmail.com>
Co-authored-by: zhagnlu <lu.zhang@zilliz.com>
Co-authored-by: luzhang <luzhang@zilliz.com>
2025-11-28 17:43:11 +08:00
cai.zhang
07727e7f25
fix: [2.6] Ensure the proxy's shard-leader cache remains stable for coord down test (#45909)
issue: #45847 
master pr: #45908 

After a collection is successfully loaded, the shard-leader state on the
QC may still not be marked as serviceable. It becomes serviceable only
after the scheduled distribution update runs, which will also invalidate
the shard-leader cache on the proxy. Therefore, even if queries are
already executable, the shard-leader mapping on the proxy may still
change afterward.

Try to ensure—as much as possible—that the proxy’s shard-leader cache
remains stable before killing the mixcoord.

Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>
2025-11-28 17:29:08 +08:00
Zhen Ye
da601d2a3c
fix: remove the streamingnode checking when loading segment (#45860)
issue: #43117
pr: #45859

If we enable checking when loading segments, all segment should always
be loaded by streamingnode but not 2.5 querynode, make some search and
query failure when upgrading. Otherwise, some search and query result
will be wrong when upgrading. We choose to disable this checking for now
to promise available search and query when upgrading.

also see pr: #43346

Signed-off-by: chyezh <chyezh@outlook.com>
2025-11-28 11:23:08 +08:00
congqixia
18493d5cf5
fix: [2.6] Add EmptySessionWatcher to prevent panic in IndexNodeBinding mode (#45912)
Cherry-pick from master
pr: #45911
Related to #45910

When IndexNodeBinding mode is enabled, DataCoord skips session watching
for datanodes but the dnSessionWatcher field remains nil. This causes a
panic when other code attempts to access the watcher.

This fix introduces an EmptySessionWatcher as a placeholder for the
IndexNodeBinding mode scenario. The empty watcher implements the
SessionWatcher interface with no-op methods, preventing nil pointer
dereferences while maintaining the expected interface contract.

Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>
2025-11-28 10:21:08 +08:00
tinswzy
1403bfdc68
enhance: [2.6] improve WAL retention strategy (#45784)
pr: #45350
issue: #44369

Signed-off-by: tinswzy <zhenyuan.wei@zilliz.com>
2025-11-28 10:07:08 +08:00
Zhen Ye
d20b332e63
fix: executor/scheduler should be latest replica meta but not replica copy (#45878)
issue: #45865
pr: #45877

---------

Signed-off-by: chyezh <chyezh@outlook.com>
2025-11-28 08:49:08 +08:00
zhenshan.cao
4cdeea5ddd
enhance: timestamptz support groupby (#45763)
issue: https://github.com/milvus-io/milvus/issues/45761
pr: https://github.com/milvus-io/milvus/pull/45762

Signed-off-by: zhenshan.cao <zhenshan.cao@zilliz.com>
2025-11-27 20:01:08 +08:00
zhagnlu
49c6a5382c
enhance: support mmap for jsonstats shared key index (#45861)
pr: #44914

Signed-off-by: luzhang <luzhang@zilliz.com>
Co-authored-by: luzhang <luzhang@zilliz.com>
2025-11-27 16:07:08 +08:00
cai.zhang
58e4673081
fix:[2.6]Reduce qc check node in replica interval for test (#45838)
issue: #45791 
master pr: #45837

Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>
2025-11-27 15:09:09 +08:00
Chun Han
195022a8f0
enhance: remove useless code(#30376) (#45713)
related: #30376
pr: https://github.com/milvus-io/milvus/pull/45685

Signed-off-by: MrPresent-Han <chun.han@gmail.com>
Co-authored-by: MrPresent-Han <chun.han@gmail.com>
2025-11-27 13:59:08 +08:00
Zhen Ye
507fe62027
fix: LastConfirmedMessageID may be wrong if high concurrent writing (#45874)
issue: #45872
pr: #45873

Signed-off-by: chyezh <chyezh@outlook.com>
2025-11-27 12:03:07 +08:00
cai.zhang
1a2c21205d
fix: [2.6] Increase the random suffix of the import test collection name (#45855)
issue: #45853 
master pr: #45854

---------

Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>
2025-11-27 10:09:07 +08:00
Xiaofan
cbb79dae8c
enhance: [2.6] add robust handle etcd servercrash (#45633)
issue: #45303
pr: #45304
fix milvus pod may restart when etcd pod start

---------

Signed-off-by: xiaofanluan <xiaofan.luan@zilliz.com>
2025-11-27 07:45:08 +08:00
Xiaofan
607c9ef40c
fix: [2.6]listImport and getImportProgress should follow import access (#45862)
issue: #45709
pr: #45822

Signed-off-by: xiaofanluan <xiaofan.luan@zilliz.com>
2025-11-27 02:11:10 +08:00
zhagnlu
fafec35bdc
fix:fix undefined behavior for dump snapshot (#45612)
pr: #45611

Signed-off-by: luzhang <luzhang@zilliz.com>
Co-authored-by: luzhang <luzhang@zilliz.com>
2025-11-26 20:17:12 +08:00
jac
59e1b0b5b7
test: update pymilvus to 2.6.4rc13 for milvus2.6 (#45836)
Signed-off-by: silas.jiang <silas.jiang@zilliz.com>
Co-authored-by: silas.jiang <silas.jiang@zilliz.com>
2025-11-26 15:47:09 +08:00
Buqian Zheng
ce302c1555
enhance: [2.6] add vector reserve to improve memory allocation in segcore (#45759)
This commit optimizes std::vector usage across segcore by adding
reserve() calls where the size is known in advance, reducing memory
reallocations during push_back operations.

Changes:

TimestampIndex.cpp: Reserve space for prefix_sums and timestamp_barriers
SegmentGrowingImpl.cpp: Reserve space for binlog info vectors
ChunkedSegmentSealedImpl.cpp: Reserve space for futures and field data
vectors
storagev2translator/GroupChunkTranslator.cpp: Reserve space for metadata
vectors
This improves performance by avoiding multiple memory reallocations when
the vector size is predictable.

issue: https://github.com/milvus-io/milvus/issues/45679
pr: https://github.com/milvus-io/milvus/pull/45757

Signed-off-by: Buqian Zheng <zhengbuqian@gmail.com>
2025-11-26 10:33:09 +08:00
cai.zhang
feffbf90d2
fix: [2.6]Search before kill coord to ensure proxy init shard leader cache (#45849)
issue: #45847 
master pr: #45848

Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>
2025-11-26 04:05:11 +08:00
Bingyi Sun
a5008ea509
fix: Replace json.doc() calls with json.dom_doc() in JsonContainsExpr (#45786)
issue: https://github.com/milvus-io/milvus/issues/45783
pr: https://github.com/milvus-io/milvus/pull/45573

Signed-off-by: sunby <sunbingyi1992@gmail.com>
2025-11-25 20:15:07 +08:00