20620 Commits

Author SHA1 Message Date
wei liu
93063ce1f9
fix: Prevent simultaneous balance of segments and channels (#37850) (#37939)
issue: #33550
pr: #37850
balance segment and balance segment execute at same time, which will
cause bounch of corner case.

This PR disable simultaneous balance of segments and channels

Signed-off-by: Wei Liu <wei.liu@zilliz.com>
2024-11-26 10:26:40 +08:00
congqixia
8601f3ed66
enhance: [2.4] Refine Replica manager colle2Replicas secondary index (#37906) (#37970)
Cherry-pick from master
pr: #37906
Related to #37630

This PR add a new util coll2Replicas secondary index to reduce map
access & iteration while get replicas by collection

---------

Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>
2024-11-26 10:20:35 +08:00
wei liu
b24510164e
enhance: Decouple shard client manager from shard cache (#37371) (#37753)
issue: #37115
pr: #37371 #37646 #37729
the old implementation update shard cache and shard client manager at
same time, which causes lots of conor case due to concurrent issue
without lock.

This PR decouple shard client manager from shard cache, so only shard
cache will be updated if delegator changes. and make sure shard client
manager will always return the right client, and create a new client if
not exist. in case of client leak, shard client manager will purge
client in async for every 10 minutes.

---------

---------

Signed-off-by: Wei Liu <wei.liu@zilliz.com>
Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>
Co-authored-by: congqixia <congqi.xia@zilliz.com>
2024-11-25 17:50:34 +08:00
Ted Xu
e928e15bfc
fix: refuse schedule compaction tasks if there is no slot (#37809)
See #37621


pr: #37589

---------

Signed-off-by: Ted Xu <ted.xu@zilliz.com>
Signed-off-by: Yinzuo Jiang <jiangyinzuo@foxmail.com>
Signed-off-by: yangxuan <xuan.yang@zilliz.com>
Signed-off-by: Wei Liu <wei.liu@zilliz.com>
Co-authored-by: Yinzuo Jiang <jiangyinzuo@foxmail.com>
Co-authored-by: yangxuan <xuan.yang@zilliz.com>
Co-authored-by: wei liu <wei.liu@zilliz.com>
2024-11-25 14:02:34 +08:00
wei liu
370f39db67
enhance: Remove unnecessary stack trace in error (#37816) (#37941)
pr: #37816

Signed-off-by: Wei Liu <wei.liu@zilliz.com>
2024-11-25 11:28:35 +08:00
wei liu
bb66636448
fix: Channel may be released after balance (#37862) (#37940)
issue: #37830
pr: #37862
casue dist handler doesn't set channel's version, so if channel checker
try to dedup channel, it may release the new delegator after balance
finished.

this PR fix the way to set proper version for channel.

Signed-off-by: Wei Liu <wei.liu@zilliz.com>
2024-11-25 11:26:44 +08:00
sthuang
d8f1af68e9
enhance: [2.4] RBAC built in privilege groups and grant v2 (#37787)
cherry-pick from master: https://github.com/milvus-io/milvus/pull/37720,
https://github.com/milvus-io/milvus/pull/37785
issue: https://github.com/milvus-io/milvus/issues/37031

Signed-off-by: shaoting-huang <shaoting.huang@zilliz.com>
2024-11-25 11:24:54 +08:00
wei liu
ff6e8e2f2b
fix: [skip e2e] unstable ut TestResourceManager (#37761) (#37936)
issue: #37760
pr: #37761

Signed-off-by: Wei Liu <wei.liu@zilliz.com>
2024-11-25 11:06:34 +08:00
zhikunyao
cf1c423e9b
enhance: [skip e2e]2.4 update workflow macos to 13 (#37942)
Signed-off-by: Zhikun Yao <zhikun.yao@zilliz.com>
2024-11-22 18:56:32 +08:00
sre-ci-robot
3ceb494403
[automated] Bump milvus version to v2.4.17 (#37935)
Bump milvus version to v2.4.17
Signed-off-by: sre-ci-robot sre-ci-robot@users.noreply.github.com

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
2024-11-22 17:42:32 +08:00
yellow-shine
fc96133374
enhance: [2.4 pipeline] limit compute resource (#37889)
Signed-off-by: Yellow Shine <sammy.huang@zilliz.com>
v2.4.17
2024-11-22 14:14:38 +08:00
congqixia
4aca68a739
enhance: Bump milvus & proto verison to v2.4.17 (#37920)
Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>
2024-11-22 14:08:34 +08:00
zhenshan.cao
9b3de3ac3e
fix: Revert "enhance: [2.4] Enable RemoteLoad l0 forward policy" (#37875)
issue https://github.com/milvus-io/milvus/issues/35303
pr: https://github.com/milvus-io/milvus/pull/37867
This reverts commit cdf703aabc2ec7e4addded68e808ba6add3ab2cb.

Signed-off-by: zhenshan.cao <zhenshan.cao@zilliz.com>
2024-11-22 12:40:33 +08:00
wei liu
e63a2f3559
fix: unstable integration test caused by paramtable.GetNodeID (#37910)
issue: #37908
pr: #37909
cause paramtable is global single instance, which cause
paramtable.GetNodeID may return wrong server id in integration test.

This PR use node.GetNodeID to replace paramtable.GetNodeID

Signed-off-by: Wei Liu <wei.liu@zilliz.com>
2024-11-22 10:14:33 +08:00
congqixia
0bd26171d5
enhance: [2.4] Provide secondary index criteria when filter leaderview (#37777) (#37802)
Cherry-pick from master
pr: #37777 
Related to #37630

---------

Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>
2024-11-21 10:48:33 +08:00
congqixia
28adfe4629
enhance: [2.4] Remove unnecessary segment clone updating dist (#37797) (#37833)
Cherry-pick from master
pr: #37797
Related to #37630

Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>
2024-11-20 19:48:33 +08:00
sre-ci-robot
5ac4e4839e
[automated] Bump milvus version to v2.4.16 (#37790)
Bump milvus version to v2.4.16
Signed-off-by: sre-ci-robot sre-ci-robot@users.noreply.github.com

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
2024-11-20 11:46:37 +08:00
congqixia
cffde80e68
enhance: [2.4] Prevent generate "null" search params (#37811)
pr: #37812
Preventing generating null search params in restful search request

Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>
v2.4.16
2024-11-19 18:20:32 +08:00
Zhen Ye
ebfd917bb6
fix: make asan avaiable when building milvus image (#37804)
issue: #35854
pr: #37041

- USE_ASAN will not enable the Debug mode.
- replace USE_ASAN by `ldd`  to make generate right so in milvus image.

Signed-off-by: chyezh <chyezh@outlook.com>
Co-authored-by: yellow-shine <sammy.huang@zilliz.com>
2024-11-19 17:28:32 +08:00
congqixia
a10f95d71c
enhance: Bump milvus & proto version to v2.4.16 (#37762)
Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>
2024-11-18 20:36:31 +08:00
congqixia
876e06b862
fix: [2.4] Load l0 delta for growings when using RemoteLoad (#37772)
Cherry-pick from master
pr: #37771
Related to #37574

Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>
2024-11-18 20:26:31 +08:00
smellthemoon
46692d7525
enhance: support upsert autoid==true in Restful API and fix some bugs(#37072)(#37487) (#37766)
pr: #37072
pr: #37487

---------

Signed-off-by: lixinguo <xinguo.li@zilliz.com>
Co-authored-by: lixinguo <xinguo.li@zilliz.com>
2024-11-18 19:44:31 +08:00
wei liu
2a4f54cd4f
fix: L0 segment has been loaded to worker during channel balance (#37758)
issue: https://github.com/milvus-io/milvus/issues/37703
pr: https://github.com/milvus-io/milvus/pull/37748

---------

Signed-off-by: Wei Liu <wei.liu@zilliz.com>
2024-11-18 17:00:32 +08:00
foxspy
cabb55595a
enhance: update knowhere version (#37763)
/kind branch-feature

knowhere release note :
https://github.com/zilliztech/knowhere/releases/tag/v2.3.13

Signed-off-by: xianliang.li <xianliang.li@zilliz.com>
2024-11-18 16:30:32 +08:00
wei liu
79f676e7d8
enhance: Use batch to speed up list collections from meta kv (#37752)
issue: #36228
pr: #37742

Signed-off-by: Wei Liu <wei.liu@zilliz.com>
2024-11-18 15:58:33 +08:00
nico
bbd96e1829
test: update pymilvus version and test cases (#37711)
Signed-off-by: nico <cheng.yuan@zilliz.com>
2024-11-18 14:14:32 +08:00
jaime
3ce27ca689
enhance: remove collection queryable check from health check (#37731)
pr: #37712

Signed-off-by: jaime <yun.zhang@zilliz.com>
2024-11-18 10:50:38 +08:00
yihao.dai
13f83df019
enhance: [2.4] Remove segment-level tag from monitoring metrics (#37737)
When there are a large number of segments, the metrics consume a lot of
memory. This PR Remove segment-level tag from monitoring metrics.

issue: https://github.com/milvus-io/milvus/issues/37636

pr: https://github.com/milvus-io/milvus/pull/37696

Signed-off-by: bigsheeper <yihao.dai@zilliz.com>
2024-11-16 23:04:33 +08:00
yihao.dai
d29573551b
enhance: [2.4] Remove unnecessary clone in SetState (#37736)
issue: https://github.com/milvus-io/milvus/issues/37637

pr: https://github.com/milvus-io/milvus/pull/37697

Signed-off-by: bigsheeper <yihao.dai@zilliz.com>
2024-11-16 19:04:34 +08:00
congqixia
cdf703aabc
enhance: [2.4] Enable RemoteLoad l0 forward policy by default (#37678) (#37713)
Cherry-pick from master
pr: #37678
Related to #35303

Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>
2024-11-15 18:28:31 +08:00
smellthemoon
b3e6482367
enhance: add search params in search request in restful(#36304) (#37673)
pr: #36304 
pr: #36714 
pr: #36448

---------

Signed-off-by: lixinguo <xinguo.li@zilliz.com>
Co-authored-by: lixinguo <xinguo.li@zilliz.com>
Co-authored-by: zhuwenxing <wenxing.zhu@zilliz.com>
2024-11-15 17:54:30 +08:00
Zhen Ye
4e11fe7adf
enhance: make milvus image with asan available (#37682)
issue: #35854
pr: #37050

Signed-off-by: chyezh <chyezh@outlook.com>
2024-11-15 17:10:30 +08:00
wei liu
1bd502b585
fix: Delegator stuck at unserviceable status (#37694) (#37702)
issue: #37679
pr: #37694

pr #36549 introduce the logic error which update current target when
only parts of channel is ready.

This PR fix the logic error and let dist handler keep pull distribution
on querynode until all delegator becomes serviceable.

Signed-off-by: Wei Liu <wei.liu@zilliz.com>
2024-11-15 14:52:30 +08:00
congqixia
e222289038
fix: [2.4] Store default value if ErrKeyNotFound is returned (#37691) (#37705)
Cherry-pick from master
pr: #37691
Related to #37690

Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>
2024-11-15 14:50:32 +08:00
XuanYang-cn
5d5f899274
fix: [cp24]Change memoryCheck write lock to read lock (#37526)
pr: #37525

Signed-off-by: yangxuan <xuan.yang@zilliz.com>
2024-11-15 14:42:31 +08:00
wei liu
c50cb8d3ef
fix: Make GetShardLeaders only retries on retriable error (#37687)
issue: #37532
pr: #37684

Signed-off-by: Wei Liu <wei.liu@zilliz.com>
2024-11-14 21:14:40 +08:00
XuanYang-cn
d5cad01c22
enhance: [cp24]tidy compaction logs (#37595) (#37647)
Remove some annoying logs and lower a log level from warn to info

pr: #37595

Signed-off-by: yangxuan <xuan.yang@zilliz.com>
2024-11-14 18:44:31 +08:00
nico
2bf8773d58
enhance: update sdk version (#37661)
pr: #37660

Signed-off-by: nico <cheng.yuan@zilliz.com>
2024-11-14 17:46:39 +08:00
XuanYang-cn
d23da2db4f
fix: [cp24]Correct varchar primarykey size calculation (#37619)
See also: #37582
pr: #37617

---------

Signed-off-by: yangxuan <xuan.yang@zilliz.com>
2024-11-14 14:16:38 +08:00
wei liu
28bcd85bd0
fix: Balance channel may stuck at increasing replica number case (#37642)
issue: #37640
pr: #37641
fix the pr #36549
cause balance channel will wait until new delegator becomes serviceable,
but new delegator need to sync target version then becomes serviceable,
and sync target version need to be wait all replica load done. so if
increasing replica number and balance channel happens at same time,
logic dead lock occurs.

Signed-off-by: Wei Liu <wei.liu@zilliz.com>
2024-11-13 14:26:30 +08:00
congqixia
8801322371
enhance: [2.4] Invalidate collection cache when release collection (#37577) (#37628)
Cherry-pick from master
pr: #37577
Related to #37395

---------

Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>
2024-11-13 14:00:31 +08:00
congqixia
d073f322a4
enhance: [2.4] Add cgo call metrics for load/write API (#37405) (#37627)
Cherry-pick from master
pr: #37405

Cgo API cost is not observerable since not metrics is related to them.
This PR add metrics for some sync cgo call related to load & write

---------

Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>
2024-11-13 13:58:30 +08:00
wei liu
6dc879b1e2
enhance: Enable node assign policy on resource group (#36968) (#37588)
issue: #36977
pr: #36968
with node_label_filter on resource group, user can add label on
querynode with env `MILVUS_COMPONENT_LABEL`, then resource group will
prefer to accept node which match it's node_label_filter.

then querynode's can't be group by labels, and put querynodes with same
label to same resource groups.

---------

Signed-off-by: Wei Liu <wei.liu@zilliz.com>
2024-11-13 11:10:29 +08:00
wei liu
7d1c899155
fix: Search may return less result after qn recover (#36549) (#37610)
issue: #36293 #36242
pr: #36549
after qn recover, delegator may be loaded in new node, after all segment
has been loaded, delegator becomes serviceable. but delegator's target
version hasn't been synced, and if search/query comes, delegator will
use wrong target version to filter out a empty segment list, which
caused empty search result.

This pr will block delegator's serviceable status until target version
is synced

---------

Signed-off-by: Wei Liu <wei.liu@zilliz.com>
2024-11-12 19:16:30 +08:00
cai.zhang
3456e241ac
fix: [2.4]Fix the bug that retrieved from wrong field for L0 segments (#37599)
issue: #37574 

master pr: #37598

Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>
2024-11-12 19:02:31 +08:00
wei liu
074f8ee696
enhance: optimize describe collection and index (#37490) (#37605)
fix #37489
pr: #34790
combine multiple describe collection and list index into one call

Signed-off-by: xiaofanluan <xiaofan.luan@zilliz.com>
Signed-off-by: Wei Liu <wei.liu@zilliz.com>
Co-authored-by: Xiaofan <83447078+xiaofan-luan@users.noreply.github.com>
2024-11-12 16:54:29 +08:00
wei liu
25c96991f6
fix: Lost loading collection's updateTs after qc restart (#37538) (#37580)
issue: #37537
pr: #37538

Signed-off-by: Wei Liu <wei.liu@zilliz.com>
2024-11-11 17:50:30 +08:00
sthuang
9e8b6ace6d
enhance: [2.4] RBAC custom privilege group (#37560)
Cherry-pick from master
pr: https://github.com/milvus-io/milvus/pull/37087,
https://github.com/milvus-io/milvus/pull/37558
issue: #37031

---------

Signed-off-by: shaoting-huang <shaoting.huang@zilliz.com>
2024-11-11 14:20:29 +08:00
congqixia
2fbb157dc8
enhance: [2.4] Handle legacy proxy load fields request (#37565) (#37569)
Cherry-pick from master
pr: #37565
Related to #35415

In rolling upgrade, legacy proxy may dispatch load request wit empty
load field list. The upgraded querycoord may report error by mistake
that load field list is changed.

This PR:

- Auto field empty load field list with all user field ids
- Refine the error messag when load field list updates
- Refine load job unit test with service cases

Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>
2024-11-11 14:06:29 +08:00
congqixia
4f4261157d
fix: [2.4] Add IP address validation from paramtable (#37416) (#37500)
Cherry-pick from master
pr: #37416
See also #37404 #37402

IP address in paramtable need validation and fail fast with reasonable
error message

---------

Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>
2024-11-11 10:12:28 +08:00