18783 Commits

Author SHA1 Message Date
aoiasd
cf02c623ab
fix: fix injection invalid bug by add inject task to handler inject when queue was empty (#31819)
relate: https://github.com/milvus-io/milvus/issues/31548

Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>
2024-04-03 14:05:14 +08:00
PowderLi
a992334d0e
fix: [cherry-pick] use milvus-io/vcpkg (#31808)
issue: #31769
master pr: #31770
3.4 pr: #31799

GitHub Disables The XZ Repository because of
https://github.com/advisories/GHSA-rxwq-x6h5-x525

Signed-off-by: PowderLi <min.li@zilliz.com>
2024-04-02 11:09:14 +08:00
cqy123456
47f767cf32
enhance: remove float16 in 2.3 branch (#31720)
issue: https://github.com/milvus-io/milvus/issues/31696

Signed-off-by: cqy123456 <qianya.cheng@zilliz.com>
2024-03-30 10:49:13 +08:00
foxspy
7394c5156c
enhance: Update Knowhere Cloud version (#31729)
/kind branch-feature

Signed-off-by: xianliang.li <xianliang.li@zilliz.com>
2024-03-29 18:15:17 +08:00
nico
c1ca0b2b0d
test: update chart version and some cases (#31686)
Signed-off-by: nico <cheng.yuan@zilliz.com>
2024-03-29 14:09:18 +08:00
XuanYang-cn
055dd7ea1d
fix: Clear compaction tasks when release channel (#31694)
See also: #31648
pr: #31666

Signed-off-by: yangxuan <xuan.yang@zilliz.com>
2024-03-29 10:53:12 +08:00
cai.zhang
4a4a4b1061
fix: [Pick]Check whether the DataType of the column is legal (#31699)
issue: #31696 
master pr: #31697 
2.4 pr: #31698

---------

Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>
2024-03-29 09:31:12 +08:00
XuanYang-cn
69931a6e7f
fix: Skip changing meta if nodeID not match with channel (#31665)
See also: #31648
pr: #31666

Signed-off-by: yangxuan <xuan.yang@zilliz.com>
2024-03-28 16:05:11 +08:00
congqixia
4021f4478f
enhance: [Cherry-pick] Save collection targets by batches (#31616) (#31655)
Cherry-pick from master
pr: #31616 
See also #28491 #31240

When colleciton number is large, querycoord saves collection target one
by one, which is slow and may block querycoord exits.

In local run, 500 collections scenario may lead to about 40 seconds
saving collection targets.

This PR changes the `SaveCollectionTarget` interface into batch one and
organizes the collection in 16 per bundle batches to accelerate this
procedure.

Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>
2024-03-28 10:25:11 +08:00
congqixia
35100ba2e5
fix: [2.3] Make target observer auto/manual task mutual exclusive (#31584) (#31603)
Cherry-pick from master
pr: #31584
See also #30867

Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>
2024-03-27 19:51:14 +08:00
sammy.huang
9d9f2cdf4d
enhance: enable docker buildkit (#31599)
pr:#31600

Signed-off-by: Liang Huang <sammy.huang@zilliz.com>
2024-03-27 16:47:11 +08:00
groot
91cdada12a
fix: minio ssl compatible issue (#31619)
issue: https://github.com/milvus-io/milvus/issues/30709
pr: https://github.com/milvus-io/milvus/pull/31607

Signed-off-by: yhmo <yihua.mo@zilliz.com>
2024-03-27 14:41:20 +08:00
jaime
8713765f4d
fix: proxy ctx cancel before revoking session (#31595)
issue: #31219
pr: #31596

Signed-off-by: jaime <yun.zhang@zilliz.com>
2024-03-26 11:45:09 +08:00
zhuwenxing
a6fe96f20a
test: [cherry-pick][2.3] add restful v2 testcase (#31583)
pr: https://github.com/milvus-io/milvus/pull/31404

---------

Signed-off-by: zhuwenxing <wenxing.zhu@zilliz.com>
2024-03-26 10:49:09 +08:00
aoiasd
50315282bb
fix: [Cherry-Pick] delegator filter out all partition's delete msg when loading segment (#31587)
May cause deleted data queryable a period of time.
issue : #31484 
pr: https://github.com/milvus-io/milvus/pull/31585

Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>
2024-03-25 20:33:09 +08:00
congqixia
f9c8cd8e7b
enhance: Bump milvus & proto version to v2.3.13 (#31579)
Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>
2024-03-25 18:03:10 +08:00
sammy.huang
f98e9c9100
enhance: to reduce container image size (#31570)
related pr:#31576

Signed-off-by: Liang Huang <sammy.huang@zilliz.com>
2024-03-25 18:01:17 +08:00
sre-ci-robot
2f0fa8488c
[automated] Update Pytest image changes (#31559)
Update Pytest image changes
See changes:
6b9004c728
Signed-off-by: sre-ci-robot sre-ci-robot@users.noreply.github.com

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
2024-03-25 16:43:08 +08:00
nico
6b9004c728
enhance: update pymilvus version (#31524)
pr: #31529

Signed-off-by: nico <cheng.yuan@zilliz.com>
2024-03-25 10:47:08 +08:00
PowderLi
f2f0d44a5d
feat: [cherry-pick] restful phase two (#30430)
issue: #28348 #29732

Support to trace the grpc request, pr: #28349
Support to trace restful request and request error, pr: #28685

restful phase two, pr: #29728 #30343
include: collections, entities, partitions, users, roles, indexes,
aliases, import jobs

---------

Signed-off-by: SimFG <bang.fu@zilliz.com>
Signed-off-by: PowderLi <min.li@zilliz.com>
Co-authored-by: SimFG <bang.fu@zilliz.com>
2024-03-25 10:39:09 +08:00
aoiasd
7c234f23c3
fix: double buffer was invalid when put entry which size larger than max size (#31549)
relate: https://github.com/milvus-io/milvus/issues/31548

---------

Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>
2024-03-23 21:09:07 +08:00
congqixia
368180bce4
fix: [2.3] Check nodeID before update channel checkpoint (#31473) (#31508)
Cherry-pick from master
pr: #31473
See also #31470 #31506

This PR adds nodeID assignment verification before updating channel
checkpoints.

---------

Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>
2024-03-23 07:07:07 +08:00
Jiquan Long
ab059bb064
enhance: add more metrics (#31271) (#31511)
/kind improvement
pr: #31271 
fix: https://github.com/milvus-io/milvus/issues/31272

This pr add more metrics, which are:

Slow query count, which the duration considered as slow can be
configurable;
Number of deleted entities;
Number of entities per collection;
Number of loaded entities per collection;
Number of indexed entities;
Number of indexed entities, per collection, per index and whether it's a
vetor index;
Quota states (LongTimeTickDelay, MemoryExhuasted, DiskQuotaExhuasted)
per database;

---------

Signed-off-by: longjiquan <jiquan.long@zilliz.com>
2024-03-22 16:11:07 +08:00
wei liu
ef523bfef3
fix: Unstable ut TestGetClientFailed (#31296) (#31472)
issue: #31295
pr: #31296

This PR fix unstable ut TestGetClientFailed

Signed-off-by: Wei Liu <wei.liu@zilliz.com>
2024-03-22 11:01:07 +08:00
wei liu
0bf595a513
enhance: Speed up target recovery after query coord restart (#31240) (#31449)
issue: #28491
pr: #31240

after querycoord restart, it will pull a new target, which include
channel and segment list. when segments loaded on querynode has reached
the target, the collection could provide search/query. but if segment
list changes by time, ater querycoord pull a new target, it will takes a
few minutes to catch up the target's segment distribution. and before
that, query/search will fail due to lack of segments.

This PR save the current loaded target to meta storein querycoord's stop
progress, and recover it when query coord starts, to speed up the target
recovery time.

---------

Signed-off-by: Wei Liu <wei.liu@zilliz.com>
2024-03-22 10:27:17 +08:00
wei liu
f8496dbc73
fix: Balance channel stuck forever due to logic dead lock (#31202) (#31455)
issue: #30816
pr: #31202

cause balance channel will stuck until leader view catch up the current
target, then start to unsub the old delegator. which make sure that the
new delegator can provide search before release old delegator. but
another logic in segment_checker skip loading segment during balance
channel. so during balance channel, if query node crash, new delegator
can't catch up target forever, then stuck forever.

This PR remove the rule that skip loading segment during balance channel
to avoid the logic dead lock here.

Signed-off-by: Wei Liu <wei.liu@zilliz.com>
2024-03-21 18:11:07 +08:00
wei liu
c8658d17f8
fix: Grpcclient return unrecoverable error (#31256) (#31452)
issue: #31222
pr: #31256

grpcclient's `call` func return a unrecoverable error, then the caller's
retry policy also breaks due to this unrecoverable error.

This PR introduce `retry.Handle`, the new func use `func() (bool,
error)` as input parameters, which return `shouldRetry` directly, to
avoid grpcclient return a unrecoverable error

Signed-off-by: Wei Liu <wei.liu@zilliz.com>
2024-03-21 11:59:12 +08:00
wei liu
6b761204ce
fix: Set node unreachable when get shard client failed (#31277) (#31451)
issue: #30531
pr: #31277

cause get client from `shardClientMgr`, doesn't means query node is
unavailable. because of the ref counter policy in `shardClientMgr`,
which will clean the client, if no collection use qn as shard leader.

This PR fix that set node unreachable when get shard client failed.

Signed-off-by: Wei Liu <wei.liu@zilliz.com>
2024-03-21 11:57:08 +08:00
wei liu
5994b6a7b0
fix: Search doesn't expire shard leader cache (#31380) (#31450)
issue: #31351
pr: #31380
This PR fixed that search doesn't expire shard leader cache when send
request to query node failed, which make every request keep trying to
connect a offline query node

Signed-off-by: Wei Liu <wei.liu@zilliz.com>
2024-03-21 11:55:07 +08:00
groot
1ca7cba222
enhance: Support MinIO TLS connection (#31292)
issue: https://github.com/milvus-io/milvus/issues/30709
master pr: #31311

Signed-off-by: yhmo <yihua.mo@zilliz.com>
Co-authored-by: Chen Rao <chenrao317328@163.com>
2024-03-21 11:15:20 +08:00
congqixia
94f3aec80a
enhance: [Cherry-pick] Add metrics for querycoord current target cp lag (#31391) (#31463)
Cherry-pick from master
pr: #31391 #31399
See also #31390

---------

Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>
2024-03-21 10:17:07 +08:00
wei liu
fef430daed
fix: Wrong behavior of CurrentTargetFirst/NextTargetFirst in target manager(#31379) (#31419)
issue: #31162
pr: #31379

when give scope CurrentTargetFirst/NextTargetFirst, it's expected to
scan both current and next target.

This PR fixed wrong behavior of CurrentTargetFirst/NextTargetFirst in
target manager, which may cause unexpected task generated, and load
collection may stuck forever due to dirty leader view.

Signed-off-by: Wei Liu <wei.liu@zilliz.com>
2024-03-20 23:39:07 +08:00
cai.zhang
52a7eb9548
fix: Fix bug for get segment index state (#31429)
issue: #31361 
master pr: #31427 
2.4 pr: #31428

Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>
2024-03-20 15:05:06 +08:00
congqixia
86e347a1a4
enhance: [2.3] Cache formatted key for param item (#31388) (#31402)
Cherry-pick from master
pr: #31388 
See also #30806

`formatKey` may cost lots of CPU on string processing under high QPS
scenario, this PR adds a formattedKeys cache preventing string operation
in each param get value.

---------

Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>
2024-03-19 19:25:10 +08:00
cai.zhang
ef530a2324
enhance: When describing an index, fetch the index info in batches (#31239)
issue: #29313 
master pr: #31238

Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>
2024-03-15 16:37:09 +08:00
sre-ci-robot
e77afcb5d5
[automated] Bump milvus version to v2.3.12 (#31303)
Bump milvus version to v2.3.12
Signed-off-by: sre-ci-robot sre-ci-robot@users.noreply.github.com

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
2024-03-15 16:19:05 +08:00
nico
75a86bc2d3
test: update test cases (#31253)
Signed-off-by: nico <cheng.yuan@zilliz.com>
2024-03-15 15:23:10 +08:00
Jiquan Long
50bfde92f2
fix: wrong num_entities used when mmap variable length data (#30848) (#31274)
https://github.com/milvus-io/milvus/issues/30728
pr: #30848

Signed-off-by: longjiquan <jiquan.long@zilliz.com>
v2.3.12
2024-03-14 20:33:03 +08:00
congqixia
4e48a4de0e
enhance: Bump milvus & proto version to v2.3.12 (#31193)
Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>
2024-03-14 19:09:04 +08:00
jaime
5ddb0b435f
fix: revoke session may be ignored due to server context cancellation in advance (#31213)
issue: #31219
pr: #31220

Signed-off-by: jaime <yun.zhang@zilliz.com>
2024-03-14 19:05:04 +08:00
sre-ci-robot
a33751a2d7
[automated] Update Pytest image changes (#31235)
Update Pytest image changes
See changes:
645cc0bdc3
Signed-off-by: sre-ci-robot sre-ci-robot@users.noreply.github.com

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
2024-03-14 09:59:11 +08:00
nico
645cc0bdc3
test: update test cases (#31161)
Signed-off-by: nico <cheng.yuan@zilliz.com>
2024-03-13 19:05:11 +08:00
sre-ci-robot
5386a2c43e
[automated] Update Pytest image changes (#31108)
Update Pytest image changes
See changes:
005dbf2b24
Signed-off-by: sre-ci-robot sre-ci-robot@users.noreply.github.com

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
2024-03-13 11:21:19 +08:00
chyezh
7105e0b261
fix: lost dbname when only passing collection id to describeCollection (#31177)
issue: #30931
pr: #31167

Signed-off-by: chyezh <chyezh@outlook.com>
2024-03-11 19:51:03 +08:00
aoiasd
e747f15c80
fix: flush insert data with nil buffer (#31159)
relate: https://github.com/milvus-io/milvus/issues/31165

Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>
2024-03-11 17:43:03 +08:00
wei liu
9d712f4dd4
fix: Balance param use duplicated key (#31112) (#31141)
pr: #31112
issue: #31115
This PR fix balance check interval  param use duplicated key

Signed-off-by: Wei Liu <wei.liu@zilliz.com>
2024-03-11 15:03:02 +08:00
wei liu
855f71ac89
fix: Dirty sealed segment won't release after channel balance (#31095) (#31126)
issue: #31074
pr: #31095
This PR fix dirty sealed segment doesn't release after channel balance,
dirty sealed segment means segment doesn't exist in targets.

Signed-off-by: Wei Liu <wei.liu@zilliz.com>
2024-03-11 15:01:11 +08:00
congqixia
3e7f2e8e7d
enhance: [Cherry-Pick] Use ListIndexes instead of DescribeIndex for qc broker (#31163)
Cherry pick from master 
pr: #31122

See also #31103

Since querycoord need index meta information from datacoord only, broker
shall use `ListIndexes` to skip segment index building check logic in
datacoord

This PR is also related to #30538, in which DescribeIndex caused lots of
memory usage and lead to OOM eventually

---------

Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>
2024-03-11 14:41:02 +08:00
pingliu
1dd4f4b4dc
enhance: jemalloc aarch64 platform use 64k pagesize. (#31114)
pr: https://github.com/milvus-io/milvus/pull/29522
enhance: jemalloc aarch64 platform use 64k pagesize.

Signed-off-by: ping.liu <ping.liu@zilliz.com>
2024-03-11 12:03:02 +08:00
congqixia
3c90475d55
enhance: [Cherry-pick] Add ListIndexes API from datacoord (#31104) (#31150)
Cherry-pick from master
pr: #31104
See also #31103

This PR add `listIndexes` API for datacoor server to list all indexes
for provided collection.
Comparing to the existing `DescribeIndex` API, the new one does NOT
check the segment index building progress to ease the burden when
invoking it

---------

Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>
2024-03-11 10:47:02 +08:00