milvus

mirror of https://gitee.com/milvus-io/milvus.git synced 2026-02-02 01:06:41 +08:00

Author	SHA1	Message	Date
congqixia	c86d68bea5	enhance: [2.5] Bump arrow/go to v17 (#44663 ) Related to #40777 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-10-09 11:47:57 +08:00
ZhuXi	cd931a0388	feat:Geospatial Data Type and GIS Function support for milvus (#43661 ) issue: #43427 pr: #37417 This pr's main goal is merge #37417 to milvus 2.5 without conflicts. # Main Goals 1. Create and describe collections with geospatial type 2. Insert geospatial data into the insert binlog 3. Load segments containing geospatial data into memory 4. Enable query and search can display geospatial data 5. Support using GIS funtions like ST_EQUALS in query # Solution 1. Add Type: Modify the Milvus core by adding a Geospatial type in both the C++ and Go code layers, defining the Geospatial data structure and the corresponding interfaces. 2. Dependency Libraries: Introduce necessary geospatial data processing libraries. In the C++ source code, use Conan package management to include the GDAL library. In the Go source code, add the go-geom library to the go.mod file. 3. Protocol Interface: Revise the Milvus protocol to provide mechanisms for Geospatial message serialization and deserialization. 4. Data Pipeline: Facilitate interaction between the client and proxy using the WKT format for geospatial data. The proxy will convert all data into WKB format for downstream processing, providing column data interfaces, segment encapsulation, segment loading, payload writing, and cache block management. 5. Query Operators: Implement simple display and support for filter queries. Initially, focus on filtering based on spatial relationships for a single column of geospatial literal values, providing parsing and execution for query expressions.Now only support brutal search 6. Client Modification: Enable the client to handle user input for geospatial data and facilitate end-to-end testing.Check the modification in pymilvus. --------- Signed-off-by: Yinwei Li <yinwei.li@zilliz.com> Signed-off-by: Cai Zhang <cai.zhang@zilliz.com> Co-authored-by: cai.zhang <cai.zhang@zilliz.com>	2025-08-26 19:11:55 +08:00
wei liu	4631657304	fix: Unstable integration case TestBalanceOnSingleReplica (#43552 ) issue: #42930 Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2025-07-25 10:52:55 +08:00
wei liu	ad0bf9cad8	enhance: Optimize channel node balancing for uneven QN distribution (#42786 ) (#43423 ) issue: #42860 pr: #42786 Fix channel node allocation when QueryNode count is not a multiple of channel count. The previous algorithm used simple division which caused uneven distribution with remainders. Key improvements: - Implement smart remainder distribution algorithm - Refactor large function into focused helper functions - Support two-phase rebalancing (release then allocate) - Handle edge cases like insufficient nodes gracefully --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2025-07-21 17:04:54 +08:00
wei liu	b08d9efe69	fix: Prevent delegator unserviceable due to shard leader change (#42689 ) (#43309 ) issue: #42098 #42404 pr: #42689 Fix critical issue where concurrent balance segment and balance channel operations cause delegator view inconsistency. When shard leader switches between load and release phases of segment balance, it results in loading segments on old delegator but releasing on new delegator, making the new delegator unserviceable. The root cause is that balance segment modifies delegator views, and if these modifications happen on different delegators due to leader change, it corrupts the delegator state and affects query availability. Changes include: - Add shardLeaderID field to SegmentTask to track delegator for load - Record shard leader ID during segment loading in move operations - Skip release if shard leader changed from the one used for loading - Add comprehensive unit tests for leader change scenarios This ensures balance segment operations are atomic on single delegator, preventing view corruption and maintaining delegator serviceability. --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2025-07-15 17:46:51 +08:00
yihao.dai	f978641d6a	enhance: [2.5] Enhance import integration tests and logs (#42696 ) 1. Optimize the import process: skip subsequent steps and mark the task as complete if the number of imported rows is 0. 2. Improve import integration tests: a. Add a test to verify that autoIDs are not duplicated b. Add a test for the corner case where all data is deleted c. Shorten test execution time 3. Enhance import logging: a. Print imported segment information upon completion b. Include file name in failure logs issue: https://github.com/milvus-io/milvus/issues/42488, https://github.com/milvus-io/milvus/issues/42518 pr: https://github.com/milvus-io/milvus/pull/42612 Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2025-06-16 20:06:38 +08:00
yihao.dai	a7c818cadb	fix: [2.5] Fix no candidate segments error for small import (#41772 ) When autoID is enabled, the preimport task estimates row distribution by evenly dividing the total row count (numRows) across all vchannels: `estimatedCount = numRows / vchannelNum`. However, the actual import task hashes real auto-generated IDs to determine the target vchannel. This mismatch can lead to inaccurate row distribution estimation in such corner cases: - Importing 1 row into 2 vchannels: • Preimport: 1 / 2 = 0 → both v0 and v1 are estimated to have 0 rows • Import: real autoID (e.g., 457975852966809057) hashes to v1 → actual result: v0 = 0, v1 = 1 To resolve such corner case, we now allocate at least one segment for each vchannel when autoID is enabled, ensuring all vchannels are prepared to receive data even if no rows are estimated for them. issue: https://github.com/milvus-io/milvus/issues/41759 pr: https://github.com/milvus-io/milvus/pull/41771 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2025-05-14 10:36:22 +08:00
SimFG	18eb627533	fix: [2.5] Update logging context and upgrade dependencies (#41319 ) - issue: #41291 - pr: #41318 --------- Signed-off-by: SimFG <bang.fu@zilliz.com> Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>	2025-04-24 23:50:40 +08:00
SimFG	a945345110	fix: [2.5] use the different msg type for the OperatePrivilegeV2 api (#40193 ) - issue: #40178 - pr: #40192 Signed-off-by: SimFG <bang.fu@zilliz.com>	2025-03-03 10:20:01 +08:00
congqixia	709594f158	enhance: [2.5] Use v2 package name for pkg module (#40117 ) Cherry-pick from master pr: #39990 Related to #39095 https://go.dev/doc/modules/version-numbers Update pkg version according to golang dep version convention Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-02-23 00:46:01 +08:00
wei liu	d92ffb66a1	fix: [skip e2e] [2.5]data race in load test (#39846 ) Related to https://github.com/milvus-io/milvus/pull/39701 pr: #39845 Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2025-02-18 15:04:50 +08:00
wei liu	11cba57dc7	fix: [2.5] load collection stucks if compaction/gc happens (#39761 ) issue: #39680 pr: #39701 if compaction/gc happens, load collection may stuck due to SegmentNotFound, we should trigger UpdateNextTarget to get a new data view to execute loading operation. Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2025-02-11 15:48:50 +08:00
Zhen Ye	858dc10ef9	enhance: broadcast with event-based notification (#39550 ) issue: #38399 pr: #39522 - broadcast message can carry multi resource key now. - implement event-based notification for broadcast messages - broadcast message use broadcast id as a unique identifier in message - broadcasted message on vchannels keep the broadcasted vchannel now. - broadcasted message and broadcast message have a common broadcast header now. --------- Signed-off-by: chyezh <chyezh@outlook.com>	2025-02-07 11:50:50 +08:00
sthuang	4a47f760b3	fix: [2.5] rbac custom group privilege level check (#39164 ) (#39224 ) cherry-pick from master: https://github.com/milvus-io/milvus/pull/39164 related: https://github.com/milvus-io/milvus/issues/39086 Signed-off-by: shaoting-huang <shaoting.huang@zilliz.com>	2025-01-14 16:50:59 +08:00
Zhen Ye	54036bcafd	enhance: add broadcast operation for msgstream (#39119 ) issue: #38399 pr: #39040 - make broadcast service available for msgstream by reusing the architecture streaming service --------- Signed-off-by: chyezh <chyezh@outlook.com>	2025-01-14 15:11:00 +08:00
sthuang	3059fae40d	fix: [2.5] fix restore rbac empty meta crash (#39143 ) cp from master: https://github.com/milvus-io/milvus/pull/39141 related: https://github.com/milvus-io/milvus/issues/38985 Signed-off-by: shaoting-huang <shaoting.huang@zilliz.com>	2025-01-13 12:04:58 +08:00
Zhen Ye	95809ca767	enhance: make new go package to manage proto (#39128 ) issue: #39095 pr: #39114 --------- Signed-off-by: chyezh <chyezh@outlook.com>	2025-01-10 10:53:01 +08:00
sthuang	a4901ef7ec	fix: [2.5] fix privilege group list and list collections (#38738 ) cherry-pick from: https://github.com/milvus-io/milvus/pull/38684 related: https://github.com/milvus-io/milvus/issues/37031 Signed-off-by: shaoting-huang <shaoting.huang@zilliz.com>	2024-12-25 18:06:50 +08:00
wei liu	b16d04d7cc	fix: Fix update loading collection's load config doesn't work (#38737 ) issue: #38594 pr: #38595 Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-12-25 15:02:50 +08:00
XuanYang-cn	ca7ec23198	enhance: Use partitionID when delete by partitionKey (#38231 ) When delete by partition_key, Milvus will generates L0 segments globally. During L0 Compaction, those L0 segments will touch all partitions collection wise. Due to the false-positive rate of segment bloomfilters, L0 compactions will append false deltalogs to completed irrelevant partitions, which causes *partition deletion amplification. This PR uses partition_key to set targeted partitionID when producing deleteMsgs into MsgStreams. This'll narrow down L0 segments scope to partition level, and remove the false-positive influence collection-wise. However, due to DeleteMsg structure, we can only label one partition to one deleteMsg, so this enhancement fails if user wants to delete over 2 partition_keys in one deletion. See also: #34665 Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2024-12-20 11:18:46 +08:00
tinswzy	27229f7907	enhance: refine exists log print with ctx (#38080 ) issue: #35917 Refines exists log print with ctx Signed-off-by: tinswzy <zhenyuan.wei@zilliz.com>	2024-12-14 22:36:44 +08:00
wei liu	a118ca14a7	fix: Fix role be dropped when grant still exist. (#38342 ) issue: #38325 the old impl only to check grant in default db before drop role, which may cause role be dropped when grant still exist. Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-12-11 11:24:42 +08:00
cai.zhang	73aa95f596	fix: Add version to the proxy cache to resolve concurrency issues (#38067 ) issue: #36989 --------- Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2024-12-04 18:06:39 +08:00
shaoyue	1f66b9ebfb	feat: add config field to set internal tls sni (#38124 ) /cc @xiaofan-luan @jaime0815 @nish112022 part of https://github.com/milvus-io/milvus/issues/36864 Signed-off-by: haorenfsa <haorenfsa@gmail.com>	2024-12-04 14:56:47 +08:00
sthuang	a5e0a56a8e	fix: move grant/revoke v2 params check from rootcoord to proxy (#38130 ) related issue: https://github.com/milvus-io/milvus/issues/37031 fixed issues #38042: The interface "grant_v2" does not support empty collectionName while the error says it does Signed-off-by: shaoting-huang <shaoting.huang@zilliz.com>	2024-12-02 19:48:37 +08:00
sthuang	23dc313c44	fix: fix grant/revoke v2 meta and unclear error messages (#38110 ) related issue: https://github.com/milvus-io/milvus/issues/37031 fixed issues: #37974: better error messages for grant v2 interface #37903: fix meta built-in privilege group object name #37843: better error messages for custom privilege group interface #38002: fix built-in privilege group meta to pass proxy interceptor check #38008: fix revoke v2 to support revoking v1 granted privileges Signed-off-by: shaoting-huang <shaoting.huang@zilliz.com>	2024-12-02 11:36:39 +08:00
sthuang	19572f5b06	enhance: RBAC new grant/revoke privilege (#37785 ) issue: https://github.com/milvus-io/milvus/issues/37031 also fix issues: https://github.com/milvus-io/milvus/issues/37843, https://github.com/milvus-io/milvus/issues/37842, https://github.com/milvus-io/milvus/issues/37887 Signed-off-by: shaoting-huang <shaoting.huang@zilliz.com>	2024-11-21 22:20:34 +08:00
nish112022	484c6b5c44	feat: Added code for Internal-tls (#36865 ) issue : https://github.com/milvus-io/milvus/issues/36864 I have a few questions regarding my approach.I will consolidate them here for feedback and review.Thanks --------- Signed-off-by: Nischay Yadav <nischay.yadav@ibm.com> Signed-off-by: Nischay <Nischay.Yadav@ibm.com>	2024-11-20 06:00:32 +08:00
sthuang	2d72ad33f2	enhance: RBAC built in privilege groups (#37720 ) issue: #37031 Signed-off-by: shaoting-huang <shaoting.huang@zilliz.com>	2024-11-18 20:38:39 +08:00
wei liu	a1b6be1253	fix: Delegator stuck at unserviceable status (#37694 ) issue: #37679 pr #36549 introduce the logic error which update current target when only parts of channel is ready. This PR fix the logic error and let dist handler keep pull distribution on querynode until all delegator becomes serviceable. Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-11-15 10:20:31 +08:00
jaime	1e8ea4a7e7	feat: add segment/channel/task/slow query render (#37561 ) issue: #36621 Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-11-12 17:44:29 +08:00
wei liu	266f8ef1f5	fix: Search may return less result after qn recover (#36549 ) issue: #36293 #36242 after qn recover, delegator may be loaded in new node, after all segment has been loaded, delegator becomes serviceable. but delegator's target version hasn't been synced, and if search/query comes, delegator will use wrong target version to filter out a empty segment list, which caused empty search result. This pr will block delegator's serviceable status until target version is synced --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-11-12 16:34:28 +08:00
Zhen Ye	b5b003551e	enhance: use localhost for it and ut (#37529 ) issue: #37528 Signed-off-by: chyezh <chyezh@outlook.com>	2024-11-12 11:36:27 +08:00
XuanYang-cn	a45a288a25	fix: Separate L0 and Mix trigger interval (#37190 ) See also: #37108 - Add MixCompactionTriggerInterval, default 60s - Add L0CompactionTriggerInterval, default 10s - Export Single related compaction configs - Raise SingleCompactionDeltaLogMaxSize from 2MB to 16MB --------- Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2024-11-12 10:56:37 +08:00
sthuang	ff00a12805	enhance: RBAC custom privilege group ut coverage (#37558 ) issue: https://github.com/milvus-io/milvus/issues/37031 Signed-off-by: shaoting-huang <shaoting.huang@zilliz.com>	2024-11-09 20:40:25 +08:00
sthuang	70605cf5b3	enhance: Support custom privilege group for RBAC (#37087 ) issue: #37031 --------- Signed-off-by: shaoting-huang <shaoting.huang@zilliz.com>	2024-11-09 08:44:28 +08:00
cai.zhang	ae227e3934	enhance: Add integration test for stats task (#37506 ) issue: #33744 Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2024-11-08 15:48:26 +08:00
wei liu	7cfd609ebc	fix: [skip e2e]unstable integration test TestNodeDownOnSingleReplica (#37480 ) issue: #37289 cause pr #37116 introduce retry on get shard leader, which make search won't fail during query node down. Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-11-06 17:08:21 -08:00
wei liu	f190e5d802	fix: [skip e2e] TestNodeDownOnSingleReplica has unstable result (#37288 ) issue: #37289 those test case use search to verify replica's status, but if the search gap is 1s, the node down's effect may be fixed up by balance. This PR remove the 1 second gap between search operation. Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-11-01 10:50:21 +08:00
zhenshan.cao	63843dce33	fix: Fix conan gdal building problem (#37338 ) issue:https://github.com/milvus-io/milvus/issues/27576 Signed-off-by: zhenshan.cao <zhenshan.cao@zilliz.com>	2024-10-31 21:04:16 +08:00
Hao Tan	67c4340565	feat: Geospatial Data Type and GIS Function Support for milvus server (#35990 ) issue:https://github.com/milvus-io/milvus/issues/27576 # Main Goals 1. Create and describe collections with geospatial fields, enabling both client and server to recognize and process geo fields. 2. Insert geospatial data as payload values in the insert binlog, and print the values for verification. 3. Load segments containing geospatial data into memory. 4. Ensure query outputs can display geospatial data. 5. Support filtering on GIS functions for geospatial columns. # Solution 1. Add Type: Modify the Milvus core by adding a Geospatial type in both the C++ and Go code layers, defining the Geospatial data structure and the corresponding interfaces. 2. Dependency Libraries: Introduce necessary geospatial data processing libraries. In the C++ source code, use Conan package management to include the GDAL library. In the Go source code, add the go-geom library to the go.mod file. 3. Protocol Interface: Revise the Milvus protocol to provide mechanisms for Geospatial message serialization and deserialization. 4. Data Pipeline: Facilitate interaction between the client and proxy using the WKT format for geospatial data. The proxy will convert all data into WKB format for downstream processing, providing column data interfaces, segment encapsulation, segment loading, payload writing, and cache block management. 5. Query Operators: Implement simple display and support for filter queries. Initially, focus on filtering based on spatial relationships for a single column of geospatial literal values, providing parsing and execution for query expressions. 6. Client Modification: Enable the client to handle user input for geospatial data and facilitate end-to-end testing.Check the modification in pymilvus. --------- Signed-off-by: tasty-gumi <1021989072@qq.com>	2024-10-31 20:58:20 +08:00
foxspy	d7b2ffe5aa	enhance: add an unify vector index config checker (#36844 ) issue: #34298 Signed-off-by: xianliang.li <xianliang.li@zilliz.com>	2024-10-28 10:11:37 +08:00
yihao.dai	ed37c27bda	fix: Fix collection leak in querynode (#37061 ) Unref the removed L0 segment count. issue: https://github.com/milvus-io/milvus/issues/36918 Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-10-25 19:59:29 +08:00
jaime	4746f47282	feat: management WebUI homepage (#36822 ) issue: #36784 1. Implement an embedded web server for WebUI access. 2. Complete the homepage development. Home page demo: <img width="2177" alt="iShot_2024-10-10_17 57 34" src="https://github.com/user-attachments/assets/38539917-ce09-4e54-a5b5-7f4f7eaac353"> Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-10-23 11:29:28 +08:00
jaime	ef1832ff9c	enhance: enable manual compaction for collections without indexes (#36577 ) issue: #36576 Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-10-08 19:57:18 +08:00
wayblink	00a5025949	enhance: support clustering compaction on null value (#36372 ) issue: #36055 Signed-off-by: wayblink <anyang.wang@zilliz.com>	2024-09-30 14:33:17 +08:00
Zhen Ye	d29e01e284	fix: port listen racing in mix or standalone mode (#36442 ) issue: #36441 --------- Signed-off-by: chyezh <chyezh@outlook.com>	2024-09-26 21:23:16 +08:00
wei liu	3cd0b26285	enhance: Enable dynamic update loaded collection's replica (#35822 ) issue: #35821 After collection loaded, if we need to increase/decrease collection's replica, we need to release and load it again. milvus offers 4 solution to update loaded collection's replica, this PR aims to dynamic change the replica number without release, and after replica number changed, milvus will execute load replica or release replica in async, and the replica loaded status can be checked by getReplicas API. Notice that if set too much replicas than querynode can afford，the new replica won't be loaded successfully until enough querynode joins. --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-09-25 10:13:18 +08:00
smellthemoon	89397d1e66	enhance: adjust parquet reader type check with null type (#36266 ) #36252 remove no need type check. if users use null type writer to write parquet, hope it successfully. Signed-off-by: lixinguo <xinguo.li@zilliz.com> Co-authored-by: lixinguo <xinguo.li@zilliz.com>	2024-09-19 18:43:10 +08:00
congqixia	c0317ce672	fix: Wait check node id goroutine in case of data race (#36302 ) Resolves: #36301 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-09-19 10:41:10 +08:00

1 2 3 4 5

226 Commits