milvus

mirror of https://gitee.com/milvus-io/milvus.git synced 2026-01-07 19:31:51 +08:00

Author	SHA1	Message	Date
Spade A	0454cdaab3	fix: remove validateFieldName in dropIndex (#45460 ) issue: https://github.com/milvus-io/milvus/issues/45459 This check is unnecessary when dropping index. --------- Signed-off-by: SpadeA <tangchenjie1210@gmail.com>	2025-11-14 10:17:37 +08:00
junjiejiangjjj	102481e53f	feat: Support add_function/alter_function/drop_function (#44895 ) https://github.com/milvus-io/milvus/issues/44053 Signed-off-by: junjie.jiang <junjie.jiang@zilliz.com>	2025-11-13 20:53:39 +08:00
Spade A	929dc65882	fix: fix index compatibility after upgrade (#45373 ) issue: https://github.com/milvus-io/milvus/issues/45380 --------- Signed-off-by: SpadeA <tangchenjie1210@gmail.com>	2025-11-13 12:59:38 +08:00
junjiejiangjjj	50f198e346	feat: Support zilliz models (#45168 ) https://github.com/milvus-io/milvus/issues/35856 Signed-off-by: junjie.jiang <junjie.jiang@zilliz.com>	2025-11-13 12:55:37 +08:00
Chun Han	87b466fd83	fix: Group value is nil(#45418 ) (#45422 ) related: #45418 Signed-off-by: MrPresent-Han <chun.han@gmail.com> Co-authored-by: MrPresent-Han <chun.han@gmail.com>	2025-11-08 10:29:33 +08:00
aoiasd	6102f001a9	enhance: skip check source id (#45377 ) relate:https://github.com/milvus-io/milvus/issues/45381 Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2025-11-07 15:19:34 +08:00
zhenshan.cao	4a936fae8e	fix: timezone parameter ingored in Search/Query (#45320 ) issue: https://github.com/milvus-io/milvus/issues/44598 Signed-off-by: zhenshan.cao <zhenshan.cao@zilliz.com>	2025-11-05 19:19:33 +08:00
zhenshan.cao	6327c9a514	fix: Fix bugs related to TimestampTz (#45111 ) issue: https://github.com/milvus-io/milvus/issues/44527 https://github.com/milvus-io/milvus/issues/44537 https://github.com/milvus-io/milvus/issues/44538 https://github.com/milvus-io/milvus/issues/44585 https://github.com/milvus-io/milvus/issues/44622 Signed-off-by: zhenshan.cao <zhenshan.cao@zilliz.com>	2025-11-04 16:51:33 +08:00
Spade A	2b5241fe5a	fix: allow "[" and "]" in index name (#45193 ) issue: https://github.com/milvus-io/milvus/issues/42148 --------- Signed-off-by: SpadeA <tangchenjie1210@gmail.com> Signed-off-by: SpadeA-Tang <tangchenjie1210@gmail.com>	2025-11-04 11:59:34 +08:00
aoiasd	ed69375f00	enhance: remove resource type from file resource config (#45103 ) File resource type was useless till now, remove it before new release. relate: https://github.com/milvus-io/milvus/issues/43687 Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2025-11-03 10:15:32 +08:00
Zhen Ye	309d564796	enhance: support collection and index with WAL-based DDL framework (#45033 ) issue: #43897 - Part of collection/index related DDL is implemented by WAL-based DDL framework now. - Support following message type in wal, CreateCollection, DropCollection, CreatePartition, DropPartition, CreateIndex, AlterIndex, DropIndex. - Part of collection/index related DDL can be synced by new CDC now. - Refactor some UT for collection/index DDL. - Add Tombstone scheduler to manage the tombstone GC for collection or partition meta. - Move the vchannel allocation into streaming pchannel manager. --------- Signed-off-by: chyezh <chyezh@outlook.com>	2025-10-30 14:24:08 +08:00
cai.zhang	c33d221536	fix: Fix bug for importing Geometry data (#45089 ) issue: #44787 , #45012 Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2025-10-27 20:34:11 +08:00
yihao.dai	8d11373376	enhance: Show create time for import job (#45058 ) issue: https://github.com/milvus-io/milvus/issues/45056 Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2025-10-27 12:14:08 +08:00
Spade A	d8591f9548	fix: csv/json import with STRUCT adapts concatenated struct name (#45000 ) After https://github.com/milvus-io/milvus/pull/44557, the field name in STRUCT field becomes STRUCT_NAME[FIELD_NAME] This PR make import consider the change. issue: https://github.com/milvus-io/milvus/issues/45006 ref: https://github.com/milvus-io/milvus/issues/42148 TODO: parquet is much more complex than csv/json, and I will leave it to a separate PR. --------- Signed-off-by: SpadeA <tangchenjie1210@gmail.com>	2025-10-24 10:22:15 +08:00
aoiasd	cfeb095ad7	enhance: forbid build analyzer at proxy (#44067 ) relate: https://github.com/milvus-io/milvus/issues/43687 We used to run the temporary analyzer and validate analyzer on the proxy, but the proxy should not be a computation-heavy node. This PR move all analyzer calculations to the streaming node. --------- Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2025-10-23 10:58:12 +08:00
congqixia	314ee81712	enhance: standardize mockery config for proxy package (#45030 ) Related to #44761 #38339 This commit consolidates the proxy package's mockery generation to use a centralized `.mockery.yaml` configuration file, aligning with the pattern used by other packages like querycoordv2. Changes - Makefile: Replace multiple individual mockery commands with a single config-based invocation for `generate-mockery-proxy` target - internal/proxy/.mockery.yaml: Add mockery configuration defining all mock interfaces for proxy and proxy/shardclient packages - Mock files: Regenerate mocks using the new configuration: - `mock_cache.go`: Clean up by removing unused interface methods (credential, shard cache, policy methods) - `shardclient/mock_lb_balancer.go`: Update type comments (nodeInfo → NodeInfo) - `shardclient/mock_lb_policy.go`: Update formatting - `shardclient/mock_shardclient_manager.go`: Fix parameter naming consistency (nodeInfo1 → nodeInfo) - task_search_test.go: Remove obsolete mock expectations for deprecated cache methods Benefits - Centralized mockery configuration for easier maintenance - Consistent with other packages (querycoordv2, etc.) - Cleaner mock interfaces by removing unused methods - Better type consistency in generated mocks Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-10-22 16:00:05 +08:00
congqixia	6c34386ff2	enhance: extract shard client logic into dedicated package (#45018 ) Related to #44761 Refactor proxy shard client management by creating a new internal/proxy/shardclient package. This improves code organization and modularity by: - Moving load balancing logic (LookAsideBalancer, RoundRobinBalancer) to shardclient package - Extracting shard client manager and related interfaces into separate package - Relocating shard leader management and client lifecycle code - Adding package documentation (README.md, OWNERS) - Updating proxy code to use the new shardclient package interfaces This change makes the shard client functionality more maintainable and better encapsulated, reducing coupling in the proxy layer. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-10-22 10:22:04 +08:00
Zhen Ye	2aa48bf4ca	fix: wrong execution order of DDL/DCL on secondary (#44886 ) issue: #44697, #44696 - The DDL executing order of secondary keep same with order of control channel timetick now. - filtering the control channel operation on shard manager of streamingnode to avoid wrong vchannel of create segment. - fix that the immutable txn message lost replicate header. --------- Signed-off-by: chyezh <chyezh@outlook.com>	2025-10-21 22:38:05 +08:00
wei liu	9dddc380a4	fix: Handle empty FieldsData in reduce/rerank for requery scenario (#44917 ) issue: #44909 When requery optimization is enabled, search results contain IDs but empty FieldsData. During reduce/rerank operations, if the first shard has empty FieldsData while others have data, PrepareResultFieldData initializes an empty array, causing AppendFieldData to panic when accessing array indices. Changes: - Find first non-empty FieldsData as template in 3 functions: reduceAdvanceGroupBy, reduceSearchResultDataWithGroupBy, reduceSearchResultDataNoGroupBy - Add length check before 2 AppendFieldData calls in reduce functions to prevent panic - Improve newRerankOutputs to find first non-empty fieldData using len(FieldsData) check instead of GetSizeOfIDs - Add length check in appendResult before AppendFieldData - Add comprehensive unit tests for empty and partial empty FieldsData scenarios in both reduce and rerank functions This fix handles both pure requery (all empty) and mixed scenarios (some empty, some with data) without breaking normal search flow. The key improvement is checking FieldsData length directly rather than IDs, as requery may have IDs but empty FieldsData. Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2025-10-21 19:42:03 +08:00
yihao.dai	0c61d969bb	fix: Fix Fix replication txn data loss during chaos (#44963 ) Only confirm CommitMsg for txn messages to prevent data loss. issue: https://github.com/milvus-io/milvus/issues/44962, https://github.com/milvus-io/milvus/issues/44123 Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2025-10-21 14:16:04 +08:00
Zhen Ye	a3a28a4b99	fix: rerank before requery if reranker didn't use field data (#44942 ) issue: #44918 --------- Signed-off-by: chyezh <chyezh@outlook.com>	2025-10-20 14:26:02 +08:00
Spade A	34f54da155	fix: reject GEOMETRY and TIMESTAMPTZ in STRUCT (#44937 ) issue: https://github.com/milvus-io/milvus/issues/44930 Signed-off-by: SpadeA <tangchenjie1210@gmail.com>	2025-10-20 11:32:05 +08:00
cai.zhang	d6aa213799	fix: Fix return EOF when geometry enable null (#44911 ) issue: #44648 If the value is `null` during insertion, it will be omitted instead of being filled with nil. Therefore, when performing checks, there’s no need to retrieve data based on the valid offset. Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2025-10-17 17:12:17 +08:00
Spade A	9c2aeaa258	fix: empty internal InsertMsg caues panic (#44903 ) fix: https://github.com/milvus-io/milvus/issues/44901 Signed-off-by: SpadeA <tangchenjie1210@gmail.com>	2025-10-17 10:28:01 +08:00
Spade A	6c8e353439	feat: impl StructArray -- ban non-float-vector for now (#44875 ) ref https://github.com/milvus-io/milvus/issues/42148 --------- Signed-off-by: SpadeA <tangchenjie1210@gmail.com>	2025-10-17 10:26:09 +08:00
Bingyi Sun	633cae9461	enhance: add namespace for query and search request (#44343 ) issue: #44011 --------- Signed-off-by: sunby <sunbingyi1992@gmail.com>	2025-10-16 17:52:01 +08:00
wei liu	529c98520c	enhance: Add nullable support for Geometry and Timestamptz types (#44846 ) issue: #44800 This commit enhances the upsert and validation logic to properly handle nullable Geometry (WKT/WKB) and Timestamptz data types: - Add ToCompressedFormatNullable support for TimestamptzData, GeometryWktData, and GeometryData to filter out null values during data compression - Implement GenNullableFieldData for Timestamptz and Geometry types to generate nullable field data structures - Update FillWithNullValue to handle both GeometryData and GeometryWktData with null value filling logic - Add UpdateFieldData support for Timestamptz, GeometryData, and GeometryWktData field updates - Comprehensive unit tests covering all new data type handling scenarios Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2025-10-15 14:04:00 +08:00
Spade A	c4f3f0ce4c	feat: impl StructArray -- support more types of vector in STRUCT (#44736 ) ref: https://github.com/milvus-io/milvus/issues/42148 --------- Signed-off-by: SpadeA <tangchenjie1210@gmail.com> Signed-off-by: SpadeA-Tang <tangchenjie1210@gmail.com>	2025-10-15 10:25:59 +08:00
cqy123456	4b84ba2189	fix:remove the limit of deduplicate case when disable autoindex (#44825 ) issue: https://github.com/milvus-io/milvus/issues/44702 Signed-off-by: cqy123456 <qianya.cheng@zilliz.com>	2025-10-14 16:14:00 +08:00
yihao.dai	cebe923d4a	enhance: Make GetReplicateInfo API work at the pchannel level (#44809 ) issue: https://github.com/milvus-io/milvus/issues/44123 Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2025-10-14 15:12:00 +08:00
congqixia	f5f053f1d2	enhance: Refactor privilege management by extracting privilege cache into separate package (#44762 ) Related to #44761 This commit refactors the privilege management system in the proxy component by: 1. Separation of Concerns: Extracts privilege-related functionality from MetaCache into a dedicated `internal/proxy/privilege` package, improving code organization and maintainability. 2. New Package Structure: Creates `internal/proxy/privilege/` with: - `cache.go`: Core privilege cache implementation (PrivilegeCache) - `result_cache.go`: Privilege enforcement result caching - `model.go`: Casbin model and policy enforcement functions - `meta_cache_adapter.go`: Casbin adapter for MetaCache integration - Corresponding test files and mock implementations 3. MetaCache Simplification: Removes privilege and credential management methods from MetaCache interface and implementation: - Removed: GetCredentialInfo, RemoveCredential, UpdateCredential - Removed: GetPrivilegeInfo, GetUserRole, RefreshPolicyInfo, InitPolicyInfo - Deleted: meta_cache_adapter.go, privilege_cache.go and their tests 4. Updated References: Updates all callsites to use the new privilegeCache global: - Authentication interceptor now uses privilegeCache for password verification - Credential cache operations (InvalidateCredentialCache, UpdateCredentialCache, UpdateCredential) now use privilegeCache - Policy refresh operations (RefreshPolicyInfoCache) now use privilegeCache - Privilege interceptor uses new privilege.GetEnforcer() and privilege result cache 5. Improved API: Renames cache functions for clarity: - GetPrivilegeCache → GetResultCache - SetPrivilegeCache → SetResultCache - CleanPrivilegeCache → CleanResultCache This refactoring makes the codebase more modular, separates privilege management concerns from general metadata caching, and provides a clearer API for privilege enforcement operations. --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-10-13 11:15:58 +08:00
congqixia	7b8ecdaad5	enhance: Add accesslog field for template value length info (#44723 ) Related to #36672 Add accesslog field displaying value length for search/query request may help developers debug related issues --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-10-11 18:23:57 +08:00
cai.zhang	7a93cfe890	fix: Fix bug for nullable geometry (#44732 ) issue: #44648 Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2025-10-11 11:27:57 +08:00
Spade A	208481a070	feat: impl StructArray -- support same names in different STRUCT (#44557 ) ref: https://github.com/milvus-io/milvus/issues/42148 --------- Signed-off-by: SpadeA <tangchenjie1210@gmail.com>	2025-10-10 15:53:56 +08:00
congqixia	e83c7e0c92	fix: Use eventually & fix task id appear in both executing&completed (#44698 ) Related to #44620 This PR: - Use eventually instead of `time.Sleep` in accesslog writer unit test - Make sure compaction task results have only one state from executor API Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-10-10 10:05:56 +08:00
congqixia	efbe6669ec	enhance: Make `accesslog.$consistency_level` represent actual value used (#44706 ) Related to #44703 This PR: - Add `SetActualConsistencyLevel` to `info.AccessInfo` interface and related util method processing it - Make `$consistency_level` returning actual value if set --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-10-09 21:03:58 +08:00
Gao	3cc59a0d69	enhance: add storage usage for delete/upsert/restful (#44512 ) #44212 Also, record metrics only when storageUsageTracking is enabled. Use MB for scanned_remote counter and scanned_total counter metrics to avoid overflow. --------- Signed-off-by: chasingegg <chao.gao@zilliz.com>	2025-09-30 00:31:06 +08:00
aoiasd	294282f1d2	enhance: support use nullable field as bm25 function input field (#44586 ) Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2025-09-29 10:25:05 +08:00
cai.zhang	19346fa389	feat: Geospatial Data Type and GIS Function support for milvus (#44547 ) issue: #43427 This pr's main goal is merge #37417 to milvus 2.5 without conflicts. # Main Goals 1. Create and describe collections with geospatial type 2. Insert geospatial data into the insert binlog 3. Load segments containing geospatial data into memory 4. Enable query and search can display geospatial data 5. Support using GIS funtions like ST_EQUALS in query 6. Support R-Tree index for geometry type # Solution 1. Add Type: Modify the Milvus core by adding a Geospatial type in both the C++ and Go code layers, defining the Geospatial data structure and the corresponding interfaces. 2. Dependency Libraries: Introduce necessary geospatial data processing libraries. In the C++ source code, use Conan package management to include the GDAL library. In the Go source code, add the go-geom library to the go.mod file. 3. Protocol Interface: Revise the Milvus protocol to provide mechanisms for Geospatial message serialization and deserialization. 4. Data Pipeline: Facilitate interaction between the client and proxy using the WKT format for geospatial data. The proxy will convert all data into WKB format for downstream processing, providing column data interfaces, segment encapsulation, segment loading, payload writing, and cache block management. 5. Query Operators: Implement simple display and support for filter queries. Initially, focus on filtering based on spatial relationships for a single column of geospatial literal values, providing parsing and execution for query expressions.Now only support brutal search 7. Client Modification: Enable the client to handle user input for geospatial data and facilitate end-to-end testing.Check the modification in pymilvus. --------- Signed-off-by: Yinwei Li <yinwei.li@zilliz.com> Signed-off-by: Cai Zhang <cai.zhang@zilliz.com> Co-authored-by: ZhuXi <150327960+Yinwei-Yu@users.noreply.github.com>	2025-09-28 19:43:05 +08:00
Bingyi Sun	4f61f4ee22	fix: Alter allow_insert_autoid via AlterCollection (#44530 ) issue: #44425 --------- Signed-off-by: sunby <sunbingyi1992@gmail.com>	2025-09-28 11:09:04 +08:00
foxspy	13c3b0b909	enhance: add autoindex configuration for the int8 vector type (#44554 ) issue: #38666 Add int8 support for autoindex to ensure it can be independently configured. At the same time, remove the restriction on int8 type for vectorDiskIndex (note that vectorDiskIndex only determines the building and loading method of the index, not the index type). Signed-off-by: xianliang.li <xianliang.li@zilliz.com>	2025-09-24 17:48:04 +08:00
congqixia	99598ae5ec	enhance: Add param item for hybrid search requery policy (#44466 ) Related to #39757 --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-09-24 17:32:04 +08:00
junjiejiangjjj	f07979f91d	enhance: add support for controlling function output field insertion (#44162 ) #44053 Signed-off-by: junjie.jiang <junjie.jiang@zilliz.com>	2025-09-24 17:26:04 +08:00
Tianx	4d5afec9a8	fix: upsert error for timestamptz (#44548 ) issue: https://github.com/milvus-io/milvus/issues/44527 Signed-off-by: xtx <xtianx@smail.nju.edu.cn>	2025-09-24 10:28:04 +08:00
Gao	d5255b5eef	fix: set report_value default value when extrainfo is not nil for compatibility (#44529 ) https://github.com/milvus-io/milvus/issues/44523 Signed-off-by: chasingegg <chao.gao@zilliz.com>	2025-09-23 22:26:05 +08:00
Bingyi Sun	96e1de4e22	feat: allow users to write pk field when autoid is enabled (#44424 ) https://github.com/milvus-io/milvus/issues/44425 --------- Signed-off-by: sunby <sunbingyi1992@gmail.com>	2025-09-23 16:10:04 +08:00
Tianx	2c0c5ef41e	feat: timestamptz expression & index & timezone (#44080 ) issue: https://github.com/milvus-io/milvus/issues/27467 >My plan is as follows. >- [x] M1 Create collection with timestamptz field >- [x] M2 Insert timestamptz field data >- [x] M3 Retrieve timestamptz field data >- [x] M4 Implement handoff >- [x] M5 Implement compare operator >- [x] M6 Implement extract operator >- [x] M8 Support database/collection level default timezone >- [x] M7 Support STL-SORT index for datatype timestamptz --- The third PR of issue: https://github.com/milvus-io/milvus/issues/27467, which completes M5, M6, M7, M8 described above. ## M8 Default Timezone We will be able to use alter_collection() and alter_database() in a future Python SDK release to modify the default timezone at the collection or database level. For insert requests, the timezone will be resolved using the following order of precedence: String Literal-> Collection Default -> Database Default. For retrieval requests, the timezone will be resolved in this order: Query Parameters -> Collection Default -> Database Default. In both cases, the final fallback timezone is UTC. ## M5: Comparison Operators We can now use the following expression format to filter on the timestamptz field: - `timestamptz_field [+/- INTERVAL 'interval_string'] {comparison_op} ISO 'iso_string' ` - The interval_string follows the ISO 8601 duration format, for example: P1Y2M3DT1H2M3S. - The iso_string follows the ISO 8601 timestamp format, for example: 2025-01-03T00:00:00+08:00. - Example expressions: "tsz + INTERVAL 'P0D' != ISO '2025-01-03T00:00:00+08:00'" or "tsz != ISO '2025-01-03T00:00:00+08:00'". ## M6: Extract We will be able to extract sepecific time filed by kwargs in a future Python SDK release. The key is `time_fields`, and value should be one or more of "year, month, day, hour, minute, second, microsecond", seperated by comma or space. Then the result of each record would be an array of int64. ## M7: Indexing Support Expressions without interval arithmetic can be accelerated using an STL-SORT index. However, expressions that include interval arithmetic cannot be indexed. This is because the result of an interval calculation depends on the specific timestamp value. For example, adding one month to a date in February results in a different number of added days than adding one month to a date in March. --- After this PR, the input / output type of timestamptz would be iso string. Timestampz would be stored as timestamptz data, which is int64_t finally. > for more information, see https://en.wikipedia.org/wiki/ISO_8601 --------- Signed-off-by: xtx <xtianx@smail.nju.edu.cn>	2025-09-23 10:24:12 +08:00
Zhen Ye	c171280f63	enhance: support replicate message in wal. (#44456 ) issue: #44123 - support replicate message in wal of milvus. - support CDC-replicate recovery from wal. - fix some CDC replicator bugs Signed-off-by: chyezh <chyezh@outlook.com>	2025-09-22 17:06:11 +08:00
Bingyi Sun	94d53a5ac6	feat: encode cluster id in auto id (#44471 ) https://github.com/milvus-io/milvus/issues/44326 prev: [physical_ts][logical_ts] after [sign_bit][cluster_id][physical_ts][logical_ts] --------- Signed-off-by: sunby <sunbingyi1992@gmail.com>	2025-09-22 10:40:02 +08:00
Gao	d3784c6515	enhance: add storage resource usage for vector search (#44308 ) issue: #44212 Implement search/query storage usage statistics in go side(result reduce), only record storage usage in vector search C++ path. Need to be implemented in query c++ path in next prs. --------- Signed-off-by: chasingegg <chao.gao@zilliz.com> Signed-off-by: marcelo.chen <marcelo.chen@zilliz.com> Co-authored-by: marcelo.chen <marcelo.chen@zilliz.com>	2025-09-19 20:20:02 +08:00

1 2 3 4 5 ...

1906 Commits