milvus

mirror of https://gitee.com/milvus-io/milvus.git synced 2025-12-28 22:45:26 +08:00

Author	SHA1	Message	Date
Feilong Hou	a7eb327746	test: fix unstable timestamptz test cases (#46403 ) Issue: #46333 test: re-write convert timestamp logic to cover daylight saving time Signed-off-by: Eric Hou <eric.hou@zilliz.com> Co-authored-by: Eric Hou <eric.hou@zilliz.com>	2025-12-17 21:13:16 +08:00
aoiasd	df80f54151	feat: support use user's file as dictionary for analyzer filter (#46145 ) relate: https://github.com/milvus-io/milvus/issues/43687 --------- Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2025-12-16 11:45:16 +08:00
Feilong Hou	971085b033	test: enable debug_mode to observe test case instability. (#46341 ) Issue: #46333 Signed-off-by: Eric Hou <eric.hou@zilliz.com> Co-authored-by: Eric Hou <eric.hou@zilliz.com>	2025-12-15 17:55:16 +08:00
zhuwenxing	75d6f0d509	test: add ST_ISVALID geometry function test cases (#46232 ) /kind improvement Signed-off-by: zhuwenxing <wenxing.zhu@zilliz.com>	2025-12-11 13:47:21 +08:00
Buqian Zheng	85a7a7b1e3	fix: skip json path index if the query path includes number (#46200 ) issue: #45511 our tantivy inverted index currently does not include item index if the value is an array, thus we can't do `a[0] == 'b'` type of look up in the inverted index. for such, we need to skip the index and use brute force search. we may improve our index in the future, so this is a temp solution Signed-off-by: Buqian Zheng <zhengbuqian@gmail.com>	2025-12-10 13:59:13 +08:00
zhuwenxing	4fe41ff14d	test: add emb list recall test (#46135 ) /kind improvement Signed-off-by: zhuwenxing <wenxing.zhu@zilliz.com>	2025-12-08 19:21:13 +08:00
nico	43fe215787	test: update sdk version and skip some debug log (#46040 ) Signed-off-by: nico <cheng.yuan@zilliz.com>	2025-12-04 10:33:11 +08:00
wei liu	f85e86a6ec	fix: change upsert duplicate PK behavior from dedup to error (#45997 ) issue: #44320 Replace the DeduplicateFieldData function with CheckDuplicatePkExist that returns an error when duplicate primary keys are detected in the same batch, instead of silently deduplicating. Changes: - Replace DeduplicateFieldData with CheckDuplicatePkExist in util.go - Update upsertTask.PreExecute to return error on duplicate PKs - Simplify helper function from findLastOccurrenceIndices to hasDuplicates - Update unit tests to verify the new error behavior - Add Python integration tests for duplicate PK error cases Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2025-12-04 10:23:11 +08:00
zhuwenxing	256e073e8d	test: add more testcases for geo and struct (#45414 ) /kind improvement Signed-off-by: zhuwenxing <wenxing.zhu@zilliz.com>	2025-11-25 10:51:06 +08:00
Feilong Hou	228eb0f5d0	test: add more test cases and add bulk insert scenario (#45770 ) Issue: #45756 1. add bulk insert scenario 2. fix small issue in e2e cases 3. add search group by test case 4. add timestampstz to gen_all_datatype_collection_schema 5. modify partial update testcase to ensure correct result from timestamptz field On branch feature/timestamps Changes to be committed: modified: common/bulk_insert_data.py modified: common/common_func.py modified: common/common_type.py modified: milvus_client/test_milvus_client_partial_update.py modified: milvus_client/test_milvus_client_timestamptz.py modified: pytest.ini modified: testcases/test_bulk_insert.py Signed-off-by: Eric Hou <eric.hou@zilliz.com> Co-authored-by: Eric Hou <eric.hou@zilliz.com>	2025-11-24 15:21:06 +08:00
Feilong Hou	0231a3edf8	test: enable all timestamptz case (#45128 ) Issue: #44518 --------- Signed-off-by: Eric Hou <eric.hou@zilliz.com> Co-authored-by: Eric Hou <eric.hou@zilliz.com>	2025-11-21 11:03:06 +08:00
qixuan	3202847092	test: add field case about dynamic and compaction (#45694 ) related issue: #42126 Signed-off-by: qixuan <673771573@qq.com>	2025-11-21 10:07:05 +08:00
zhuwenxing	e0df44481d	test: refactor checker to using milvus client (#45524 ) /kind improvement Signed-off-by: zhuwenxing <wenxing.zhu@zilliz.com>	2025-11-20 11:59:08 +08:00
Bingyi Sun	1ba75eea62	enhance: skip test_milvus_client_search_json_path_index_default (#45604 ) To prevent this issue from blocking other PRs, we are temporarily disabling this test. A proper fix will be implemented before the 2.6.6 release. issue: https://github.com/milvus-io/milvus/issues/45511 --------- Signed-off-by: sunby <sunbingyi1992@gmail.com>	2025-11-18 10:54:09 +08:00
wei liu	7aed88113c	enhance: Deduplicate primary keys in upsert request batch (#45249 ) issue: #44320 This change adds deduplication logic to handle duplicate primary keys within a single upsert batch, keeping the last occurrence of each primary key. Key changes: - Add DeduplicateFieldData function to remove duplicate PKs from field data, supporting both Int64 and VarChar primary keys - Refactor fillFieldPropertiesBySchema into two separate functions: validateFieldDataColumns for validation and fillFieldPropertiesOnly for property filling, improving code clarity and reusability - Integrate deduplication logic in upsertTask.PreExecute to automatically deduplicate data before processing - Add comprehensive unit tests for deduplication with various PK types (Int64, VarChar) and field types (scalar, vector) - Add Python integration tests to verify end-to-end behavior --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2025-11-17 21:35:40 +08:00
Zhen Ye	4797bb6ab2	fix: wrong update timetick of collection meta info (#45461 ) issue: #45403, #45463 - fix the Nightly E2E failures. - fix the wrong update timetick of altering collection to fix the related load failure. Signed-off-by: chyezh <chyezh@outlook.com>	2025-11-11 16:01:36 +08:00
zhuwenxing	b1af0df9f3	test: add struct array mmap testcases (#45309 ) /kind improvement Signed-off-by: zhuwenxing <wenxing.zhu@zilliz.com>	2025-11-10 16:49:36 +08:00
zhenshan.cao	6327c9a514	fix: Fix bugs related to TimestampTz (#45111 ) issue: https://github.com/milvus-io/milvus/issues/44527 https://github.com/milvus-io/milvus/issues/44537 https://github.com/milvus-io/milvus/issues/44538 https://github.com/milvus-io/milvus/issues/44585 https://github.com/milvus-io/milvus/issues/44622 Signed-off-by: zhenshan.cao <zhenshan.cao@zilliz.com>	2025-11-04 16:51:33 +08:00
Feilong Hou	9e4975bdfa	test: added test case for partial update on duplicate pk (#45130 ) Issue: #45129 <test>: <add new test case> <also delete duplicate test case> On branch feature/partial-update Changes to be committed: modified: milvus_client/test_milvus_client_partial_update.py modified: milvus_client/test_milvus_client_upsert.py --------- Signed-off-by: Eric Hou <eric.hou@zilliz.com> Co-authored-by: Eric Hou <eric.hou@zilliz.com>	2025-11-04 15:47:32 +08:00
zhuwenxing	434e0847fd	test: remove xfail after fix (#45114 ) /kind improvement Signed-off-by: zhuwenxing <wenxing.zhu@zilliz.com>	2025-11-03 17:21:37 +08:00
zhuwenxing	a03c398986	test: add import case for struct array (#45146 ) /kind improvement Signed-off-by: zhuwenxing <wenxing.zhu@zilliz.com>	2025-11-03 17:19:39 +08:00
Feilong Hou	7aa56e1fb6	test: change test_milvus_client_search_json_path_index_all_expressions to L1 (#44986 ) Issue: #44989 On branch feature/json-shredding Changes to be committed: modified: milvus_client/test_milvus_client_query.py Signed-off-by: Eric Hou <eric.hou@zilliz.com> Co-authored-by: Eric Hou <eric.hou@zilliz.com>	2025-10-23 16:14:05 +08:00
Spade A	6077178553	enhance: enable STL_SORT to support VARCHAR (#44401 ) issue: https://github.com/milvus-io/milvus/issues/44399 This PR implements STL_SORT for VARCHAR data type for both RAM and MMAP mode. The general idea is that we deduplicate field values and maintains a posting list for each unique value. The serialization format of the index is: ``` [unique_count][string_offsets][string_data][post_list_offsets][post_list_data][magic_code] string_offsets: array of offsets into string_data section string_data: str_len1, str1, str_len2, str2, ... post_list_offsets: array of offsets into post_list_data section post_list_data: post_list_len1, row_id1, row_id2, ..., post_list_len2, row_id1, row_id2, ... ``` --------- Signed-off-by: SpadeA <tangchenjie1210@gmail.com>	2025-10-23 11:00:05 +08:00
zhuwenxing	b497dd0b45	test: add geometry datatype testcases (#44646 ) /kind improvement Signed-off-by: zhuwenxing <wenxing.zhu@zilliz.com>	2025-10-21 19:56:03 +08:00
zhuwenxing	2f4b66d9ab	test: add struct array testcases (#44940 ) /kind improvement Signed-off-by: zhuwenxing <wenxing.zhu@zilliz.com>	2025-10-20 17:34:03 +08:00
nico	a4935d2eaa	test: update rba test cases 2 (#44954 ) Signed-off-by: nico <cheng.yuan@zilliz.com>	2025-10-20 16:32:03 +08:00
Feilong Hou	16ff5db79d	test: Add e2e case for timestamptz (currently skipping them) (#44871 ) Issue: #44518 On branch feature/timestamps Changes to be committed: modified: common/common_func.py new file: milvus_client/test_milvus_client_timestamptz.py --------- Signed-off-by: Eric Hou <eric.hou@zilliz.com> Co-authored-by: Eric Hou <eric.hou@zilliz.com>	2025-10-20 10:04:02 +08:00
zhagnlu	b7935557e1	fix:unified json exists path semantic (#44916 ) #44927 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2025-10-17 16:40:02 +08:00
nico	9f2937fd0f	test: updatec rba test cases (#44863 ) Signed-off-by: nico <cheng.yuan@zilliz.com>	2025-10-17 15:14:02 +08:00
yanliang567	9e5f9277c0	test: Split insert file and add test for allow insert auto id (#44801 ) related issue: #44425 1. split insert.py into a few files: upsert.py, insert.py, partial_upsert.py ... 2. add test for allow insert auto id --------- Signed-off-by: yanliang567 <yanliang.qiao@zilliz.com>	2025-10-14 14:28:00 +08:00
zhagnlu	2f178f810f	fix:fix json_contains(path, int) bug (#44814 ) #44816 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2025-10-14 00:19:59 +08:00
Spade A	208481a070	feat: impl StructArray -- support same names in different STRUCT (#44557 ) ref: https://github.com/milvus-io/milvus/issues/42148 --------- Signed-off-by: SpadeA <tangchenjie1210@gmail.com>	2025-10-10 15:53:56 +08:00
nico	e5378a64bc	test: update test cases (#44651 ) Signed-off-by: nico <cheng.yuan@zilliz.com>	2025-10-10 11:35:56 +08:00
yanliang567	c9f01a73cc	test:Skip unstable test (#44649 ) related issue: #44620 Signed-off-by: yanliang567 <yanliang.qiao@zilliz.com>	2025-09-30 16:47:52 +08:00
congqixia	31670c5489	enhance: Use dbName in error message (#44618 ) The collection not found err could contains db id in err message, which is not meaningful to users. This patch make error message wrapping dbname instead of db id. --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-09-30 12:25:05 +08:00
nico	bc170201dd	test: update test case (#44503 ) Signed-off-by: nico <cheng.yuan@zilliz.com>	2025-09-24 10:54:03 +08:00
Tianx	2c0c5ef41e	feat: timestamptz expression & index & timezone (#44080 ) issue: https://github.com/milvus-io/milvus/issues/27467 >My plan is as follows. >- [x] M1 Create collection with timestamptz field >- [x] M2 Insert timestamptz field data >- [x] M3 Retrieve timestamptz field data >- [x] M4 Implement handoff >- [x] M5 Implement compare operator >- [x] M6 Implement extract operator >- [x] M8 Support database/collection level default timezone >- [x] M7 Support STL-SORT index for datatype timestamptz --- The third PR of issue: https://github.com/milvus-io/milvus/issues/27467, which completes M5, M6, M7, M8 described above. ## M8 Default Timezone We will be able to use alter_collection() and alter_database() in a future Python SDK release to modify the default timezone at the collection or database level. For insert requests, the timezone will be resolved using the following order of precedence: String Literal-> Collection Default -> Database Default. For retrieval requests, the timezone will be resolved in this order: Query Parameters -> Collection Default -> Database Default. In both cases, the final fallback timezone is UTC. ## M5: Comparison Operators We can now use the following expression format to filter on the timestamptz field: - `timestamptz_field [+/- INTERVAL 'interval_string'] {comparison_op} ISO 'iso_string' ` - The interval_string follows the ISO 8601 duration format, for example: P1Y2M3DT1H2M3S. - The iso_string follows the ISO 8601 timestamp format, for example: 2025-01-03T00:00:00+08:00. - Example expressions: "tsz + INTERVAL 'P0D' != ISO '2025-01-03T00:00:00+08:00'" or "tsz != ISO '2025-01-03T00:00:00+08:00'". ## M6: Extract We will be able to extract sepecific time filed by kwargs in a future Python SDK release. The key is `time_fields`, and value should be one or more of "year, month, day, hour, minute, second, microsecond", seperated by comma or space. Then the result of each record would be an array of int64. ## M7: Indexing Support Expressions without interval arithmetic can be accelerated using an STL-SORT index. However, expressions that include interval arithmetic cannot be indexed. This is because the result of an interval calculation depends on the specific timestamp value. For example, adding one month to a date in February results in a different number of added days than adding one month to a date in March. --- After this PR, the input / output type of timestamptz would be iso string. Timestampz would be stored as timestamptz data, which is int64_t finally. > for more information, see https://en.wikipedia.org/wiki/ISO_8601 --------- Signed-off-by: xtx <xtianx@smail.nju.edu.cn>	2025-09-23 10:24:12 +08:00
jiaqizho	338ed2fed4	enhance: Introduce sparse filter in query (#44347 ) issue: #44373 The current commit implements sparse filtering in query tasks using the statistical information (Bloom filter/MinMax) of the Primary Key (PK). The statistical information of the PK is bound to the segment during the segment loading phase. A new filter has been added to the segment filter to enable the sparse filtering functionality. Signed-off-by: jiaqizho <jiaqi.zhou@zilliz.com>	2025-09-23 09:58:09 +08:00
Feilong Hou	f9afde23d1	test: Add New Test Cases for Partial Update (#44483 ) Issue: #43872 <fix>: <fix after accidental force pull> Changes to be committed: modified: chaos/checker.py modified: chaos/testcases/test_single_request_operation.py modified: common/common_func.py modified: common/common_type.py modified: milvus_client/test_milvus_client_insert.py This includes e2e cases and chaos checker. All the cases are currently skipped due to partial update feature not ready. 1. test_milvus_client_partial_update_insert_delete_upsert_with_flush(): insert -> delete -> flush -> query -> upsert -> flush -> query 2. test_milvus_client_partial_update_insert_upsert_delete_upsert_flush(): insert -> upsert -> delete -> upsert -> flush -> query 3. test_milvus_client_partial_update_insert_upsert_flush_delete_upsert_flush(): insert -> upsert -> flush -> delete -> upsert -> flush -> query Also update requirements.txt to use latest pymilvus version --------- Signed-off-by: Eric Hou <eric.hou@zilliz.com> Co-authored-by: Eric Hou <eric.hou@zilliz.com>	2025-09-23 09:06:12 +08:00
XuanYang-cn	ad86292bed	fix: Deny RenameCollection when db enabled encryption (#44225 ) Also fix the validation of enabling encyption when create db See also: #44217, #44218 --------- Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2025-09-16 10:08:07 +08:00
qixuan	9228ed7b8f	test: add case about enable dynamic schema (#44355 ) related issue: #42126 Signed-off-by: qixuan <673771573@qq.com>	2025-09-13 19:55:57 +08:00
9Eurydice9	662397e487	test: migrate test_query V2 cases for milvus client (#44238 ) issue: #44090 Migrate test_query cases from TestcaseBase to TestMilvusClientV2Base --------- Signed-off-by: Orpheus Wang <orpheus.wang@zilliz.com>	2025-09-12 09:31:57 +08:00
Feilong Hou	6884cdbe90	test: add complex json expression test (#44211 ) <test>: <add test case for complex json expression On branch feature/json-shredding Changes to be committed: modified: milvus_client/expressions/README.md modified: milvus_client/expressions/test_milvus_client_scalar_filtering.py --------- Signed-off-by: Eric Hou <eric.hou@zilliz.com> Co-authored-by: Eric Hou <eric.hou@zilliz.com>	2025-09-11 19:57:58 +08:00
nico	cc7a6d3ec6	test: update nightly case (#44248 ) Signed-off-by: nico <cheng.yuan@zilliz.com>	2025-09-10 10:31:56 +08:00
9Eurydice9	bfd42c9f9e	test: migrate test_query V2 cases for milvus client (#44179 ) issue: #44090 Migrate test_query cases from TestcaseBase to TestMilvusClientV2Base --------- Signed-off-by: Orpheus Wang <orpheus.wang@zilliz.com>	2025-09-04 13:07:53 +08:00
zhuwenxing	4f1ea8d4cb	test: add cohere, voyageai and siliconflow reranker testcases and some lints (#44181 ) /kind improvement --------- Signed-off-by: zhuwenxing <wenxing.zhu@zilliz.com>	2025-09-03 15:51:54 +08:00
Bingyi Sun	0c0630cc38	feat: support dropping index without releasing collection (#42941 ) issue: #42942 This pr includes the following changes: 1. Added checks for index checker in querycoord to generate drop index tasks 2. Added drop index interface to querynode 3. To avoid search failure after dropping the index, the querynode allows the use of lazy mode (warmup=disable) to load raw data even when indexes contain raw data. 4. In segcore, loading the index no longer deletes raw data; instead, it evicts it. 5. In expr, the index is pinned to prevent concurrent errors. --------- Signed-off-by: sunby <sunbingyi1992@gmail.com>	2025-09-02 16:17:52 +08:00
nico	198cc62039	test: update nightly cases (#44167 ) Signed-off-by: nico <cheng.yuan@zilliz.com>	2025-09-02 14:21:52 +08:00
yanliang567	a7087b0023	test: Add more ngram tests, including mmap and utf8 characters (#44169 ) related issue: #43989 Signed-off-by: yanliang567 <yanliang.qiao@zilliz.com>	2025-09-02 14:17:52 +08:00
qixuan	e126df2330	test: add field case about reranker and analyzer (#44095 ) related issue: #42126 --------- Signed-off-by: qixuan <673771573@qq.com>	2025-09-01 17:27:52 +08:00

1 2 3

143 Commits