milvus

mirror of https://gitee.com/milvus-io/milvus.git synced 2026-01-07 19:31:51 +08:00

Author	SHA1	Message	Date
Bingyi Sun	24e24caf14	fix: Remove cached null expr result (#42818 ) issue: #42698 cached result may be changed in caller so there is no need to cache it Signed-off-by: sunby <sunbingyi1992@gmail.com>	2025-06-23 10:44:40 +08:00
zhagnlu	9c31a47c0f	fix:fix arith mod bug for big int (#42699 ) #42624 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2025-06-17 10:36:38 +08:00
Spade A	911a8df17c	feat: impl StructArray -- data storage support in segcore (#42406 ) Ref https://github.com/milvus-io/milvus/issues/42148 This PR mainly enables segcore to support array of vector (read and write, but not indexing). Now only float vector as the element type is supported. --------- Signed-off-by: SpadeA <tangchenjie1210@gmail.com> Signed-off-by: SpadeA-Tang <tangchenjie1210@gmail.com>	2025-06-12 14:38:35 +08:00
Bingyi Sun	6c16d3dbee	enhance: Add bulk api for json data (#42407 ) issue: https://github.com/milvus-io/milvus/issues/42409 --------- Signed-off-by: sunby <sunbingyi1992@gmail.com>	2025-06-12 10:40:39 +08:00
Bingyi Sun	fbf5cb4e62	feat: Add json flat index (#39917 ) issue: https://github.com/milvus-io/milvus/issues/35528 This PR introduces a JSON flat index that allows indexing JSON fields and dynamic fields in the same way as other field types. In a previous PR (#36750), we implemented a JSON index that requires specifying a JSON path and casting a type. The only distinction lies in the json_cast_type parameter. When json_cast_type is set to JSON type, Milvus automatically creates a JSON flat index. For details on how Tantivy interprets JSON data, refer to the [tantivy documentation](https://github.com/quickwit-oss/tantivy/blob/main/doc/src/json.md#pitfalls-limitation-and-corner-cases). Limitations Array handling: Arrays do not function as nested objects. See the [limitations section](https://github.com/quickwit-oss/tantivy/blob/main/doc/src/json.md#arrays-do-not-work-like-nested-object) for more details. --------- Signed-off-by: sunby <sunbingyi1992@gmail.com>	2025-06-10 19:14:35 +08:00
zhagnlu	2861096734	fix: Add explicit move semantics to get_batch_view interface (#42403 ) #42401 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2025-06-10 13:06:35 +08:00
Bingyi Sun	6404e02d99	fix: Check cast type is array for json contains expr (#42184 ) issue: https://github.com/milvus-io/milvus/issues/42181 Signed-off-by: sunby <sunbingyi1992@gmail.com>	2025-06-09 17:04:33 +08:00
Buqian Zheng	cae0091071	feat: make SkipIndex lazyload (#41826 ) issue: https://github.com/milvus-io/milvus/issues/41435 this PR also: 1. fixed the skip index for VARCHAR. before this PR, skip index of VARCHAR uses the minmax of the entire column as the minmax of chunk 0, and provides no minmax for other chunks. 2. refactored some skip index loading related code 3. partly fixed a bug in test_expr.cpp --------- Signed-off-by: Buqian Zheng <zhengbuqian@gmail.com>	2025-05-15 01:30:23 +08:00
cai.zhang	4ead8caaba	fix: prevent crash when contains_all/any is used with empty array (#41739 ) issue: #41348 related and optimized by #41347 --------- Signed-off-by: Cai Zhang <cai.zhang@zilliz.com> Co-authored-by: Sangho Park <hoyaspark@gmail.com>	2025-05-14 14:32:22 +08:00
Xianhui Lin	26cbc74478	fix: support infix and suffix match types in JsonStats (#41720 ) fix: support infix and suffix match types in JsonStats issue:https://github.com/milvus-io/milvus/issues/41386 Signed-off-by: Xianhui.Lin <xianhui.lin@zilliz.com>	2025-05-09 10:42:53 +08:00
zhagnlu	e3c81ba1cc	enhance: use scan mode for like although inverted index exists (#41325 ) #41065 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2025-05-09 10:36:54 +08:00
zhagnlu	39e7ad33d7	enhance: add optimize for like expr (#41066 ) #41065 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2025-05-08 14:28:52 +08:00
foxspy	e2ddbe4962	feat: add cachinglayer to index (#41653 ) issue: #41435 Signed-off-by: xianliang.li <xianliang.li@zilliz.com>	2025-05-08 10:12:54 +08:00
Bingyi Sun	4c08090687	feat: Add json index support for json contains expr (#41478 ) issue: #35528 --------- Signed-off-by: sunby <sunbingyi1992@gmail.com>	2025-05-06 11:44:52 +08:00
Buqian Zheng	3de904c7ea	feat: add cachinglayer to sealed segment (#41436 ) issue: https://github.com/milvus-io/milvus/issues/41435 --------- Signed-off-by: Buqian Zheng <zhengbuqian@gmail.com>	2025-04-28 10:52:40 +08:00
Chun Han	12cde913b5	fix: fail to get string views due to chunk bound empty loop(#41300 ) (#41452 ) related: #41300 Signed-off-by: MrPresent-Han <chun.han@gmail.com> Co-authored-by: MrPresent-Han <chun.han@gmail.com>	2025-04-27 10:40:38 +08:00
Xianhui Lin	3d4889586d	fix: JsonStats filter by conjunctExpr and improve the task slot calculation logic (#41459 ) Optimized JSON filter execution by introducing ProcessJsonStatsChunkPos() for unified position calculation and GetNextBatchSize() for better batch processing. Improved JSON key generation by replacing manual path joining with milvus::Json::pointer() and adjusted slot size calculation for JSON key index jobs. Updated the task slot calculation logic in calculateStatsTaskSlot() to handle the increased resource needs of JSON key index jobs. issue: https://github.com/milvus-io/milvus/issues/41378 https://github.com/milvus-io/milvus/issues/41218 --------- Signed-off-by: Xianhui.Lin <xianhui.lin@zilliz.com>	2025-04-23 16:30:37 +08:00
Bingyi Sun	a953eaeaf0	enhance: support binary range expression for json path index (#41025 ) issue: #35528 --------- Signed-off-by: sunby <sunbingyi1992@gmail.com>	2025-04-15 19:32:33 +08:00
Bingyi Sun	bf617115ca	enhance: Remove single chunk segment related codes (#39249 ) https://github.com/milvus-io/milvus/issues/39112 --------- Signed-off-by: sunby <sunbingyi1992@gmail.com>	2025-04-11 18:56:29 +08:00
Spade A	9ce3e3cb44	enhance: add documents in batch for json key stats (#41228 ) issue: https://github.com/milvus-io/milvus/issues/40897 After this, the document add operations scheduling duration is decreased roughly from 6s to 0.9s for the case in the issue. --------- Signed-off-by: SpadeA <tangchenjie1210@gmail.com>	2025-04-11 14:08:26 +08:00
Xianhui Lin	3bc24c264f	enhance: Add json key inverted index in stats for optimization (#38039 ) Add json key inverted index in stats for optimization https://github.com/milvus-io/milvus/issues/36995 --------- Signed-off-by: Xianhui.Lin <xianhui.lin@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2025-04-10 15:20:28 +08:00
zhagnlu	ee1faf80dd	fix:add clear bitmap for batch skip mode (#41166 ) #41086 #41150 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2025-04-09 13:08:27 +08:00
congqixia	e2d8adb963	fix: Use element_type for Array is null operator (#41157 ) Related to #41156 --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-04-09 10:16:24 +08:00
Bingyi Sun	da21640ac3	fix: Fix the bug that null data can not be filtered by null expr (#41124 ) issue: https://github.com/milvus-io/milvus/issues/41063 --------- Signed-off-by: sunby <sunbingyi1992@gmail.com>	2025-04-08 19:12:24 +08:00
Bingyi Sun	fcb03b5bd1	feat: add json null/exists expression (#41004 ) issue: #35528 --------- Signed-off-by: sunby <sunbingyi1992@gmail.com>	2025-04-03 17:48:21 +08:00
Chun Han	afa519b4c7	fix: array is null failed(#40686 ) (#41027 ) related: #40686 Signed-off-by: MrPresent-Han <chun.han@gmail.com> Co-authored-by: MrPresent-Han <chun.han@gmail.com>	2025-04-02 18:20:22 +08:00
Bingyi Sun	15ec7bae4d	fix: Fix using json index when iterative_filter is specified (#40945 ) issue: #40934 Signed-off-by: sunby <sunbingyi1992@gmail.com>	2025-03-31 15:26:19 +08:00
zhagnlu	87e7d6d79f	fix:fix exception when do arith expr with using index (#40794 ) #40783 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2025-03-27 11:10:21 +08:00
zhagnlu	7fdb2e144f	enhance:change multi or expr to in expr (#40757 ) #40752 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2025-03-25 11:06:18 +08:00
zhagnlu	6c55db44f1	enhance: reorder sub expr for conjunct expr (#39872 ) two point: (1) reoder conjucts expr's subexpr, postpone heavy operations sequence: int(column) -> index(column) -> string(column) -> light conjuct ...... -> json(column) -> heavy conjuct -> two_column_compare (2) support pre filter for expr execute, skip scan raw data that had been skipped because of preceding expr result. #39869 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2025-03-19 14:50:14 +08:00
Bingyi Sun	d3adab15ac	fix: Build double index for all json numeric field (#40619 ) issue: #35528 --------- Signed-off-by: sunby <sunbingyi1992@gmail.com>	2025-03-14 16:52:11 +08:00
Bingyi Sun	7040ba1c12	enhance: make json path index support term filter (#40140 ) issue: #35528 --------- Signed-off-by: sunby <sunbingyi1992@gmail.com>	2025-03-04 11:56:02 +08:00
Chun Han	259f9106ad	enhance: refine variable-length-type memory usage(#38736 ) (#39578 ) related: #38736 Signed-off-by: MrPresent-Han <chun.han@gmail.com> Co-authored-by: MrPresent-Han <chun.han@gmail.com>	2025-02-27 21:13:58 +08:00
Bingyi Sun	f05e9628f6	fix: Fix search failure of null expression (#40129 ) issue: #40095 Signed-off-by: sunby <sunbingyi1992@gmail.com>	2025-02-25 20:43:55 +08:00
Bingyi Sun	db4769281c	fix: Fall back to a brute-force search if json index type unmatched (#40076 ) issue: https://github.com/milvus-io/milvus/issues/35528 If the query data type does not match the index type, fall back to a brute-force search --------- Signed-off-by: sunby <sunbingyi1992@gmail.com>	2025-02-24 16:25:57 +08:00
congqixia	7ccde3300e	fix: Use `text_log` prefix for TextMatchIndex null offset file (#39935 ) Related to #39933 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-02-17 20:17:25 +08:00
zhagnlu	8a9f02ef71	enhance: optimize expr performace for some points (#39695 ) 1. skip get expr arguments which deserialize proto for every batch execute. 2. replace unordered_set with sort array that has better performace for small set. #39688 Co-authored-by: luzhang <luzhang@zilliz.com>	2025-02-16 20:32:14 +08:00
Bingyi Sun	b59555057d	feat: support json index (#36750 ) https://github.com/milvus-io/milvus/issues/35528 This PR adds json index support for json and dynamic fields. Now you can only do unary query like 'a["b"] > 1' using this index. We will support more filter type later. basic usage: ``` collection.create_index("json_field", {"index_type": "INVERTED", "params": {"json_cast_type": DataType.STRING, "json_path": 'json_field["a"]["b"]'}}) ``` There are some limits to use this index: 1. If a record does not have the json path you specify, it will be ignored and there will not be an error. 2. If a value of the json path fails to be cast to the type you specify, it will be ignored and there will not be an error. 3. A specific json path can have only one json index. 4. If you try to create more than one json indexes for one json field, sdk(pymilvus<=2.4.7) may return immediately because of internal implementation. This will be fixed in a later version. --------- Signed-off-by: sunby <sunbingyi1992@gmail.com>	2025-02-15 14:06:15 +08:00
Spade A	f7d9587720	enhance: add tantivy collector for i64 (#39850 ) issue: #39852 Signed-off-by: SpadeA <tangchenjie1210@gmail.com>	2025-02-14 15:50:15 +08:00
cai.zhang	9e6e477c5d	fix: Fix modulo for long type (#39722 ) issue: #39640 Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2025-02-11 20:04:46 +08:00
Spade A	0461ddf776	fix: phrase match does not support offset input (#39338 ) fix: #39337 Signed-off-by: SpadeA-Tang <tangchenjie1210@gmail.com>	2025-01-16 22:05:01 +08:00
Spade A	032292a432	feat: support phrase match query (#38869 ) The relevant issue: https://github.com/milvus-io/milvus/issues/38930 --------- Signed-off-by: SpadeA-Tang <tangchenjie1210@gmail.com>	2025-01-12 20:24:58 +08:00
congqixia	182cac03e5	enhance: Use bitset or instead of bitwise set (#39037 ) Related to #39003 Copying bitset value bit by bit is slow and CPU heavy, this PR utilizes bitset operator "\|=" to accelerate this procedure Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-01-07 15:02:56 +08:00
smellthemoon	907fc24f85	enhance: support null expr (#38772 ) #31728 --------- Signed-off-by: lixinguo <xinguo.li@zilliz.com> Co-authored-by: lixinguo <xinguo.li@zilliz.com>	2025-01-02 14:16:54 +08:00
Ted Xu	acc8fb7af6	enhance: eliminate compile warnings (part2) (#38535 ) See #38435 --------- Signed-off-by: Ted Xu <ted.xu@zilliz.com>	2024-12-25 15:30:50 +08:00
Ted Xu	4919ccf543	enhance: eliminate compile warnings (#38420 ) See: #38435 --------- Signed-off-by: Ted Xu <ted.xu@zilliz.com>	2024-12-16 09:58:43 +08:00
Gao	994fc544e7	enhance: support iterative filter execution (#37363 ) issue: #37360 --------- Signed-off-by: chasingegg <chao.gao@zilliz.com>	2024-12-11 11:32:44 +08:00
Xianhui Lin	6d0a4fdb31	fix: Fix bug for Search fails with filter expression contains underscore (#38085 ) Enhance the matching for elements within the UnaryRangeArray https://github.com/milvus-io/milvus/issues/38068 --------- Signed-off-by: Xianhui.Lin <xianhui.lin@zilliz.com>	2024-12-05 10:18:40 +08:00
Bingyi Sun	90064cd47b	fix: Fix variable redeclaration in term filter (#38045 ) https://github.com/milvus-io/milvus/issues/38046 Signed-off-by: sunby <sunbingyi1992@gmail.com>	2024-12-02 15:10:38 +08:00
Bingyi Sun	d1596297d9	fix: Fix query failure with inverted index (#37686 ) https://github.com/milvus-io/milvus/issues/37649 --------- Signed-off-by: sunby <sunbingyi1992@gmail.com>	2024-11-15 10:28:31 +08:00

1 2

94 Commits