milvus

mirror of https://gitee.com/milvus-io/milvus.git synced 2025-12-07 01:28:27 +08:00

Author	SHA1	Message	Date
congqixia	07bca45376	fix: [2.6] Pass fs via `FileManagerContext` when loading index (#44734 ) Cherry-pick from master pr: #44733 Related to #44615 --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-10-11 09:57:57 +08:00
Gao	d3784c6515	enhance: add storage resource usage for vector search (#44308 ) issue: #44212 Implement search/query storage usage statistics in go side(result reduce), only record storage usage in vector search C++ path. Need to be implemented in query c++ path in next prs. --------- Signed-off-by: chasingegg <chao.gao@zilliz.com> Signed-off-by: marcelo.chen <marcelo.chen@zilliz.com> Co-authored-by: marcelo.chen <marcelo.chen@zilliz.com>	2025-09-19 20:20:02 +08:00
Spade A	ba4cd68edb	fix: adjust params to make CPP UT run faster (#44223 ) fix: https://github.com/milvus-io/milvus/issues/44224 --------- Signed-off-by: SpadeA <tangchenjie1210@gmail.com>	2025-09-06 14:13:54 +08:00
congqixia	e3b3502287	fix: Use correct regex for cppcheck (#44077 ) Related to #44076 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-08-27 20:57:50 +08:00
marcelo-cjl	e13e19cd2c	enhance: add sparse_u32_f32 data type for sparse vertor (#43974 ) issue: #43973 Signed-off-by: marcelo.chen <marcelo.chen@zilliz.com>	2025-08-27 16:47:50 +08:00
Gao	e97a618630	enhance: support readAt interface for remote input stream (#43997 ) #42032 Also, fix the cacheoptfield method to work in storagev2. Also, change the sparse related interface for knowhere version bump #43974 . Also, includes https://github.com/milvus-io/milvus/pull/44046 for metric lost. --------- Signed-off-by: chasingegg <chao.gao@zilliz.com> Signed-off-by: marcelo.chen <marcelo.chen@zilliz.com> Signed-off-by: Congqi Xia <congqi.xia@zilliz.com> Co-authored-by: marcelo.chen <marcelo.chen@zilliz.com> Co-authored-by: Congqi Xia <congqi.xia@zilliz.com>	2025-08-26 11:19:58 +08:00
Spade A	d6a428e880	feat: impl StructArray -- support create index for vector array (embedding list) and search on it (#43726 ) Ref https://github.com/milvus-io/milvus/issues/42148 This PR supports create index for vector array (now, only for `DataType.FLOAT_VECTOR`) and search on it. The index type supported in this PR is `EMB_LIST_HNSW` and the metric type is `MAX_SIM` only. The way to use it: ```python milvus_client = MilvusClient("xxx:19530") schema = milvus_client.create_schema(enable_dynamic_field=True, auto_id=True) ... struct_schema = milvus_client.create_struct_array_field_schema("struct_array_field") ... struct_schema.add_field("struct_float_vec", DataType.ARRAY_OF_VECTOR, element_type=DataType.FLOAT_VECTOR, dim=128, max_capacity=1000) ... schema.add_struct_array_field(struct_schema) index_params = milvus_client.prepare_index_params() index_params.add_index(field_name="struct_float_vec", index_type="EMB_LIST_HNSW", metric_type="MAX_SIM", index_params={"nlist": 128}) ... milvus_client.create_index(COLLECTION_NAME, schema=schema, index_params=index_params) ``` Note: This PR uses `Lims` to convey offsets of the vector array to knowhere where vectors of multiple vector arrays are concatenated and we need offsets to specify which vectors belong to which vector array. --------- Signed-off-by: SpadeA <tangchenjie1210@gmail.com> Signed-off-by: SpadeA-Tang <tangchenjie1210@gmail.com>	2025-08-20 10:27:46 +08:00
Chun Han	001619aef9	feat: supporing load priority for loading (#42413 ) related: #40781 Signed-off-by: MrPresent-Han <chun.han@gmail.com> Co-authored-by: MrPresent-Han <chun.han@gmail.com>	2025-06-17 15:22:38 +08:00
Cai Yudong	64feeb0e2b	enhance: Rename API GenDataset to GenFieldData in unittest (#39386 ) Issue: #38666 Signed-off-by: Cai Yudong <yudong.cai@zilliz.com>	2025-01-17 15:55:03 +08:00
Zhen Ye	3e788f0fbd	enhance: record memory size (uncompressed) item for index (#38770 ) issue: #38715 - Current milvus use a serialized index size(compressed) for estimate resource for loading. - Add a new field `MemSize` (before compressing) for index to estimate resource. --------- Signed-off-by: chyezh <chyezh@outlook.com>	2025-01-14 10:33:06 +08:00
Buqian Zheng	5e38f01e5b	enhance: update knowhere version (#39212 ) Signed-off-by: Buqian Zheng <zhengbuqian@gmail.com>	2025-01-14 10:21:05 +08:00
cqy123456	8216345b07	enhance: reduce copy of bitset and id conversion of brurtforce search (#37675 ) issue: https://github.com/milvus-io/milvus/issues/37798 Signed-off-by: cqy123456 <qianya.cheng@zilliz.com>	2024-11-19 15:48:40 +08:00
aoiasd	e9391acf80	fix: bm25 brute force search need index params k1 and b (#37721 ) relate: https://github.com/milvus-io/milvus/issues/35853 --------- Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2024-11-18 15:44:31 +08:00
Buqian Zheng	f4a91e135b	enhance: Allow empty sparse row (#34700 ) issue: #29419 * If a sparse vector with 0 non-zero value is inserted, no ANN search on this sparse vector field will return it as a result. User may retrieve this row via scalar query or ANN search on another vector field though. * If the user uses an empty sparse vector as the query vector for a ANN search, no neighbor will be returned. Signed-off-by: Buqian Zheng <zhengbuqian@gmail.com>	2024-08-16 14:14:54 +08:00
zhenshan.cao	aa247f192d	enhance: remove unused code for StorageV2 (#35132 ) issue: https://github.com/milvus-io/milvus/issues/34168 Signed-off-by: zhenshan.cao <zhenshan.cao@zilliz.com>	2024-08-01 12:08:13 +08:00
Chun Han	f00c529aea	feat: support group_size for search_group_by(#33544 ) (#33720 ) related: #33544 mainly changes in three aspects: 1. enable setting group_size for group by function 2. separate normal reduce and group by reduce 3. eleminate uncessary padding in search result for reducing Signed-off-by: MrPresent-Han <chun.han@gmail.com> Co-authored-by: MrPresent-Han <chun.han@gmail.com>	2024-07-12 10:17:36 +08:00
Cai Yudong	ad90360162	enhance: Update knowhere commit (#34223 ) Signed-off-by: Cai Yudong <yudong.cai@zilliz.com>	2024-06-27 18:20:06 +08:00
Gao	0d20303e54	fix: fix binary vector data size (#33750 ) issue: https://github.com/milvus-io/milvus/issues/22837 - fix byte size wrong for binary vectors - fix the expect/actual error msg Signed-off-by: chasingegg <chao.gao@zilliz.com>	2024-06-18 21:39:59 +08:00
Buqian Zheng	c5918ffbdb	enhance: mark sparse inverted index as mmap-able (#33281 ) issue: #29419 Signed-off-by: Buqian Zheng <zhengbuqian@gmail.com>	2024-05-23 14:11:42 +08:00
Buqian Zheng	bb7765cbd6	fix: fix Indexing.Iterator ut: build index with all data at once (#32844 ) issue: #32843 Signed-off-by: Buqian Zheng <zhengbuqian@gmail.com>	2024-05-10 11:31:30 +08:00
Chun Han	01c2684355	enhance: [skip e2e] disable unstable ut temporarily (#32836 ) Signed-off-by: MrPresent-Han <chun.han@zilliz.com>	2024-05-08 12:17:29 +08:00
Chun Han	337cc0756d	fix: lack good results for insufficient ef(#29883 ) (#32080 ) related: #29883 Signed-off-by: MrPresent-Han <chun.han@zilliz.com>	2024-04-13 22:13:23 +08:00
cqy123456	aba4993c6c	fix: fix some fp16/bf16 code miss in segcore. (#31771 ) issue：https://github.com/milvus-io/milvus/issues/22837 Signed-off-by: cqy123456 <qianya.cheng@zilliz.com>	2024-04-07 14:13:16 +08:00
Buqian Zheng	96cfae55a5	feat: [Sparse Float Vector] segcore to support sparse vector search and get raw vector by id (#30629 ) This PR adds the ability to search/get sparse float vectors in segcore, and added unit tests by modifying lots of existing tests into parameterized ones. https://github.com/milvus-io/milvus/issues/29419 Signed-off-by: Buqian Zheng <zhengbuqian@gmail.com>	2024-03-12 09:16:30 -07:00
Buqian Zheng	070dfc77bf	feat: [Sparse Float Vector] segcore basics and index building (#30357 ) This commit adds sparse float vector support to segcore with the following: 1. data type enum declarations 2. Adds corresponding data structures for handling sparse float vectors in various scenarios, including: * FieldData as a bridge between the binlog and the in memory data structures * mmap::Column as the in memory representation of a sparse float vector column of a sealed segment; * ConcurrentVector as the in memory representation of a sparse float vector of a growing segment which supports inserts. 3. Adds logic in payload reader/writer to serialize/deserialize from/to binlog 4. Adds the ability to allow the index node to build sparse float vector index 5. Adds the ability to allow the query node to build growing index for growing segment and temp index for sealed segment without index built This commit also includes some code cleanness, comment improvement, and some unit tests for sparse vector. https://github.com/milvus-io/milvus/issues/29419 Signed-off-by: Buqian Zheng <zhengbuqian@gmail.com>	2024-03-11 14:45:02 +08:00
Cai Yudong	8a219e0102	feat: Support knowhere trace using OpenTelemetry (#30750 ) Issue: #21508 Signed-off-by: Yudong Cai <yudong.cai@zilliz.com>	2024-02-28 12:29:00 +08:00
Jiquan Long	e2330f02f8	fix: pattern match use incorrect raw data (#30764 ) issue: https://github.com/milvus-io/milvus/issues/30687 We store all the varchar datas in an continuous address and use string_view to quickly find them. In this case, using string_view.data() directly will point to all rest varchar datas. --------- Signed-off-by: longjiquan <jiquan.long@zilliz.com>	2024-02-22 19:56:52 +08:00
MrPresent-Han	77eb6defb1	feat: support groupby on growing and non-indexed sealed egment(#30307 ) (#30644 ) related: #30308 Signed-off-by: MrPresent-Han <chun.han@zilliz.com>	2024-02-21 14:02:53 +08:00
Jiquan Long	a587450e56	enhance: [skip-e2e] disable asan (#30498 ) fix: #30511 /kind improvement --------- Signed-off-by: longjiquan <jiquan.long@zilliz.com>	2024-02-04 21:25:05 +08:00
Xu Tong	e429965f32	Add float16 approve for multi-type part (#28427 ) issue：https://github.com/milvus-io/milvus/issues/22837 Add bfloat16 vector, add the index part of float16 vector. Signed-off-by: Writer-X <1256866856@qq.com>	2024-01-11 15:48:51 +08:00
congqixia	d6429933a7	enhance: make Load process traceable in querynode & segcore (#29858 ) See also #29803 This PR: - Add trace span for `LoadIndex` & `LoadFieldData` in segment loader - Add `TraceCtx` parameter for `Index.Load` in segcore - Add span for ReadFiles & Engine Load for Memory/Disk Vector index --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-01-10 21:58:51 +08:00
Bingyi Sun	36f69ea031	feat: integrate storagev2 in building index of segcore (#28768 ) issue: https://github.com/milvus-io/milvus/issues/28655 --------- Signed-off-by: sunby <sunbingyi1992@gmail.com>	2023-12-05 16:48:54 +08:00
yah01	267c67dfee	enhance: reduce 1x copy while retrieving data from growing segment (#28323 ) Signed-off-by: yah01 <yah2er0ne@outlook.com>	2023-11-10 15:44:22 +08:00
Gao	7a65b6fb85	Limit faiss ivf index build thread num and fix ut (#27567 ) Signed-off-by: chasingegg <chao.gao@zilliz.com>	2023-10-11 10:33:33 +08:00
foxspy	5db4a0489e	dynamic index version control (#27335 ) Co-authored-by: longjiquan <jiquan.long@zilliz.com>	2023-09-25 21:39:27 +08:00
foxspy	370b6fde58	milvus support multi index engine (#27178 ) Co-authored-by: longjiquan <jiquan.long@zilliz.com>	2023-09-22 09:59:26 +08:00
yah01	3d05ddf505	Reduce cpp test time (#27043 ) Signed-off-by: yah01 <yah2er0ne@outlook.com>	2023-09-13 15:41:18 +08:00
Enwei Jiao	c3f15c6b95	Refactor duplicate error class into one place (#26985 ) Signed-off-by: Enwei Jiao <enwei.jiao@zilliz.com>	2023-09-11 20:43:17 +08:00
xige-16	5b8d716cbc	Add ut for growing segment load binlog (#26268 ) Signed-off-by: xige-16 <xi.ge@zilliz.com>	2023-08-13 20:41:31 +08:00
zhagnlu	411f9ac823	Upgrade minio-go and add region and virtual host config for segcore chunk manager (#26194 ) Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2023-08-11 10:37:36 +08:00
yah01	300fef446b	Enable mmap for vector index (#25877 ) Signed-off-by: yah01 <yah2er0ne@outlook.com>	2023-08-10 13:59:15 +08:00
yah01	9618bd9b42	Set channel capacity before consuming it (#25895 ) Signed-off-by: yah01 <yang.cen@zilliz.com>	2023-07-26 17:35:01 +08:00
Cai Yudong	9a4761dcc7	Remove binary metrics TANIMOTO/SUPERSTRUCTURE/SUBSTRUCTURE (#25708 ) Signed-off-by: Yudong Cai <yudong.cai@zilliz.com>	2023-07-19 16:16:58 +08:00
xige-16	04082b3de2	Migrate the ability to upload and download binlog to cpp (#22984 ) Signed-off-by: xige-16 <xi.ge@zilliz.com>	2023-06-25 14:38:44 +08:00
cqy123456	a519213316	Update knowhere version, update diskann api and generate cache nodes in build process (#24898 ) Signed-off-by: cqy123456 <qianya.cheng@zilliz.com>	2023-06-16 14:20:39 +08:00
yihao.dai	092d743917	Add support for getting vectors by ids (#23450 ) Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2023-04-23 09:00:32 +08:00
Cai Yudong	ef63e64ded	Remove ANNOY index type (#23189 ) Signed-off-by: Yudong Cai <yudong.cai@zilliz.com>	2023-04-04 16:30:27 +08:00
xige-16	9aa99aedbb	[Cherry-Pick] Remove arrow uasge in FieldData (#22726 ) Signed-off-by: xige-16 <xi.ge@zilliz.com>	2023-03-20 10:41:56 +08:00
Jiquan Long	a36fefb009	Fix cpplint (#22657 ) Signed-off-by: longjiquan <jiquan.long@zilliz.com>	2023-03-10 09:47:54 +08:00
smellthemoon	9e0ec15436	Support range search (#21652 ) Signed-off-by: smellthemoon <xinguo.li@zilliz.com> Signed-off-by: lixinguo <xinguo.li@zilliz.com> Co-authored-by: jaime <yun.zhang@zilliz.com>	2023-02-21 09:48:32 +08:00

1 2 3

102 Commits