milvus

mirror of https://gitee.com/milvus-io/milvus.git synced 2025-12-07 01:28:27 +08:00

Author	SHA1	Message	Date
Spade A	08142a4854	fix: fix false negative panic on missing fields (#45902 ) issue: https://github.com/milvus-io/milvus/issues/45834 Signed-off-by: SpadeA <tangchenjie1210@gmail.com>	2025-11-28 18:05:08 +08:00
aoiasd	cfeb095ad7	enhance: forbid build analyzer at proxy (#44067 ) relate: https://github.com/milvus-io/milvus/issues/43687 We used to run the temporary analyzer and validate analyzer on the proxy, but the proxy should not be a computation-heavy node. This PR move all analyzer calculations to the streaming node. --------- Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2025-10-23 10:58:12 +08:00
congqixia	8f16afd5e7	fix: Handle JSON field default values in storage layer (#44999 ) Related to #44995 Added missing case for JSON data type in GetDefaultValue function to properly retrieve default values for JSON fields. This prevents crashes when enabling dynamic fields with default values during concurrent insert operations. Changes: - Added JSON data type case in GetDefaultValue to return BytesData - Added comprehensive tests for fillMissingFields covering JSON and other data types with default values - Added tests for nullable fields, required fields validation, and edge cases Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-10-21 15:14:03 +08:00
cai.zhang	19346fa389	feat: Geospatial Data Type and GIS Function support for milvus (#44547 ) issue: #43427 This pr's main goal is merge #37417 to milvus 2.5 without conflicts. # Main Goals 1. Create and describe collections with geospatial type 2. Insert geospatial data into the insert binlog 3. Load segments containing geospatial data into memory 4. Enable query and search can display geospatial data 5. Support using GIS funtions like ST_EQUALS in query 6. Support R-Tree index for geometry type # Solution 1. Add Type: Modify the Milvus core by adding a Geospatial type in both the C++ and Go code layers, defining the Geospatial data structure and the corresponding interfaces. 2. Dependency Libraries: Introduce necessary geospatial data processing libraries. In the C++ source code, use Conan package management to include the GDAL library. In the Go source code, add the go-geom library to the go.mod file. 3. Protocol Interface: Revise the Milvus protocol to provide mechanisms for Geospatial message serialization and deserialization. 4. Data Pipeline: Facilitate interaction between the client and proxy using the WKT format for geospatial data. The proxy will convert all data into WKB format for downstream processing, providing column data interfaces, segment encapsulation, segment loading, payload writing, and cache block management. 5. Query Operators: Implement simple display and support for filter queries. Initially, focus on filtering based on spatial relationships for a single column of geospatial literal values, providing parsing and execution for query expressions.Now only support brutal search 7. Client Modification: Enable the client to handle user input for geospatial data and facilitate end-to-end testing.Check the modification in pymilvus. --------- Signed-off-by: Yinwei Li <yinwei.li@zilliz.com> Signed-off-by: Cai Zhang <cai.zhang@zilliz.com> Co-authored-by: ZhuXi <150327960+Yinwei-Yu@users.noreply.github.com>	2025-09-28 19:43:05 +08:00
congqixia	cbed31933a	fix: [AddField] Permit missing new nullable field in InsertMsg (#42684 ) Related to #41858 #41951 #42084 When insert msg consumer (pipeline/flowgraph) have newer schema than insertMsg, it have to adapter the insert msg used old schema(missing newly added field) Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-06-13 13:52:35 +08:00
congqixia	118684afbb	enhance: [storageV2] Pass nullable converting insertMsg fieldData (#42584 ) Related to #39173 `nullable` flag is crucial for serde logic of v2 writer, missing this flag causes logic bug for v2 nullalbe data. --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-06-10 10:06:34 +08:00
congqixia	cb7f2fa6fd	enhance: Use v2 package name for pkg module (#39990 ) Related to #39095 https://go.dev/doc/modules/version-numbers Update pkg version according to golang dep version convention --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-02-22 23:15:58 +08:00
junjiejiangjjj	16cbdfb3b1	feat: Add Text Embedding Function (#36366 ) https://github.com/milvus-io/milvus/issues/35856 Signed-off-by: junjie.jiang <junjie.jiang@zilliz.com>	2025-01-24 14:23:06 +08:00
Cai Yudong	5bf1b2b929	feat: Support Int8Vector in go (#38990 ) Issue: #38666 Signed-off-by: Cai Yudong <yudong.cai@zilliz.com>	2025-01-14 20:43:06 +08:00
congqixia	b0bd290a6e	enhance: Use internal json(sonic) to replace std json lib (#37708 ) Related to #35020 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-11-18 10:46:31 +08:00
aoiasd	db34572c56	feat: support load and query with bm25 metric (#36071 ) relate: https://github.com/milvus-io/milvus/issues/35853 --------- Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2024-10-11 10:23:20 +08:00
congqixia	de8a266d8a	enhance: Enable linux code checker (#35084 ) See also #34483 --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-07-30 15:53:51 +08:00
wei liu	c45f38aa61	enhance: Update protobuf-go to protobuf-go v2 (#34394 ) issue: #34252 Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-07-29 11:31:51 +08:00
smellthemoon	2a1356985d	enhance: support null in go payload (#32296 ) #31728 --------- Signed-off-by: lixinguo <xinguo.li@zilliz.com> Co-authored-by: lixinguo <xinguo.li@zilliz.com>	2024-06-19 17:08:00 +08:00
shaoting-huang	0ecd694305	enhance: legacy code clean up (#33838 ) issue: #33839 Signed-off-by: shaoting-huang <shaoting.huang@zilliz.com>	2024-06-14 14:25:56 +08:00
Cai Yudong	4fc7915c70	enhance: unify data generation test APIs (#32955 ) Issue: #22837 Signed-off-by: Cai Yudong <yudong.cai@zilliz.com>	2024-05-14 14:33:33 +08:00
Buqian Zheng	8a1017a152	enhance: add helpers to parse sparse float vector in JSON (#32543 ) issue: #29419 added helper functions to parse JSON representation of sparse float vectors, will be used by both the restful server and the import utils. Signed-off-by: Buqian Zheng <zhengbuqian@gmail.com>	2024-04-25 14:47:24 +08:00
cqy123456	976928ecd1	fix: fix fp16/bf16 some code missing and add more fp16/bf16 test (#31612 ) issue: #31534 Signed-off-by: cqy123456 <qianya.cheng@zilliz.com>	2024-03-28 14:11:10 +08:00
SimFG	b1a1cca10b	feat: add more operation detail info for better allocation (#30438 ) issue: #30436 --------- Signed-off-by: SimFG <bang.fu@zilliz.com>	2024-03-28 06:33:11 +08:00
Buqian Zheng	3c80083f51	feat: [Sparse Float Vector] add sparse vector support to milvus components (#30630 ) add sparse float vector support to different milvus components, including proxy, data node to receive and write sparse float vectors to binlog, query node to handle search requests, index node to build index for sparse float column, etc. https://github.com/milvus-io/milvus/issues/29419 --------- Signed-off-by: Buqian Zheng <zhengbuqian@gmail.com>	2024-03-13 14:32:54 -07:00
Xu Tong	e429965f32	Add float16 approve for multi-type part (#28427 ) issue：https://github.com/milvus-io/milvus/issues/22837 Add bfloat16 vector, add the index part of float16 vector. Signed-off-by: Writer-X <1256866856@qq.com>	2024-01-11 15:48:51 +08:00
congqixia	f18a7191f2	enhance: make `ColumnBasedInsertMsgToInsertData` check field missing (#29758 ) fix: #29757 In previous code, `ColumnBasedInsertMsgToInsertData` adds empty field if the insertMsg parameter does not have the column schema defined. This may lead to unexpected behavior of caller functions. This PR: - Add column missing check - Add column length check - Generate BlobInfo for ColumnBasedInsertMsgToInsertData result --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-01-09 11:50:48 +08:00
Xu Tong	9166011c4a	Add float16 vector (#25852 ) Signed-off-by: Writer-X <1256866856@qq.com>	2023-09-08 10:03:16 +08:00
bjzhjing	548c82eca5	Refactor storage.MergeInsertData() to optimize the merging process (#26839 ) Benchmark Milvus with https://github.com/qdrant/vector-db-benchmark and specify the datasets as 'deep-image-96-angular'. Meanwhile, do perf profiling during 'upload + index' stage of vector-db-benchmark and see the following hot spots. 39.59%--github.com/milvus-io/milvus/internal/storage.MergeInsertData \| \|--21.43%--github.com/milvus-io/milvus/internal/storage.MergeFieldData \| \| \| \|--17.22%--runtime.memmove \| \| \| \|--1.53%--asm_exc_page_fault \| ...... \| \|--18.16%--runtime.memmove \| \|--1.66%--asm_exc_page_fault ...... The hot code path is in storage.MergeInsertData() which updates buffer.buffer by creating a new 'InsertData' instance and merging both the old buffer.buffer and addedBuffer into it. When it calls golang runtime.memmove to move buffer.buffer which is with big size (>1M), the hot spots appear. To avoid the above overhead, update storage.MergeInsertData() by appending addedBuffer to buffer.buffer, instead of moving buffer.buffer and addedBuffer to a new 'InsertData'. This change removes the hot spots 'runtime.memmove' from perf profiling output. Additionally, the 'upload + index' time, which is one performance metric of vector-db-benchmark, is reduced around 60% with this change. Signed-off-by: Cathy Zhang <cathy.zhang@intel.com>	2023-09-05 21:41:48 +08:00
congqixia	41af0a98fa	Use go-api/v2 for milvus-proto (#24770 ) Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2023-06-09 01:28:37 +08:00
yah01	ebd0279d3f	Check error by Error() and NoError() for better report message (#24736 ) Signed-off-by: yah01 <yang.cen@zilliz.com>	2023-06-08 15:36:36 +08:00
congqixia	73a181d226	Fix get vector it timeout and improve some string const usage (#24141 ) Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2023-05-16 17:41:22 +08:00
Enwei Jiao	967a97b9bd	Support json & array types (#23408 ) Signed-off-by: yah01 <yang.cen@zilliz.com> Co-authored-by: yah01 <yang.cen@zilliz.com>	2023-04-20 11:32:31 +08:00
jaime	c9d0c157ec	Move some modules from internal to public package (#22572 ) Signed-off-by: jaime <yun.zhang@zilliz.com>	2023-04-06 19:14:32 +08:00
congqixia	732986aa04	Remove fmt.Print from internal package (#22722 ) Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2023-03-14 17:36:05 +08:00
jaime	d126f06946	Decouple mq module from internal proto definition (#22536 ) Signed-off-by: jaime <yun.zhang@zilliz.com>	2023-03-04 23:21:50 +08:00
Xiaofan	949d5d078f	Fix memory calculation in dataCodec (#21800 ) Signed-off-by: xiaofan-luan <xiaofan.luan@zilliz.com>	2023-01-28 11:09:52 +08:00
SimFG	a55f739608	Separate public proto files (#19782 ) Signed-off-by: SimFG <bang.fu@zilliz.com> Signed-off-by: SimFG <bang.fu@zilliz.com>	2022-10-16 20:49:27 +08:00
SimFG	d7f38a803d	Separate some proto files (#19218 ) Signed-off-by: SimFG <bang.fu@zilliz.com> Signed-off-by: SimFG <bang.fu@zilliz.com>	2022-09-16 16:56:49 +08:00
xige-16	4de1bfe5bc	Add cpp data codec (#18538 ) Signed-off-by: xige-16 <xi.ge@zilliz.com> Co-authored-by: zhagnlu lu.zhang@zilliz.com Signed-off-by: xige-16 <xi.ge@zilliz.com>	2022-09-09 22:12:34 +08:00
congqixia	68a6587374	Set insert&stats binlog timestamp range (#19005 ) Signed-off-by: Congqi Xia <congqi.xia@zilliz.com> Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2022-09-04 09:05:09 +08:00
jaime	68b1b82faf	Remove DataKV interface (#16692 ) Signed-off-by: yun.zhang <yun.zhang@zilliz.com>	2022-04-28 21:03:47 +08:00
xige-16	205c92e54b	Support insert string data (#15993 ) Signed-off-by: xige-16 <xi.ge@zilliz.com>	2022-03-25 14:27:25 +08:00
Jiquan Long	3121619758	Chunk manager support scalar data (#16010 ) Signed-off-by: dragondriver <jiquan.long@zilliz.com>	2022-03-11 14:39:59 +08:00
Jiquan Long	f71651e294	Support column-based insert data in message stream (#15802 ) Signed-off-by: dragondriver <jiquan.long@zilliz.com>	2022-03-04 15:09:56 +08:00
Cai Yudong	92c8e32ebd	Let MemoryKV.Load return error when key not exist (#15814 ) Signed-off-by: yudong.cai <yudong.cai@zilliz.com>	2022-03-02 18:51:55 +08:00
XuanYang-cn	dd860a76cf	[skip e2e]Update license for storage util (#14453 ) Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2021-12-28 20:11:55 +08:00
godchen	9d5bcd3e3a	Close event and binlog reader (#12173 ) Signed-off-by: godchen <qingxiang.chen@zilliz.com>	2021-11-22 17:27:14 +08:00
bigsheeper	93149c5ad9	Load growing segment in query node (#11664 ) Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2021-11-12 18:27:10 +08:00
godchen	a0a3a889e7	Add common endian for global use (#11092 ) Signed-off-by: godchen <qingxiang.chen@zilliz.com>	2021-11-02 18:16:32 +08:00
cai.zhang	5b42a3223c	Increase compatibility for EstimateMemorySize interface (#10603 ) Signed-off-by: cai.zhang <cai.zhang@zilliz.com>	2021-10-26 15:34:21 +08:00
Cai Yudong	a63ef91c74	Fix static-check (#9776 ) Signed-off-by: yudong.cai <yudong.cai@zilliz.com>	2021-10-13 13:22:33 +08:00
dragondriver	1f224c4b2e	Optimize the ut of storage utils (#9740 ) Signed-off-by: dragondriver <jiquan.long@zilliz.com>	2021-10-12 19:47:08 +08:00
dragondriver	f85271cf3f	Estimate memory size by descriptor event (#9688 ) Signed-off-by: dragondriver <jiquan.long@zilliz.com>	2021-10-12 17:00:34 +08:00
dragondriver	7daa319dc2	[skip ci] Rename EstimateMemorySize to GetBinlogSize (#9651 ) Signed-off-by: dragondriver <jiquan.long@zilliz.com>	2021-10-11 18:20:30 +08:00

1 2

51 Commits