milvus

mirror of https://gitee.com/milvus-io/milvus.git synced 2026-01-07 19:31:51 +08:00

Author	SHA1	Message	Date
yah01	f542bdbf3c	enhance: calc the accurate mem size of segment (#30093 ) this stats the real memory size of segment, also reduces the memory usage in mmap mode resolve #30095 Signed-off-by: yah01 <yang.cen@zilliz.com>	2024-01-19 12:32:53 +08:00
xige-16	fa7cf587b0	enhance: Opt metric type does not match error message (#29927 ) issue: #29791 /kind improvement Signed-off-by: xige-16 <xi.ge@zilliz.com> Signed-off-by: xige-16 <xi.ge@zilliz.com>	2024-01-17 20:25:03 +08:00
yah01	6c477ce3a7	enhance: optimize the loading strategy (#29910 ) as we have the pool size limit so we don't need to limit the concurrency manually Signed-off-by: yah01 <yang.cen@zilliz.com>	2024-01-12 14:26:50 +08:00
Xu Tong	e429965f32	Add float16 approve for multi-type part (#28427 ) issue：https://github.com/milvus-io/milvus/issues/22837 Add bfloat16 vector, add the index part of float16 vector. Signed-off-by: Writer-X <1256866856@qq.com>	2024-01-11 15:48:51 +08:00
yah01	031243fee7	feat: support mmap for marisa trie (#29613 ) this supports mmap for marisa trie index related https://github.com/milvus-io/milvus/issues/21866 Signed-off-by: yah01 <yang.cen@zilliz.com>	2024-01-11 10:22:50 +08:00
congqixia	d6429933a7	enhance: make Load process traceable in querynode & segcore (#29858 ) See also #29803 This PR: - Add trace span for `LoadIndex` & `LoadFieldData` in segment loader - Add `TraceCtx` parameter for `Index.Load` in segcore - Add span for ReadFiles & Engine Load for Memory/Disk Vector index --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-01-10 21:58:51 +08:00
Cai Yudong	cb9d9ec0f0	enhance: Correct sampleFraction's type to float (#29810 ) Signed-off-by: Yudong Cai <yudong.cai@zilliz.com>	2024-01-10 13:18:50 +08:00
MrPresent-Han	9e2e7157e9	feat: support search_group_by for milvus(#25324 ) (#28983 ) related: #25324 Search GroupBy function, used to aggregate result entities based on a specific scalar column. several points to mention: 1. Temporarliy, the whole groupby is implemented separated from iterative expr framework for the first period 2. In the long term, the groupBy operation will be incorporated into the iterative expr framework:https://github.com/milvus-io/milvus/pull/28166 3. This pr includes some unrelated mocked interface regarding alterIndex due to some unworth-to-mention reasons. All these un-associated content will be removed before the final pr is merged. This version of pr is only for review 4. All other related details were commented in the files comparison Signed-off-by: MrPresent-Han <chun.han@zilliz.com>	2024-01-05 15:50:47 +08:00
Jiquan Long	3f46c6d459	feat: support inverted index (#28783 ) issue: https://github.com/milvus-io/milvus/issues/27704 Add inverted index for some data types in Milvus. This index type can save a lot of memory compared to loading all data into RAM and speed up the term query and range query. Supported: `INT8`, `INT16`, `INT32`, `INT64`, `FLOAT`, `DOUBLE`, `BOOL` and `VARCHAR`. Not supported: `ARRAY` and `JSON`. Note: - The inverted index for `VARCHAR` is not designed to serve full-text search now. We will treat every row as a whole keyword instead of tokenizing it into multiple terms. - The inverted index don't support retrieval well, so if you create inverted index for field, those operations which depend on the raw data will fallback to use chunk storage, which will bring some performance loss. For example, comparisons between two columns and retrieval of output fields. The inverted index is very easy to be used. Taking below collection as an example: ```python fields = [ FieldSchema(name="pk", dtype=DataType.VARCHAR, is_primary=True, auto_id=False, max_length=100), FieldSchema(name="int8", dtype=DataType.INT8), FieldSchema(name="int16", dtype=DataType.INT16), FieldSchema(name="int32", dtype=DataType.INT32), FieldSchema(name="int64", dtype=DataType.INT64), FieldSchema(name="float", dtype=DataType.FLOAT), FieldSchema(name="double", dtype=DataType.DOUBLE), FieldSchema(name="bool", dtype=DataType.BOOL), FieldSchema(name="varchar", dtype=DataType.VARCHAR, max_length=1000), FieldSchema(name="random", dtype=DataType.DOUBLE), FieldSchema(name="embeddings", dtype=DataType.FLOAT_VECTOR, dim=dim), ] schema = CollectionSchema(fields) collection = Collection("demo", schema) ``` Then we can simply create inverted index for field via: ```python index_type = "INVERTED" collection.create_index("int8", {"index_type": index_type}) collection.create_index("int16", {"index_type": index_type}) collection.create_index("int32", {"index_type": index_type}) collection.create_index("int64", {"index_type": index_type}) collection.create_index("float", {"index_type": index_type}) collection.create_index("double", {"index_type": index_type}) collection.create_index("bool", {"index_type": index_type}) collection.create_index("varchar", {"index_type": index_type}) ``` Then, term query and range query on the field can be speed up automatically by the inverted index: ```python result = collection.query(expr='int64 in [1, 2, 3]', output_fields=["pk"]) result = collection.query(expr='int64 < 5', output_fields=["pk"]) result = collection.query(expr='int64 > 2997', output_fields=["pk"]) result = collection.query(expr='1 < int64 < 5', output_fields=["pk"]) ``` --------- Signed-off-by: longjiquan <jiquan.long@zilliz.com>	2023-12-31 19:50:47 +08:00
yah01	aef483806d	enhance: improve the segcore logs (#29372 ) - remove the streaming logging - refine existing logs fix #29366 --------- Signed-off-by: yah01 <yah2er0ne@outlook.com>	2023-12-23 21:52:43 +08:00
yah01	04b2518ae7	enhance: fix the incorrect init parameter (#29357 ) as the `driver_` field is not used so this doesn't matter for now Signed-off-by: yah01 <yang.cen@zilliz.com>	2023-12-20 20:50:43 +08:00
yah01	8f89e9cf75	enhance: remove all unnecessary string formatting (#29323 ) done by two regex expressions: - `PanicInfo\((.+),[. \n]+fmt::format\(([.\s\S]+?)\)\)` - `AssertInfo\((.+),[. \n]+fmt::format\(([.\s\S]+?)\)\)` related: #28811 --------- Signed-off-by: yah01 <yang.cen@zilliz.com>	2023-12-20 10:04:43 +08:00
zhagnlu	a602171d06	enhance: Refactor runtime and expr framework (#28166 ) #28165 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2023-12-18 12:04:42 +08:00
Cai Yudong	26409d801e	enhance: Remove omp from segcore (#29207 ) Signed-off-by: Yudong Cai <yudong.cai@zilliz.com>	2023-12-15 14:00:39 +08:00
Enwei Jiao	0e65e90338	enhance: Support otlp with insecure (#29115 ) issue: https://github.com/milvus-io/milvus/issues/28914 Signed-off-by: Enwei Jiao <enwei.jiao@zilliz.com>	2023-12-12 11:14:37 +08:00
congqixia	dcb662d9ed	enhance: Refine C.NewSegment response and handle exception (#28952 ) See also #28795 Orignal `C.NewSegment` may panic if some condition is not met, this pr changes response struct to `CNewSegmentResult`, which contains `C.CStatus` and may return catched exception --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2023-12-07 13:34:35 +08:00
Bingyi Sun	36f69ea031	feat: integrate storagev2 in building index of segcore (#28768 ) issue: https://github.com/milvus-io/milvus/issues/28655 --------- Signed-off-by: sunby <sunbingyi1992@gmail.com>	2023-12-05 16:48:54 +08:00
yah01	342635ed61	enhance: enable assert method to format arguments (#28812 ) for now the assert method in segcore could accept a string information, too many codes don't print the value they assert. make it happy related #28811 --------- Signed-off-by: yah01 <yah2er0ne@outlook.com>	2023-12-01 18:04:33 +08:00
yah01	02c5a649cf	enhance: store system fields in segcore (#28524 ) we need the system fields info for some usacase fix: #28523 --------- Signed-off-by: yah01 <yah2er0ne@outlook.com>	2023-11-21 09:28:22 +08:00
Xu Tong	8ec85f5f4c	Add template for VectorMemIndex (#28324 ) Signed-off-by: Writer-X <1256866856@qq.com>	2023-11-11 13:20:22 +08:00
yah01	30847cad3e	Handle exception while loading (#28304 ) Signed-off-by: yah01 <yah2er0ne@outlook.com>	2023-11-09 17:59:12 +08:00
cai.zhang	2b5f632fa4	Fix bug for constructing ArrayView with fixed-length type (#28185 ) Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2023-11-07 23:38:21 +08:00
cai.zhang	8011054a2a	Check length before comparing strings (#28110 ) Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2023-11-04 10:04:29 +08:00
cai.zhang	fc2df9514f	Refine code for fixed-length types array (#28108 ) Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2023-11-03 00:40:14 +08:00
yah01	dc89730a50	Support collection-level mmap control (#26901 ) Signed-off-by: yah01 <yah2er0ne@outlook.com>	2023-11-02 23:52:16 +08:00
Enwei Jiao	8ae9c947ae	Use OpenDAL to access object store (#25642 ) Signed-off-by: Enwei Jiao <enwei.jiao@zilliz.com>	2023-11-01 09:00:14 +08:00
yah01	2af46d7333	Increase the ChunkManager request timeout (#28015 ) Signed-off-by: yah01 <yah2er0ne@outlook.com>	2023-10-31 09:06:13 +08:00
zhagnlu	6060dd7ea8	Add chunk manager request timeout (#27692 ) Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2023-10-23 20:08:08 +08:00
Enwei Jiao	e98e56f75d	Fix SIGSEGV if run within gdb (#27736 ) Signed-off-by: Enwei Jiao <enwei.jiao@zilliz.com>	2023-10-18 02:16:11 +08:00
Enwei Jiao	b80a3e19d3	Add code for PanicInfo (#27364 ) Signed-off-by: Enwei Jiao <enwei.jiao@zilliz.com>	2023-09-27 12:01:28 +08:00
foxspy	5db4a0489e	dynamic index version control (#27335 ) Co-authored-by: longjiquan <jiquan.long@zilliz.com>	2023-09-25 21:39:27 +08:00
foxspy	fa033e586a	disable growing index for flat (#27309 ) Signed-off-by: xianliang <xianliang.li@zilliz.com>	2023-09-22 14:19:24 +08:00
foxspy	370b6fde58	milvus support multi index engine (#27178 ) Co-authored-by: longjiquan <jiquan.long@zilliz.com>	2023-09-22 09:59:26 +08:00
cai.zhang	a362bb1457	Support array datatype (#26369 ) Signed-off-by: cai.zhang <cai.zhang@zilliz.com>	2023-09-19 14:23:23 +08:00
PowderLi	4feb3fa7c6	support azure (#26398 ) Signed-off-by: PowderLi <min.li@zilliz.com>	2023-09-19 10:01:23 +08:00
yihao.dai	bb6711f28c	Add ChunkCache: support get vector from storage (#26142 ) Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2023-09-15 10:21:20 +08:00
Enwei Jiao	0afdfdb9af	Remove other Exceptions, keeps SegcoreError only (#27017 ) Signed-off-by: Enwei Jiao <enwei.jiao@zilliz.com>	2023-09-14 14:05:20 +08:00
Enwei Jiao	c3f15c6b95	Refactor duplicate error class into one place (#26985 ) Signed-off-by: Enwei Jiao <enwei.jiao@zilliz.com>	2023-09-11 20:43:17 +08:00
Xu Tong	9166011c4a	Add float16 vector (#25852 ) Signed-off-by: Writer-X <1256866856@qq.com>	2023-09-08 10:03:16 +08:00
MrPresent-Han	a34a9d606c	fix panic due to empty traceID(#26754 ) (#26808 ) Signed-off-by: MrPresent-Han <chun.han@zilliz.com>	2023-09-02 16:13:01 +08:00
MrPresent-Han	7d5a4b2994	add more event for segcore search(#26277 ) (#26688 ) Signed-off-by: MrPresent-Han <chun.han@zilliz.com>	2023-08-30 14:15:01 +08:00
MrPresent-Han	d30a920226	add log trace for segcore(#26277 ) (#26339 ) Signed-off-by: MrPresent-Han <chun.han@zilliz.com>	2023-08-16 11:41:33 +08:00
yah01	c0870d3c62	Set default thread pool size to be smaller (#26328 ) Signed-off-by: yah01 <yah2er0ne@outlook.com>	2023-08-14 16:33:35 +08:00
cai.zhang	a0198ce8ae	Support json contains feature (#25384 ) Signed-off-by: cai.zhang <cai.zhang@zilliz.com>	2023-08-11 17:09:30 +08:00
zhagnlu	411f9ac823	Upgrade minio-go and add region and virtual host config for segcore chunk manager (#26194 ) Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2023-08-11 10:37:36 +08:00
xige-16	1055c90456	Add default retrieve limit (#24782 ) Signed-off-by: xige-16 <xi.ge@zilliz.com>	2023-08-10 14:11:15 +08:00
Enwei Jiao	f97d5a7d08	Fix compile failed with llvm16 (#26112 ) Signed-off-by: Enwei Jiao <enwei.jiao@zilliz.com>	2023-08-03 17:29:07 +08:00
MrPresent-Han	5634ba777d	add new threadpool with various priority to avoid deadlock(#25781 ) (#26028 ) Signed-off-by: MrPresent-Han <chun.han@zilliz.com>	2023-08-03 09:31:07 +08:00
zhagnlu	833674c1cb	add glog configurable function and redirect aws log to segcore log (#25664 ) Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2023-07-27 19:49:02 +08:00
Cai Yudong	9a4761dcc7	Remove binary metrics TANIMOTO/SUPERSTRUCTURE/SUBSTRUCTURE (#25708 ) Signed-off-by: Yudong Cai <yudong.cai@zilliz.com>	2023-07-19 16:16:58 +08:00

1 2 3 4 5

246 Commits