milvus

mirror of https://gitee.com/milvus-io/milvus.git synced 2026-01-02 17:05:33 +08:00

Author	SHA1	Message	Date
xige-16	aee19dcd6b	enhance: Opt vector dimension mismatch error message (#29928 ) issue: https://github.com/milvus-io/milvus/issues/29791 /kind improvement Signed-off-by: xige-16 <xi.ge@zilliz.com> --------- Signed-off-by: xige-16 <xi.ge@zilliz.com>	2024-01-19 17:52:54 +08:00
yah01	f542bdbf3c	enhance: calc the accurate mem size of segment (#30093 ) this stats the real memory size of segment, also reduces the memory usage in mmap mode resolve #30095 Signed-off-by: yah01 <yang.cen@zilliz.com>	2024-01-19 12:32:53 +08:00
xige-16	fa7cf587b0	enhance: Opt metric type does not match error message (#29927 ) issue: #29791 /kind improvement Signed-off-by: xige-16 <xi.ge@zilliz.com> Signed-off-by: xige-16 <xi.ge@zilliz.com>	2024-01-17 20:25:03 +08:00
yah01	1185e4dcd5	fix: written file size is over the int32 range and raises error (#30057 ) we sum the total data size in int32, which could lead to an overflow error related #30056 Signed-off-by: yah01 <yang.cen@zilliz.com>	2024-01-17 16:42:54 +08:00
Bingyi Sun	8030b90891	fix: correct file name when loading index (#29985 ) issue: #29973 Signed-off-by: sunby <sunbingyi1992@gmail.com>	2024-01-16 10:24:52 +08:00
MrPresent-Han	c31e68446e	enhance: refine groupby-performance (#29933 ) related: #29844 Signed-off-by: MrPresent-Han <chun.han@zilliz.com>	2024-01-15 14:12:52 +08:00
chyezh	def717af55	fix: SealedIndexingEntry in SealedIndexingRecord may leak without smart pointer protect. (#29932 ) may related issue: #29828 Signed-off-by: chyezh <ye.zhen@zilliz.com>	2024-01-14 10:28:51 +08:00
Bingyi Sun	e1258b8cad	feat: integrate storagev2 into loading segment (#29336 ) issue: #29335 --------- Signed-off-by: sunby <sunbingyi1992@gmail.com>	2024-01-12 18:10:51 +08:00
yah01	f2e36db488	enhance: optimize the loading index performance (#29894 ) this utilizes concurrent loading Signed-off-by: yah01 <yang.cen@zilliz.com>	2024-01-12 17:44:51 +08:00
yah01	6c477ce3a7	enhance: optimize the loading strategy (#29910 ) as we have the pool size limit so we don't need to limit the concurrency manually Signed-off-by: yah01 <yang.cen@zilliz.com>	2024-01-12 14:26:50 +08:00
yah01	aba2656e68	fix: missing field data after appending scalar index to loaded segment (#29912 ) related #29843 Signed-off-by: yah01 <yang.cen@zilliz.com>	2024-01-12 14:04:54 +08:00
Xu Tong	e429965f32	Add float16 approve for multi-type part (#28427 ) issue：https://github.com/milvus-io/milvus/issues/22837 Add bfloat16 vector, add the index part of float16 vector. Signed-off-by: Writer-X <1256866856@qq.com>	2024-01-11 15:48:51 +08:00
zhagnlu	5164d30287	fix: increase expr recursion depth to avoid parse failed (#29860 ) #29759 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2024-01-11 10:26:50 +08:00
yah01	031243fee7	feat: support mmap for marisa trie (#29613 ) this supports mmap for marisa trie index related https://github.com/milvus-io/milvus/issues/21866 Signed-off-by: yah01 <yang.cen@zilliz.com>	2024-01-11 10:22:50 +08:00
congqixia	d6429933a7	enhance: make Load process traceable in querynode & segcore (#29858 ) See also #29803 This PR: - Add trace span for `LoadIndex` & `LoadFieldData` in segment loader - Add `TraceCtx` parameter for `Index.Load` in segcore - Add span for ReadFiles & Engine Load for Memory/Disk Vector index --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-01-10 21:58:51 +08:00
Cai Yudong	cb9d9ec0f0	enhance: Correct sampleFraction's type to float (#29810 ) Signed-off-by: Yudong Cai <yudong.cai@zilliz.com>	2024-01-10 13:18:50 +08:00
zhagnlu	601a8b801b	fix: add move cursor function to physical expr (#29603 ) #29570 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2024-01-09 17:08:48 +08:00
zhenshan.cao	60e88fb833	fix: Restore the MVCC functionality. (#29749 ) When the TimeTravel functionality was previously removed, it inadvertently affected the MVCC functionality within the system. This PR aims to reintroduce the internal MVCC functionality as follows: 1. Add MvccTimestamp to the requests of Search/Query and the results of Search internally. 2. When the delegator receives a Query/Search request and there is no MVCC timestamp set in the request, set the delegator's current tsafe as the MVCC timestamp of the request. If the request already has an MVCC timestamp, do not modify it. 3. When the Proxy handles Search and triggers the second phase ReQuery, divide the ReQuery into different shards and pass the MVCC timestamp to the corresponding Query requests. issue: #29656 Signed-off-by: zhenshan.cao <zhenshan.cao@zilliz.com>	2024-01-09 11:38:48 +08:00
xige-16	9702cef2b5	feat: Support multiple vector search (#29433 ) issue #25639 Signed-off-by: xige-16 <xi.ge@zilliz.com> Signed-off-by: xige-16 <xi.ge@zilliz.com>	2024-01-08 15:34:48 +08:00
Jiquan Long	e9f3df3626	fix: inverted index file not found (#29695 ) issue: https://github.com/milvus-io/milvus/issues/29654 --------- Signed-off-by: longjiquan <jiquan.long@zilliz.com>	2024-01-07 20:26:49 +08:00
zhagnlu	d07197ab1a	enhance: add compare simd function (#29432 ) #26137 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2024-01-07 20:20:57 +08:00
foxspy	271edc6669	fix: throw exception when upload file failed for DiskIndex (#29627 ) related to : #29417 cardinal indexes upload index files in `Serialize` interface, and throw exception when the `Serialize` failed. Signed-off-by: xianliang <xianliang.li@zilliz.com>	2024-01-07 20:03:13 +08:00
cai.zhang	5dc300c4a9	fix: Fix bug for pk index doesn't have raw data (#29711 ) issue: #29697 Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2024-01-07 19:36:48 +08:00
MrPresent-Han	9e2e7157e9	feat: support search_group_by for milvus(#25324 ) (#28983 ) related: #25324 Search GroupBy function, used to aggregate result entities based on a specific scalar column. several points to mention: 1. Temporarliy, the whole groupby is implemented separated from iterative expr framework for the first period 2. In the long term, the groupBy operation will be incorporated into the iterative expr framework:https://github.com/milvus-io/milvus/pull/28166 3. This pr includes some unrelated mocked interface regarding alterIndex due to some unworth-to-mention reasons. All these un-associated content will be removed before the final pr is merged. This version of pr is only for review 4. All other related details were commented in the files comparison Signed-off-by: MrPresent-Han <chun.han@zilliz.com>	2024-01-05 15:50:47 +08:00
cqy123456	22bb84fa9d	feat:add new gpu index:GPU_BRUTE_FORCE and limit gpu index metric type (#29590 ) issue: https://github.com/milvus-io/milvus/issues/29230 this pr do these things: 1. add gpu brute force; 2. limit gpu index only support l2 / ip; Signed-off-by: cqy123456 <qianya.cheng@zilliz.com>	2024-01-05 15:24:48 +08:00
PowderLi	c8db36a63a	enhance: get a blob to check object storage config (#29703 ) issue: #29672 the storage account need privileges of actions `Microsoft.Storage/storageAccounts/blobServices/containers/blobs/*` at least Signed-off-by: PowderLi <min.li@zilliz.com>	2024-01-05 14:50:46 +08:00
yah01	0ae90443ba	enhance: fill missed info for segcore error (#29610 ) - fill missed error info - format the error message directly Signed-off-by: yah01 <yang.cen@zilliz.com>	2024-01-04 17:54:46 +08:00
yah01	99e0f1e65a	enhance: unable to compile C++ tests (#29616 ) The tests need to call a private method, Milvus uses `#define` to replace private with public, the hack trick works but would be broken if the including order changed. This uses friend to make all things work well Signed-off-by: yah01 <yang.cen@zilliz.com> Signed-off-by: yah01 <yah2er0ne@outlook.com>	2024-01-04 13:20:46 +08:00
PowderLi	5f00bad4b8	fix: link with install path's libblob-chunk-manager (#29496 ) issue: #29494 1. link with install path's libblob-chunk-manager 2. performance of `ShouldBindWith` is better than `ShouldBindBodyWith` 3. the middleware shouldn't read the unrefreshed parameter repeatly Signed-off-by: PowderLi <min.li@zilliz.com>	2023-12-31 20:02:48 +08:00
Jiquan Long	3f46c6d459	feat: support inverted index (#28783 ) issue: https://github.com/milvus-io/milvus/issues/27704 Add inverted index for some data types in Milvus. This index type can save a lot of memory compared to loading all data into RAM and speed up the term query and range query. Supported: `INT8`, `INT16`, `INT32`, `INT64`, `FLOAT`, `DOUBLE`, `BOOL` and `VARCHAR`. Not supported: `ARRAY` and `JSON`. Note: - The inverted index for `VARCHAR` is not designed to serve full-text search now. We will treat every row as a whole keyword instead of tokenizing it into multiple terms. - The inverted index don't support retrieval well, so if you create inverted index for field, those operations which depend on the raw data will fallback to use chunk storage, which will bring some performance loss. For example, comparisons between two columns and retrieval of output fields. The inverted index is very easy to be used. Taking below collection as an example: ```python fields = [ FieldSchema(name="pk", dtype=DataType.VARCHAR, is_primary=True, auto_id=False, max_length=100), FieldSchema(name="int8", dtype=DataType.INT8), FieldSchema(name="int16", dtype=DataType.INT16), FieldSchema(name="int32", dtype=DataType.INT32), FieldSchema(name="int64", dtype=DataType.INT64), FieldSchema(name="float", dtype=DataType.FLOAT), FieldSchema(name="double", dtype=DataType.DOUBLE), FieldSchema(name="bool", dtype=DataType.BOOL), FieldSchema(name="varchar", dtype=DataType.VARCHAR, max_length=1000), FieldSchema(name="random", dtype=DataType.DOUBLE), FieldSchema(name="embeddings", dtype=DataType.FLOAT_VECTOR, dim=dim), ] schema = CollectionSchema(fields) collection = Collection("demo", schema) ``` Then we can simply create inverted index for field via: ```python index_type = "INVERTED" collection.create_index("int8", {"index_type": index_type}) collection.create_index("int16", {"index_type": index_type}) collection.create_index("int32", {"index_type": index_type}) collection.create_index("int64", {"index_type": index_type}) collection.create_index("float", {"index_type": index_type}) collection.create_index("double", {"index_type": index_type}) collection.create_index("bool", {"index_type": index_type}) collection.create_index("varchar", {"index_type": index_type}) ``` Then, term query and range query on the field can be speed up automatically by the inverted index: ```python result = collection.query(expr='int64 in [1, 2, 3]', output_fields=["pk"]) result = collection.query(expr='int64 < 5', output_fields=["pk"]) result = collection.query(expr='int64 > 2997', output_fields=["pk"]) result = collection.query(expr='1 < int64 < 5', output_fields=["pk"]) ``` --------- Signed-off-by: longjiquan <jiquan.long@zilliz.com>	2023-12-31 19:50:47 +08:00
zhagnlu	79c417b14e	fix: pass active count to query context instead of timestamp (#29541 ) #29319 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2023-12-31 16:08:48 +08:00
Jiquan Long	6f4791da0b	fix: panic in concurrent insert/query scenario (#29408 ) issue: https://github.com/milvus-io/milvus/issues/29405 --------- Signed-off-by: longjiquan <jiquan.long@zilliz.com>	2023-12-26 15:10:48 +08:00
yah01	b8318fcd7d	enhance: improve the handling for segcore error (#29471 ) - fix lost exception details in segcore - improve the logs of handling errors from segcore Signed-off-by: yah01 <yang.cen@zilliz.com>	2023-12-26 14:06:46 +08:00
cqy123456	4c979538a4	enhance: update cagra index params in config and add params check (#29045 ) issue:https://github.com/milvus-io/milvus/issues/29230 this pr do two things about cagra index: a.milvus yaml config support gpu memory settings b.add cagra-params check Signed-off-by: cqy123456 <qianya.cheng@zilliz.com> Co-authored-by: yusheng.ma <yusheng.ma@zilliz.com>	2023-12-26 11:04:47 +08:00
yah01	aef483806d	enhance: improve the segcore logs (#29372 ) - remove the streaming logging - refine existing logs fix #29366 --------- Signed-off-by: yah01 <yah2er0ne@outlook.com>	2023-12-23 21:52:43 +08:00
yah01	1b7f1d7067	enhance: mmap data corrupted after seal the column (#29422 ) this bug was introduced in recent changes Signed-off-by: yah01 <yang.cen@zilliz.com>	2023-12-23 15:20:43 +08:00
zhagnlu	1cbe3cd5fc	fix: fix memory leak when cancel segcore task (#29431 ) #29430 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2023-12-22 20:28:43 +08:00
zhagnlu	a6eb7e5f9a	enhance: skip segment when using pk in (..) expr (#29394 ) #29293 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2023-12-21 20:06:42 +08:00
yah01	7a2374e698	enhance: reduce the memory usage of variable length data (#29387 ) add all loading data into a buffer and then copy them into the a fit-in-size memory --------- Signed-off-by: yah01 <yang.cen@zilliz.com>	2023-12-21 18:02:42 +08:00
chyezh	be87c18b44	fix: fixup data race at generate binlog index (#29370 ) issue: #29339 Signed-off-by: chyezh <ye.zhen@zilliz.com>	2023-12-21 14:58:49 +08:00
yah01	04b2518ae7	enhance: fix the incorrect init parameter (#29357 ) as the `driver_` field is not used so this doesn't matter for now Signed-off-by: yah01 <yang.cen@zilliz.com>	2023-12-20 20:50:43 +08:00
Gao	9b52cb6417	enhance: improve reducing results when many segments are filtered (#29073 ) Do not fill the invalid ids for the empty results, it will incur useless memory overhead and reduce overhead when nq and topk is large. --------- Signed-off-by: chasingegg <chao.gao@zilliz.com>	2023-12-20 12:56:42 +08:00
yah01	8f89e9cf75	enhance: remove all unnecessary string formatting (#29323 ) done by two regex expressions: - `PanicInfo\((.+),[. \n]+fmt::format\(([.\s\S]+?)\)\)` - `AssertInfo\((.+),[. \n]+fmt::format\(([.\s\S]+?)\)\)` related: #28811 --------- Signed-off-by: yah01 <yang.cen@zilliz.com>	2023-12-20 10:04:43 +08:00
Bingyi Sun	89b208d27a	enhance: Fix format message (#29159 ) Signed-off-by: sunby <sunbingyi1992@gmail.com>	2023-12-20 09:30:44 +08:00
MrPresent-Han	bfca0a7926	fix: refine skipIndex to resolve cyclic dependcy(#29132 ) (#29189 ) related: #29132 Signed-off-by: MrPresent-Han <chun.han@zilliz.com>	2023-12-19 10:26:40 +08:00
zhagnlu	a602171d06	enhance: Refactor runtime and expr framework (#28166 ) #28165 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2023-12-18 12:04:42 +08:00
Cai Yudong	26409d801e	enhance: Remove omp from segcore (#29207 ) Signed-off-by: Yudong Cai <yudong.cai@zilliz.com>	2023-12-15 14:00:39 +08:00
cai.zhang	49b8657f95	enhance: Support implicit type conversion for parquet (#29046 ) issue: #29019 Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2023-12-12 16:14:44 +08:00
Enwei Jiao	0e65e90338	enhance: Support otlp with insecure (#29115 ) issue: https://github.com/milvus-io/milvus/issues/28914 Signed-off-by: Enwei Jiao <enwei.jiao@zilliz.com>	2023-12-12 11:14:37 +08:00
MrPresent-Han	464bc9e8f4	fix: fix reduce precision for search(#27325 ) (#29031 ) related: #27325 Signed-off-by: MrPresent-Han <chun.han@zilliz.com>	2023-12-08 10:04:37 +08:00

1 2 3 4 5 ...

1130 Commits