milvus

mirror of https://gitee.com/milvus-io/milvus.git synced 2025-12-30 15:35:33 +08:00

Author	SHA1	Message	Date
Chun Han	f7af323d1e	fix: sync partitiion stats blocking balance task(#33741 ) (#33742 ) related: #33741 Signed-off-by: MrPresent-Han <chun.han@zilliz.com>	2024-06-11 14:21:56 +08:00
yihao.dai	b1d46eb34b	fix: Fix multiple vector fields import (#33723 ) 1. Fix dim mismatch with multi-vector fields and JSON import 2. Enhance: do not display file ID in GetImportResponse. issue: https://github.com/milvus-io/milvus/issues/33681, https://github.com/milvus-io/milvus/issues/33682 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-06-10 21:57:54 +08:00
wayblink	a1232fafda	feat: Major compaction (#33620 ) #30633 Signed-off-by: wayblink <anyang.wang@zilliz.com> Co-authored-by: MrPresent-Han <chun.han@zilliz.com>	2024-06-10 21:34:08 +08:00
yihao.dai	3540eee977	enhance: Support L0 import (#33514 ) issue: https://github.com/milvus-io/milvus/issues/33157 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-06-07 14:17:20 +08:00
yihao.dai	35532a3e7d	fix: Fill stats log id and check validity (#33477 ) 1. Fill log ID of stats log from import 2. Add a check to validate the log ID before writing to meta issue: https://github.com/milvus-io/milvus/issues/33476 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-06-05 11:17:56 +08:00
wei liu	c6a1c49e02	enhance: Use Blocked Bloom Filter instead of basic bloom fitler impl. (#33405 ) issue: #32995 To speed up the construction and querying of Bloom filters, we chose a blocked Bloom filter instead of a basic Bloom filter implementation. WARN: This PR is compatible with old version bf impl, but if fall back to old milvus version, it may causes bloom filter deserialize failed. In single Bloom filter test cases with a capacity of 1,000,000 and a false positive rate (FPR) of 0.001, the blocked Bloom filter is 5 times faster than the basic Bloom filter in both querying and construction, at the cost of a 30% increase in memory usage. - Block BF construct time {"time": "54.128131ms"} - Block BF size {"size": 3021578} - Block BF Test cost {"time": "55.407352ms"} - Basic BF construct time {"time": "210.262183ms"} - Basic BF size {"size": 2396308} - Basic BF Test cost {"time": "192.596229ms"} In multi Bloom filter test cases with a capacity of 100,000, an FPR of 0.001, and 100 Bloom filters, we reuse the primary key locations for all Bloom filters to avoid repeated hash computations. As a result, the blocked Bloom filter is also 5 times faster than the basic Bloom filter in querying. - Block BF TestLocation cost {"time": "529.97183ms"} - Basic BF TestLocation cost {"time": "3.197430181s"} --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-05-31 17:49:45 +08:00
wei liu	b13932bb55	enhance: Enable database level replica num and resource groups for loading collection (#33052 ) issue: #30040 This PR introduce two database level props: 1. database.replica.number 2. database.resource_groups User can set those two database props by AlterDatabase API, then can load collection without specified replica_num and resource groups. then it will use database level load param when try to load collections. Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-05-29 10:59:43 +08:00
Cai Yudong	4004e4c545	enhance: Optimize bulk insert unittest (#33224 ) Issue: #22837 Signed-off-by: Cai Yudong <yudong.cai@zilliz.com>	2024-05-24 10:23:41 +08:00
yihao.dai	7730b910b9	enhance: Decouple compaction from shard (#33138 ) Decouple compaction from shard, remove dependencies on shards (e.g. SyncSegments, injection). issue: https://github.com/milvus-io/milvus/issues/32809 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-05-24 09:07:41 +08:00
yihao.dai	9ff023ee35	fix: Fix filtering by partition key fails for importing data (#33274 ) Before executing the import, partition IDs should be reordered according to partition names. Otherwise, the data might be hashed to the wrong partition during import. This PR corrects this error. issue: https://github.com/milvus-io/milvus/issues/33237 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-05-23 11:13:40 +08:00
Cai Yudong	b560602885	enhance: Store SparseFloatVector into parquet as JSON string (#33101 ) Issue: #22837 Signed-off-by: Cai Yudong <yudong.cai@zilliz.com>	2024-05-17 15:01:37 +08:00
Cai Yudong	4ef163fb70	enhance: Support readable JSON file import for Float16/BFloat16/SparseFloat (#33064 ) Issue: #22837 Signed-off-by: Cai Yudong <yudong.cai@zilliz.com>	2024-05-16 14:47:35 +08:00
Cai Yudong	4fc7915c70	enhance: unify data generation test APIs (#32955 ) Issue: #22837 Signed-off-by: Cai Yudong <yudong.cai@zilliz.com>	2024-05-14 14:33:33 +08:00
SimFG	4031abd2fa	enhance: change default partition num to 16 when using partition key (#32950 ) /kind improvement Signed-off-by: SimFG <bang.fu@zilliz.com>	2024-05-13 14:19:31 +08:00
wei liu	e2332bdc17	enhance: Enable channel exclusive balance policy (#32911 ) issue: #32910 * split replica's node list to channels when create replicas * balance nodes among channels when node change happens * implement channel level balance, let balance happens in channel level Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-05-10 17:27:31 +08:00
Cai Yudong	dc89c6f810	enhance: remove duplicated data generation APIs for bulk insert test (#32889 ) Issue: #22837 including following changes: 1. Add API CreateInsertData() and BuildArrayData() in internal/util/testutil 2. Remove duplicated test APIs from importutilv2 unittest and bulk insert integration test Signed-off-by: Cai Yudong <yudong.cai@zilliz.com>	2024-05-10 15:27:31 +08:00
Cai Yudong	bcdbd1966e	feat: Support sparse float vector bulk insert for binlog/json/parquet (#32649 ) Issue: #22837 Signed-off-by: Cai Yudong <yudong.cai@zilliz.com>	2024-05-07 18:43:30 +08:00
yihao.dai	53874ce245	fix: Fix cannot specify partition name in binlog import (#32730 ) issue: https://github.com/milvus-io/milvus/issues/32807 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-05-07 17:19:30 +08:00
yiwangdr	b1eacb2ae8	feat: datacoord/node watch based on rpc (#32036 ) issue: https://github.com/milvus-io/milvus/issues/25309 Signed-off-by: yiwangdr <yiwangdr@gmail.com>	2024-05-07 15:49:30 +08:00
yihao.dai	4de063ae14	fix: Make the dynamic column optional in parquet import (#32738 ) issue: https://github.com/milvus-io/milvus/issues/32729 Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-05-07 11:21:29 +08:00
Buqian Zheng	37a99ca23e	fix: remove flaky sparse integration test (#32767 ) issue: https://github.com/milvus-io/milvus/issues/32766 this test is outdated, thus removing it instead of fixing it. Signed-off-by: Buqian Zheng <zhengbuqian@gmail.com>	2024-05-06 19:19:29 +08:00
congqixia	ecd8e52b53	fix: Use default integration case timeout for `TestBinlogImport` (#32701 ) See also #32700 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-04-29 19:07:27 +08:00
yihao.dai	1594122c0a	enhance: Make the dynamic field file optional during numpy import (#32596 ) 1. Make the dynamic field file optional during numpy import 2. Add integration importing test with dynamic 3. Disallow file of pk when autoID=true during numpy import issue: https://github.com/milvus-io/milvus/issues/32542 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-04-28 19:39:25 +08:00
congqixia	8c4fc1e61c	enhance: Close singleton etcd client in integration teardown (#32664 ) Found lots of `failed to updateTimeTick` with error `skip ChannelTimeTickMsg from un-recognized session 1` The reason was etcd client became singleton and used last root path in multiple cases are run in one suite. This PR add close singleton client invocation to fix this problem. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-04-28 18:17:26 +08:00
congqixia	0ff7a46e95	fix: [skip e2e] Disable compaction for balance integration test (#32603 ) See also #31468 Balance test suite may assert segment number based on test setup. However the compaction may reduce the number and cause test cases unstable. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-04-25 16:55:23 +08:00
Buqian Zheng	8a1017a152	enhance: add helpers to parse sparse float vector in JSON (#32543 ) issue: #29419 added helper functions to parse JSON representation of sparse float vectors, will be used by both the restful server and the import utils. Signed-off-by: Buqian Zheng <zhengbuqian@gmail.com>	2024-04-25 14:47:24 +08:00
Cai Yudong	16b8b7b35d	enhance: Add get_vector unittest for float16 & bfloat16 (#32153 ) Issue: #22837 Signed-off-by: Cai Yudong <yudong.cai@zilliz.com>	2024-04-23 16:15:23 +08:00
Cai Yudong	5fc439c600	feat: Bulk insert support fp16/bf16 (#32157 ) Issue: #22837 Signed-off-by: Cai Yudong <yudong.cai@zilliz.com>	2024-04-22 10:05:22 +08:00
smellthemoon	d1ef0a81ee	enhance: [skip e2e] change some long logs (#32309 ) <img width="1042" alt="image" src="https://github.com/milvus-io/milvus/assets/64083300/8daa9ab9-1988-4398-a92a-7d2dac2cd8cd"> change this log Signed-off-by: lixinguo <xinguo.li@zilliz.com> Co-authored-by: lixinguo <xinguo.li@zilliz.com>	2024-04-17 10:25:19 +08:00
yihao.dai	558feed5ed	fix: Use pk from binlog during import (#32118 ) During binlog import, even if the primary key's autoID is set to true, the primary key from the binlog should be used instead of being reassigned. issue: https://github.com/milvus-io/milvus/discussions/31943, https://github.com/milvus-io/milvus/issues/28521 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-04-16 14:51:20 +08:00
congqixia	b87f41128b	fix: [skip e2e] Make channel balance test accept flushing segments (#32229 ) See also #30973 Make the case stable since the segment state may be flushing when suite tries to check segment state. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-04-15 11:27:18 +08:00
chyezh	48fe977a9d	enhance: declarative resource group api (#31930 ) issue: #30647 - Add declarative resource group api - Add config for resource group management - Resource group recovery enhancement --------- Signed-off-by: chyezh <chyezh@outlook.com>	2024-04-15 08:13:19 +08:00
Patrick Weizhi Xu	52ae47c850	enhance: gather materialized view search info once per request (#31996 ) issue: #29892 This PR: 1. Move the process of gathering materialized search info to when the search plan is created, before it goes to each segment, to avoid repeated work and access the plan node under multi-threaded circumstances. 2. Enforce the supported MV type to `VARCHAR` 3. Add integration test Signed-off-by: Patrick Weizhi Xu <weizhi.xu@zilliz.com>	2024-04-11 15:21:19 +08:00
yihao.dai	273df98e20	enhance: Add binlog import intergration test (#32112 ) issue: https://github.com/milvus-io/milvus/issues/28521 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-04-11 10:31:18 +08:00
Buqian Zheng	2fdf1a6e76	feat: [Sparse Float Vector] added some integration tests (#31062 ) add some integration tests for sparse float vector support https://github.com/milvus-io/milvus/issues/29419 Signed-off-by: Buqian Zheng <zhengbuqian@gmail.com>	2024-04-10 19:57:18 +08:00
SimFG	90bed1caf9	enhance: add the related data size for the read apis (#31816 ) issue: #30436 origin pr: #30438 related pr: #31772 --------- Signed-off-by: SimFG <bang.fu@zilliz.com>	2024-04-10 15:07:17 +08:00
wei liu	c5a9cae44e	enhance: [skip e2e]remove useless suspend/resume gc operation in integration test (#31954 ) Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-04-09 19:55:17 +08:00
yiwangdr	1cd15d9322	test: support segment release in integration test (#31190 ) issue: #29507 Notice that api_testonly.go files should be guarded by compiler tag `test`, so that production build rules don't compile them and these APIs don't get misused. Signed-off-by: yiwangdr <yiwangdr@gmail.com>	2024-04-09 11:39:17 +08:00
congqixia	1c2ae59ece	fix: [skip e2e] Dedup available ports and retry for integration setup (#31902 ) See also #31901 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-04-08 10:35:17 +08:00
SimFG	ac26908cc4	enhance: Remove the storage info report (#31772 ) issue: #30436 origin pr: #30438 Signed-off-by: SimFG <bang.fu@zilliz.com>	2024-04-01 20:50:59 -07:00
XuanYang-cn	5d97693bcd	enhance: [skip e2e]Add it for channel balance (#30973 ) See also: #30993 Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2024-04-02 10:25:15 +08:00
SimFG	b1a1cca10b	feat: add more operation detail info for better allocation (#30438 ) issue: #30436 --------- Signed-off-by: SimFG <bang.fu@zilliz.com>	2024-03-28 06:33:11 +08:00
wei liu	92971707de	enhance: Add restful api for devops to execute rolling upgrade (#29998 ) issue: #29261 This PR Add restful api for devops to execute rolling upgrade, including suspend/resume balance and manual transfer segments/channels. --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-03-27 16:15:19 +08:00
wei liu	5d752498e7	fix: Skip release duplicate l0 segment (#31540 ) issue: #31480 #31481 release duplicate l0 segment task, which execute on old delegator may cause segment lack, and execute on new delegator may break new delegator's leader view. This PR skip release duplicate l0 segment by segment_checker, cause l0 segment will be released with unsub channel --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-03-27 12:53:10 +08:00
congqixia	8e5865f630	enhance: Save collection targets by batches (#31616 ) See also #28491 #31240 When colleciton number is large, querycoord saves collection target one by one, which is slow and may block querycoord exits. In local run, 500 collections scenario may lead to about 40 seconds saving collection targets. This PR changes the `SaveCollectionTarget` interface into batch one and organizes the collection in 16 per bundle batches to accelerate this procedure. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-03-27 00:09:08 +08:00
yihao.dai	0fe5e90e8b	enhance: Remove import v1 (#31403 ) Remove all code and logic related to import v1. issue: https://github.com/milvus-io/milvus/issues/28521 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-03-22 15:29:09 +08:00
wei liu	03eaa5d478	fix: Load segment task promote failed (#31430 ) issue: #30816 pr #31319 introduce the logic that segment checker need to load level zero segment which only exist in current target. This PR fix load segment task promote failed when segment only belongs to current target --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-03-21 18:09:07 +08:00
yihao.dai	c408a32db6	feat: Add disk quota checks for import V2 (#31131 ) Return quota error when the files to be imported exceed the disk quota. issue: https://github.com/milvus-io/milvus/issues/28521 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-03-15 14:43:03 +08:00
wei liu	d79aa58b37	enhance: Speed up target recovery after query coord restart (#31240 ) issue: #28491 after querycoord restart, it will pull a new target, which include channel and segment list. when segments loaded on querynode has reached the target, the collection could provide search/query. but if segment list changes by time, ater querycoord pull a new target, it will takes a few minutes to catch up the target's segment distribution. and before that, query/search will fail due to lack of segments. This PR save the current loaded target to meta storein querycoord's stop progress, and recover it when query coord starts, to speed up the target recovery time. --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-03-15 14:19:03 +08:00
wei liu	147a3b8bdc	fix: Grpcclient return unrecoverable error (#31256 ) issue: #31222 grpcclient's `call` func return a unrecoverable error, then the caller's retry policy also breaks due to this unrecoverable error. This PR introduce `retry.Handle`, the new func use `func() (bool, error)` as input parameters, which return `shouldRetry` directly, to avoid grpcclient return a unrecoverable error Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-03-15 10:03:05 +08:00

1 2 3

134 Commits