milvus

mirror of https://gitee.com/milvus-io/milvus.git synced 2025-12-30 23:45:28 +08:00

Author	SHA1	Message	Date
wei liu	32e55a02ea	fix: Fix privilege group hasn't been register for validate (#35937 ) issue: #35471 --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-09-05 15:35:04 +08:00
jaime	919e96ac22	enhance: add IT for rate limit using db properties (#35930 ) issue: #35929 Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-09-04 14:37:04 +08:00
cai.zhang	2c9bb4dfa3	feat: Support stats task to sort segment by PK (#35054 ) issue: #33744 This PR includes the following changes: 1. Added a new task type to the task scheduler in datacoord: stats task, which sorts segments by primary key. 2. Implemented segment sorting in indexnode. 3. Added a new field `FieldStatsLog` to SegmentInfo to store token index information. --------- Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2024-09-02 14:19:03 +08:00
smellthemoon	a3f2f044d6	fix: not set nullable when stream writer write headers (#35799 ) #35802 Signed-off-by: lixinguo <xinguo.li@zilliz.com> Co-authored-by: lixinguo <xinguo.li@zilliz.com>	2024-08-29 20:59:00 +08:00
Zhen Ye	99dff06391	enhance: using streaming service in insert/upsert/flush/delete/querynode (#35406 ) issue: #33285 - using streaming service in insert/upsert/flush/delete/querynode - fixup flusher bugs and refactor the flush operation - enable streaming service for dml and ddl - pass the e2e when enabling streaming service - pass the integration tst when enabling streaming service --------- Signed-off-by: chyezh <chyezh@outlook.com> Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-08-29 10:03:08 +08:00
XuanYang-cn	0e7877d413	fix: [skip e2e]unstable l0 it (#35612 ) See also: #35617 Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2024-08-26 18:53:04 +08:00
wei liu	5c245d51c4	enhance: Refresh proxy cache after restore rbac meta (#35635 ) issue: #35443 --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-08-22 19:09:01 +08:00
OxalisCu	ed4eaffc9d	enhance: add csv support for bulkinsert (#34938 ) See this issue for details: #34937 --------- Signed-off-by: OxalisCu <2127298698@qq.com>	2024-08-21 17:47:01 +08:00
smellthemoon	ba6db117e3	enhance: add some integration test in null (#35599 ) #31728 Signed-off-by: lixinguo <xinguo.li@zilliz.com> Co-authored-by: lixinguo <xinguo.li@zilliz.com>	2024-08-21 17:44:56 +08:00
smellthemoon	80a7c78f28	enhance: import supports null in parquet and json formats (#35558 ) #31728 --------- Signed-off-by: lixinguo <xinguo.li@zilliz.com> Co-authored-by: lixinguo <xinguo.li@zilliz.com>	2024-08-20 16:50:55 +08:00
Chun Han	031ee6f155	enhance: support httpv1/v2 throttle and add it for httpV2(#35350 ) (#35470 ) related: #35350 Signed-off-by: MrPresent-Han <chun.han@gmail.com> Co-authored-by: MrPresent-Han <chun.han@gmail.com>	2024-08-20 16:16:55 +08:00
wei liu	e09dc3be58	enhance: Mark query node as read only after suspend (#35492 ) issue: #34985 #35493 after querynode has been suspended, it's not allow to load segment/channel on it, which means the node is read only. to be compatible with resource group design, after query node has been suspend, we remove it from it's original resource group, make it a read only query node in replica. then two things will happens: 1. it's original resource group will be lacking of query nodes, query coord will assign new node to it. 2. querycoord will try to move out all segments/channels after querynode has been suspended Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-08-20 14:02:54 +08:00
XuanYang-cn	967f38672a	enhance: Add integration tests for l0 (#35429 ) See also: #34796 --------- Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2024-08-19 10:56:54 +08:00
Buqian Zheng	f4a91e135b	enhance: Allow empty sparse row (#34700 ) issue: #29419 * If a sparse vector with 0 non-zero value is inserted, no ANN search on this sparse vector field will return it as a result. User may retrieve this row via scalar query or ANN search on another vector field though. * If the user uses an empty sparse vector as the query vector for a ANN search, no neighbor will be returned. Signed-off-by: Buqian Zheng <zhengbuqian@gmail.com>	2024-08-16 14:14:54 +08:00
wei liu	1d49358f82	enhance: Add BackupRBAC/RestoreRBAC API to enable rbac backup (#35444 ) issue: #35443 --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-08-16 10:10:53 +08:00
wei liu	344dc6a9f8	enhance: enable to set load config in cluster level (#35169 ) issue: #35170 This PR enable to set load configs in cluster level, such as replicas and resource groups. then when load collections will use the load config. Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-08-07 12:38:21 +08:00
yellow-shine	241c71fdde	enhance: use docker compose instead of docker-compose (#35208 ) https://github.com/milvus-io/milvus/issues/35209 --------- Signed-off-by: Yellow Shine <sammy.huang@zilliz.com>	2024-08-02 19:32:32 +08:00
cai.zhang	196a7986b3	enhance: Change the fixed value to a ratio for clustering segment size (#35076 ) issue: #34495 --------- Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2024-08-01 22:04:14 +08:00
congqixia	a642a26ed4	enhance: Resolve ChunkFileWriter lint issue (#35166 ) See also #34483 --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-08-01 16:52:13 +08:00
wayblink	5bbb1c201c	enhance:support l2 single compaction (#34935 ) #34928 Signed-off-by: wayblink <anyang.wang@zilliz.com>	2024-08-01 14:36:13 +08:00
congqixia	dfda9f0478	enhance: Add depguard rules to ban deprecated proto lib (#35140 ) See also #34394 #34252 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-08-01 10:01:49 +08:00
smellthemoon	6106a48acb	fix: upsert result use the previous pk (#34672 ) #34668 Signed-off-by: lixinguo <xinguo.li@zilliz.com> Co-authored-by: lixinguo <xinguo.li@zilliz.com>	2024-07-31 15:25:51 +08:00
wei liu	c45f38aa61	enhance: Update protobuf-go to protobuf-go v2 (#34394 ) issue: #34252 Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-07-29 11:31:51 +08:00
cai.zhang	2372452fac	enhance: Optimized the GC logic to ensure that memory is released in time (#34949 ) issue: #34703 Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2024-07-28 23:53:47 +08:00
congqixia	783f9d9c33	fix: Unify hook singleton implementation in proxy (#34887 ) Related to #34885 --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-07-26 18:07:53 +08:00
cai.zhang	4c45bc412f	enhance: Add integration test for clustering compaction (#34881 ) issue: #34792 --------- Signed-off-by: cai.zhang <cai.zhang@zilliz.com> Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2024-07-23 10:13:43 +08:00
wayblink	c339df26fc	enhance: refine clustering compaction basic it (#34793 ) #34792 Signed-off-by: wayblink <anyang.wang@zilliz.com>	2024-07-22 11:27:51 +08:00
yihao.dai	b22e549844	enhance: Rename config of sealing by growing segmetns size (#34787 ) /kind improvement --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-07-19 20:27:41 +08:00
yihao.dai	4939f82d4f	enhance: Seal by total growing segments size (#34692 ) Seals the largest growing segment if the total size of growing segments of each shard exceeds the size threshold(default 4GB). Introducing this policy can help keep the size of growing segments within a suitable level, alleviating the pressure on the delegator. issue: https://github.com/milvus-io/milvus/issues/34554 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-07-17 21:45:41 +08:00
wei liu	acb33bba4d	enhance: Preserve fixed-size memory in delegator node for growing segment. (#34596 ) issue: #34595 When consuming insert data on the delegator node, QueryCoord will move out some sealed segments to manage its memory usage. After the growing segment gets flushed, some sealed segments from other workers will be moved back to the delegator node. To avoid the frequent movement of segments, we estimate the maximum growing row count and preserve a fixed-size memory in the delegator node. --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-07-15 20:51:46 +08:00
XuanYang-cn	eb472b7f08	enhance: [skip e2e]Enable compaction it test (#34526 ) Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2024-07-15 20:45:39 +08:00
smellthemoon	07b94b4615	enhance: support upsert autoid==true (#30342 ) related with: #29258 --------- Signed-off-by: lixinguo <xinguo.li@zilliz.com> Co-authored-by: lixinguo <xinguo.li@zilliz.com>	2024-07-11 16:53:35 +08:00
wei liu	9b37d3f517	enhance: Enable setting the replica number and resource group during collection creation (#34403 ) issue: #30040 --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-07-10 10:20:13 +08:00
congqixia	3333160b8d	enhance: Fix lint issues from recent PRs (#34482 ) See also #34483 Some lint issues are introduced due to lack of static check run. This PR fixes these problems. --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-07-09 10:06:24 +08:00
jaime	d6afb31b94	enhance: make subfunctions of datanode component modular (#33992 ) issue: #33994 also remove deprecated channel manager based on the etcd implementation Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-07-01 14:46:07 +08:00
congqixia	144ee269f2	fix: [skip e2e] Skip unstable integration test for master (#33824 ) See also #33716 #33823 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-06-13 16:53:55 +08:00
Cai Yudong	9d4535ce0b	enhance: Handle Float16Vector/BFloat16Vector numpy bulk insert as same as BinaryVector (#33760 ) Issue: #22837 Signed-off-by: Cai Yudong <yudong.cai@zilliz.com>	2024-06-12 17:17:55 +08:00
Chun Han	f7af323d1e	fix: sync partitiion stats blocking balance task(#33741 ) (#33742 ) related: #33741 Signed-off-by: MrPresent-Han <chun.han@zilliz.com>	2024-06-11 14:21:56 +08:00
yihao.dai	b1d46eb34b	fix: Fix multiple vector fields import (#33723 ) 1. Fix dim mismatch with multi-vector fields and JSON import 2. Enhance: do not display file ID in GetImportResponse. issue: https://github.com/milvus-io/milvus/issues/33681, https://github.com/milvus-io/milvus/issues/33682 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-06-10 21:57:54 +08:00
wayblink	a1232fafda	feat: Major compaction (#33620 ) #30633 Signed-off-by: wayblink <anyang.wang@zilliz.com> Co-authored-by: MrPresent-Han <chun.han@zilliz.com>	2024-06-10 21:34:08 +08:00
yihao.dai	3540eee977	enhance: Support L0 import (#33514 ) issue: https://github.com/milvus-io/milvus/issues/33157 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-06-07 14:17:20 +08:00
yihao.dai	35532a3e7d	fix: Fill stats log id and check validity (#33477 ) 1. Fill log ID of stats log from import 2. Add a check to validate the log ID before writing to meta issue: https://github.com/milvus-io/milvus/issues/33476 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-06-05 11:17:56 +08:00
wei liu	c6a1c49e02	enhance: Use Blocked Bloom Filter instead of basic bloom fitler impl. (#33405 ) issue: #32995 To speed up the construction and querying of Bloom filters, we chose a blocked Bloom filter instead of a basic Bloom filter implementation. WARN: This PR is compatible with old version bf impl, but if fall back to old milvus version, it may causes bloom filter deserialize failed. In single Bloom filter test cases with a capacity of 1,000,000 and a false positive rate (FPR) of 0.001, the blocked Bloom filter is 5 times faster than the basic Bloom filter in both querying and construction, at the cost of a 30% increase in memory usage. - Block BF construct time {"time": "54.128131ms"} - Block BF size {"size": 3021578} - Block BF Test cost {"time": "55.407352ms"} - Basic BF construct time {"time": "210.262183ms"} - Basic BF size {"size": 2396308} - Basic BF Test cost {"time": "192.596229ms"} In multi Bloom filter test cases with a capacity of 100,000, an FPR of 0.001, and 100 Bloom filters, we reuse the primary key locations for all Bloom filters to avoid repeated hash computations. As a result, the blocked Bloom filter is also 5 times faster than the basic Bloom filter in querying. - Block BF TestLocation cost {"time": "529.97183ms"} - Basic BF TestLocation cost {"time": "3.197430181s"} --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-05-31 17:49:45 +08:00
wei liu	b13932bb55	enhance: Enable database level replica num and resource groups for loading collection (#33052 ) issue: #30040 This PR introduce two database level props: 1. database.replica.number 2. database.resource_groups User can set those two database props by AlterDatabase API, then can load collection without specified replica_num and resource groups. then it will use database level load param when try to load collections. Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-05-29 10:59:43 +08:00
Cai Yudong	4004e4c545	enhance: Optimize bulk insert unittest (#33224 ) Issue: #22837 Signed-off-by: Cai Yudong <yudong.cai@zilliz.com>	2024-05-24 10:23:41 +08:00
yihao.dai	7730b910b9	enhance: Decouple compaction from shard (#33138 ) Decouple compaction from shard, remove dependencies on shards (e.g. SyncSegments, injection). issue: https://github.com/milvus-io/milvus/issues/32809 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-05-24 09:07:41 +08:00
yihao.dai	9ff023ee35	fix: Fix filtering by partition key fails for importing data (#33274 ) Before executing the import, partition IDs should be reordered according to partition names. Otherwise, the data might be hashed to the wrong partition during import. This PR corrects this error. issue: https://github.com/milvus-io/milvus/issues/33237 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-05-23 11:13:40 +08:00
Cai Yudong	b560602885	enhance: Store SparseFloatVector into parquet as JSON string (#33101 ) Issue: #22837 Signed-off-by: Cai Yudong <yudong.cai@zilliz.com>	2024-05-17 15:01:37 +08:00
Cai Yudong	4ef163fb70	enhance: Support readable JSON file import for Float16/BFloat16/SparseFloat (#33064 ) Issue: #22837 Signed-off-by: Cai Yudong <yudong.cai@zilliz.com>	2024-05-16 14:47:35 +08:00
Cai Yudong	4fc7915c70	enhance: unify data generation test APIs (#32955 ) Issue: #22837 Signed-off-by: Cai Yudong <yudong.cai@zilliz.com>	2024-05-14 14:33:33 +08:00

1 2 3 4

171 Commits