milvus

mirror of https://gitee.com/milvus-io/milvus.git synced 2026-01-07 19:31:51 +08:00

Author	SHA1	Message	Date
congqixia	92c0c38e24	fix: validate collection TTL property to prevent compaction stuck (#46717 ) If collection TTL property is malformed (e.g., non-numeric value), compaction tasks would fail silently and get stuck. This change: - Add centralized GetCollectionTTL/GetCollectionTTLFromMap functions in pkg/common to handle TTL parsing with proper error handling - Validate TTL property in createCollectionTask and alterCollectionTask PreExecute to reject invalid values early - Refactor datacoord compaction policies to use the new common functions - Remove duplicated getCollectionTTL from datacoord/util.go issue: #46716 <!-- This is an auto-generated comment: release notes by coderabbit.ai --> - Core invariant: collection.ttl.seconds must be a parseable int64 and validated at collection creation/alter time so malformed TTLs never reach compaction/execution codepaths. - Bug fix (resolves #46716): malformed/non-numeric TTLs could silently cause compaction tasks to fail/stall; fixed by adding centralized parsing helpers pkg/common.GetCollectionTTL and GetCollectionTTLFromMap and validating TTL in createCollectionTask.PreExecute and alterCollectionTask.PreExecute (calls with default -1 and return parameter-invalid errors on parse failure). - Simplification / removed redundancy: eliminated duplicated getCollectionTTL in internal/datacoord/util.go and replaced ad-hoc TTL parsing across datacoord (compaction policies, import_util, compaction triggers) and proxy util with the common helpers, centralizing error handling and defaulting logic. - No data loss or behavior regression: valid TTL parsing semantics unchanged (helpers use identical int64 parsing and default fallback from paramtable/CommonCfg); validation occurs in PreExecute so existing valid collections proceed unchanged while malformed values are rejected early—compaction codepaths now receive only validated TTL values (or explicit defaults), preventing silent skips without altering valid execution flows. <!-- end of auto-generated comment: release notes by coderabbit.ai --> --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2026-01-01 08:13:22 +08:00
cai.zhang	5911cb44e0	enhance: Estimate index task slot using field size instead of segment size (#46275 ) issue: #45186 Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2025-12-23 11:23:22 +08:00
Zhen Ye	7c575a18b0	enhance: support AckSyncUp for broadcaster, and enable it in truncate api (#46313 ) issue: #43897 also for issue: #46166 add ack_sync_up flag into broadcast message header, which indicates that whether the broadcast operation is need to be synced up between the streaming node and the coordinator. If the ack_sync_up is false, the broadcast operation will be acked once the recovery storage see the message at current vchannel, the fast ack operation can be applied to speed up the broadcast operation. If the ack_sync_up is true, the broadcast operation will be acked after the checkpoint of current vchannel reach current message. The fast ack operation can not be applied to speed up the broadcast operation, because the ack operation need to be synced up with streaming node. e.g. if truncate collection operation want to call ack once callback after the all segment are flushed at current vchannel, it should set the ack_sync_up to be true. TODO: current implementation doesn't promise the ack sync up semantic, it only promise FastAck operation will not be applied, wait for 3.0 to implement the ack sync up semantic. only for truncate api now. --------- Signed-off-by: chyezh <chyezh@outlook.com>	2025-12-17 16:55:17 +08:00
cai.zhang	cfd49b7680	enhance: Estimate the taskSlot based on whether scalar or vector index (#45850 ) issue: #45186 --------- Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2025-12-04 14:15:10 +08:00
Zhen Ye	30091a3bb7	enhance: remove redundant channel manager from datacoord (#44532 ) issue: #41611 - After enabling streaming arch, channel manager of data coord is a redundant component. --------- Signed-off-by: chyezh <chyezh@outlook.com>	2025-10-09 11:01:57 +08:00
congqixia	bfc9e80e14	enhance: Add param item forcing all indices ready for segment (#44313 ) Related to #44312 --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-09-12 17:51:58 +08:00
Zhen Ye	cd38d65417	fix: make savebinlogpath idompotent at binlog level (#43615 ) issue: #43574 - update all binlog every time when calling udpate savebinlogpath. --------- Signed-off-by: chyezh <chyezh@outlook.com>	2025-07-29 19:47:36 +08:00
cai.zhang	6989e18599	enhance: Move sort stats task to sort compaction (#42562 ) issue: #42560 --------- Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2025-07-08 20:22:47 +08:00
Xianhui Lin	98067f5fc6	fix: datacoord stop get stuck After upgrading from 2.5 to 2.6 (#42674 ) datacoord stop get stuck After upgrading from 2.5 to 2.6 issue:https://github.com/milvus-io/milvus/issues/42656 Signed-off-by: Xianhui.Lin <xianhui.lin@zilliz.com>	2025-06-12 16:56:36 +08:00
cai.zhang	8a77fb9cdc	enhance: Support slot for index task and stats task (#39084 ) issue: #39101 --------- Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2025-04-08 20:46:25 +08:00
smellthemoon	cb1e86e17c	enhance: support add field (#39800 ) after the pr merged, we can support to insert, upsert, build index, query, search in the added field. can only do the above operates in added field after add field request complete, which is a sync operate. compact will be supported in the next pr. #39718 --------- Signed-off-by: lixinguo <xinguo.li@zilliz.com> Co-authored-by: lixinguo <xinguo.li@zilliz.com>	2025-04-02 14:24:31 +08:00
congqixia	cb7f2fa6fd	enhance: Use v2 package name for pkg module (#39990 ) Related to #39095 https://go.dev/doc/modules/version-numbers Update pkg version according to golang dep version convention --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-02-22 23:15:58 +08:00
Zhen Ye	bb8d1ab3bf	enhance: make new go package to manage proto (#39114 ) issue: #39095 --------- Signed-off-by: chyezh <chyezh@outlook.com>	2025-01-10 10:49:01 +08:00
jaime	78438ef41e	fix: revert optimize CPU usage for CheckHealth requests (#35589 ) (#38555 ) issue: #35563 Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-12-19 00:38:45 +08:00
jaime	28fdbc4e30	enhance: optimize CPU usage for CheckHealth requests (#35589 ) issue: #35563 1. Use an internal health checker to monitor the cluster's health state, storing the latest state on the coordinator node. The CheckHealth request retrieves the cluster's health from this latest state on the proxy sides, which enhances cluster stability. 2. Each health check will assess all collections and channels, with detailed failure messages temporarily saved in the latest state. 3. Use CheckHealth request instead of the heavy GetMetrics request on the querynode and datanode Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-12-17 11:02:45 +08:00
zhagnlu	c522ce84b4	fix:remove unnecessary error logs (#38245 ) #38241 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2024-12-08 17:20:40 +08:00
foxspy	3de57ec4fa	enhance: add vector index mgr to remove vector index type dependency (#36843 ) issue: #34298 Signed-off-by: xianliang.li <xianliang.li@zilliz.com>	2024-10-17 22:15:25 +08:00
jaime	ef1832ff9c	enhance: enable manual compaction for collections without indexes (#36577 ) issue: #36576 Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-10-08 19:57:18 +08:00
Rijin-N	a05a37a583	enhance: GCS native support (GCS implemented using Google Cloud Storage libraries) (#36214 ) Native support for Google cloud storage using the Google Cloud Storage libraries. Authentication is performed using GCS service account credentials JSON. Currently, Milvus supports Google Cloud Storage using S3-compatible APIs via the AWS SDK. This approach has the following limitations: 1. Overhead: Translating requests between S3-compatible APIs and GCS can introduce additional overhead. 2. Compatibility Limitations: Some features of the original S3 API may not fully translate or work as expected with GCS. To address these limitations, This enhancement is needed. Related Issue: #36212	2024-09-30 13:23:32 +08:00
cai.zhang	ecb2b242e2	enhance: Add sorted for segment info (#36469 ) issue: #33744 Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2024-09-30 10:01:16 +08:00
cai.zhang	2c9bb4dfa3	feat: Support stats task to sort segment by PK (#35054 ) issue: #33744 This PR includes the following changes: 1. Added a new task type to the task scheduler in datacoord: stats task, which sorts segments by primary key. 2. Implemented segment sorting in indexnode. 3. Added a new field `FieldStatsLog` to SegmentInfo to store token index information. --------- Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2024-09-02 14:19:03 +08:00
jaime	a08a0c831f	fix: encountering orphan channel-cp meta after DataCoord GC (#34612 ) issue: #34545 Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-07-11 23:01:35 +08:00
jaime	c332f69dec	enhance: skip orphan channel cp meta when checking cp lag (#34555 ) issue: # #34545 Print warn log instead of check health fail if orphan channel cp meta is found in health check request. Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-07-11 09:36:56 +08:00
jaime	0426390f06	enhance: improve check health (#33800 ) issue: #34264 Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-07-01 10:16:06 +08:00
cai.zhang	27cc9f2630	enhance: Support analyze data (#33651 ) issue: #30633 Signed-off-by: Cai Zhang <cai.zhang@zilliz.com> Co-authored-by: chasingegg <chao.gao@zilliz.com>	2024-06-06 17:37:51 +08:00
zhenshan.cao	ac4f3997ce	enhance: Reconstructing Compaction to possess persistence capability (#33265 ) issue #33586 Signed-off-by: zhenshan.cao <zhenshan.cao@zilliz.com>	2024-06-05 10:17:50 +08:00
cai.zhang	6ea7633bd5	enhance: Add memory size for binlog (#33025 ) issue: #33005 1. add `MemorySize` field for insert binlog. 2. `LogSize` means the file size in the storage object. 3. `MemorySize` means the size of the data in the memory. --------- Signed-off-by: Cai Zhang <cai.zhang@zilliz.com> Signed-off-by: cai.zhang <cai.zhang@zilliz.com>	2024-05-15 12:59:34 +08:00
yiwangdr	037de8e4d3	enhance: speed up minor functions calls in datacoord (#32389 ) Related to https://github.com/milvus-io/milvus/issues/32165 1. nodeid based channel store access should use map access instead of iteration. 2. The join-ish functions calls are slow when # collections/segments increases (e.g. 10k). e.g. getNumRowsOfCollectionUnsafe is O(num_segments); GetAllCollectionNumRows is of O(num_collections*num_segments). Signed-off-by: yiwangdr <yiwangdr@gmail.com>	2024-04-20 07:55:21 +08:00
Patrick Weizhi Xu	52ae47c850	enhance: gather materialized view search info once per request (#31996 ) issue: #29892 This PR: 1. Move the process of gathering materialized search info to when the search plan is created, before it goes to each segment, to avoid repeated work and access the plan node under multi-threaded circumstances. 2. Enforce the supported MV type to `VARCHAR` 3. Add integration test Signed-off-by: Patrick Weizhi Xu <weizhi.xu@zilliz.com>	2024-04-11 15:21:19 +08:00
wei liu	4c8cc6ceff	fix: Avoid acquire index meta's lock for each segment (#31723 ) issue: #31662 #31409 during FilterIndexedSegment in GetRecoveryInfo, it try to acquire index meta's read lock for every segment. when a collection has thousands of segments, which may blocked for more than 10 seconds and even longer. cause `AddSegmentIndex` may also triggered frequently, which try to get the write lock. This PR avoid acquire index meta's lock for each segment Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-04-01 15:49:13 +08:00
Bingyi Sun	bdc70dfc6a	feat: Add global mmap enable configuration (#31267 ) https://github.com/milvus-io/milvus/issues/31279 Signed-off-by: sunby <sunbingyi1992@gmail.com>	2024-03-18 15:17:10 +08:00
jaime	4b0c3dd377	enhance: index meta use independent rather than global meta lock (#30869 ) issue: https://github.com/milvus-io/milvus/issues/30837 Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-03-04 16:56:59 +08:00
foxspy	e1e87d572b	fix: compatibility for diskann cache param (#30119 ) patch search cache param from index configs when index meta could not get the search cache size key #issue: #30113 Signed-off-by: xianliang.li <xianliang.li@zilliz.com>	2024-02-26 16:54:55 +08:00
XuanYang-cn	fd19e419f9	fix: Use size bucket for compacted segment size metric (#30028 ) See also: #29204 Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2024-01-26 10:53:02 +08:00
Patrick Weizhi Xu	0907d76253	enhance: pass partition key scalar info if enabled when build vector index (#29931 ) issue: #29892 Pass optional scalar IVF offsets to Cardinal Signed-off-by: Patrick Weizhi Xu <weizhi.xu@zilliz.com>	2024-01-24 00:04:55 +08:00
xige-16	02673914a0	feat: Support multiple vector indexes in a collection (#27700 ) issue: #25639 /kind improvement Signed-off-by: xige-16 <xi.ge@zilliz.com> --------- Signed-off-by: xige-16 <xi.ge@zilliz.com>	2023-12-29 11:44:45 +08:00
aoiasd	8a4cfb7d6a	enhance: add l0 metric and fix datacoord no need drop l0 segment when flush (#28373 ) relate: https://github.com/milvus-io/milvus/issues/27675 Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2023-11-24 15:58:24 +08:00
XuanYang-cn	40d5c902b6	Enable getting multiple segments in plan result (#28350 ) Compaction plan result contained one segment for one plan. For l0 compaction would write to multiple segments, this PR expand the segments number in plan results and refactor some names for readibility. - Name refactory: - CompactionStateResult -> CompactionPlanResult - CompactionResult -> CompactionSegment See also: #27606 Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2023-11-14 15:56:19 +08:00
aoiasd	1d4be0d257	Adjust datacoord for L0 Delta (#28021 ) Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2023-11-06 15:26:16 +08:00
yah01	6539a5ae2c	Refine DataCoord status (#27262 ) Signed-off-by: yah01 <yah2er0ne@outlook.com>	2023-09-26 17:15:27 +08:00
Xu Tong	9166011c4a	Add float16 vector (#25852 ) Signed-off-by: Writer-X <1256866856@qq.com>	2023-09-08 10:03:16 +08:00
XuanYang-cn	b2e7cbdf4b	Remove TimeTravel in compactor (#26785 ) Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2023-09-04 17:41:48 +08:00
congqixia	597a4d9227	Treat small segment without index as sealed (#25237 ) Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2023-07-02 19:50:23 +08:00
xige-16	33c2012675	Add more metrics (#25081 ) Signed-off-by: xige-16 <xi.ge@zilliz.com>	2023-06-26 17:52:44 +08:00
congqixia	41af0a98fa	Use go-api/v2 for milvus-proto (#24770 ) Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2023-06-09 01:28:37 +08:00
congqixia	73a181d226	Fix get vector it timeout and improve some string const usage (#24141 ) Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2023-05-16 17:41:22 +08:00
congqixia	5aa9db0d38	Add collection level auto compaction enabled config (#24013 ) Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2023-05-10 17:45:20 +08:00
jaime	c9d0c157ec	Move some modules from internal to public package (#22572 ) Signed-off-by: jaime <yun.zhang@zilliz.com>	2023-04-06 19:14:32 +08:00
Enwei Jiao	697dedac7e	Use cockroachdb/errors to replace other error pkg (#22390 ) Signed-off-by: Enwei Jiao <enwei.jiao@zilliz.com>	2023-02-26 11:31:49 +08:00
cai.zhang	e5f408dceb	Merge IndexCoord and DataCoord (#21267 ) Signed-off-by: cai.zhang <cai.zhang@zilliz.com>	2023-01-04 19:37:36 +08:00

1 2

85 Commits