milvus

mirror of https://gitee.com/milvus-io/milvus.git synced 2026-01-07 19:31:51 +08:00

Author	SHA1	Message	Date
wei liu	975c91df16	feat: Add comprehensive snapshot functionality for collections (#44361 ) issue: #44358 Implement complete snapshot management system including creation, deletion, listing, description, and restoration capabilities across all system components. Key features: - Create snapshots for entire collections - Drop snapshots by name with proper cleanup - List snapshots with collection filtering - Describe snapshot details and metadata Components added/modified: - Client SDK with full snapshot API support and options - DataCoord snapshot service with metadata management - Proxy layer with task-based snapshot operations - Protocol buffer definitions for snapshot RPCs - Comprehensive unit tests with mockey framework - Integration tests for end-to-end validation Technical implementation: - Snapshot metadata storage in etcd with proper indexing - File-based snapshot data persistence in object storage - Garbage collection integration for snapshot cleanup - Error handling and validation across all operations - Thread-safe operations with proper locking mechanisms <!-- This is an auto-generated comment: release notes by coderabbit.ai --> - Core invariant/assumption: snapshots are immutable point‑in‑time captures identified by (collection, snapshot name/ID); etcd snapshot metadata is authoritative for lifecycle (PENDING → COMMITTED → DELETING) and per‑segment manifests live in object storage (Avro / StorageV2). GC and restore logic must see snapshotRefIndex loaded (snapshotMeta.IsRefIndexLoaded) before reclaiming or relying on segment/index files. - New capability added: full end‑to‑end snapshot subsystem — client SDK APIs (Create/Drop/List/Describe/Restore + restore job queries), DataCoord SnapshotWriter/Reader (Avro + StorageV2 manifests), snapshotMeta in meta, SnapshotManager orchestration (create/drop/describe/list/restore), copy‑segment restore tasks/inspector/checker, proxy & RPC surface, GC integration, and docs/tests — enabling point‑in‑time collection snapshots persisted to object storage and restorations orchestrated across components. - Logic removed/simplified and why: duplicated recursive compaction/delta‑log traversal and ad‑hoc lookup code were consolidated behind two focused APIs/owners (Handler.GetDeltaLogFromCompactTo for delta traversal and SnapshotManager/SnapshotReader for snapshot I/O). MixCoord/coordinator broker paths were converted to thin RPC proxies. This eliminates multiple implementations of the same traversal/lookup, reducing divergence and simplifying responsibility boundaries. - Why this does NOT introduce data loss or regressions: snapshot create/drop use explicit two‑phase semantics (PENDING → COMMIT/DELETING) with SnapshotWriter writing manifests and metadata before commit; GC uses snapshotRefIndex guards and IsRefIndexLoaded/GetSnapshotBySegment/GetSnapshotByIndex checks to avoid removing referenced files; restore flow pre‑allocates job IDs, validates resources (partitions/indexes), performs rollback on failure (rollbackRestoreSnapshot), and converts/updates segment/index metadata only after successful copy tasks. Extensive unit and integration tests exercise pending/deleting/GC/restore/error paths to ensure idempotence and protection against premature deletion. <!-- end of auto-generated comment: release notes by coderabbit.ai --> --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2026-01-06 10:15:24 +08:00
sijie-ni-0214	f51de1a8ab	feat: support TruncateCollection api to clear collection data (#46167 ) issue: https://github.com/milvus-io/milvus/issues/46166 --------- Signed-off-by: sijie-ni-0214 <sijie.ni@zilliz.com>	2025-12-12 10:31:14 +08:00
aoiasd	354ab2f55e	enhance: sync file resource to querynode and datanode (#44480 ) relate:https://github.com/milvus-io/milvus/issues/43687 Support use file resource with sync mode. Auto download or remove file resource to local when user add or remove file resource. Sync file resource to node when find new node session. --------- Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2025-12-04 16:23:11 +08:00
congqixia	569a5b40d2	enhance: [StorageV2] add manifest path support for FFI integration (#44991 ) Related to #44956 Add manifest_path field throughout the data path to support LOON Storage V2 manifest tracking. The manifest stores metadata for segment data files and enables the unified Storage V2 FFI interface. Changes include: - Add manifest_path field to SegmentInfo and SaveBinlogPathsRequest proto messages - Add UpdateManifest operator to datacoord meta operations - Update metacache, sync manager, and meta writer to propagate manifest paths - Include manifest_path in segment load info for query coordinator This is part of the Storage V2 FFI interface integration. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-10-27 19:24:10 +08:00
wei liu	d84c4c580a	enhance: [DataCoord] Remove full-collection index work from metrics (#43859 ) issue: #43858 - Remove full-collection index handling in getCollectionMetrics - Avoid heavy metadata scans and RPC calls during metrics - Reduce latency and CPU/memory usage on large datasets - No functional change to metrics semantics Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2025-08-29 12:05:50 +08:00
aoiasd	eca51ed2c6	enhance: add file resource api (#43766 ) relate: https://github.com/milvus-io/milvus/issues/43687 Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2025-08-08 14:17:41 +08:00
Zhen Ye	cd38d65417	fix: make savebinlogpath idompotent at binlog level (#43615 ) issue: #43574 - update all binlog every time when calling udpate savebinlogpath. --------- Signed-off-by: chyezh <chyezh@outlook.com>	2025-07-29 19:47:36 +08:00
cai.zhang	74c08069ef	fix: Set result storage version for sort compaction (#43521 ) issue: #43520 Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2025-07-23 19:04:53 +08:00
cai.zhang	c54a04c71c	fix: L2 segments remain as L2 even after sort compaction (#43237 ) issue: #43186 Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2025-07-11 11:30:48 +08:00
Xianhui Lin	a72492169f	feat: add NotifyDropPartition in mixcoord for droppartition in dc (#42029 ) add NotifyDropPartition in mixcoord for droppartition in dc issue:https://github.com/milvus-io/milvus/issues/41976 https://github.com/milvus-io/milvus/issues/41542 Signed-off-by: Xianhui.Lin <xianhui.lin@zilliz.com>	2025-05-23 18:32:26 +08:00
yihao.dai	f65e6b7c6e	enhance: Optimize datacoord meta mutex (#40552 ) Use a separate collection mutex. issue: https://github.com/milvus-io/milvus/issues/40551 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2025-03-25 13:46:25 +08:00
XuanYang-cn	4bebca6416	enhance: Replace currRows with NumOfRows (#40074 ) See also: #40068 --------- Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2025-03-10 12:16:03 +08:00
congqixia	cb7f2fa6fd	enhance: Use v2 package name for pkg module (#39990 ) Related to #39095 https://go.dev/doc/modules/version-numbers Update pkg version according to golang dep version convention --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-02-22 23:15:58 +08:00
yihao.dai	38f813bed3	enhance: Read metadata concurrently to accelerate recovery (#38403 ) Read metadata such as segments, binlogs, and partitions concurrently at the collection level. issue: https://github.com/milvus-io/milvus/issues/37630 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2025-01-23 14:27:27 +08:00
aoiasd	9cb4c4e8ac	fix: bm25 import segment without bm25 stats meta (#38855 ) relate: https://github.com/milvus-io/milvus/issues/38854 Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2025-01-21 11:09:04 +08:00
yihao.dai	272d95ad79	enhance: Reduce mutex contention in datacoord meta (#38219 ) 1. Using secondary index to avoid retrieving all segments at `GetSegmentsChanPart`. 2. Perform batch SetAllocations to reduce the number of times the meta lock is acquired. issue: https://github.com/milvus-io/milvus/issues/37630 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2025-01-15 01:15:02 +08:00
Zhen Ye	bb8d1ab3bf	enhance: make new go package to manage proto (#39114 ) issue: #39095 --------- Signed-off-by: chyezh <chyezh@outlook.com>	2025-01-10 10:49:01 +08:00
tinswzy	7944538ade	enhance: Add ctx param to KV operation interfaces (#38154 ) issue: #35917 Refine KV operation interfaces by adding a ctx param Signed-off-by: tinswzy <zhenyuan.wei@zilliz.com>	2024-12-05 15:16:41 +08:00
tinswzy	1dbb6cd7cb	enhance: refine the datacoord meta related interfaces (#37957 ) issue: #35917 This PR refines the meta-related APIs in datacoord to allow the ctx to be passed down to the catalog operation interfaces Signed-off-by: tinswzy <zhenyuan.wei@zilliz.com>	2024-11-26 19:46:34 +08:00
jaime	7bbfe86bcd	enhance: add list index and segment index retrieval API for WebUI (#37861 ) issue: #36621 Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-11-22 16:58:34 +08:00
XuanYang-cn	5e6c3df253	fix: l0RowCount metrics value always empty (#37306 ) See also: #36953 --------- Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2024-11-11 17:00:27 +08:00
Zhen Ye	49657c4690	enhance: add create segment message, enable empty segment flush (#37407 ) issue: #37172 - add redo interceptor to implement append context refresh. (make new timetick) - add create segment handler for flusher. - make empty segment flushable and directly change it into dropped. - add create segment message into wal when creating new growing segment. - make the insert operation into following seq: createSegment -> insert -> insert -> flushSegment. - make manual flush into following seq: flushTs -> flushsegment -> flushsegment -> manualflush. --------- Signed-off-by: chyezh <chyezh@outlook.com>	2024-11-08 10:16:34 +08:00
jaime	f348bd9441	feat: add segment,pipeline, replica and resourcegroup api for WebUI (#37344 ) issue: #36621 Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-11-07 11:52:25 +08:00
XuanYang-cn	51ed2a61c8	fix: Correct dropped segment num metrics (#37410 ) See also: #31891 Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2024-11-07 11:16:24 +08:00
cai.zhang	ecb2b242e2	enhance: Add sorted for segment info (#36469 ) issue: #33744 Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2024-09-30 10:01:16 +08:00
congqixia	d2c774fb6d	fix: Return all compactTo segments after support split (#36361 ) Related to #36360 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-09-20 14:11:11 +08:00
aoiasd	139787371e	feat: support embedding bm25 sparse vector and flush bm25 stats log (#36036 ) relate: https://github.com/milvus-io/milvus/issues/35853 --------- Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2024-09-19 10:57:12 +08:00
jaime	22cce44afc	fix: metrics stored_index_files_size is never cleared (#36160 ) issue: #36159 Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-09-13 20:09:15 +08:00
cai.zhang	2c9bb4dfa3	feat: Support stats task to sort segment by PK (#35054 ) issue: #33744 This PR includes the following changes: 1. Added a new task type to the task scheduler in datacoord: stats task, which sorts segments by primary key. 2. Implemented segment sorting in indexnode. 3. Added a new field `FieldStatsLog` to SegmentInfo to store token index information. --------- Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2024-09-02 14:19:03 +08:00
congqixia	c992a61a23	enhance: Separate allocator pkg in datacoord (#35622 ) Related to #28861 Move allocator interface and implementation into separate package. Also update some unittest logic. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-08-22 10:06:56 +08:00
wei liu	c45f38aa61	enhance: Update protobuf-go to protobuf-go v2 (#34394 ) issue: #34252 Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-07-29 11:31:51 +08:00
congqixia	67324eb809	enhance: Add l0 segment entry num quota (#34733 ) See also #34670 This PR add quota configuration for l0 segment entry number per collection. If l0 compaction cannot keep up the insertion/upsertion rate, this feature could back press the related rate. --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-07-17 17:35:41 +08:00
jaime	9630974fbb	enhance: move rocksmq from internal to pkg module (#33881 ) issue: #33956 Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-06-25 21:18:15 +08:00
zhenshan.cao	d18c49013b	enhance: Refine compaction (#33982 ) issue : https://github.com/milvus-io/milvus/issues/32939 Signed-off-by: zhenshan.cao <zhenshan.cao@zilliz.com>	2024-06-25 10:08:03 +08:00
wayblink	a1232fafda	feat: Major compaction (#33620 ) #30633 Signed-off-by: wayblink <anyang.wang@zilliz.com> Co-authored-by: MrPresent-Han <chun.han@zilliz.com>	2024-06-10 21:34:08 +08:00
smellthemoon	c61fb1eff5	enhance: do check when add not empty logpath (#33640 ) meta only store logid Signed-off-by: lixinguo <xinguo.li@zilliz.com> Co-authored-by: lixinguo <xinguo.li@zilliz.com>	2024-06-07 10:19:51 +08:00
cai.zhang	27cc9f2630	enhance: Support analyze data (#33651 ) issue: #30633 Signed-off-by: Cai Zhang <cai.zhang@zilliz.com> Co-authored-by: chasingegg <chao.gao@zilliz.com>	2024-06-06 17:37:51 +08:00
yihao.dai	35532a3e7d	fix: Fill stats log id and check validity (#33477 ) 1. Fill log ID of stats log from import 2. Add a check to validate the log ID before writing to meta issue: https://github.com/milvus-io/milvus/issues/33476 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-06-05 11:17:56 +08:00
zhenshan.cao	ac4f3997ce	enhance: Reconstructing Compaction to possess persistence capability (#33265 ) issue #33586 Signed-off-by: zhenshan.cao <zhenshan.cao@zilliz.com>	2024-06-05 10:17:50 +08:00
jaime	3d29907b6e	enhance: decrease cpu overhead during filter segments on datacoord (#33130 ) issue: #33129 Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-05-28 19:17:43 +08:00
SimFG	2453181218	fix: not found database name in the datacoord meta object (#33411 ) - issue: #33410 Signed-off-by: SimFG <bang.fu@zilliz.com>	2024-05-28 10:09:48 +08:00
jaime	ba625835bc	enhance: Add metrics for segment index files size (#32979 ) issue:#32980 Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-05-13 17:59:32 +08:00
SimFG	c012e6786f	feat: support rate limiter based on db and partition levels (#31070 ) issue: https://github.com/milvus-io/milvus/issues/30577 co-author: @jaime0815 --------- Signed-off-by: Patrick Weizhi Xu <weizhi.xu@zilliz.com> Signed-off-by: SimFG <bang.fu@zilliz.com> Co-authored-by: Patrick Weizhi Xu <weizhi.xu@zilliz.com>	2024-04-12 16:01:19 +08:00
jaime	d4fd6c7283	enhance: add db label on binlog size metrics (#32003 ) Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-04-10 21:01:20 +08:00
yihao.dai	0fe5e90e8b	enhance: Remove import v1 (#31403 ) Remove all code and logic related to import v1. issue: https://github.com/milvus-io/milvus/issues/28521 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-03-22 15:29:09 +08:00
aoiasd	0c153a5820	enhance: Rename update segment operator (#31121 ) Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2024-03-20 17:53:14 +08:00
yihao.dai	c411cb4a49	enhance: Prevent the backlog of channelCP update tasks, perform batch updates of channelCPs (#30941 ) This PR includes the following adjustments: 1. To prevent channelCP update task backlog, only one task with the same vchannel is retained in the updater. Additionally, the lastUpdateTime is refreshed after the flowgraph submits the update task, rather than in the callBack function. 2. Batch updates of multiple vchannel checkpoints are performed in the UpdateChannelCheckpoint RPC (default batch size is 128). Additionally, the lock for channelCPs in DataCoord meta has been switched from key lock to global lock. 3. The concurrency of UpdateChannelCheckpoint RPCs in the datanode has been reduced from 1000 to 10. issue: https://github.com/milvus-io/milvus/issues/30004 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com> Co-authored-by: jaime <yun.zhang@zilliz.com> Co-authored-by: congqixia <congqi.xia@zilliz.com>	2024-03-07 20:39:02 +08:00
chyezh	8f7019468f	fix: starve lock caused by slow GetCompactionTo method when too much segments (#30963 ) issue: #30823 Signed-off-by: chyezh <chyezh@outlook.com>	2024-03-05 10:04:59 +08:00
jaime	4b0c3dd377	enhance: index meta use independent rather than global meta lock (#30869 ) issue: https://github.com/milvus-io/milvus/issues/30837 Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-03-04 16:56:59 +08:00
yihao.dai	a434d33e75	feat: Add import scheduler and manager (#29367 ) This PR introduces novel managerial roles for importv2: 1. ImportMeta: To manage all the import tasks; 2. ImportScheduler: To process tasks and modify their states; 3. ImportChecker: To ascertain the completion of all tasks and instigate relevant operations. issue: https://github.com/milvus-io/milvus/issues/28521 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-03-01 18:31:02 +08:00

1 2 3

138 Commits