milvus

mirror of https://gitee.com/milvus-io/milvus.git synced 2025-12-07 01:28:27 +08:00

Author	SHA1	Message	Date
yihao.dai	e15ac2b472	fix: Fix incorrect segment num rows (#34441 ) Repeated calls to UpdateStatistics, this PR correct it. issue: https://github.com/milvus-io/milvus/issues/34440 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-07-08 20:00:14 +08:00
yihao.dai	4e5f1d5f75	enhance: Pre-allocate ids for import (#33958 ) The import is dependent on syncTask, which in turn relies on the allocator. This PR pre-allocate the necessary IDs for import syncTask. issue: https://github.com/milvus-io/milvus/issues/33957 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-07-07 21:26:14 +08:00
yihao.dai	43fd8d19c2	enhance: Check segment existence when FlushSegments and add some key logs (#34438 ) Check if the segment exists during FlushSegments and add some key logs in write path. issue: https://github.com/milvus-io/milvus/issues/34255 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-07-06 08:50:11 +08:00
congqixia	962a5446f8	enhance: Add ctx in `SyncTask.Run` to be cancellable (#34042 ) Related to #33716 This PR add context param in SyncTask.Run execution functions to make it cancellable from the caller. This make it possible to cancel task when datanode/data sync service is beeing shut down. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-06-25 14:22:04 +08:00
yihao.dai	6c1d815894	enhance: Remove the unused compaction logic from shard (#33932 ) 1. Remove the `compactTo` field in `SegmentInfo`. 2. Remove the target segment not match and its retry logic in `SyncManager`. issue: https://github.com/milvus-io/milvus/issues/32809 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-06-23 21:12:01 +08:00
wei liu	4987067375	enhance: Execute bloom filter apply in parallel to speed up segment predict (#33792 ) issue: #33610 Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-06-14 11:37:56 +08:00
congqixia	512ea6be5f	enhance: Avoid merging insert data when buffering insert msgs (#33562 ) See also #33561 This PR: - Use zero copy when buffering insert messages - Make `storage.InsertCodec` support serialize multiple insert data chunk into same batch binlog files Signed-off-by: Congqi Xia <congqi.xia@zilliz.com> --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-06-13 11:15:56 +08:00
XuanYang-cn	68c9e7db8c	fix: Sync dropped segment for dropped partition (#33331 ) See also: #33330 --------- Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2024-06-06 10:25:52 +08:00
aoiasd	387b7cd7f4	enhance:avoid maintain checkpoint info in sync manager (#33413 ) relate: https://github.com/milvus-io/milvus/issues/32915 Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2024-06-05 10:05:50 +08:00
congqixia	7f4698f4a7	enhance: Use map PK to timestamp in buffer insert (#33566 ) Related to #27675 Store pk to minimal timestamp in `inData` instead of bloom filter to check whether some delete entry hit current insert batch Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-06-04 10:07:48 +08:00
wei liu	c6a1c49e02	enhance: Use Blocked Bloom Filter instead of basic bloom fitler impl. (#33405 ) issue: #32995 To speed up the construction and querying of Bloom filters, we chose a blocked Bloom filter instead of a basic Bloom filter implementation. WARN: This PR is compatible with old version bf impl, but if fall back to old milvus version, it may causes bloom filter deserialize failed. In single Bloom filter test cases with a capacity of 1,000,000 and a false positive rate (FPR) of 0.001, the blocked Bloom filter is 5 times faster than the basic Bloom filter in both querying and construction, at the cost of a 30% increase in memory usage. - Block BF construct time {"time": "54.128131ms"} - Block BF size {"size": 3021578} - Block BF Test cost {"time": "55.407352ms"} - Basic BF construct time {"time": "210.262183ms"} - Basic BF size {"size": 2396308} - Basic BF Test cost {"time": "192.596229ms"} In multi Bloom filter test cases with a capacity of 100,000, an FPR of 0.001, and 100 Bloom filters, we reuse the primary key locations for all Bloom filters to avoid repeated hash computations. As a result, the blocked Bloom filter is also 5 times faster than the basic Bloom filter in querying. - Block BF TestLocation cost {"time": "529.97183ms"} - Basic BF TestLocation cost {"time": "3.197430181s"} --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-05-31 17:49:45 +08:00
congqixia	e71b7c7cc9	enhance: Reduce datanode metacache frequent scan range (#33400 ) See also #32165 There were some frequent scan in metacache: - List all segments whose start positions not synced - List compacted segments Those scan shall cause lots of CPU time when flushed segment number is large meanwhile `Flushed` segments can be skipped in those two scenarios This PR make: - Add segment state shortcut in metacache - List start positions state before `Flushed` - Make compacted segments state to be `Dropped` and use `Dropped` state while scanning them --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-05-28 14:19:42 +08:00
Xiaofan	36cbce4def	enhance: optimize datanode cpu usage under large collection number (#33267 ) fix #33266 try to improve cpu usage by refactoring the ttchecker logic and caching string Signed-off-by: xiaofanluan <xiaofan.luan@zilliz.com>	2024-05-25 04:43:41 +08:00
congqixia	e1bafd7105	enhance: Use pre-built logger for write buffer frequent ops (#33273 ) See also #33266 Each `WriteBuffer` shall have same channel/collection id attribute, so use same logger will do and reduce logger allocation & frequent name composition Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-05-22 21:11:40 +08:00
congqixia	2c1e8f4774	enhance: Use `struct{}` for sync task future result (#32673 ) Related to #27675 Use `struct{}` instead `error` for sync task future result type to reduce result size and preventing logci error. Also change some unused parameter to `_` to suppress lint warning Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-04-29 10:59:26 +08:00
congqixia	c0fa169d9a	enhance: Make write buffer memory check do until safe (#32172 ) See also #27675 #26177 Make memory check evict memory buffer until memory water level is safe. Also make `EvictBuffer` wait until sync task done. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-04-12 10:55:18 +08:00
yihao.dai	7ce876a072	fix: Decoupling importing segment from flush process (#30402 ) This pr decoups importing segment from flush process by: 1. Exclude the importing segment from the flush policy, this approch avoids notifying the datanode to flush the importing segment, which may not exist. 2. When RootCoord call Flush, DataCoord directly set the importing segment state to `Flushed`. issue: https://github.com/milvus-io/milvus/issues/30359 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-02-03 13:01:12 +08:00
congqixia	fc0d007bd1	enhance: Add `MemoryHighSyncPolicy` back to write buffer manager (#29997 ) See also #27675 This PR adds back MemoryHighSyncPolicy implementation. Also change MinSegmentSize & CheckInterval to configurable param item. --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-01-31 19:03:04 +08:00
congqixia	6445880753	fix: prevent segments got flushed multiple times (#30240 ) See also #30111 Segments could be "Flushed" only by `FlushSegments` grpc call from datacoord by design. There are two possible reason to cause one segment got flushed multiple times. - Segment is in flushing state during multiple epoch in flowgraph - Segment is flushed by flushTs & Flush segments So this pr fix: - Remove state change logic form FlushTs policy - Change Flush segment into three stage way: Sealed->Flushing->Flushed preventing multiple Flushed=true operations. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-01-24 14:19:00 +08:00
congqixia	0d356b0545	enhance: only buffer delete if it match insert has smaller timestamp (#30122 ) See also: #30121 #27675 This PR changes the delete buffering logic: - Write buffer shall buffer insert first - Then the delete messages shall be evaluated - Whether PK matches previous Bloom filter, which ts is always smaller - Whether PK matches insert data which has smaller timestamp - Then the segment bloom filter is updates by the newly buffered pk rows --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-01-19 17:28:53 +08:00
congqixia	a040692129	enhance: Use estimated batch size to initalize BF (#29842 ) See also: #27675 The bloom filter set initialized new BF with fixed configured `n`. This value is always larger than the actual batch size and causes generated BF using more memory. This PR make write buffer to initialize BF with estimated batch size from schema & configuration value. --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-01-10 20:36:50 +08:00
XuanYang-cn	a3aff37f73	fix: Correct flush buffer size metrics (#29571 ) See also: #29204 Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2024-01-04 17:22:46 +08:00
congqixia	79c06c5e73	fix: serializer shall bypass L0 segment merge stats step (#29636 ) See also #27675 Fix logic problem introduced by #29413, which is serializer tries to merge statslog list while level segments do not have statslog. This shall result returning error. `writeBufferBase` ignores this error but it shall only ignore `ErrSegmentNotFound`. This PR add logic checking segment level before execution of merging statslog list. And add error type check for getSyncTask failure. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-01-04 16:52:45 +08:00
congqixia	55af8f611f	fix: always sync level zero segments as flushed (#29569 ) See also #27675 For now, Level zero segments shall always be synced as `Flushed` ones. This PR fixes when level zero segments selected by policies other than flush ts policy will be synced as growing state. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2023-12-29 10:34:47 +08:00
congqixia	277849a915	enhance: separate serializer logic from sync task (#29413 ) See also #27675 Since serialization segment buffer does not related to sync manager can shall be done before submit into sync manager. So that the pk statistic file could be more accurate and reduce complex logic inside sync manager. --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2023-12-26 10:40:47 +08:00
XuanYang-cn	7a6aa8552a	fix: add back existing datanode metrics (#29360 ) See also: #29204 --------- Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2023-12-22 14:20:43 +08:00
congqixia	cb43647b9e	enhance: Log channel checkpoint source info in writebuffer (#28993 ) See also #27675 Print channel checkpoint source with rated log will help debugging system behavior Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2023-12-07 11:50:36 +08:00
congqixia	cb31016640	enhance: Write buffer time range when syncing logs (#28970 ) Related to #27675 The timestamp from, to field is not field for new implementation of writebuffer & sync manager This pr fills these field for better log information Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2023-12-05 17:36:36 +08:00
congqixia	eaabe0293b	fix: Update segment compactTo when compactTo segment is compacted (#28755 ) Related to #28736 #28748 See also #27675 Previous PR: #28646 This PR fixes `SegmentNotFound` issue when compaction happens multiple times and the buffer of first generation segment is sync due to stale policy Now the `CompactSegments` API of metacache shall update the compactTo field of segmentInfo if the compactTo segment is also compacted to keep the bloodline clean Also, add the `CompactedSegment` SyncPolicy to sync the compacted segment asap to keep metacache clean Now the `SyncPolicy` is an interface instead of a function type so that when it selects some segments to sync, we colud log the reason and target segment Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2023-11-27 19:48:26 +08:00
congqixia	39be35804c	enhance: Add back clean compacted segment info logic (#28646 ) See also #27675 Compacted segment info shall be removed after all buffer belongs to it is sync-ed. This PR add the cleanup function after triggerSyncTask logic: - The buffer is stable and protected by mutex - Cleanup fetches compacted & non-sync segment - Remove segment info only there is no buffered maintained in manager --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2023-11-24 15:38:25 +08:00
Bingyi Sun	4fedff6d47	feat: integrate storage v2 into the write path (#28440 ) #28378 --------- Signed-off-by: sunby <sunbingyi1992@gmail.com>	2023-11-23 17:26:24 +08:00
congqixia	18dc6b61ce	enhance: fix LevelZero segment sync logic (#28482 ) See also #27675 - Fix LevelZero segment cannot be flushed - Add level option for syncTask - Invoke `AddSegment` when new LevelZero segment is allocated Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2023-11-17 21:46:20 +08:00
congqixia	bed7467f20	enhance: Remove commented code and fix naming issue (#28450 ) This PR removes all the commented code and files from PR #28320 For naming issue: - Renaming `MinCheckpoint` to `EarliestPosition`, see #28320 comment - Renaming `writebuffer.Mananger` to `BufferMananger`, see #27874 comment Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2023-11-16 00:22:20 +08:00
congqixia	0b905078e7	Use writebuffer, sync manager refactory in datanode (#28320 ) See also #27675 This PR make previously merged refactory of datanode go online - Use write node to replace insert/delete node - Use write buffer manager to control all buffers - Use sync manager to control sync tasks instead of flush manager Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2023-11-15 15:24:18 +08:00
congqixia	bf2f62c1e7	Add `WriteBuffer` to provide abstraction for delta policy (#27874 ) Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2023-11-04 12:10:17 +08:00

35 Commits