milvus

mirror of https://gitee.com/milvus-io/milvus.git synced 2026-01-07 19:31:51 +08:00

Author	SHA1	Message	Date
yihao.dai	b18ebd9468	enhance: Remove legacy cdc/replication (#46603 ) issue: https://github.com/milvus-io/milvus/issues/44123 <!-- This is an auto-generated comment: release notes by coderabbit.ai --> - Core invariant: legacy in-cluster CDC/replication plumbing (ReplicateMsg types, ReplicateID-based guards and flags) is obsolete — the system relies on standard msgstream positions, subPos/end-ts semantics and timetick ordering as the single source of truth for message ordering and skipping, so replication-specific channels/types/guards can be removed safely. - Removed/simplified logic (what and why): removed replication feature flags and params (ReplicateMsgChannel, TTMsgEnabled, CollectionReplicateEnable), ReplicateMsg type and its tests, ReplicateID constants/helpers and MergeProperties hooks, ReplicateConfig and its propagation (streamPipeline, StreamConfig, dispatcher, target), replicate-aware dispatcher/pipeline branches, and replicate-mode pre-checks/timestamp-allocation in proxy tasks — these implemented a redundant alternate “replicate-mode” pathway that duplicated position/end-ts and timetick logic. - Why this does NOT cause data loss or regression (concrete code paths): no persistence or core write paths were removed — proxy PreExecute flows (internal/proxy/task_*.go) still perform the same schema/ID/size validations and then follow the normal non-replicate execution path; dispatcher and pipeline continue to use position/subPos and pullback/end-ts in Seek/grouping (pkg/mq/msgdispatcher/dispatcher.go, internal/util/pipeline/stream_pipeline.go), so skipping and ordering behavior remains unchanged; timetick emission in rootcoord (sendMinDdlTsAsTt) is now ungated (no silent suppression), preserving or increasing timetick delivery rather than removing it. - PR type and net effect: Enhancement/Refactor — removes deprecated replication API surface (types, helpers, config, tests) and replication branches, simplifies public APIs and constructor signatures, and reduces surface area for future maintenance while keeping DML/DDL persistence, ordering, and seek semantics intact. <!-- end of auto-generated comment: release notes by coderabbit.ai --> --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2025-12-30 14:53:21 +08:00
yihao.dai	b7761d67a3	enhance: Enhance logs for proxy and rootcoord meta table (#46652 ) issue: https://github.com/milvus-io/milvus/issues/46651 <!-- This is an auto-generated comment: release notes by coderabbit.ai --> ## Enhancement: Add Context-Aware Logging for Proxy and RootCoord Meta Table Operations Core Invariant: All changes maintain existing cache behavior and state transition logic by purely enhancing observability through context-aware logging without modifying control flow, return values, or data structures. Logic Simplified Without Regression: - Removed internal helper method `getFullCollectionInfo` from MetaCache by inlining its logic directly into GetCollectionInfo, eliminating an unnecessary abstraction layer while preserving the exact same cache-hit/miss and fetch-or-update paths - This consolidation has no impact on behavior because the helper was only called from one location and the inlined logic executes identically Enhanced Logging for Observability (No Behavior Changes): - Added context-aware logging (log.Ctx(ctx)) to cache miss scenarios and timestamp comparisons in proxy MetaCache, enabling request tracing without altering cache lookup logic - Expanded RootCoord MetaTable's internal helper method signatures to propagate context for contextual logging across collection lifecycle events (begin truncate, update state, remove names/aliases, delete from collections map), while keeping all call sites and state transitions unchanged - Enhanced DescribeCollection logging in proxy to capture request scope (role, database, collection IDs, timestamp) and response schema at operation boundaries Type: Enhancement focused on improved observability. All modifications are strictly additive logging; no data structures, caching strategies, or core logic paths were altered. <!-- end of auto-generated comment: release notes by coderabbit.ai --> Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2025-12-30 14:41:20 +08:00
aoiasd	90809d1d86	fix: highlight with multi analyzer failed (#46527 ) relate: https://github.com/milvus-io/milvus/issues/46498 <!-- This is an auto-generated comment: release notes by coderabbit.ai --> - Core invariant: text fields configured with multi_analyzer_params must include a "by_field" string that names another field containing per-row analyzer choices; schemaInfo.GetMultiAnalyzerNameFieldID caches and returns the dependent field ID (or 0 if none) and relies on that mapping to make per-row analyzer names available to the highlighter. - What changed / simplified: the highlighter is now schema-aware — addTaskWithSearchText accepts *schemaInfo and uses GetMultiAnalyzerNameFieldID to resolve the analyzer-name field; resolution and caching moved into schemaInfo.multiAnalyzerFieldMap (meta_cache.go), eliminating ad-hoc/typeutil-only lookups and duplicated logic; GetMultiAnalyzerParams now gates on EnableAnalyzer(), centralizing analyzer enablement checks. - Why this fixes the bug (root cause): fixes #46498 — previously the highlighter failed when the analyzer-by-field was not in output_fields. The change (1) populates task.AnalyzerNames (defaulting missing names to "default") when multi-analyzer is configured and (2) appends the analyzer-name field ID to LexicalHighlighter.extraFields so FieldIDs includes it; the operator then requests the analyzer-name column at search time, ensuring per-row analyzer selection is available for highlighting. - No data-loss or regression: when no multi-analyzer is configured GetMultiAnalyzerNameFieldID returns 0 and behavior is unchanged; the patch only adds the analyzer-name field to requested output IDs (no mutation of stored data). Error handling on malformed params is preserved (errors are returned instead of silently changing data), and single-analyzer behavior remains untouched. <!-- end of auto-generated comment: release notes by coderabbit.ai --> Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2025-12-30 11:55:21 +08:00
Zhen Ye	1cd0ef943e	fix: use latest timetick to expire cache (#45717 ) issue: #45697 --------- Signed-off-by: chyezh <chyezh@outlook.com>	2025-11-20 21:39:04 +08:00
Zhen Ye	87f9a79a6a	fix: inconsistent proxy cache when multiple DDL is executing with DML (#45698 ) issue: #45697 Signed-off-by: chyezh <chyezh@outlook.com>	2025-11-20 02:53:06 +08:00
congqixia	6c34386ff2	enhance: extract shard client logic into dedicated package (#45018 ) Related to #44761 Refactor proxy shard client management by creating a new internal/proxy/shardclient package. This improves code organization and modularity by: - Moving load balancing logic (LookAsideBalancer, RoundRobinBalancer) to shardclient package - Extracting shard client manager and related interfaces into separate package - Relocating shard leader management and client lifecycle code - Adding package documentation (README.md, OWNERS) - Updating proxy code to use the new shardclient package interfaces This change makes the shard client functionality more maintainable and better encapsulated, reducing coupling in the proxy layer. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-10-22 10:22:04 +08:00
congqixia	f5f053f1d2	enhance: Refactor privilege management by extracting privilege cache into separate package (#44762 ) Related to #44761 This commit refactors the privilege management system in the proxy component by: 1. Separation of Concerns: Extracts privilege-related functionality from MetaCache into a dedicated `internal/proxy/privilege` package, improving code organization and maintainability. 2. New Package Structure: Creates `internal/proxy/privilege/` with: - `cache.go`: Core privilege cache implementation (PrivilegeCache) - `result_cache.go`: Privilege enforcement result caching - `model.go`: Casbin model and policy enforcement functions - `meta_cache_adapter.go`: Casbin adapter for MetaCache integration - Corresponding test files and mock implementations 3. MetaCache Simplification: Removes privilege and credential management methods from MetaCache interface and implementation: - Removed: GetCredentialInfo, RemoveCredential, UpdateCredential - Removed: GetPrivilegeInfo, GetUserRole, RefreshPolicyInfo, InitPolicyInfo - Deleted: meta_cache_adapter.go, privilege_cache.go and their tests 4. Updated References: Updates all callsites to use the new privilegeCache global: - Authentication interceptor now uses privilegeCache for password verification - Credential cache operations (InvalidateCredentialCache, UpdateCredentialCache, UpdateCredential) now use privilegeCache - Policy refresh operations (RefreshPolicyInfoCache) now use privilegeCache - Privilege interceptor uses new privilege.GetEnforcer() and privilege result cache 5. Improved API: Renames cache functions for clarity: - GetPrivilegeCache → GetResultCache - SetPrivilegeCache → SetResultCache - CleanPrivilegeCache → CleanResultCache This refactoring makes the codebase more modular, separates privilege management concerns from general metadata caching, and provides a clearer API for privilege enforcement operations. --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-10-13 11:15:58 +08:00
SimFG	9ffcc55b55	fix: Clean privilege cache after loading policy in InitPolicyInfo (#43642 ) - issue: #43641 Signed-off-by: SimFG <bang.fu@zilliz.com>	2025-07-30 16:57:37 +08:00
Spade A	faeb7fd410	feat: impl StructArray -- create schema, insert, and retrieve data (#42855 ) Ref https://github.com/milvus-io/milvus/issues/42148 https://github.com/milvus-io/milvus/pull/42406 impls the segcore part of storage for handling with VectorArray. This PR: 1. impls the go part of storage for VectorArray 2. impls the collection creation with StructArrayField and VectorArray 3. insert and retrieve data from the collection. --------- Signed-off-by: SpadeA <tangchenjie1210@gmail.com> Signed-off-by: SpadeA-Tang <tangchenjie1210@gmail.com> Signed-off-by: SpadeA-Tang <u6748471@anu.edu.au>	2025-07-27 01:30:55 +08:00
Ted Xu	07894b37b6	enhance: returning collection metadata from cache (#42823 ) See #43187 --------- Signed-off-by: Ted Xu <ted.xu@zilliz.com>	2025-07-14 10:54:50 +08:00
congqixia	74ea57bac1	enhance: Remove unused load field check from proxy (#42816 ) Related to #42489 Since load list works as hint after cachelayer implemented, the related check logic could be removed to keep code logic clean. --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-06-19 19:34:47 +08:00
wei liu	0b4a17c22b	fix: Fix exclude nodes clearing logic position in load balancer retry (#42577 ) issue: #42561 Move the exclude nodes clearing logic from ExecuteWithRetry to selectNode after shard leader cache refresh to ensure proper retry behavior: - Remove premature exclude clearing in ExecuteWithRetry that happened before shard leader cache update - Add exclude clearing logic in selectNode after refreshing shard leader cache when all replicas are excluded - Ensure multiple retries can properly update shard leader cache and clear exclude list when needed - Add comprehensive tests for edge cases including empty shard leaders and mixed serviceable node scenarios --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2025-06-17 08:15:24 +08:00
wei liu	54619eaa2c	feat: Implement partial result support on node down (#42009 ) issue: https://github.com/milvus-io/milvus/issues/41690 This commit implements partial search result functionality when query nodes go down, improving system availability during node failures. The changes include: - Enhanced load balancing in proxy (lb_policy.go) to handle node failures with retry support - Added partial search result capability in querynode delegator and distribution logic - Implemented tests for various partial result scenarios when nodes go down - Added metrics to track partial search results in querynode_metrics.go - Updated parameter configuration to support partial result required data ratio - Replaced old partial_search_test.go with more comprehensive partial_result_on_node_down_test.go - Updated proto definitions and improved retry logic These changes improve query resilience by returning partial results to users when some query nodes are unavailable, ensuring that queries don't completely fail when a portion of data remains accessible. --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2025-05-28 00:12:28 +08:00
Xianhui Lin	6a0e182e13	enhance: support TTL expiration with queries returning no results (#42086 ) support TTL expiration with queries returning no results issue:https://github.com/milvus-io/milvus/issues/41959 Signed-off-by: Xianhui.Lin <xianhui.lin@zilliz.com>	2025-05-27 18:28:27 +08:00
congqixia	dbe54c2df8	enhance: [AddField] Resolve conflicts & make WAL ts collection updatets (#41476 ) Related to #39718 This PR: - Use WAL broadcast timestamp as Collection update timestamp - Remove request_fields size assertion - Remove proxy schema cache loaded field check & skip related cases - other minor issues --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-04-24 12:06:39 +08:00
Xianhui Lin	f9febe3bae	enhance: Merge RootCoord, DataCoord And QueryCoord into MixCoord (#41006 ) Merge RootCoord, DataCoord And QueryCoord into MixCoord Make Session into one issue : https://github.com/milvus-io/milvus/issues/37764 --------- Signed-off-by: Xianhui.Lin <xianhui.lin@zilliz.com>	2025-04-11 16:36:30 +08:00
smellthemoon	cb1e86e17c	enhance: support add field (#39800 ) after the pr merged, we can support to insert, upsert, build index, query, search in the added field. can only do the above operates in added field after add field request complete, which is a sync operate. compact will be supported in the next pr. #39718 --------- Signed-off-by: lixinguo <xinguo.li@zilliz.com> Co-authored-by: lixinguo <xinguo.li@zilliz.com>	2025-04-02 14:24:31 +08:00
SimFG	a3755cf409	fix: improve error handling and unit tests for InitMetaCache function (#40322 ) - issue: #40320 Signed-off-by: SimFG <bang.fu@zilliz.com>	2025-03-05 11:08:13 +08:00
congqixia	cb7f2fa6fd	enhance: Use v2 package name for pkg module (#39990 ) Related to #39095 https://go.dev/doc/modules/version-numbers Update pkg version according to golang dep version convention --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2025-02-22 23:15:58 +08:00
Xianhui Lin	82f9689711	enhance: Add schema update time verification for insert and upsert to use cache (#39096 ) enhance: Add schema update time verification for insert and upsert to use cache issue: https://github.com/milvus-io/milvus/issues/39093 --------- Signed-off-by: Xianhui.Lin <xianhui.lin@zilliz.com>	2025-02-07 14:10:45 +08:00
Zhen Ye	bb8d1ab3bf	enhance: make new go package to manage proto (#39114 ) issue: #39095 --------- Signed-off-by: chyezh <chyezh@outlook.com>	2025-01-10 10:49:01 +08:00
SimFG	2afe2eaf3e	feat: support to replicate collection when the services contains the system tt msg (#37559 ) - issue: #37105 --------- Signed-off-by: SimFG <bang.fu@zilliz.com>	2024-12-17 09:08:46 +08:00
tinswzy	27229f7907	enhance: refine exists log print with ctx (#38080 ) issue: #35917 Refines exists log print with ctx Signed-off-by: tinswzy <zhenyuan.wei@zilliz.com>	2024-12-14 22:36:44 +08:00
cai.zhang	73aa95f596	fix: Add version to the proxy cache to resolve concurrency issues (#38067 ) issue: #36989 --------- Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2024-12-04 18:06:39 +08:00
SimFG	302650ae0e	fix: use the default partition for the limit quota when the request partition name is empty (#38005 ) - issue: #37685 Signed-off-by: SimFG <bang.fu@zilliz.com>	2024-11-27 11:00:36 +08:00
SimFG	971b4f17ae	fix: add the db information in the dml message (#37969 ) - issue: #37966 Signed-off-by: SimFG <bang.fu@zilliz.com>	2024-11-27 10:02:35 +08:00
SimFG	7c5a8012cf	enhance: remove the collectionBasicInfo class in the proxy metacache (#37874 ) /kind improvement - issue: #37928 Signed-off-by: SimFG <bang.fu@zilliz.com>	2024-11-22 16:10:33 +08:00
SimFG	923a661dfe	enhance: filter the fields instead of create a new response obj (#37845 ) /kind improvement Here you only need to filter out the system fields, and you don’t need to recreate a response, because recreating the response will cause this part to be easily missed when adding fields later. Signed-off-by: SimFG <bang.fu@zilliz.com>	2024-11-22 11:42:32 +08:00
wei liu	965bda6e60	enhance: Add channel name to shard leader log in meta cache (#37856 ) Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-11-21 19:24:31 +08:00
cai.zhang	c07f056b17	fix: Use the ID to retrieve the real name when collectionName is empty (#37859 ) issue: #36989 Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2024-11-21 14:28:32 +08:00
Buqian Zheng	511edd29fd	enhance: disallow get raw vector data of a BM25 Function output field (#37800 ) issue: https://github.com/milvus-io/milvus/issues/35853 Signed-off-by: Buqian Zheng <zhengbuqian@gmail.com>	2024-11-20 14:22:30 +08:00
Xiaofan	33bfb25c73	enhance: refine meta cache log and logic (#37318 ) related to #36989 add more logs in proxy meta cache and make it clearer Signed-off-by: xiaofanluan <xiaofan.luan@zilliz.com>	2024-11-19 10:50:33 +08:00
wei liu	2a4c00de9d	enhance: Decouple shard client manager from shard cache (#37371 ) issue: #37115 the old implementation update shard cache and shard client manager at same time, which causes lots of conor case due to concurrent issue without lock. This PR decouple shard client manager from shard cache, so only shard cache will be updated if delegator changes. and make sure shard client manager will always return the right client, and create a new client if not exist. in case of client leak, shard client manager will purge client in async for every 10 minutes. --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-11-12 10:30:28 +08:00
sthuang	70605cf5b3	enhance: Support custom privilege group for RBAC (#37087 ) issue: #37031 --------- Signed-off-by: shaoting-huang <shaoting.huang@zilliz.com>	2024-11-09 08:44:28 +08:00
wei liu	b83b376cfc	fix: Search/Query may failed during updating delegator cache. (#37116 ) issue: #37115 casue init query node client is too heavy, so we remove updateShardClient from leader mutex, which cause much more concurrent cornor cases. This PR delay query node client's init operation until `getClient` is called, then use leader mutex to protect updating shard client progress to avoid concurrent issues. --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-11-05 10:52:23 +08:00
Xiaofan	f13faa37aa	fix: make sure alias is cached (#36807 ) fix #36806 Signed-off-by: xiaofanluan <xiaofan.luan@zilliz.com>	2024-10-31 01:05:03 -07:00
SimFG	bb3ef5349f	enhance: update the expr version to support automatic conversion of variable types (#36832 ) /kind improvement Signed-off-by: SimFG <bang.fu@zilliz.com>	2024-10-15 10:53:22 +08:00
jaime	5713620825	enhance: skip alter operation when no change are detected (#36785 ) issue: #36784 Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-10-12 11:25:20 +08:00
wei liu	bd658a6510	enhance: Enable dynamic update replica selection policy (#35860 ) issue: #35859 --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-09-13 17:05:15 +08:00
aoiasd	da227ff9a1	feat: Support create collection with functions (#35973 ) relate: https://github.com/milvus-io/milvus/issues/35853 Support create collection with functions. Prepare for support bm25 function. --------- Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2024-09-12 10:43:06 +08:00
jaime	91d23ecbe1	fix: memory leak in proxy meta cache (#36075 ) issue: #36074 Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-09-08 17:33:05 +08:00
congqixia	66ed289a85	enhance: Fix typo of clustering key not loaded msg (#35948 ) Related to #35415 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-09-05 10:49:03 +08:00
congqixia	9d80137698	fix: Check clustering key skip load behavior (#35865 ) feature issue: #35415 See also #35861 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-09-02 11:17:02 +08:00
SimFG	311f860676	enhance: support to drop the role which is related the privilege list (#35727 ) - issue: #35545 Signed-off-by: SimFG <bang.fu@zilliz.com>	2024-08-30 15:17:00 +08:00
congqixia	2fbc628994	feat: Support field partial load collection (#35416 ) Related to #35415 --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-08-20 16:49:02 +08:00
congqixia	a2b517523d	enhance: Add in-memory cache for casbin enforcer result (#35271 ) See also #35270 --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-08-05 18:48:15 +08:00
balloon1995	7306d2d115	fix: fix metaCache cleanup issue when listPolicy failed (#34449 ) issue: #34667 --------- Signed-off-by: balloon1995 <hszoe1995@outlook.com> Co-authored-by: congqixia <congqi.xia@zilliz.com>	2024-07-16 10:03:38 +08:00
Patrick Weizhi Xu	104d0966b7	feat: support partition key isolation (#34336 ) issue: #34332 --------- Signed-off-by: Patrick Weizhi Xu <weizhi.xu@zilliz.com>	2024-07-11 19:01:35 +08:00
jaime	60be454db0	enhance: add disk quota and max collections into db properties (#34368 ) issue: #34385 Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-07-05 18:22:17 +08:00
wei liu	8a9a42198d	fix: Proxy crash due to shard leader cache data race (#32971 ) issue: #32970 cause InvalidateShardLeaderCache use wrong lock, which may cause data race in meta cache, then proxy may crash This PR fixed that use leaderMut when try to access shard leader cache. Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-05-11 14:32:12 +08:00

1 2 3 4

172 Commits