milvus/internal/core/src/index/ScalarIndex.h
Jiquan Long e549148a19
enhance: full-support for wildcard pattern matching (#30288)
issue: #29988 
This pr adds full-support for wildcard pattern matching from end to end.
Before this pr, the users can only use prefix match in their expression,
for example, "like 'prefix%'". With this pr, more flexible syntax can be
combined.

To do so, this pr makes these changes:
- 1. support regex query both on index and raw data;
- 2. translate the pattern matching to regex query, so that it can be
handled by the regex query logic;
- 3. loose the limit of the expression parsing, which allows general
pattern matching syntax;

With the support of regex query in segcore backend, we can also add
mysql-like `REGEXP` syntax later easily.

---------

Signed-off-by: longjiquan <jiquan.long@zilliz.com>
2024-02-01 12:37:04 +08:00

89 lines
2.3 KiB
C++

// Licensed to the LF AI & Data foundation under one
// or more contributor license agreements. See the NOTICE file
// distributed with this work for additional information
// regarding copyright ownership. The ASF licenses this file
// to you under the Apache License, Version 2.0 (the
// "License"); you may not use this file except in compliance
// with the License. You may obtain a copy of the License at
//
// http://www.apache.org/licenses/LICENSE-2.0
//
// Unless required by applicable law or agreed to in writing, software
// distributed under the License is distributed on an "AS IS" BASIS,
// WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
// See the License for the specific language governing permissions and
// limitations under the License.
#pragma once
#include <boost/dynamic_bitset.hpp>
#include <map>
#include <memory>
#include <string>
#include "common/Types.h"
#include "common/EasyAssert.h"
#include "index/Index.h"
#include "fmt/format.h"
namespace milvus::index {
template <typename T>
class ScalarIndex : public IndexBase {
public:
void
BuildWithRawData(size_t n,
const void* values,
const Config& config = {}) override;
void
BuildWithDataset(const DatasetPtr& dataset,
const Config& config = {}) override {
PanicInfo(Unsupported,
"scalar index don't support build index with dataset");
};
public:
virtual void
Build(size_t n, const T* values) = 0;
virtual const TargetBitmap
In(size_t n, const T* values) = 0;
virtual const TargetBitmap
NotIn(size_t n, const T* values) = 0;
virtual const TargetBitmap
Range(T value, OpType op) = 0;
virtual const TargetBitmap
Range(T lower_bound_value,
bool lb_inclusive,
T upper_bound_value,
bool ub_inclusive) = 0;
virtual T
Reverse_Lookup(size_t offset) const = 0;
virtual const TargetBitmap
Query(const DatasetPtr& dataset);
virtual int64_t
Size() = 0;
virtual bool
SupportRegexQuery() const {
return false;
}
virtual const TargetBitmap
RegexQuery(const std::string& pattern) {
PanicInfo(Unsupported, "regex query is not supported");
}
};
template <typename T>
using ScalarIndexPtr = std::unique_ptr<ScalarIndex<T>>;
} // namespace milvus::index