mirror of
https://gitee.com/milvus-io/milvus.git
synced 2025-12-28 22:45:26 +08:00
related: #39173 Core Features * Parquet File Analysis: Analyze Milvus binlog Parquet files with metadata extraction * MinIO Integration: Direct connection to MinIO storage for remote file analysis * Vector Data Deserialization: Specialized handling of Milvus vector data in binlog files * Interactive CLI: Command-line interface with interactive exploration Analysis Capabilities * Metadata & Vector Analysis: Extract schema info, row counts, and vector statistics * Data Export: Export data to JSON format with configurable limits * Query Functionality: Search for specific records by ID * Batch Processing: Analyze multiple Parquet files simultaneously User Experience * Verbose Output: Detailed logging for debugging * Error Handling: Robust error handling for file access and parsing * Flexible Output: Support for single file and batch analysis formats --------- Signed-off-by: shaoting-huang <shaoting.huang@zilliz.com> Co-authored-by: nico <109071306+NicoYuan1986@users.noreply.github.com>
11 lines
370 B
Python
11 lines
370 B
Python
"""
|
|
Parquet Analyzer Package
|
|
A toolkit for analyzing parquet files, including metadata parsing and vector deserialization functionality
|
|
"""
|
|
|
|
from .meta_parser import ParquetMetaParser
|
|
from .vector_deserializer import VectorDeserializer
|
|
from .analyzer import ParquetAnalyzer
|
|
|
|
__version__ = "1.0.0"
|
|
__all__ = ["ParquetMetaParser", "VectorDeserializer", "ParquetAnalyzer"] |