milvus

mirror of https://gitee.com/milvus-io/milvus.git synced 2026-01-07 19:31:51 +08:00

History

test: replace parquet with jsonl for EventRecords and RequestRecords in checker (#46671 )

/kind improvement

<!-- This is an auto-generated comment: release notes by coderabbit.ai
-->
- Core invariant: tests' persistence of EventRecords and RequestRecords
must be append-safe under concurrent writers; this PR replaces Parquet
with JSONL and uses per-file locks and explicit buffer flushes to
guarantee atomic, append-safe writes (EventRecords uses event_lock +
append per line; RequestRecords buffers under request_lock and flushes
to file when threshold or on sink()).

- Logic removed/simplified and rationale: DataFrame-based parquet
append/read logic (pyarrow/fastparquet) and implicit parquet buffering
were removed in favor of simple line-oriented JSON writes and explicit
buffer management. The complex Parquet append/merge paths were redundant
because parquet append under concurrent test-writer patterns caused
corruption; JSONL removes the append-mode complexity and the
parquet-specific buffering/serialization code.

- Why no data loss or behavior regression (concrete code paths):
EventRecords.insert writes a complete JSON object per event under
event_lock to /tmp/ci_logs/event_records_*.jsonl and get_records_df
reads every JSON line under the same lock (or returns an empty DataFrame
with the same schema on FileNotFound/Error), preserving all fields
event_name/event_status/event_ts. RequestRecords.insert appends to an
in-memory buffer under request_lock and triggers _flush_buffer() when
len(buffer) >= 100; _flush_buffer() writes each buffered JSON line to
/tmp/ci_logs/request_records_*.jsonl and clears the buffer; sink() calls
_flush_buffer() under request_lock before get_records_df() reads the
file — ensuring all buffered records are persisted before reads. Both
read paths handle FileNotFoundError and exceptions by returning empty
DataFrames with identical column schemas, so external callers see the
same API and no silent record loss.

- Enhancement summary (concrete): Replaces flaky Parquet append/read
with JSONL + explicit locking and deterministic flush semantics,
removing the root cause of parquet append corruption in tests while
keeping the original DataFrame-based analysis consumers unchanged
(get_records_df returns equivalent schemas).
<!-- end of auto-generated comment: release notes by coderabbit.ai -->

Signed-off-by: zhuwenxing <wenxing.zhu@zilliz.com>

2025-12-30 14:13:21 +08:00

_helm

…

benchmark

…

docker

…

go_client

feat: [GoSDK] add QueryIterator support for Go client (#46633 )

2025-12-27 01:43:20 +08:00

integration

…

java_client

…

python_client

test: replace parquet with jsonl for EventRecords and RequestRecords in checker (#46671 )

2025-12-30 14:13:21 +08:00

restful_client

…

restful_client_v2

…

scripts

…

OWNERS

…

README_CN.md

…

README.md

…

README.md

Tests

E2E Test

Configuration Requirements

Operating System

Operating System	Version
Amazon Linux	2023 or above
Ubuntu	20.04 or above
Mac	10.14 or above

Hardware

Hardware Type	Recommended Configuration
CPU	x86_64 architecture Intel CPU Sandy Bridge or above CPU Instruction Set - SSE4_2 - AVX - AVX2 - AVX512 or arm64 Linux/MacOS
Memory	16 GB or more

Software

Software Name	Version
Docker	19.05 or above
Docker Compose	1.25.5 or above
jq	1.3 or above
kubectl	1.14 or above
helm	3.0 or above
kind	0.10.0 or above

Installing Dependencies

Troubleshooting Docker and Docker Compose

Confirm that Docker Daemon is running：

$ docker info

Ensure that Docker is installed. Refer to the official installation instructions for Docker CE/EE.
Start the Docker Daemon if it is not already started.
To run Docker without root privileges, create a user group labeled docker, then add a user to the group with sudo usermod -aG docker $USER. Log out and log back into the terminal for the changes to take effect. For more information, see the official Docker documentation for Managing Docker as a Non-Root User.

Check the version of Docker-Compose

$ docker compose version

docker compose version 1.25.5, build 8a1c60f6
docker-py version: 4.1.0
CPython version: 3.7.5
OpenSSL version: OpenSSL 1.1.1f  31 Mar 2020

To install Docker-Compose, see Install Docker Compose

Run E2E Tests

$ cd tests/scripts
$ ./e2e-k8s.sh

Getting help

You can get help with the following command:
$ ./e2e-k8s.sh --help

README.md Unescape Escape

Tests

E2E Test

Configuration Requirements

Operating System

Hardware

Software

Installing Dependencies

Troubleshooting Docker and Docker Compose

Install jq

Install kubectl

Install helm

Install kind

Run E2E Tests

README.md