-
Notifications
You must be signed in to change notification settings - Fork 6.6k
Add queue observability for liaison internal pipelines #13775
Copy link
Copy link
Open
Labels
databaseBanyanDB - SkyWalking native databaseBanyanDB - SkyWalking native database
Milestone
Description
Problem
Liaison queue behavior is not easy to observe end-to-end during chunked sync pressure and failures.
Current context
- Queue-sub already exposes chunk-ordering/error metrics in
banyand/queue/sub/server.go. - Liaison wires
sub.NewServerWithPorts(..., "liaison-server", ...)inpkg/cmdsetup/liaison.go. - Existing dashboards aggregate some queue errors (
banyandb_queue_sub_total_msg_sent_err) indocs/operation/observability.mdanddocs/operation/grafana-cluster.json.
Proposal
- Add/extend liaison-focused queue metrics (depth, retries, per-topic throughput/latency, failed-part counters).
- Add dashboard panels and alert suggestions for liaison queue health.
- Document metric meanings and troubleshooting paths.
Acceptance criteria
- New metrics are exported and documented.
- Dashboards include liaison queue saturation/failure visibility.
- Integration/e2e validation demonstrates metric changes under injected failures.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
databaseBanyanDB - SkyWalking native databaseBanyanDB - SkyWalking native database