roachtest: upload runner-level logs to Datadog by williamchoe3 · Pull Request #166175 · cockroachdb/cockroach

williamchoe3 · 2026-03-19T15:14:01Z

Extends roachtest's datadog integration by adding support for uploading test runner level logs (test_runner-*.log, w*.log) to Datadog. Previously, these were ignored.

trunk-io · 2026-03-19T15:14:06Z

Merging to master in this repository is managed by Trunk.

To merge this pull request, check the box to the left or comment /trunk merge below.

blathers-crl · 2026-03-19T15:14:07Z

Your pull request contains more than 1000 changes. It is strongly encouraged to split big PRs into smaller chunks.

_{🦉 Hoot! I am a Blathers, a bot for CockroachDB. My owner is dev-inf.}

cockroach-teamcity · 2026-03-19T15:14:23Z

This change is

Epic: None Release note: None Co-Authored-By: roachdev-claude <roachdev-claude-bot@cockroachlabs.com>

williamchoe3 · 2026-03-19T15:25:12Z

The first 2 commits are from #166059, which this PR depends on. Once that PR merges, this branch will be rebased onto master and the diff will only show the third commit.

williamchoe3 · 2026-03-19T15:34:05Z

Smoke: https://teamcity.cockroachdb.com/buildConfiguration/Cockroach_Nightlies_RoachtestNightlyGceBazel/?branch=166175&mode=builds

Query: service:roachtest @log_file:test_runner*
Noticed there are some blank log attributes, added a filter to not have those, running another smoke

https://teamcity.cockroachdb.com/buildConfiguration/Cockroach_Nightlies_RoachtestNightlyGceBazel/21274245?hideProblemsFromDependencies=false&hideTestsFromDependencies=false

works, make one more change, running a final smoke

https://teamcity.cockroachdb.com/buildConfiguration/Cockroach_Nightlies_RoachtestNightlyGceBazel/21274440?hideProblemsFromDependencies=false&hideTestsFromDependencies=false

williamchoe3 · 2026-03-19T15:45:07Z

PR #166175 Review: roachtest: upload runner-level logs to Datadog

Reviewing commit 3 only (f34b01f — "roachtest: upload runner-level logs to Datadog"). Commits 1-2 were reviewed separately.

Overview

Extends the datadog log upload system with a new "runner logs" category for test_runner-*.log and w*.log files from _runner-logs/. The changes are well-structured: readTCBuildProperties is cleanly extracted, the new NewRunnerLogMetadata/ShouldUploadRunnerLogs/MaybeUploadRunnerLogs follow the established pattern, and renames (ShouldUploadLogsToDatadog → ShouldUploadTestLogsToDatadog, discoverLogFiles → discoverTestLogFiles) disambiguate the two categories. Good test coverage across all new functions.

BLOCKERS

None.

MEDIUM

M1. MaybeUploadRunnerLogs duplicates the upload loop from MaybeUploadTestLogs

The two functions share ~30 lines of identical logic: Datadog context creation, API client setup, the file iteration loop (stat → copy metadata → select parser → parse/upload → log timing), and the summary log. The only differences are the discovery function called and the child logger name.

This is manageable for two categories, but the package doc comment establishes categories as a first-class concept ("Current categories: Test logs, Runner logs"), suggesting more may follow. If a third category arrives, the duplication becomes a real maintenance burden.

Consider extracting the shared upload loop into a helper:

func uploadLogFiles(
    ctx context.Context, l *logger.Logger, logsAPI *datadogV2.LogsApi,
    files []string, baseDir string, cfg LogMetadata,
) error { ... }

Not blocking since two copies is tolerable, but worth noting before a third appears.

M2. Runner metadata produces empty-value tags in Datadog

NewRunnerLogMetadata leaves test-specific fields empty (TestName, Owner, Cloud, Platform), but makeTags() unconditionally includes all fields:

func (m LogMetadata) makeTags() map[string]string {
    tagMap := map[string]string{
        testNameTagName: m.TestName,  // ""
        ownerTagName:    m.Owner,     // ""
        cloudTagName:    m.Cloud,     // ""
        platformTagName: m.Platform,  // ""
        ...
    }
}

This produces Datadog tags like name:, owner:, cloud:, platform: with empty values. In the Datadog UI, these appear in tag facet dropdowns as blank entries, making it harder to filter and distinguish runner logs from test logs.

Options:

Skip empty values in makeTags() (changes behavior for test logs too — probably fine since empty tags aren't useful there either).
Add a category tag (category:runner vs category:test) to explicitly distinguish log types, regardless of empty fields.
Accept the empty tags for now.

NITS

N1. Typo in selectParser comment: "it's" → "its"

// ... and it's parser does not match any lines ...

Should be "its parser" (possessive, not contraction).

N2. "uploaded X log events" printed even after upload error

if uploadErr != nil {
    l.Printf("error uploading %s to Datadog (%d entries uploaded): %v", ...)
}
l.Printf("uploaded %d log events from %s in %s", ...)

After a failed upload, the log shows both an error and a success-like message ("uploaded 0 log events"). Same pattern exists in MaybeUploadTestLogs, so not a regression, but worth wrapping the success message in an else:

if uploadErr != nil {
    l.Printf("error uploading %s ...", ...)
} else {
    l.Printf("uploaded %d log events from %s in %s", ...)
}

N3. shouldUploadRunnerFile matches more than intended

if strings.HasSuffix(name, ".log") && name[0] == 'w' && name[1] >= '0' && name[1] <= '9' {

This matches any file starting with w + digit + anything + .log, e.g., w0extra.log or widget.log (if it existed). Since the _runner-logs/ directory is controlled by roachtest, this won't cause issues in practice. But a stricter check would be:

// Match w<digits>.log exactly.
if strings.HasSuffix(name, ".log") && len(name) > 4 {
    stem := strings.TrimSuffix(name, ".log")
    if stem[0] == 'w' {
        if _, err := strconv.Atoi(stem[1:]); err == nil {
            return true
        }
    }
}

Not worth the complexity unless false matches become a problem.

N4. Blank line removed in NewLogMetadata

The diff removes a blank line after the function signature opening brace in NewLogMetadata. Minor formatting change mixed in with the refactoring — fine, just noting for commit hygiene.

williamchoe3 · 2026-03-19T15:48:55Z

M1 ... This is manageable for two categories, but the package doc comment establishes categories as a first-class concept ("Current categories: Test logs, Runner logs"), suggesting more may follow. If a third category arrives, the duplication becomes a real maintenance burden.

Added a comment on the top of file to explain the boilerplate, I agree if this needs to be extended for a 3rd category of logs for roachtest, I would look to implementing an interface that clearly defines what is needed by a runnerLogsUploaderImpl or testLogsUploaderImpl, but at this point I'm not sure what that 3rd category would be so I'm leaving it as is for now.

M2, N1: addressed
N2, N3: I think these are fine as is. For N2, since we're best effort, we can both fail and succeed so a strict if else wouldn't make as much sense. For N3, I don't think we need to be that cautious.

williamchoe3 · 2026-03-19T20:02:18Z

link

Add support for uploading test runner logs (test_runner-*.log, w*.log) to Datadog. Epic: None Release note: None Co-Authored-By: roachdev-claude <roachdev-claude-bot@cockroachlabs.com>

williamchoe3 and others added 2 commits March 19, 2026 11:18

roachtest: improve datadog parsing test coverage

8855983

Epic: None Release note: None Co-Authored-By: roachdev-claude <roachdev-claude-bot@cockroachlabs.com>

CODEOWNERS: add entry for roachtest's datadog package

89c0fc4

Epic: None Release note: None Co-Authored-By: roachdev-claude <roachdev-claude-bot@cockroachlabs.com>

williamchoe3 force-pushed the datadog-runner-logs branch from 79337eb to f34b01f Compare March 19, 2026 15:22

williamchoe3 force-pushed the datadog-runner-logs branch 5 times, most recently from 1d65ecd to 61aa4cd Compare March 19, 2026 20:01

williamchoe3 marked this pull request as ready for review March 19, 2026 20:04

williamchoe3 requested review from a team as code owners March 19, 2026 20:04

williamchoe3 requested review from DarrylWong and srosenberg and removed request for a team and DarrylWong March 19, 2026 20:04

roachtest: upload runner-level logs to Datadog

b9ddc82

Add support for uploading test runner logs (test_runner-*.log, w*.log) to Datadog. Epic: None Release note: None Co-Authored-By: roachdev-claude <roachdev-claude-bot@cockroachlabs.com>

williamchoe3 force-pushed the datadog-runner-logs branch from 61aa4cd to b9ddc82 Compare March 20, 2026 19:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

roachtest: upload runner-level logs to Datadog#166175

roachtest: upload runner-level logs to Datadog#166175
williamchoe3 wants to merge 3 commits intocockroachdb:masterfrom
williamchoe3:datadog-runner-logs

williamchoe3 commented Mar 19, 2026 •

edited

Loading

Uh oh!

trunk-io bot commented Mar 19, 2026

Uh oh!

blathers-crl bot commented Mar 19, 2026

Uh oh!

cockroach-teamcity commented Mar 19, 2026

Uh oh!

williamchoe3 commented Mar 19, 2026

Uh oh!

williamchoe3 commented Mar 19, 2026 •

edited

Loading

Uh oh!

williamchoe3 commented Mar 19, 2026

Uh oh!

williamchoe3 commented Mar 19, 2026 •

edited

Loading

Uh oh!

williamchoe3 commented Mar 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

williamchoe3 commented Mar 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

trunk-io bot commented Mar 19, 2026

Uh oh!

blathers-crl bot commented Mar 19, 2026

Uh oh!

cockroach-teamcity commented Mar 19, 2026

Uh oh!

williamchoe3 commented Mar 19, 2026

Uh oh!

williamchoe3 commented Mar 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

williamchoe3 commented Mar 19, 2026

PR #166175 Review: roachtest: upload runner-level logs to Datadog

Overview

BLOCKERS

MEDIUM

NITS

Uh oh!

williamchoe3 commented Mar 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

williamchoe3 commented Mar 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

williamchoe3 commented Mar 19, 2026 •

edited

Loading

williamchoe3 commented Mar 19, 2026 •

edited

Loading

williamchoe3 commented Mar 19, 2026 •

edited

Loading