mantle/system/nproc: account for page cache in cgroup available memory by dustymabe · Pull Request #4494 · coreos/coreos-assembler

dustymabe · 2026-03-18T16:11:45Z

The cgroup available memory calculation used memory.current (total
cgroup usage) directly, which includes page cache (file-backed memory).
Since page cache is reclaimable by the kernel under memory pressure, it
should not count as unavailable. This caused GetCurrentMemAvailableMiB()
to significantly underestimate available memory, making QEMU instance
scheduling overly conservative.

Read the "file" field from /sys/fs/cgroup/memory.stat, which reports
the page cache size in bytes, and subtract it from current usage before
computing available memory. The effective formula becomes:

available = limit - (current - page_cache)

This mirrors how /proc/meminfo computes MemAvailable by considering
reclaimable caches.

A new helper getCgroupMemoryStatField() is added for parsing individual
fields from memory.stat, returning 0 gracefully if the file or field is
absent.

Written-by: <anthropic/claude-opus-4.6>

Also a second commit in there to warn when any test initially gets delayed due to memory availability.

gemini-code-assist

Code Review

The pull request effectively addresses the issue of cgroup available memory calculation by accounting for page cache, which significantly improves the accuracy of GetCurrentMemAvailableMiB(). The new getCgroupMemoryStatField helper function is a well-encapsulated addition for parsing cgroup memory statistics. The changes in harness.go to introduce conditional warning logs for memory reservation waits also enhance the clarity and debuggability of the test harness.

prestist · 2026-03-18T18:30:35Z

 Running [/home/runner/golangci-lint-1.64.4-linux-amd64/golangci-lint run --out-format=github-actions --timeout=5m] in [] ...
  level=warning msg="[config_reader] The output format `github-actions` is deprecated, please use `colored-line-number`"
  Error: can't load config: the Go language version (go1.24) used to build golangci-lint is lower than the targeted Go version (1.25.0)
  Failed executing command with error: can't load config: the Go language version (go1.24) used to build golangci-lint is lower than the targeted Go version (1.25.0)

wait... hmm I thought that the golang-lint was managed by repo-templates for this repo. Seems to be out of sync with the go.mod build version.

dustymabe · 2026-03-18T20:01:42Z

wait... hmm I thought that the golang-lint was managed by repo-templates for this repo

it will be soon. I'm working to get a few PRs merged before I pivot back to #4468 where there will be a lot of cleanups done for golangci-lint.

I don't want to do those cleanups yet because I want to get #3989 merged before I do all the new golangci lint cleanups because I want to be able to backport to older branches with less pain.

I just dropped the existing linters for now in e117447 and rebased this PR.

prestist

LGTM - the page cache fix makes sense. curious though, how bad was the underestimation? was the page cache eating up a big chunk of memory.current and causing tests to sit around waiting for memory?

dustymabe · 2026-03-18T23:44:27Z

was the page cache eating up a big chunk of memory.current and causing tests to sit around waiting for memory?

yeah. basically there were processes (like kola itself) that ate up a bunch of memory (virt) and it never got reclaimed. The RSS wasn't high, but it was all just sitting in page cache I think.

Look at the logs from the x86_64 kola run in https://jenkins-fedora-coreos-pipeline.apps.ocp.fedoraproject.org/blue/organizations/jenkins/bump-lockfile/detail/bump-lockfile/567/pipeline/542 and you can see it

The cgroup available memory calculation used memory.current (total cgroup usage) directly, which includes page cache (file-backed memory). Since inactive page cache is reclaimable by the kernel under memory pressure, it should not count as unavailable. This caused GetCurrentMemAvailableMiB() to significantly underestimate available memory, making QEMU instance scheduling overly conservative. Read the "inactive_file" field from /sys/fs/cgroup/memory.stat, which reports the page cache size that can be reclaimed easily in bytes, and subtract it from current usage before computing available memory. The effective formula becomes: available = limit - (current - inactive_file) This mirrors how /proc/meminfo computes MemAvailable by considering reclaimable caches. A new helper getCgroupMemoryStatField() is added for parsing individual fields from memory.stat, returning 0 gracefully if the file or field is absent. Written-by: <anthropic/claude-opus-4.6>

Let's pass in a boolean and also warn on the first wait and then periodically after that.

dustymabe · 2026-03-19T14:03:41Z

ok i had to adjust the strategy here to use inactive_file instead of file. incative_file is page cache that's currently not being used (i.e. used page cache can't actually be evicted so the earlier code was overestimating how many byte we could subtract from the reported current usage).

luckily CI on the previous version was failing and made me look into the calculation more.

prestist

LGTM

gemini-code-assist Bot reviewed Mar 18, 2026

View reviewed changes

Comment thread mantle/system/nproc.go Outdated

Comment thread mantle/system/nproc.go Outdated

Comment thread mantle/system/nproc.go

dustymabe force-pushed the dusty-page-cache branch from 845f338 to f83ee02 Compare March 18, 2026 16:28

dustymabe force-pushed the dusty-page-cache branch from f83ee02 to 95410f8 Compare March 18, 2026 19:58

prestist previously approved these changes Mar 18, 2026

View reviewed changes

dustymabe dismissed prestist’s stale review via 5e57106 March 19, 2026 02:30

dustymabe force-pushed the dusty-page-cache branch from 5e57106 to 95410f8 Compare March 19, 2026 02:30

prestist previously approved these changes Mar 19, 2026

View reviewed changes

dustymabe added 2 commits March 19, 2026 09:59

mantle/kola/harness: switch strategy for warning about memory starvation

afcec9a

Let's pass in a boolean and also warn on the first wait and then periodically after that.

dustymabe dismissed prestist’s stale review via afcec9a March 19, 2026 13:59

dustymabe force-pushed the dusty-page-cache branch from 95410f8 to afcec9a Compare March 19, 2026 13:59

dustymabe enabled auto-merge (rebase) March 19, 2026 15:04

prestist approved these changes Mar 19, 2026

View reviewed changes

dustymabe disabled auto-merge March 19, 2026 17:14

dustymabe merged commit 64453dc into coreos:main Mar 19, 2026
4 checks passed

dustymabe deleted the dusty-page-cache branch March 19, 2026 17:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mantle/system/nproc: account for page cache in cgroup available memory#4494

mantle/system/nproc: account for page cache in cgroup available memory#4494
dustymabe merged 2 commits into
coreos:mainfrom
dustymabe:dusty-page-cache

dustymabe commented Mar 18, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

prestist commented Mar 18, 2026

Uh oh!

dustymabe commented Mar 18, 2026

Uh oh!

prestist left a comment

Uh oh!

dustymabe commented Mar 18, 2026

Uh oh!

dustymabe commented Mar 19, 2026

Uh oh!

prestist left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

dustymabe commented Mar 18, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

prestist commented Mar 18, 2026

Uh oh!

dustymabe commented Mar 18, 2026

Uh oh!

prestist left a comment

Choose a reason for hiding this comment

Uh oh!

dustymabe commented Mar 18, 2026

Uh oh!

dustymabe commented Mar 19, 2026

Uh oh!

prestist left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants