Add cache implementation to compute cores #44

Yellown42 · 2025-12-23T22:16:09Z

TL;DR

Adds a simple, educational direct-mapped L1 cache between the LSU and memory controller.
Reduces memory traffic for repeated accesses while keeping the design minimal and easy to understand.
Includes a test demonstrating cache hits vs misses.

Implementation Details

New Files

src/cache.sv: Direct-mapped cache module (64 cache lines, write-through policy)
src/lsu_cached.sv: Cache-enabled LSU that wraps the cache module
test/test_cache.py: Test demonstrating cache effectiveness

Modified Files

src/core.sv: Updated to use lsu_cached instead of lsu
Makefile: Updated to compile cache modules

Architecture

Direct-mapped cache: Each memory address maps to exactly one cache line
64 cache lines: Sufficient for demonstrating cache benefits
Write-through policy: Writes update both cache and memory for simplicity
Per-thread caches: Each thread has its own cache instance

Design Decisions

Direct-mapped: Simplest cache design, easy to understand for educational purposes
Write-through: Keeps cache and memory consistent without complex coherency protocols
Small size (64 lines): Sufficient for tiny-gpu's demonstration scale
Per-thread caches: Avoids cache coherency issues between threads

Testing

Run the cache test:

make test_cache

The test verifies:

Correct data output (each thread outputs 30 = 10+10+10)
Cache functionality (repeated reads of same address)
Compatibility with existing tests (test_matadd, test_matmul)

Backward Compatibility

To use the original non-cached LSU, simply change lsu_cached back to lsu in src/core.sv.

Future Enhancements

Possible extensions to this implementation:

Set-associative cache (reduce conflicts)
Write-back policy (reduce write traffic)
Cache statistics counters for performance analysis
Shared L2 cache across cores
Cache coherency protocols for multi-core scenarios

- Implement direct-mapped cache (64 lines, write-through policy) - Add cache.sv module for data caching between LSU and memory controller - Add lsu_cached.sv as cache-enabled LSU variant - Update core.sv to use lsu_cached - Update Makefile to compile cache modules - Add test_cache.py to demonstrate cache effectiveness with data reuse patterns The cache stores recently accessed data to reduce global memory traffic. Each thread has its own cache instance, providing significant performance improvements for workloads with temporal locality (repeated data access).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add cache implementation to compute cores #44

Add cache implementation to compute cores #44

Yellown42 commented Dec 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add cache implementation to compute cores #44

Are you sure you want to change the base?

Add cache implementation to compute cores #44

Conversation

Yellown42 commented Dec 23, 2025

TL;DR

Implementation Details

New Files

Modified Files

Architecture

Design Decisions

Testing

Backward Compatibility

Future Enhancements

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants