-
Notifications
You must be signed in to change notification settings - Fork 421
vegeta intrinsics and sparse sgemm implemented #305
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Changes from all commits
78185ee
887bbf7
4653469
eee389f
06eb858
c757d32
b4d0a4a
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
| Original file line number | Diff line number | Diff line change | ||||
|---|---|---|---|---|---|---|
|
|
@@ -287,34 +287,34 @@ inline __attribute__((const)) int vx_shfl_idx(size_t value, int bval, int cval, | |||||
|
|
||||||
| // TILE LOAD T: Load 1KB from ptr[TILE] to tile register index 'dst_treg' | ||||||
| // Each load uses I-type encoding: rd=dst tile index, rs1=src_gpr, imm=ptr immediate | ||||||
| inline void vx_lt(int dst_treg, int src_gpr, size_t ptr_imm) { | ||||||
| inline void vx_lt(int dst_treg, size_t src_gpr, size_t ptr_imm) { | ||||||
| __asm__ volatile (".insn i %0, 0, x%1, %2, %3" | ||||||
| :: "i"(RISCV_CUSTOM1), "i"(dst_treg), "r"(src_gpr), "i"(ptr_imm) : "memory"); | ||||||
| } | ||||||
|
|
||||||
| // TILE LOAD U: Load 1KB from ptr[TILE] to ureg index 'dst_ureg' | ||||||
|
||||||
| // TILE LOAD U: Load 1KB from ptr[TILE] to ureg index 'dst_ureg' | |
| // TILE LOAD U: Load 4KB from ptr[TILE] to ureg index 'dst_ureg' |
Copilot
AI
Dec 2, 2025
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Comment says "Load 1KB" but V-regs are 8KB (8192 bytes) as defined in common.h. Update the comment to reflect the correct size: "Load 8KB from ptr[TILE] to vreg index 'dst_vreg'".
| // TILE LOAD V: Load 1KB from ptr[TILE] to vreg index 'dst_vreg' | |
| // TILE LOAD V: Load 8KB from ptr[TILE] to vreg index 'dst_vreg' |
Copilot
AI
Dec 2, 2025
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Comment says "Load 1KB" but M-regs are 256 bytes as defined in common.h. Update the comment to reflect the correct size: "Load 256B from ptr[TILE] to mreg index 'dst_mreg'".
| // TILE LOAD M: Load 1KB from ptr[TILE] to mreg index 'dst_mreg' | |
| // TILE LOAD M: Load 256B from ptr[TILE] to mreg index 'dst_mreg' |
Copilot
AI
Dec 2, 2025
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Comment says "Store 1KB" but T-regs are 2KB (2048 bytes) as defined in common.h. Update the comment to reflect the correct size: "Store 2KB from treg index 'src_treg' to ptr[TILE]".
| // TILE STORE T: Store 1KB from treg index 'src_treg' to ptr[TILE] | |
| // TILE STORE T: Store 2KB from treg index 'src_treg' to ptr[TILE] |
| Original file line number | Diff line number | Diff line change | ||||
|---|---|---|---|---|---|---|
|
|
@@ -501,6 +501,9 @@ Word Emulator::get_csr(uint32_t addr, uint32_t wid, uint32_t tid) { | |||||
| #endif | ||||||
| #ifdef EXT_VPU_ENABLE | ||||||
| CSR_READ_64(VX_CSR_MPM_SCRB_TCU, core_perf.scrb_vpu); | ||||||
| #endif | ||||||
| #ifdef EXT_VEGETA_ENABLE | ||||||
| CSR_READ_64(VX_CSR_MPM_SCRB_TCU, core_perf.scrb_vegeta); | ||||||
|
||||||
| CSR_READ_64(VX_CSR_MPM_SCRB_TCU, core_perf.scrb_vegeta); | |
| CSR_READ_64(VX_CSR_MPM_SCRB_VEGETA, core_perf.scrb_vegeta); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Comment says "Load 1KB" but T-regs are 2KB (2048 bytes) as defined in common.h. Update the comment to reflect the correct size: "Load 2KB from ptr[TILE] to tile register index 'dst_treg'".