Skip to content

Conversation

@yhmtsai
Copy link
Contributor

@yhmtsai yhmtsai commented Aug 21, 2025

Summary:
This PR implements the simple operation like scale, frob_norm, inf_norm,

Details:

  • only do it for single and double version
  • inf_norm uses very slow implementation to get norm row by row, but I do not need to implement additional kernel
  • I do not find the test on CPU. Could someone remind me?
  • Additionally, I add the exception for cublas error

Merge Checklist:

  • Passing CI
  • Update documentation or README.md
  • Additional Test/example added (if applicable) and passing
  • At least one reviewer approval
  • (optional) Clang sanitizer scan run and triaged
  • Clang formatter applied (verified as part of passing CI)

@yhmtsai yhmtsai self-assigned this Aug 21, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants