Needed: - Split the implementation of `shl` and `shr` into signed and unsigned shift variants as done with `sal` and `sar`. - Add benchmarks.