Fix precise floats by cadmus-to · Pull Request #152 · GobySoft/dccl

cadmus-to · 2026-01-22T02:17:14Z

Summary

This PR improves the numerical stability and precision of float and double encoding/decoding in DefaultNumericFieldCodec, particularly for higher precision decimal fields (e.g. precision ≥ 6) and or values close to quantisation boundaries.
The core change is to avoid lossy floating‑point arithmetic during quantisation by performing the core mathematics in an integer domain derived directly from the IEEE‑754 representation of floats. This makes the encoding behave consistently with the intended round(value/resolution) − round(min/resolution) semantics even at the limits of float precision.

An initial attempt at extracting the significand and exponent into standard integer types improved the situation, but still ran out of precision for some edge cases. That led to the current approach, which uses a wider integer representation (std::bitset necessary for 128 bits when working with doubles) for the significand to perform the core arithmetic exactly.

Implementation overview

Encoding changes
The floating types are decomposed into std::bitset significand and integer exponent based on the IEEE‑754 layout.
Additional arithmetic was added to work with the std::bitset/integer representations including:

increment
negation
sum, subtract, multiply, divide
rounding

A std::bitset with twice the width of the native significand is used to safely carry intermediate results. This is implemented for floats so that we test the same code path as the double case.

For non‑floating types, the existing logic is preserved.

Decoding changes
Decoding mirrors the inverse integer arithmetic to reconstruct the value.
Final values are explicitly quantised at the end to ensure consistency with declared resolution.

Tests

This PR adds a dedicated test suite covering:

Simple float precision round‑trip cases
Negative precision handling
A sweep across a range of precisions (0 through 6) close to float’s decimal limits
Boundary values around representable float precision

All existing tests pass, and all new tests pass with the updated implementation.

Additional comments

Encoding changes
Although decode/encode round‑trips preserve values within the expected tolerance, the bit‑level encoding has changed for some float values compared to previous versions. This means encoded payloads for some float fields may differ from earlier DCCL versions.

Should this logic be gated behind a new codec / protocol version (e.g. v5)? Should I leave this for you to handle @tsaubergine ?

Code cleanup / style
Are there helpers or utilities you would prefer refactored, renamed, or relocated?
Are there unused functions that should be removed or consolidated?

Cases are manually verified to be expressible in float using an online tool

non-float representations are encoded with round((val - min)/res) float representations follow the same mathematics, but floats are converted to integer significant and exponent representations for lossless computation. std::bitset is also used to implement "wider" integer types. This is generalised so that the implementation for floats will cover for doubles too (would use 128 bit integers)

tsaubergine · 2026-01-22T03:56:38Z

Thanks for this - I'll take a deeper look soon.

This will probably need to be part of Codec 5, but I'll see what I think. For DCCL5 I'd be happy to push the minimum C++ version to 17.

cadmus-to · 2026-01-22T04:12:34Z

Oh I didn't see your message. I updated it so that it works with C++14 anyway 😂

cadmus-to · 2026-01-22T04:21:40Z

Okay I think I did all I can for the CI tests. I'm not really sure why the amd64-noble-build is failing. The error is in a generated file?

tsaubergine · 2026-01-26T02:40:46Z

Hi, thanks for your work here - it looks good. I have no idea why the amd64-noble-build build failure was happening but it's gone now and I can't reproduce. OS X needs more work to support the new abseil library in Protubuf > 3.21.

This will become part of DCCL 5, which I've started tracking at DCCL 5

cadmus-to · 2026-01-26T11:28:21Z

No problem!

This will become part of DCCL 5, which I've started tracking at DCCL 5
The link doesn't seem accessible publically yet.

Also do we need to move the version of the code as V5 codec or should this be overwriting the previous implementation?

tsaubergine · 2026-01-26T20:25:26Z

No problem!

This will become part of DCCL 5, which I've started tracking at DCCL 5
The link doesn't seem accessible publically yet.

Also do we need to move the version of the code as V5 codec or should this be overwriting the previous implementation?

Yes we will make this the v5 DefaultNumericFieldCodec. I think I'd like to move these support functions out of common.h into something more specifically named, perhaps numeric.h and numeric.cpp?

Also I fixed the visibility of the DCCL5 project.

cadmus-to · 2026-01-26T22:45:04Z

Apologies about the force push, I had to update the emails for a few of my commits.

Yes we will make this the v5 DefaultNumericFieldCodec. I think I'd like to move these support functions out of common.h into something more specifically named, perhaps numeric.h and numeric.cpp?

I moved the new functions from this PR to numeric.*, did you want any others moved over there? There's also a bunch of template functions that aren't in the .cpp yet. Did you want to use explicit template specialization so we can put the template implementations in the .cpp file?

…this likely changes the wire format in some cases. Update unit test for v5

Co-authored-by: tsaubergine <732276+tsaubergine@users.noreply.github.com>

Copilot

Pull request overview

This PR introduces a new numerically robust float/double quantization path for DCCL v5 default numeric fields (aimed at eliminating edge-case precision loss near quantization boundaries) and adds a dedicated test suite to validate float precision behavior across multiple precisions.

Changes:

Added src/numeric.h/.cpp with IEEE-754 decomposition helpers and bitset-based integer-domain arithmetic for float/double encode/decode.
Replaced the v5 DefaultNumericFieldCodec alias with an explicit v5 implementation that uses the new dccl::encode/decode helpers.
Added a new dccl_float_precision CTest target (proto + test) covering float precision sweeps, boundary cases, and negative precision.

Reviewed changes

Copilot reviewed 11 out of 11 changed files in this pull request and generated 8 comments.

Show a summary per file

File	Description
`src/codecs5/field_codec_default.h`	Implements v5 numeric codec using new numeric helpers for improved float/double stability.
`src/numeric.h`	Adds bitset arithmetic + template encode/decode helpers used by v5 numeric codec.
`src/numeric.cpp`	Adds IEEE-754 decomposition/composition implementation backing `numeric.h` declarations.
`src/common.h`	Adds `<bitset>` include (supporting new numeric utilities).
`src/CMakeLists.txt`	Adds `numeric.cpp` to the library build.
`src/test/CMakeLists.txt`	Adds the new float precision test subdirectory.
`src/test/dccl_float_precision/test.proto`	New proto messages for float precision / negative precision / precision sweep testing.
`src/test/dccl_float_precision/test.cpp`	New executable test validating v5 float encoding/decoding behavior and tolerances.
`src/test/dccl_float_precision/CMakeLists.txt`	Builds and registers the new `dccl_test_float_precision` test.
`AUTHORS`	Adds contributor entry.
`CMakeLists.txt`	Minor whitespace-only change.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

…memcpy to read float into uint instead of reinterpret_cast; fix typo with float overload of numeric_limits in double function

tsaubergine · 2026-03-19T00:40:20Z

@cadmus-to Thank you for your hard work here - this is great. I can't convince myself that it won't change the wire format in some edge cases so this will be the default v5 numeric codec in the forthcoming 5.0.0 release (and v2-v4 will stay the same, slightly buggy former implementation).

Cadmus added 14 commits January 22, 2026 12:45

chore: add self to AUTHORS

759f6e6

test: add simple float precision test cases

6248e7b

Cases are manually verified to be expressible in float using an online tool

test: add tests for negative precision

75c3352

test: add precision range testing

770604d

perf: change to use rounded division

1b93294

fix: adjust decoding logic

2662fed

fix: remove reliance on bitset's to_ullong

7058152

fix: add logic to handle negative value cases before dropping sig figs

b38347c

fix: adjust main algorithm for encoding/decoding

87706e9

fix: apply quantize at the end of decoding

bd5c623

fix: handle negative sum case properly

a321424

chore: remove printing in code

abb829c

refactor: change to make compatible with C++14

055da85

tsaubergine self-requested a review January 22, 2026 03:54

tsaubergine self-assigned this Jan 22, 2026

Cadmus added 2 commits January 22, 2026 14:32

test: fix compilation issue on CircleCI

25393c9

test: try to fix array compilation issue

a8b20ec

cadmus-to marked this pull request as ready for review January 22, 2026 04:21

tsaubergine added 3 commits January 26, 2026 09:26

Bump macosx executor

26c4b4c

Bump Cmake minimum required to allow us to build on newer Cmake

91a10e3

Disable OSX build until we fix abseil issues with new protobuf

95aa06b

tsaubergine added this to DCCL 5 Jan 26, 2026

github-project-automation Bot moved this to Backlog in DCCL 5 Jan 26, 2026

tsaubergine moved this from Backlog to In review in DCCL 5 Jan 26, 2026

Cadmus added 2 commits January 27, 2026 09:07

refactor: migrate new functions to numeric.h

0311fe6

refactor: migrate some function implementations to numeric.cpp

3aa7de2

cadmus-to force-pushed the fix-precise-floats branch from ccb9514 to 3aa7de2 Compare January 26, 2026 22:39

tsaubergine and others added 2 commits March 18, 2026 11:55

Merge branch '4.0' into fix-precise-floats

d7608ea

Fix protobuf_generate function and remove commenting out of osx-build

264d4fd

tsaubergine changed the base branch from 4.0 to 5.0 March 18, 2026 23:04

tsaubergine added 2 commits March 19, 2026 10:45

Merge branch '5.0' into fix-precise-floats

e74fdf3

Change target for reworked DefaultNumericFieldCodec to v5 from v2 as …

ba8bdf9

…this likely changes the wire format in some cases. Update unit test for v5

Copilot AI mentioned this pull request Mar 18, 2026

Fix float boundary encoding (issue #149) and add v5 precise numeric codec #177

Merged

tsaubergine requested a review from Copilot March 18, 2026 23:59

Copilot AI added a commit that referenced this pull request Mar 19, 2026

Add unit test for issue #149 and implement PR #152 fix in v5 codec

f891a8f

Co-authored-by: tsaubergine <732276+tsaubergine@users.noreply.github.com>

Copilot AI reviewed Mar 19, 2026

View reviewed changes

Comment thread src/numeric.h

Comment thread src/numeric.h

Comment thread src/numeric.h

Comment thread src/numeric.cpp Outdated

Comment thread src/numeric.cpp

Comment thread src/numeric.cpp

Comment thread src/numeric.cpp Outdated

Comment thread src/test/dccl_float_precision/test.cpp

tsaubergine and others added 3 commits March 19, 2026 11:20

Remove header guards on .cpp file; add missing <cassert> header, use …

ddba608

…memcpy to read float into uint instead of reinterpret_cast; fix typo with float overload of numeric_limits in double function

Missing cassert header

c0626b7

Merge branch '5.0' into fix-precise-floats

5640c7e

tsaubergine merged commit 6267278 into GobySoft:5.0 Mar 19, 2026
10 checks passed

github-project-automation Bot moved this from In review to Done in DCCL 5 Mar 19, 2026

tsaubergine mentioned this pull request Mar 19, 2026

min max limits of floating point fields are not always inclusive #149

Closed

Uh oh!

Conversation

cadmus-to commented Jan 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Implementation overview

Tests

Additional comments

Uh oh!

tsaubergine commented Jan 22, 2026

Uh oh!

cadmus-to commented Jan 22, 2026

Uh oh!

cadmus-to commented Jan 22, 2026

Uh oh!

tsaubergine commented Jan 26, 2026

Uh oh!

cadmus-to commented Jan 26, 2026

Uh oh!

tsaubergine commented Jan 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cadmus-to commented Jan 26, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tsaubergine commented Mar 19, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

cadmus-to commented Jan 22, 2026 •

edited

Loading

tsaubergine commented Jan 26, 2026 •

edited

Loading