Fix performance trace inline asm in os_common.h#492
Open
disconnect3d wants to merge 1 commit intoTencent:masterfrom
Open
Fix performance trace inline asm in os_common.h#492disconnect3d wants to merge 1 commit intoTencent:masterfrom
disconnect3d wants to merge 1 commit intoTencent:masterfrom
Conversation
This commit fixes an inline assembly block that uses `rdtsc` instructin in order to trace the SQLite performance. The issue is that the `__asm__` block is not marked as `__volatile__` and so an optimizing compiler (e.g. GCC 10.1 with -O3 compilation flag) may optimize out a second call to the `hwtime()` function, assuming it should return the same value. This behavior is also described in GCC docs in https://gcc.gnu.org/onlinedocs/gcc/Extended-Asm.html#Volatile after the following block: > The following example demonstrates a case where you need to use the volatile qualifier. It uses the x86 rdtsc instruction, which reads the computer’s time-stamp counter. Without the volatile qualifier, the optimizers might assume that the asm block will always return the same value and therefore optimize away the second call. The issue can also bee seen on https://godbolt.org/z/v_a-Qy
Author
|
btw this code comes from sqlite and it seems newer sqlite has it fixed. |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
This commit fixes an inline assembly block that uses
rdtscinstruction in order to trace the SQLite performance.The issue is that the
__asm__block is not marked as__volatile__and so an optimizing compiler (e.g. GCC 10.1 with -O3 compilation flag) may optimize out a second call to thehwtime()function, assuming it should return the same value.This behavior is also described in GCC docs in https://gcc.gnu.org/onlinedocs/gcc/Extended-Asm.html#Volatile after the following block:
The issue can also bee seen on https://godbolt.org/z/v_a-Qy