Feature extraction corrections by bmascaro · Pull Request #1 · pasquale90/BeatNet

bmascaro · 2026-03-07T17:22:11Z

Major corrections to align c++ output data with BeatNet Python outputs

Framed Signal
Spectrum
Log Spectrogram
Spectrogram difference

…; load BeatNet cpp model

…hing results dir

…withing results dir

chg vals ch val change

stack features log spectrum and spectral diff newspectral diff change

… (22050 samples/sec) and for current audiowave file

…T_SOURCE_DIR in the download location variable

pasquale90

There are some changes that need to be made first to get consistent results. Once these are done, we can then test again to see if the C++ and Python results align. If not, we may need another round of adjustments.

pasquale90 · 2026-03-24T17:33:41Z

+{
+	int nMax = ((nFrames -1) * hopSize) + frameSize;
+	padded_signal.assign(nMax, 0.0f);
+
+	{
+		auto s0 = original_signal.begin();
+		auto sEnd = original_signal.end();
+		auto destination = padded_signal.begin() + frameSize / 2;
+
+		int i = frameSize / 2;
+
+		std::copy_if(s0, sEnd, destination, 
+			[&i, nMax](float x) 
+			{
+				return i++ < nMax; 
+			});
+	}
+
+	for (int iFrame = 0, index = 0; iFrame < nFrames; iFrame++, index += hopSize)
+	{
+		auto i0 = padded_signal.begin() + index;
+
+		std::vector<float> signal(i0,  i0 + frameSize);
+		frames.push_back(signal);
+	}
+}


Implementing the framing logic on the constructor results in the inputSignal variable being lost, because the object of the class is simply destructed after the scope of the caller (the audio callback) reach to the end of execution. The only way to maintain the information of it would be to declare inputSignal as a static object in the header, but that s not the best design IMO for that particular implementation.

This logic should be transferred to a process function that collects input buffers in each call, and utilizes them to create the frames that will feed the model. Take a look here

BeatNet/onnx/frameprocessor.h

Line 10 in 1985e3d

bool process(const std::vector<float>& input, std::vector<float>& frame_out);

There's an input buffer coming in, in each call, and an output frame coming out, while the function returns true if a valid frame is produced (simply because the first buffers wont be enough in length to produce a full frame)

pasquale90 · 2026-03-24T17:37:35Z

+
+    // slice original signal to Frames
+    const int nFrames = 4;
+    FramedSignal framedSignal{ resampledSignal , nFrames, FRAME_LENGTH, HOP_SIZE };


FramedSignal framedSignal should be declared on the header first. This will create an member object of the BeatNet class. Declaring it here does not make sense for many reasons.

At first, because the framedSignal object resides in the scope of the function, which means that its lifetime has automatic duration and at the end of each function call, the object gets destroyed and defined in the next call. So the concept of accumulating buffers for creating frames to feed the model, is in this way violated.

You should declare the object on the header, initialize it on the BeatNet constructor, and then call framedSignal.process() function from withing preprocess function.

pasquale90 · 2026-03-24T17:41:15Z

-    if (!valid_frame) {
-        // std::cout<<"invalid frame and will be invalid for the first ~"<<FRAME_LENGTH/resampled.size()-1<<" frames"<<std::endl;
-        return false;
-    }


You should probably keep this, to make sure that during the first calls of the function, where the first frame that is currently under formation while collecting the first buffers, will return false, aborting the inference of the model. Simply put, in such case, there is not yet a valid input signal to pass to the model.

pasquale90 · 2026-03-24T18:00:55Z

+    std::vector<float> getOriginalSignal();
+    int get_nFrames();


These functions are not used. Should they be removed?

pasquale90 · 2026-03-24T18:08:53Z

+    std::vector<float> resampledSignal = resampler.resample(raw_input); 
+
+    // slice original signal to Frames
+    const int nFrames = 4;


Prefer defining hyperparameters outside functions. Especially since this is a fixed value, you can either define it using #define ( i.e. #define FRAMED_SIGNAL_NFRAMES 4) in BeatNet.h along with the rest of them, or either in the top of the framedSignal.h. I believe that would make things clearer and more maintainable in case of you need to experiment with its value in the future.

pasquale90 and others added 24 commits January 18, 2026 01:08

add initial setup for testing the porting of the BeatNet model into C++

8849c87

test:implement cpp project structure for testing; integrate AudioFile…

951c8f8

…; load BeatNet cpp model

test: implement cpp testing and store results into separate files wit…

0cac0cd

…hing results dir

test: implement Python testing and store results into separate files …

0513dcb

…withing results dir

beatNet gitignore additions

2857f02

framed signal

f7b678f

filterbank processor

9d4a686

change parameter values

05787ba

chg vals ch val change

spectral diff

d57e872

stack features log spectrum and spectral diff newspectral diff change

changed file paths to std::filesystem paths

405c1f2

get sample rate from audiofile

b9b1c21

add vectors time and output0, output1

c7f9c70

changed the loop for audio block and frames processing

89ebc4c

beatpositions condition

a39f114

plots of signal output0 output1

836777f

separation of parameters buffersize, framelength, hopsize for BeatNet…

dc3ec8a

… (22050 samples/sec) and for current audiowave file

separate beat and downbeat

6bcc917

filepaths changes

16f1e7e

cleanup testCPP

4af2d0e

fix 128bpm filename

f2d026e

fix results

af33fc6

Cmakelists.txt : add copy dlls dependencies in the exe directory

289c7d4

submit updated results

644ed2a

fix minor bug in cmake/cpm.cmake : replaced LIB_DIR with CMAKE_CURREN…

b5dadd8

…T_SOURCE_DIR in the download location variable

pasquale90 requested changes Mar 24, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature extraction corrections #1

Feature extraction corrections #1
bmascaro wants to merge 24 commits intomainfrom
beats_prediction_beforePF

bmascaro commented Mar 7, 2026

Uh oh!

pasquale90 left a comment

Uh oh!

pasquale90 Mar 24, 2026

Uh oh!

pasquale90 Mar 24, 2026 •

edited

Loading

Uh oh!

pasquale90 Mar 24, 2026

Uh oh!

pasquale90 Mar 24, 2026

Uh oh!

pasquale90 Mar 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

bmascaro commented Mar 7, 2026

Uh oh!

pasquale90 left a comment

Choose a reason for hiding this comment

Uh oh!

pasquale90 Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

pasquale90 Mar 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

pasquale90 Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

pasquale90 Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

pasquale90 Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

pasquale90 Mar 24, 2026 •

edited

Loading