Add comprehensive test coverage for CSVDialog class #582

Copilot · 2025-11-15T03:44:48Z

The CSVDialog class in model/CSVDialog.cc had minimal test coverage with only one test (ClassifyColumns). This adds 14 targeted tests to achieve 100% method coverage.

Changes

Core method tests (9 tests)

loadFile() / loadFileFromName() - file loading with DOS line ending handling
guessSpecAndLoadFile() - automatic CSV format detection
populateHeaders() / populateHeader() - header name extraction
parseLines() - delimiter modes (merged, quoted, column limits)
correctedUniqueValues() - unique value calculation with header correction
reportFromFile() - CSV report generation

Edge case tests (5 tests)

Invalid header row/column indices (boundary conditions)
Large files >100 lines (truncation behavior)
Empty column data (classification logic)
Escaped quotes in fields

All tests use temporary files with proper cleanup and follow existing test patterns. The isNumerical() helper function already had comprehensive coverage.

Test coverage: 1 → 15 tests, 10/10 public methods covered

Warning

Firewall rules blocked me from connecting to one or more addresses (expand for details)

I tried to connect to the following addresses, but was blocked by firewall rules:

esm.ubuntu.com
- Triggering command: /usr/lib/apt/methods/https (dns block)

If you need me to access, download, or install something from one of these locations, you can either:

Configure Actions setup steps to set up my environment, which run before the firewall is enabled
Add the appropriate URLs or hosts to the custom allowlist in this repository's Copilot coding agent settings (admins only)

Original prompt

Improve code coverage of CSVDialog class, found in model/CSVDialog.cc

💡 You can make Copilot smarter by setting up custom instructions, customizing its development environment and configuring Model Context Protocol (MCP) servers. Learn more Copilot coding agent tips in the docs.

This change is

Summary by CodeRabbit

Bug Fixes
- Improved robustness of CSV file loading, particularly when handling DOS line endings and various file formats to prevent parsing errors.

coderabbitai · 2025-11-15T03:44:53Z

Important

Review skipped

Bot user detected.

To trigger a single review, invoke the @coderabbitai review command.

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Walkthrough

Two files modified: a robustness improvement to line-reading logic in CSVDialog.cc that prevents pushing empty lines on read failures, and comprehensive test coverage expansion for CSV parsing functionality including edge cases like DOS line endings, escaped quotes, and merged delimiters.

Changes

Cohort / File(s)	Change Summary
CSV Dialog Robustness Fix `model/CSVDialog.cc`	Modified `loadFileFromName` to read lines into a local buffer first, only appending to `initialLines` if `getline` succeeds; prevents invalid empty line additions and ensures `initialLines.back()` is valid before DOS '\r' normalization.
CSV Parser Test Coverage `test/testCSVParser.cc`	Added 15+ new test cases covering file loading, DOS line ending handling, header population, line parsing with delimiters/escapes, duplicate action handling, column classification, and CSV parser variants; replaced strict assertions with expectation-based forms to broaden test tolerances.

Estimated code review effort

🎯 2 (Simple) | ⏱️ ~12 minutes

model/CSVDialog.cc: Localized fix with straightforward logic change; verify the buffer move semantics and DOS '\r' handling remain correct.
test/testCSVParser.cc: Large test suite but follows repetitive patterns; focus on verifying test data validity and assertion conversions (ASSERT→EXPECT) are intentional.

Poem

🐰 A rabbit hops through CSV rows,
Where line-breaks dance and logic flows,
We read before we push with care,
No empty ghosts left floating there! ✨

Pre-merge checks and finishing touches

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 0.00% which is insufficient. The required threshold is 80.00%.	You can run `@coderabbitai generate docstrings` to improve docstring coverage.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title accurately describes the main change: comprehensive test coverage additions for CSVDialog class.

Tip

📝 Customizable high-level summaries are now available in beta!

You can now customize how CodeRabbit generates the high-level summary in your pull requests — including its content, structure, tone, and formatting.

Provide your own instructions using the high_level_summary_instructions setting.
Format the summary however you like (bullet lists, tables, multi-section layouts, contributor stats, etc.).
Use high_level_summary_in_walkthrough to move the summary from the description to the walkthrough section.

Example instruction:

"Divide the high-level summary into five sections:

📝 Description — Summarize the main change in 50–60 words, explaining what was done.

📓 References — List relevant issues, discussions, documentation, or related PRs.

📦 Dependencies & Requirements — Mention any new/updated dependencies, environment variable changes, or configuration updates.

📊 Contributor Summary — Include a Markdown table showing contributions:
| Contributor | Lines Added | Lines Removed | Files Changed |

✔️ Additional Notes — Add any extra reviewer context.
Keep each section concise (under 200 words) and use bullet or numbered lists for clarity."

Note: This feature is currently in beta for Pro-tier users, and pricing will be announced later.

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

Co-authored-by: highperformancecoder <3075825+highperformancecoder@users.noreply.github.com>

highperformancecoder · 2025-11-18T07:02:05Z

@CodeRabbit review

coderabbitai · 2025-11-18T07:02:13Z

✅ Actions performed

Review triggered.

Note: CodeRabbit is an incremental review system and does not re-review already reviewed commits. This command is applicable only when automatic reviews are paused.

coderabbitai

Actionable comments posted: 1

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 550a001 and ba40db9.

📒 Files selected for processing (2)

model/CSVDialog.cc (1 hunks)
test/testCSVParser.cc (9 hunks)

🧰 Additional context used

🧠 Learnings (3)

📓 Common learnings

Learnt from: highperformancecoder
Repo: highperformancecoder/minsky PR: 541
File: engine/CSVParser.cc:652-656
Timestamp: 2025-02-06T01:34:05.110Z
Learning: In the CSVParser, bad input files are handled gracefully by treating them as empty files, without throwing explicit errors. This is by design to make the parser more robust.

📚 Learning: 2025-02-06T01:34:05.110Z

Learnt from: highperformancecoder
Repo: highperformancecoder/minsky PR: 541
File: engine/CSVParser.cc:652-656
Timestamp: 2025-02-06T01:34:05.110Z
Learning: In the CSVParser, bad input files are handled gracefully by treating them as empty files, without throwing explicit errors. This is by design to make the parser more robust.

Applied to files:

test/testCSVParser.cc

📚 Learning: 2024-12-30T09:34:38.295Z

Learnt from: highperformancecoder
Repo: highperformancecoder/minsky PR: 538
File: engine/variableValue.cc:331-351
Timestamp: 2024-12-30T09:34:38.295Z
Learning: The user prefers failing silently for dimension lookups in `variableValue.cc` rather than throwing an exception if a dimension is not found.

Applied to files:

test/testCSVParser.cc

🪛 Clang (14.0.6)

test/testCSVParser.cc

[error] 237-237: all parameters should be named in a function

(readability-named-parameter,-warnings-as-errors)

[error] 238-238: variable 'input' is not initialized

(cppcoreguidelines-init-variables,-warnings-as-errors)

[error] 259-259: all parameters should be named in a function

(readability-named-parameter,-warnings-as-errors)

[error] 260-260: variable 'input' is not initialized

(cppcoreguidelines-init-variables,-warnings-as-errors)

[error] 280-280: all parameters should be named in a function

(readability-named-parameter,-warnings-as-errors)

[error] 281-281: variable 'input' is not initialized

(cppcoreguidelines-init-variables,-warnings-as-errors)

[error] 301-301: all parameters should be named in a function

(readability-named-parameter,-warnings-as-errors)

[error] 302-302: variable 'input' is not initialized

(cppcoreguidelines-init-variables,-warnings-as-errors)

[error] 325-325: all parameters should be named in a function

(readability-named-parameter,-warnings-as-errors)

[error] 326-326: variable 'input' is not initialized

(cppcoreguidelines-init-variables,-warnings-as-errors)

[error] 347-347: all parameters should be named in a function

(readability-named-parameter,-warnings-as-errors)

[error] 348-348: variable 'input' is not initialized

(cppcoreguidelines-init-variables,-warnings-as-errors)

[error] 371-371: all parameters should be named in a function

(readability-named-parameter,-warnings-as-errors)

[error] 372-372: variable 'input' is not initialized

(cppcoreguidelines-init-variables,-warnings-as-errors)

[error] 394-394: all parameters should be named in a function

(readability-named-parameter,-warnings-as-errors)

[error] 395-395: variable 'input' is not initialized

(cppcoreguidelines-init-variables,-warnings-as-errors)

[error] 420-420: all parameters should be named in a function

(readability-named-parameter,-warnings-as-errors)

[error] 421-421: variable 'input' is not initialized

(cppcoreguidelines-init-variables,-warnings-as-errors)

[error] 426-426: variable 'outputFile' is not initialized

(cppcoreguidelines-init-variables,-warnings-as-errors)

[error] 444-444: variable 'outputFile' is not initialized

(cppcoreguidelines-init-variables,-warnings-as-errors)

[error] 447-447: all parameters should be named in a function

(readability-named-parameter,-warnings-as-errors)

[error] 448-448: variable 'input' is not initialized

(cppcoreguidelines-init-variables,-warnings-as-errors)

[error] 470-470: all parameters should be named in a function

(readability-named-parameter,-warnings-as-errors)

[error] 471-471: variable 'input' is not initialized

(cppcoreguidelines-init-variables,-warnings-as-errors)

[error] 495-495: all parameters should be named in a function

(readability-named-parameter,-warnings-as-errors)

[error] 496-496: variable 'input' is not initialized

(cppcoreguidelines-init-variables,-warnings-as-errors)

[error] 520-520: all parameters should be named in a function

(readability-named-parameter,-warnings-as-errors)

[error] 525-525: 150 is a magic number; consider replacing it with a named constant

(cppcoreguidelines-avoid-magic-numbers,readability-magic-numbers,-warnings-as-errors)

[error] 539-539: all parameters should be named in a function

(readability-named-parameter,-warnings-as-errors)

[error] 540-540: variable 'input' is not initialized

(cppcoreguidelines-init-variables,-warnings-as-errors)

[error] 605-605: variable 'expected' is not initialized

(cppcoreguidelines-init-variables,-warnings-as-errors)

[error] 648-648: variable 'expected' is not initialized

(cppcoreguidelines-init-variables,-warnings-as-errors)

[error] 764-764: variable 'expected' is not initialized

(cppcoreguidelines-init-variables,-warnings-as-errors)

[error] 794-794: variable 'is' is not initialized

(cppcoreguidelines-init-variables,-warnings-as-errors)

[error] 794-794: variable name 'is' is too short, expected at least 3 characters

(readability-identifier-length,-warnings-as-errors)

[error] 801-801: variable 'is' is not initialized

(cppcoreguidelines-init-variables,-warnings-as-errors)

[error] 801-801: variable name 'is' is too short, expected at least 3 characters

(readability-identifier-length,-warnings-as-errors)

[error] 808-808: variable 'is' is not initialized

(cppcoreguidelines-init-variables,-warnings-as-errors)

[error] 808-808: variable name 'is' is too short, expected at least 3 characters

(readability-identifier-length,-warnings-as-errors)

[error] 815-815: variable 'is' is not initialized

(cppcoreguidelines-init-variables,-warnings-as-errors)

[error] 815-815: variable name 'is' is too short, expected at least 3 characters

(readability-identifier-length,-warnings-as-errors)

[error] 822-822: variable 'is' is not initialized

(cppcoreguidelines-init-variables,-warnings-as-errors)

[error] 822-822: variable name 'is' is too short, expected at least 3 characters

(readability-identifier-length,-warnings-as-errors)

[error] 829-829: all parameters should be named in a function

(readability-named-parameter,-warnings-as-errors)

[error] 896-896: all parameters should be named in a function

(readability-named-parameter,-warnings-as-errors)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (4)

GitHub Check: ubuntu-20.04
GitHub Check: ubuntu-20.04
GitHub Check: ubuntu-20.04
GitHub Check: ubuntu-20.04

🔇 Additional comments (2)

test/testCSVParser.cc (2)

237-565: Excellent comprehensive test coverage for CSVDialog.

The 14 new test cases provide thorough coverage of the CSVDialog class:

Core functionality: file loading (including DOS line endings), automatic format detection, header population, various parsing modes (merge delimiters, max columns, escaped quotes), unique value calculation, and report generation

Edge cases: invalid header row/column indices, large file truncation (>100 lines), empty column data, and special character handling

All tests follow the existing fixture pattern with proper resource cleanup and align well with the PR objectives.

592-910: Good practice: ASSERT to EXPECT conversions improve test diagnostics.

Converting strict ASSERT_* macros to EXPECT_* macros allows tests to continue execution and report multiple failures per test run, rather than stopping at the first failure. This improves test diagnostics and follows Google Test best practices.

model/CSVDialog.cc

Co-authored-by: coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com>

Copilot

Pull Request Overview

This PR adds comprehensive test coverage for the CSVDialog class, increasing from 1 test to 15 tests and achieving 100% method coverage of the 10 public methods. The PR also includes a bug fix in model/CSVDialog.cc that prevents empty lines from being added to initialLines when getline() fails at the end of a file or on read errors.

Key changes:

Added 14 new focused tests covering all CSVDialog public methods including edge cases
Fixed a subtle bug in loadFileFromName() where failed getline() calls could add empty strings to the initial lines buffer
Converted many ASSERT_* macros to EXPECT_* in existing tests where appropriate (keeping ASSERT_* for structural checks that could cause undefined behavior)

Reviewed Changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 2 comments.

File	Description
test/testCSVParser.cc	Added 14 comprehensive tests for CSVDialog methods (file loading, parsing, header population, classification, edge cases); converted ASSERT_* to EXPECT_* for non-critical assertions in existing tests
model/CSVDialog.cc	Fixed bug in loadFileFromName() to prevent adding empty lines when getline() fails by checking the return value before emplacing

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2025-11-18T22:27:29Z

test/testCSVParser.cc

+    dialog.populateHeaders();
+
+    // Headers should not have been populated
+    EXPECT_TRUE(spec.dimensionNames.empty() || spec.dimensionNames.size() == 3);


[nitpick] The assertion EXPECT_TRUE(spec.dimensionNames.empty() || spec.dimensionNames.size() == 3) is too weak and doesn't properly verify the test's purpose.

Looking at the populateHeaders() implementation, when headerRow is invalid (>= parsedLines.size()), it returns early without modifying dimensionNames. Therefore, the state of dimensionNames depends on what happened before this test runs.

The test should instead verify the expected behavior more explicitly. Consider either:

Initializing dimensionNames to a known state before calling populateHeaders(), then verifying it remains unchanged

Checking that dimensionNames is empty (if that's the expected initial state)

As written, this assertion would pass even if populateHeaders() incorrectly modified dimensionNames to have 3 elements.

Copilot · 2025-11-18T22:27:29Z

test/testCSVParser.cc

+
+    // Empty columns should be treated as 'ignore'
+    ASSERT_EQ(3, spec.numCols);
+    // None of the  three columns should be in dataCols, nor dimensionCols since they're empty


The comment has an extra space: "None of the three columns" should be "None of the three columns" (remove one space between "the" and "three").

Suggested change

// None of the three columns should be in dataCols, nor dimensionCols since they're empty

// None of the three columns should be in dataCols, nor dimensionCols since they're empty

Initial plan

d555823

Copilot AI assigned Copilot and highperformancecoder Nov 15, 2025

Copilot started work on behalf of highperformancecoder November 15, 2025 03:45 View session

Copilot AI and others added 2 commits November 15, 2025 03:52

Add comprehensive tests for CSVDialog class methods

2df4f47

Co-authored-by: highperformancecoder <3075825+highperformancecoder@users.noreply.github.com>

Add edge case tests for CSVDialog class

f42423c

Co-authored-by: highperformancecoder <3075825+highperformancecoder@users.noreply.github.com>

Copilot AI changed the title ~~[WIP] Improve code coverage for CSVDialog class~~ Add comprehensive test coverage for CSVDialog class Nov 15, 2025

Copilot AI requested a review from highperformancecoder November 15, 2025 03:56

Copilot finished work on behalf of highperformancecoder November 15, 2025 03:56

Fix unit tests.

ba40db9

coderabbitai bot reviewed Nov 18, 2025

View reviewed changes

model/CSVDialog.cc Outdated Show resolved Hide resolved

CodeRabbit suggested fix

92dadfb

Co-authored-by: coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com>

highperformancecoder marked this pull request as ready for review November 18, 2025 22:23

Copilot AI review requested due to automatic review settings November 18, 2025 22:23

highperformancecoder merged commit 2aa7d68 into master Nov 18, 2025
9 checks passed

Copilot started reviewing on behalf of highperformancecoder November 18, 2025 22:23 View session

Copilot finished reviewing on behalf of highperformancecoder November 18, 2025 22:27

Copilot AI reviewed Nov 18, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Add comprehensive test coverage for CSVDialog class #582

Add comprehensive test coverage for CSVDialog class #582

Uh oh!

Copilot AI commented Nov 15, 2025 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Nov 15, 2025 •

edited

Loading

Review skipped

Uh oh!

highperformancecoder commented Nov 18, 2025

Uh oh!

coderabbitai bot commented Nov 18, 2025

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Nov 18, 2025

Uh oh!

Copilot AI Nov 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	// None of the three columns should be in dataCols, nor dimensionCols since they're empty
	// None of the three columns should be in dataCols, nor dimensionCols since they're empty

Uh oh!

Add comprehensive test coverage for CSVDialog class #582

Add comprehensive test coverage for CSVDialog class #582

Uh oh!

Conversation

Copilot AI commented Nov 15, 2025 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

Core method tests (9 tests)

Edge case tests (5 tests)

I tried to connect to the following addresses, but was blocked by firewall rules:

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Nov 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review skipped

Walkthrough

Changes

Estimated code review effort

Poem

Pre-merge checks and finishing touches

Uh oh!

highperformancecoder commented Nov 18, 2025

Uh oh!

coderabbitai bot commented Nov 18, 2025

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Copilot AI Nov 18, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Nov 18, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Copilot AI commented Nov 15, 2025 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Nov 15, 2025 •

edited

Loading