fix codelist operations type inference for empty dataframes by gerrycampion · Pull Request #1709 · cdisc-org/cdisc-rules-engine

gerrycampion · 2026-04-24T15:34:10Z

Fixes the issue for CORE-000857 with #1651. I cannot reproduce the issue for CORE-000873, CORE-000874 without the proper data.

Previously, we would get an Execution Error because the operations input dataframe is empty (after match datasets) and the column datatypes cannot be inferred.
Broken.xlsx
Now we get a skip.
Fixed.xlsx

Ran with following data and launch config:
pilot_LZZT_narrative_2026MAR10.json

{
  "name": "USDM Rule",
  "type": "debugpy",
  "request": "launch",
  "program": "${workspaceFolder}/core.py",
  "console": "integratedTerminal",
  "args": [
    "validate",
    "-s",
    "usdm",
    "-v",
    "4-0",
    "-dp",
    "pilot_LZZT_narrative_2026MAR10.json",
    "-r",
    "CORE-000857",
    "-of",
    "XLSX",
    "-of",
    "json",
    "-l",
    "debug"
  ]
}

RamilCDISC

Would it be better to add a unit or regression test for this case?

gerrycampion · 2026-04-27T15:46:16Z

Would it be better to add a unit or regression test for this case?

Yes, I added unit tests that fail on main but pass on this branch

RamilCDISC · 2026-04-27T20:04:00Z

            self.params.ct_package_type, unique_ct_versions
        )
        ct_df = self.evaluation_dataset.__class__.from_dict(ct_data)
+        ct_df = ct_df.astype(


The PR only converts the ct_df to string but evaluation dataset is still not made safe for merge. if it has empty columns the merge can fail. I have noticed that in the test the evaluation dataset was made safe manually by converting it to string. That let me to think that we need to make the evaluation dataset safe here too to merge without errors.

i believe that the fix for the types allowed us to have empty evaluation datasets as well. I've removed the conversions from the tests.

RamilCDISC · 2026-04-27T20:11:43Z

        ct_df = self.evaluation_dataset.__class__.from_dict(ct_data)
+        ct_df = ct_df.astype(
+            {
+                "version": str,


using str to covnert to string here can convert missing values like nan to "nan" which could create issue in downstream flow. One approach can be to convert like:
`{

"version": "string", "codelist_code": "string"

}
`

RamilCDISC · 2026-04-27T20:12:01Z

        ct_df = self.evaluation_dataset.__class__.from_dict(ct_data)
+        ct_df = ct_df.astype(
+            {
+                "version": str,


same here too

…s-in-usdm-rule-execution

RamilCDISC

The PR fixes the bug for empty dataframe in codelist operations. The PR was validated by:

Reviewing the PR for any unwanted code or comments.
Reviewing the PR in accordance with the AC.
Reviewing the updated tests for logic and coverage.
Reviewing the code for maintaining quality and downstream safe checks.
Running validation using positive dataset in dev editor.
Running validation using negative dataset in dev editor.
Running validation using edge case of empty dataframe.
Ensuring all unit and integration testing pass.

fix codelist operations type inference for empty dataframes

f999224

gerrycampion linked an issue Apr 24, 2026 that may be closed by this pull request

New version of Core engine causes errors in USDM rule execution #1651

Closed

gerrycampion temporarily deployed to DEV April 24, 2026 15:34 — with GitHub Actions Inactive

gerrycampion requested review from RamilCDISC, SFJohnson24 and pendingintent April 24, 2026 15:34

RamilCDISC requested changes Apr 24, 2026

View reviewed changes

added unit tests

25b1b54

gerrycampion temporarily deployed to DEV April 27, 2026 15:45 — with GitHub Actions Inactive

gerrycampion requested a review from RamilCDISC April 27, 2026 15:46

RamilCDISC temporarily deployed to DEV April 27, 2026 19:47 — with GitHub Actions Inactive

RamilCDISC requested changes Apr 27, 2026

View reviewed changes

Fixed str dtype

c933f4d

gerrycampion temporarily deployed to DEV April 27, 2026 21:08 — with GitHub Actions Inactive

removed casting from tests

5f62f89

gerrycampion temporarily deployed to DEV April 27, 2026 21:22 — with GitHub Actions Inactive

Merge branch 'main' into 1651-new-version-of-core-engine-causes-error…

1128e63

…s-in-usdm-rule-execution

gerrycampion had a problem deploying to DEV April 27, 2026 21:23 — with GitHub Actions Failure

gerrycampion temporarily deployed to DEV April 27, 2026 21:36 — with GitHub Actions Inactive

gerrycampion requested a review from RamilCDISC April 27, 2026 21:41

RamilCDISC approved these changes Apr 27, 2026

View reviewed changes

gerrycampion merged commit 4d89e6d into main Apr 28, 2026
13 of 14 checks passed

gerrycampion deleted the 1651-new-version-of-core-engine-causes-errors-in-usdm-rule-execution branch April 28, 2026 15:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix codelist operations type inference for empty dataframes#1709

fix codelist operations type inference for empty dataframes#1709
gerrycampion merged 5 commits into
mainfrom
1651-new-version-of-core-engine-causes-errors-in-usdm-rule-execution

gerrycampion commented Apr 24, 2026

Uh oh!

RamilCDISC left a comment

Uh oh!

gerrycampion commented Apr 27, 2026

Uh oh!

RamilCDISC Apr 27, 2026

Uh oh!

gerrycampion Apr 27, 2026

Uh oh!

RamilCDISC Apr 27, 2026

Uh oh!

RamilCDISC Apr 27, 2026

Uh oh!

RamilCDISC left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

gerrycampion commented Apr 24, 2026

Uh oh!

RamilCDISC left a comment

Choose a reason for hiding this comment

Uh oh!

gerrycampion commented Apr 27, 2026

Uh oh!

RamilCDISC Apr 27, 2026

Choose a reason for hiding this comment

Uh oh!

gerrycampion Apr 27, 2026

Choose a reason for hiding this comment

Uh oh!

RamilCDISC Apr 27, 2026

Choose a reason for hiding this comment

Uh oh!

RamilCDISC Apr 27, 2026

Choose a reason for hiding this comment

Uh oh!

RamilCDISC left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants