Dpg by yolile · Pull Request #6 · CivicDataLab/risk-score-model-generic

yolile · 2026-05-31T23:08:37Z

closes #2
closes #4
closes #1

See the commits for the detailed changes.

The tool is now a library, so it is easier to reuse. The library includes generating sample config files and synthetic data for testing in a new geography. I kept the Indian examples and specific scripts under contrib/india, however, I'm not sure if they are needed.

I'm also not sure if the hazard plot is needed (or all the prints).

Note that this changes the final output columns and format (for time_period). Also, the input column names.
The output column names are still kebab-cased and not snake_case; I'm not sure if this is needed.

A pypi.yml github action is still needed to automate the package publication

- Move scripts in assets to contrib/india/maps, as they are India specific - Remove test, as not relevant for DPG - Move district_objectid.csv to data - Update docs accordingly - remove assets

…kebab-case for output/display columns - fix generic topsis script Note: the new outputs must be udpated at https://github.com/CivicDataLab/IDS-DRR-Data-Management/tree/main/layer/assets/indicators

- Remove Topsis class, convert to topsis fuction inside topsis_riskscore - Calculate the worst condition only (was the only method used) - User higher is better always (criteria was always True, True, True, True

- Output names don't need to be configurable, change them to const in common - topsis_riskscore.py now derives the factor columns and weight vector from one ordered FACTOR_WEIGHTS list - one spelling per config file

…del` package Move the flat scripts/ collection and config/loader.py into a src/-layout package and add packaging so the project is pip-installable, eliminating the sys.path.insert hack every script previously carried. - src/disaster_risk_score_model/: the four factor modules, topsis (was topsis_riskscore), sample_data (was generate_sample_data), dea, common, and a rewritten config loader. Intra-package absolute imports replace the sys.path manipulation; __main__ blocks are removed. - New cli.py exposes a single `drsm` console command (also runnable as `python -m disaster_risk_score_model`) with subcommands per stage plus init-config / generate-sample-data / run. - config.py gains command-generated-only config resolution (--config-dir / RISK_MODEL_CONFIG_DIR / ./config, else a clear error), an init_config scaffolder, and resolve_data_dir/resolve_input_file so I/O locations come from the CLI/env rather than the repo root. Factor/topsis main() functions take config_dir/data_dir/input_file; data paths resolve under --data-dir. - pyproject.toml (setuptools, deps, `drsm` entry point, bundled config templates as package data); requirements*.txt reduced to `-e .` / `-e .[dev]`. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

Replace the six per-script config files (base + one per factor + topsis) with two files that `drsm init-config` scaffolds: - scores_config.toml: a shared [columns] table (single source of truth) plus a nested section per factor ([hazard.*], [exposure.*], [vulnerability.*], [govtresponse.*]). - topsis_config.toml: [weights] and [classification] (required) with the indicators/rounding/cumulative_vars/derivations/renames sections now optional. I/O locations leave the config entirely: [paths] is dropped: the data folder and input filename come from --data-dir/--input-file, and the TOPSIS district lookup and outputs are fixed names resolved under the data dir. The generic set ships as bundled package-data templates; the India example is migrated to the same two-file layout. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

- Rewrite README, getting_started, the per-factor/topsis methodology docs, the India example README, CONTRIBUTING and SECURITY to use the `drsm` commands and the two-file config, replacing the old `python scripts/*.py` + per-file config references. - Drive the smoke test through the CLI (`python -m disaster_risk_score_model`) in a temp working dir; point test_dea at the package import; delete the obsolete sys.path conftest. - CI installs the package (`pip install -e .[dev]`) and sets MPLBACKEND=Agg. - Move data_dictionary.csv to docs/ (it is reference documentation, not data) and gitignore the top-level data/ wholesale: it is now just the default --data-dir scratch space, with MASTER_VARIABLES.csv / district_objectid.csv regenerated by `drsm generate-sample-data`. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

…non-configurable To more standard and generic names. Add checks to ensure they are present

yolile added 14 commits May 31, 2026 18:56

feat: refactor assets

3640a8d

- Move scripts in assets to contrib/india/maps, as they are India specific - Remove test, as not relevant for DPG - Move district_objectid.csv to data - Update docs accordingly - remove assets

feat: remove unused code, move india specific to contrib/india

bbdd7cb

feat: add syntetich data and configs, move India configs to India

068c7ee

fix: fix dea for small locations

82f6362

tests: add tests

07ba7ae

docs and CI: updates docs and add CI

c6c26f0

docs: fix CITATION.cff

784f053

feat: implement snake_case for all inputs/config keys/intermediates, …

4071182

…kebab-case for output/display columns - fix generic topsis script Note: the new outputs must be udpated at https://github.com/CivicDataLab/IDS-DRR-Data-Management/tree/main/layer/assets/indicators

feat: remove unused scripts

9951a12

fix: fix object_id use

625837e

feat: deduplicate code in scripts, remove dead code

9666d24

feat: simplify topsis calculation

5445e28

- Remove Topsis class, convert to topsis fuction inside topsis_riskscore - Calculate the worst condition only (was the only method used) - User higher is better always (criteria was always True, True, True, True

feat: single source of truth for factor-output names

044ecf2

- Output names don't need to be configurable, change them to const in common - topsis_riskscore.py now derives the factor columns and weight vector from one ordered FACTOR_WEIGHTS list - one spelling per config file

docs: fix mention of hazard config

6e9f956

yolile force-pushed the dpg branch from 5190b99 to 6e9f956 Compare June 1, 2026 15:08

yolile and others added 12 commits June 1, 2026 13:32

feat: simplify dea to keep only the fuctions used by the project

8fec9cc

chore: add ruff, lint, run and fix ruff issues, add pre commit

1feb9e4

fix: fix PTH ruff check

190b90e

refactor: move library to root path

270f92f

fix: remove (now) unused requirements files

5c415ce

docs: tidy up documentation

9a7bc5d

refactor: rename timeperiod, district_id and object_id and make them …

ac08558

…non-configurable To more standard and generic names. Add checks to ensure they are present

feat: change time_period format to standard YYYY-MM

b0180ee

fix: update map_exporter to use unit_id instead of object_id

f9822f8

yolile marked this pull request as ready for review June 2, 2026 00:31

yolile requested a review from saurabhlevin June 2, 2026 00:31

fix: tidy up code with const variables usage

4bc8cdd

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dpg#6

Dpg#6
yolile wants to merge 27 commits into
mainfrom
dpg

yolile commented May 31, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

yolile commented May 31, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

yolile commented May 31, 2026 •

edited

Loading