Sparks

Welcome to Sparks!

Sparks is your open-source Python toolkit for connecting energy system modeling results with Life Cycle Assessment (LCA) workflows. If you use Calliope for energy system modeling and want to analyze your results with ENBIOS, Sparks makes the process smooth and standardized.

Why Sparks?

Bridging energy system outputs and LCA data can be tricky. Sparks automates the conversions, data mapping, and regional aggregation—so you can focus on insights, not data wrangling. All the outputs can be directly used in ENBIOS, following its hierarchical structure.

Key Features

🔄 Effortless Data Integration: Connects ESM outputs with LCA inventory data in a snap.
📊 Automatic Unit Conversion: Sparks adapts energy units for you in a flexible way.
🌍 Flexible Regional Aggreagtion: Analyze at national or subnational levels.
🏗️ Hierarchical Data Structure: Outputs are ready for ENBIOS, with dendrogram-like hierarchies.
⚡ Optimized Performance: Fast processing with pandas and smart caching.
🔧 Customizable: Supports multiple databases and your own conversion factors.

Getting Started

Prerequisites

Python 3.11 or newer
A Brightway2 database (with ecoinvent or similar LCA data)
Required Python packages (see environment.yaml)

Installation

Clone the repository:

git clone https://github.com/LIVENlab/Sparks.git
cd Sparks

Setup

Clone the repository:

git clone https://github.com/LIVENlab/Sparks.git
cd Sparks

Install dependencies using conda (recommended):

conda env create -f environment.yaml
conda activate sparks

Quick Start

Basic Usage

from Sparks.util.base import SoftLink

# Initialize Sparks with your data directory and Brightway project
enbios_mod = SoftLink(
    path='path/to/your/data',
    bw_project='your_brightway_project_name'
)

# Preprocess and transform the data
enbios_mod.preprocess()

# Generate ENBIOS input file
enbios_mod.data_for_ENBIOS(
    smaller_vers=False,
    path_save='output.json'
)

Complete Workflow Example

from Sparks.util.base import SoftLink
import pandas as pd

# 1. Prepare your data directory with:
#    - basefile.xlsx (configuration file)
#    - energy system output files (CSV format)

# 2. Initialize Sparks
sparks_object = SoftLink(
    path='testing/data_test',
    bw_project='Hydrogen_SEEDS'
)

# 3. Preprocess data (maps LCI data with energy system data)
sparks_object.preprocess()

# 4. Check preprocessed units
print(sparks_object.preprocessed_units)

# 5. Generate ENBIOS input
sparks_object.data_for_ENBIOS(
    smaller_vers=False,
    path_save='test.json'
)

Input Requirements

Basefile Structure

The core configuration file (basefile.xlsx) must contain a Processors sheet with the following columns:

Column	Description	Example
`Processor`	Technology name in energy system model	`"CHP_hydrogen"`
`Region`	Geographic region identifier	`"Spain"` or `"Spain_1"`
`@SimulationCarrier`	Energy carrier type	`"HEAT"`, `"ELECTRICITY"`
`ParentProcessor`	Hierarchical parent technology	`"Power_Generation"`
`@SimulationToEcoinventFactor`	Unit conversion factor	`1.0` or `0.001`
`@Ecoinvent_key_code`	Brightway database activity code	`"CHP_hydrogen_2050"`
`File_source`	Source CSV file name	`"flow_out_sum.csv"`
`geo_loc`	Distinguish betwen onsite-offsite, operation or infrastructure	`"onsite"`

See the Pandera validations schemas here

Energy Data File Requirements

Format: CSV files with comma separation
Naming: Main data column (energy value) must match the filename
- File: flow_out_sum.csv → Column: flow_out_sum
Required Columns:
- techs: Technology identifiers (they must match the Processor in basefile)
- locs: Location identifiers
- energy_value: Main energy values
Optional Columns:
- carriers: Energy carrier types
- - spores or scenario: Scenario identifiers

Directory Structure

your_data_directory/
├── basefile.xlsx          # Configuration file
├── flow_out_sum.csv       # Energy flow data
├── energy_cap.csv         # Capacity data
├── nameplate_capacity.csv # Nameplate capacity data
└── other_data_files.csv   # Additional data files

Advanced Features

Regional Pattern Detection

Sparks automatically detects regional naming patterns. An aggregated analysis can be activated using :

sparks_object.preprocess(national=True)

National: "Spain", "Germany"
Subnational: "Spain_Madrid", "Germany_Berlin"

Unit Conversion Pipeline

Input Validation: Ensures data structure consistency
Technology Filtering: Maps energy technologies to LCA activities
Unit Adaptation: Applies conversion factors from basefile
Regional Aggregation: Optionally combines subnational data

Performance Optimizations

Activity Caching: Reduces database queries for repeated activities
Batch Processing: Efficient handling of large datasets
Memory Management: Optimized pandas operations

Troubleshooting

Debug Mode

# Enable detailed logging
import warnings
warnings.filterwarnings('always')

# Check data at each step
print(f"Data shape: {df.shape}")
print(f"Columns: {df.columns.tolist()}")
print(f"NaN summary: {df.isna().sum()}")

License

This project is licensed under the MIT License - see the LICENSE file for details.

Citation

If you use Sparks in your research, please cite:

@software{sparks2024,
  title={Sparks: Energy System to LCA Integration Tool},
  author={de Tomás, Alexander},
  year={2024},
  url={https://github.com/LIVENlab/Sparks}
}

Contact

Maintainer: Alexander de Tomás
Organization: LIVENlab
Repository: https://github.com/LIVENlab/Sparks

Acknowledgments

Calliope - Energy system modeling framework
ENBIOS - LCA analysis platform
Brightway25 - LCA database framework
pandas - Data manipulation library

Name		Name	Last commit message	Last commit date
Latest commit History 245 Commits
.idea		.idea
Sparks		Sparks
testing		testing
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
demo.ipynb		demo.ipynb
environment.yaml		environment.yaml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Sparks

Welcome to Sparks!

Why Sparks?

Key Features

Getting Started

Prerequisites

Installation

Setup

Quick Start

Basic Usage

Complete Workflow Example

Input Requirements

Basefile Structure

Energy Data File Requirements

Directory Structure

Advanced Features

Regional Pattern Detection

Unit Conversion Pipeline

Performance Optimizations

Troubleshooting

Debug Mode

License

Citation

Contact

Acknowledgments

About

Uh oh!

Releases 2

Packages

Uh oh!

Contributors 4

Uh oh!

Languages

License

LIVENlab/Sparks

Folders and files

Latest commit

History

Repository files navigation

Sparks

Welcome to Sparks!

Why Sparks?

Key Features

Getting Started

Prerequisites

Installation

Setup

Quick Start

Basic Usage

Complete Workflow Example

Input Requirements

Basefile Structure

Energy Data File Requirements

Directory Structure

Advanced Features

Regional Pattern Detection

Unit Conversion Pipeline

Performance Optimizations

Troubleshooting

Debug Mode

License

Citation

Contact

Acknowledgments

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Contributors 4

Uh oh!

Languages

Packages