5G Latency Optimization DQN – ONNX Inference Agent

This repository provides an inference-ready ONNX implementation of a Deep Q-Network (DQN) agent trained to select the optimal data center in a 5G / MEC latency optimization scenario.

The model is exported from Stable-Baselines3 (DQN) and distributed as a reproducible, lightweight artifact suitable for deployment, benchmarking, and research use.

Overview

Purpose

This model performs a Reinforcement Learning–based decision agent that predicts a discrete action:

Input:
A normalized tensor representing the current state of three candidate data centers, shaped as (3 × 10), where each row corresponds to a data center and each column represents a feature such as client identifier, resource utilization, network metrics, latency statistics, packet loss, and carbon intensity.

Output:
A discrete action in {0, 1, 2} corresponding to the selection of the optimal data center, computed as the index with the highest predicted Q-value.

Repository Structure

.
├── model/
│   ├── 5g_latency_opt_dqn_model.onnx   # ONNX model (opset 18)
│   └── model_config.json               # Model metadata, I/O specs, preprocessing params
│
├── src/
│   ├── inference_engine.py             # ONNXRuntime inference wrapper
│   ├── state_serializer.py             # Builds (1,3,10) input tensor from JSON
│   ├── minmax_scaler.py                # MinMax scaling (training-fitted params)
│   └── action_interpreter.py           # Human-readable action decoding
│
├── demo.py                             # End-to-end inference demo
├── requirements.txt                    # Minimal inference dependencies
└── README.md

Model Description

Training Data

The model was trained on a tabular dataset containing per–data center telemetry and network metrics. Each decision step groups three rows (one per candidate data center) into a single observation.

Features Used

After preprocessing, each data center is represented by 10 features:

client_id (label-encoded)
cpu_usage_percent
memory_usage_percent
disk_usage_percent
net_in_percent
net_out_percent
latency_avg
latency_mdev
lost_percent
carbon_intensity

All numeric features are MinMax-scaled using parameters learned on the training dataset.

Model Architecture

Algorithm: Deep Q-Network (DQN)
Framework: Stable-Baselines3
Policy network: MLP-based Q-network
Export format: ONNX (opset 18)

The ONNX model contains only the Q-network, optimized for inference.

Model Specification

Inputs

Name: observation
Type: float32
Shape: (batch_size, 3, 10)

Where:

3 = number of candidate data centers
10 = feature vector length per data center

Feature Order (Last Dimension)

The feature order must be exactly:

[
  client_id,
  cpu_usage_percent,
  memory_usage_percent,
  disk_usage_percent,
  net_in_percent,
  net_out_percent,
  latency_avg,
  latency_mdev,
  lost_percent,
  carbon_intensity
]

Outputs

Name: q_values
Type: float32
Shape: (batch_size, 3)

Each output value represents the Q-value of selecting a specific data center:

0 → Data Center 0 (Milan)
1 → Data Center 1 (Rome)
2 → Data Center 2 (Cosenza)

The final decision is:

action = argmax(q_values)

Usage Demo

Setup Environment

Create and activate a virtual environment, then install dependencies:

python -m venv .venv
source .venv/bin/activate    # Linux / macOS
# .venv\Scripts\activate     # Windows

pip install -r requirements.txt

Minimum runtime dependencies:

onnxruntime
numpy
pandas

Run Inference Script

Run the demo script:

python demo.py

The demo will:

Load a JSON scenario containing dataCenterStates
Apply preprocessing (MinMax scaling + client_id encoding)
Run inference using ONNXRuntime
Print the selected data center and corresponding Q-values

Limitations

The model is trained for exactly three data centers; input shape is fixed.
Inference requires the same preprocessing parameters used during training: -- MinMaxScaler data_min / data_max -- client_id encoding mapping
Unknown or unseen client_id values must be handled explicitly.
Performance outside the training distribution is not guaranteed.
This is a decision-support model, not a guaranteed optimal controller.

Zenodo Release

The ONNX model and inference bundle are archived on Zenodo for reproducibility and citation:

Zenodo record: 10.5281/zenodo.18303750
DOI: 10.5281/zenodo.18303750

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

5G Latency Optimization DQN – ONNX Inference Agent

Overview

Purpose

Repository Structure

Model Description

Training Data

Features Used

Model Architecture

Model Specification

Usage Demo

Limitations

Zenodo Release

About

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
model		model
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
demo.py		demo.py
requirements.txt		requirements.txt

License

mlsysops-eu/model-5g-network-optimization

Folders and files

Latest commit

History

Repository files navigation

5G Latency Optimization DQN – ONNX Inference Agent

Overview

Purpose

Repository Structure

Model Description

Training Data

Features Used

Model Architecture

Model Specification

Usage Demo

Limitations

Zenodo Release

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Languages