`its-hub`: A Python library for inference-time scaling

its_hub is a Python library for inference-time scaling of LLMs, focusing on mathematical reasoning tasks.

📚 Documentation

For comprehensive documentation, including installation guides, tutorials, and API reference, visit:

Installation

Choose the installation option based on which algorithms you need:

# Core installation - includes:
#   - Best-of-N with LLM Judge
#   - Self-Consistency
#   - OpenAI-compatible language models
pip install its_hub

# Process Reward Model installation - adds:
#   - Particle Filtering Algorithms
#   - Beam Search
#   - LocalVllmProcessRewardModel
#   - Required for step-by-step reasoning with process reward models
pip install its_hub[prm]

# Development installation
git clone https://github.com/Red-Hat-AI-Innovation-Team/its_hub.git
cd its_hub
pip install -e ".[dev]"

Quick Start

Example 1: Best-of-N with LLM Judge

Installation required: pip install its_hub (core)

Use Best-of-N algorithm with an LLM judge for response selection - works with any OpenAI-compatible API:

from its_hub.lms import OpenAICompatibleLanguageModel
from its_hub.algorithms import BestOfN
from its_hub.integration.reward_hub import LLMJudgeRewardModel

# Initialize language model
lm = OpenAICompatibleLanguageModel(
    endpoint="https://api.openai.com/v1",
    api_key="your-api-key",
    model_name="gpt-4o-mini",
)

# Set up LLM judge for scoring
judge = LLMJudgeRewardModel(
    model="gpt-4o-mini",
    criterion="overall_quality",
    judge_type="groupwise",
    api_key="your-api-key",
)
scaling_alg = BestOfN(judge)

# Generate multiple responses and select the best
result = scaling_alg.infer(lm, "Explain quantum entanglement in simple terms", budget=4)
print(result)

Example 2: Particle Filtering with Process Reward Model

Installation required: pip install its_hub[prm]

Use Particle Filtering for step-by-step reasoning with process reward models:

from its_hub.utils import SAL_STEP_BY_STEP_SYSTEM_PROMPT
from its_hub.lms import OpenAICompatibleLanguageModel, StepGeneration
from its_hub.algorithms import ParticleFiltering
from its_hub.integration.reward_hub import LocalVllmProcessRewardModel

# Initialize language model (requires vLLM server running)
lm = OpenAICompatibleLanguageModel(
    endpoint="http://localhost:8100/v1",
    api_key="NO_API_KEY",
    model_name="Qwen/Qwen2.5-Math-1.5B-Instruct",
    system_prompt=SAL_STEP_BY_STEP_SYSTEM_PROMPT,
)

# Set up step generation and process reward model
sg = StepGeneration(step_token="\n\n", max_steps=32, stop_token=r"\boxed")
prm = LocalVllmProcessRewardModel(
    model_name="Qwen/Qwen2.5-Math-PRM-7B",
    device="cuda:0",
    aggregation_method="prod"
)
scaling_alg = ParticleFiltering(sg, prm)

# Solve with step-by-step reasoning
result = scaling_alg.infer(lm, "Solve x^2 + 5x + 6 = 0", budget=8)
print(result)

Key Features

🔬 Multiple Algorithms: Particle Filtering, Best-of-N, Beam Search, Self-Consistency
🚀 OpenAI-Compatible API: Easy integration with existing applications
🧮 Math-Optimized: Built for mathematical reasoning with specialized prompts
📊 Benchmarking Tools: Compare algorithms on MATH500 and AIME-2024 datasets
⚡ Async Support: Concurrent generation with limits and error handling

For detailed documentation, visit: https://ai-innovation.team/its_hub

Name		Name	Last commit message	Last commit date
Latest commit History 241 Commits
.claude		.claude
.devcontainer		.devcontainer
.github/workflows		.github/workflows
docs		docs
its_hub		its_hub
notebooks		notebooks
scripts		scripts
tests		tests
.gitignore		.gitignore
.jupytext.yml		.jupytext.yml
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
justfile		justfile
pyproject.toml		pyproject.toml
ruff.toml		ruff.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

`its-hub`: A Python library for inference-time scaling

📚 Documentation

Installation

Quick Start

Example 1: Best-of-N with LLM Judge

Example 2: Particle Filtering with Process Reward Model

Key Features

About

Uh oh!

Releases 20

Packages

Uh oh!

Contributors 13

Languages

License

Red-Hat-AI-Innovation-Team/its_hub

Folders and files

Latest commit

History

Repository files navigation

its-hub: A Python library for inference-time scaling

📚 Documentation

Installation

Quick Start

Example 1: Best-of-N with LLM Judge

Example 2: Particle Filtering with Process Reward Model

Key Features

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 20

Packages 0

Uh oh!

Contributors 13

Languages

`its-hub`: A Python library for inference-time scaling

Packages