SmellHunter API

Event-driven API for detecting code smells using metrics analysis and a Domain Specific Language (DSL).

Research Motivation

Problem

Code smells are internal structures in source code that violate coding conventions and design principles, harming the internal quality of evolving systems and indicating issues of architectural and design degradation.

They typically arise when developers make hurried or poorly planned modifications to implement features or fix problems.

Research Gap

Traditional detection approaches focus mainly on static analysis and predefined technical metrics. However, such approaches often ignore important aspects of the development context, such as team characteristics, project constraints, and the stage of software evolution.

Proposed Approach

Unlike traditional detection approaches, SmellHunter integrates technical metrics alongside development context.

The tool supports asynchronous analyses, reducing interference with the developer’s workflow while enabling scalable and incremental processing.

This approach aims to reduce false positives and helps in refactoring decisions aligned with real-world development contexts.

Architecture

The system uses an event bus pattern with the following event types:

ANALYSIS_REQUESTED
VALIDATION_COMPLETED / VALIDATION_FAILED
ANALYSIS_COMPLETED
PERSISTENCE_COMPLETED

Detection Workflow

flowchart LR

A[Eclipse Plugin / Client] --> B[POST /analyze]

B --> C[API Gateway]

C --> D[Event: ANALYSIS_REQUESTED]

D --> E[Validation Service]

E --> F{Validation Result}

F -->|Success| G[Event: VALIDATION_COMPLETED]
F -->|Failure| X[Event: VALIDATION_FAILED]

G --> H[Interpreter Engine]

H --> I[Event: ANALYSIS_COMPLETED]

I --> J[Persistence Worker]

J --> K[(Smell Storage)]

K --> L[GET /status]
K --> M[GET /smells]

API Endpoints

`POST /analyze`

Initiates asynchronous smell analysis.

Request Format: multipart/form-data or application/json

Required Parameters (multipart/form-data):

Field	Type	Required	Description
user_id	string	Yes	User identifier
smell_dsl	file	Yes	`.smelldsl` file with smell definitions
metrics	file	Yes	CSV/JSON file with metric values
thresholds	file	Yes	CSV/JSON file with threshold values

Optional Parametes

Field	Type	Required	Description
loc_id	string	Yes	Location identifier
project_id	string	Yes	Project identifier
org_id	string	Yes	Company identifier

File Formats:

`metrics.csv`

Metrica,Valor
GodClass.ATFD,12
GodClass.TCC,4
LongMethod.LOC,300

`thresholds.csv`

Metrica,Valor
GodClass.ATFD-LIMIT,10
GodClass.TCC-LIMIT,5
LongMethod.LOC-LIMIT,100

`smelldsl`:

smelltype DesignSmell;
smell GodClass extends DesignSmell {
    feature ATFD with threshold 4, 10;
    feature TCC with threshold 3, 5;
    treatment "Refactor into smaller classes";
}
rule GodClassRule when (GodClass.ATFD > GodClass.ATFD-LIMIT) then "Flag";

JSON Request (alternative):

{
  "user_id": 3,
  "smell_dsl": "smelltype DesignSmell; smell GodClass extends...",
  "metrics": {
    "GodClass.ATFD": 12,
    "GodClass.TCC": 4
  },
  "thresholds": {
    "GodClass.ATFD-LIMIT": 10,
    "GodClass.TCC-LIMIT": 5
  },
  "request_data": {
    "org_id": 2,
    "loc_id": 3,
    "project_id": 1,
    "file_path": "/src/Main.java",
    "language": "java",
    "branch": "main",
    "commit_sha": "abc123"
  }
}

Response (202 Accepted):

{
  "status": "accepted",
  "ctx_id": "550e8400-e29b-41d4-a716-446655440000",
  "smell_id": "6ba7b810-9dad-11d1-80b4-00c04fd430c8"
}

`GET /status/<ctx_id>`

Check analysis status.

Response (processing):

{
  "status": "processing"
}

Response (completed):

{
  "status": "ok",
  "history": [
    {
      "cod_ctx": "550e8400-e29b-41d4-a716-446655440000",
      "status": "INTERPRETED",
      "details": "{\"result\": {\"is_smell\": true, \"smells_detected\": [\"GodClass\"]}}"
    }
  ]
}

`GET /smells/<smell_id>`

Retrieve persisted smell data.

Response (200 OK):

{
  "id": "6ba7b810-9dad-11d1-80b4-00c04fd430c8",
  "ctx_id": "550e8400-e29b-41d4-a716-446655440000",
  "timestamp_utc": "2024-01-01T12:00:00.000Z",
  "user_id": "123",
  "org_id": "456",
  "loc_id": "789",
  "project_id": "101",
  "type": "GodClass",
  "smell_type": "DesignSmell",
  "is_smell": true,
  "rule": {"GodClassRule": true},
  "file_path": "/src/Main.java",
  "language": "java",
  "branch": "main",
  "commit_sha": "abc123",
  "treatment": "Refactor into smaller classes",
  "metrics": {
    "GodClass.ATFD": 12,
    "GodClass.TCC": 4
  }
}

Event Flow

Eclipse Plugin Client → POST /analyze
API generates ctx_id and smell_id
Event ANALYSIS__REQUESTED published
ValidationObserver validates metrics and thresholds
Event VALIDATION_COMPLETED published
InterpreterWorker executes run_interpretation()
Event ANALYSIS_COMPLETED published
PersistenceWorker saves to local CSV
Event PERSISTENCE_COMPLETED published
SheetsPersistenceObserver saves to Google Sheets
StatusWorker stores result for status queries
Client polls GET /status/<ctx_id> and GET /smells/<smell_id>

Response Codes

Code	Description
202	Analysis accepted (async processing)
400	Bad request (invalid data)
404	Resource not found
500	Internal server error

Observers Overview

Observer	Event	Responsibility
ValidationObserver	ANALYSIS_REQUESTED	Starts the pipeline
InterpreterWorker	VALIDATION_COMPLETED	Executes interpretation
PersistenceWorker	ANALYSIS_COMPLETED	Saves to CSV
SheetsPersistenceObserver	PERSISTENCE_COMPLETED	Saves to Google Sheets
StatusWorker	ANALYSIS_COMPLETED	Stores for status queries
LogObserver	ANALYSIS_COMPLETED	Saves log file
CsvSheetsObserver	ANALYSIS_COMPLETED	Exports to CSV
EventBusLoggerObserver	All	Logs context events

Setup Guide

Reproducibility Video

🔗 Link

Complete Step-by-Step Installation

Prerequisites

Python Environment

Python 3.9+ required

python --version  # Verify version

Create virtual environment (recommended)

python -m venv venv

Activate virtual environment

# Windows:
venv\Scripts\activate
# Linux/Mac:
source venv/bin/activate

Install Dependencies

Google Sheets Setup

3.1 Create Google Cloud Project

Go to Google Cloud Console
Create new project or select existing
Enable Google Sheets API

3.2 Create Service Account

Navigate to IAM & Admin → Service Accounts
Click Create Service Account
Name: (...)
Assign role: Editor
Create key: JSON format
Download and save as service-account.json in project root

3.3 Google Sheets Setup

Download the pre-configured spreadsheet:
- Access the shared Google Drive template:
  
  🔗 SmellHunter Database Template
- Click "Make a copy" to save it to your own Google Drive
- Rename it as needed (e.g., "SmellHunter - [Your Project Name]")
Worksheet Structure (already configured):
- Bad_Smell - Contains all detected smells with complete metadata
- Context - Logs all context events and execution history
Share with Service Account:
- Open your copied spreadsheet
- Click the "Share" button in the top-right corner
- Add your service account email (found in service-account.json)
- Assign role: Editor
- Uncheck "Notify people" and click Share
Get Spreadsheet ID:
- The spreadsheet URL contains the ID:
  https://docs.google.com/spreadsheets/d/``SPREADSHEET_ID_HERE``/edit
- Copy this ID and add it to your .env file:

        SPREADSHEET_ID=YOUR_SPREADSHEET_ID
        GOOGLE_APPLICATION_CREDENTIALS=app/configs/service_account.json

Verify Headers (already set up):

Bad_Smell worksheet headers:

   id, timestamp_utc, time_zone, user_id, org_id, loc_id, project_id, type, smell_type, is_smell, rule, file_path, language, branch, commit_sha, ctx_id, treatment

Context worksheet headers:

    ctx_id, user_id, org_id, loc_id, timestamp_utc, event_type

The spreadsheets are now ready to receive data from your SmellDSL Detection Service!

4. Configuration File

Create .env file in project root:

Flask settings

FLASK_ENV=development
FLASK_APP=interpreter_api.py
PORT=5000

Google Sheets

SPREADSHEET_ID=your-spreadsheet-id-here
SERVICE_ACCOUNT_FILE=service-account.json

Logging

LOG_DIR=logs

5. Project Structure

smell-detect/
├── app/
│   ├── configs/
│   │   └── settings.py
│   ├── events/
│   │   ├── event_bus.py
│   │   ├── event_types.py
│   │   ├── observers.py
│   │   └── validation_service.py
│   ├── parser/
│   │   ├── grammar.py
│   │   └── metric_extractor.py
│   ├── repositories/
│   │   └── sheets_repository.py
│   ├── interpreter_api.py
│   ├── interpreter_core.py
│   └── __init__.py
├── logs/
├── service-account.json
├── .env
└── requirements.txt

6. pip install requirements.txt

#Core dependencies
flask==2.3.3
lark==1.2.2

#Google Sheets integration
google-api-python-client==2.108.0
google-auth==2.28.1
google-auth-httplib2==0.2.0
google-auth-oauthlib==1.2.0
google-oauth2==1.0.0


#Utilities
python-dotenv==1.0.0
requests==2.31.0
dataclasses==0.6  # For Python < 3.7 (optional)
typing-extensions==4.9.0

#Development tools (optional)
pytest==7.4.4
black==23.12.1
flake8==7.0.0

Running the Application

1. Start the API Server

cd smelldetect
python -m app.interpreter_api

Eclipse Plugin Setup

🔗SmellHunter Eclipse Plugin

Requirements

Eclipse IDE 2023-12 or later
JDK 21 or later
SWT libraries (included with Eclipse)

Import Plugin Project

File → Import → Existing Projects into Workspace
Select the plugin project directory
Check "Search for nested projects"
Click Finish

Build and Run

Right-click on the project → Run As → Eclipse Application
A new Eclipse instance will launch
Navigate to Window → Show View → Other...
In the dialog, expand the plugin category and select "MyView"
Click Open to display the view

Data Visualization

Overview

SmellHunter persists detected smells and contextual execution data in Google Sheets.
These datasets can be connected to AppSheet to provide an interactive visualization layer for exploring detection results.

The dashboard allows users to inspect detected smells, navigate contextual information, and analyze detection outcomes through a structured interface.

🔗SmellHunter AppSheet Mobile View

🔗SmellHunter AppSheet Browser View

Physical Context View

This view presents contextual information related to the execution environment where the analysis occurred.
It includes metadata such as organization identifiers, project information, location identifiers, and execution timestamps.

The goal of this view is to support contextual analysis of smell occurrences across different projects and development environments.

Smell Details View

The Smell Details view displays the complete information related to a detected smell instance.
This includes the smell type, evaluated rule results, associated metrics, and metadata describing the analyzed artifact.

This view helps developers understand why a smell was detected and provides insights to guide refactoring decisions.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
figures		figures
smelldetect		smelldetect
test_files		test_files
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

SmellHunter API

Table of Contents

Research Motivation

Problem

Research Gap

Proposed Approach

Architecture

Detection Workflow

API Endpoints

POST /analyze

Required Parameters (multipart/form-data):

Optional Parametes

File Formats:

metrics.csv

thresholds.csv

smelldsl:

JSON Request (alternative):

Response (202 Accepted):

GET /status/<ctx_id>

Response (processing):

Response (completed):

GET /smells/<smell_id>

Response (200 OK):

Event Flow

Response Codes

Observers Overview

Setup Guide

Reproducibility Video

Complete Step-by-Step Installation

Prerequisites

Python Environment

Python 3.9+ required

Create virtual environment (recommended)

Activate virtual environment

Install Dependencies

Google Sheets Setup

3.1 Create Google Cloud Project

3.2 Create Service Account

3.3 Google Sheets Setup

4. Configuration File

Flask settings

Google Sheets

Logging

5. Project Structure

6. pip install requirements.txt

Running the Application

1. Start the API Server

Eclipse Plugin Setup

Requirements

Import Plugin Project

Build and Run

Data Visualization

Overview

The dashboard allows users to inspect detected smells, navigate contextual information, and analyze detection outcomes through a structured interface.

Physical Context View

Smell Details View

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`POST /analyze`

`metrics.csv`

`thresholds.csv`

`smelldsl`:

`GET /status/<ctx_id>`

`GET /smells/<smell_id>`

Packages