Software Architecture Document

Aignostics Python SDK

Version: 0.2.105
Date: August 12, 2025
Status: Draft

1. Overview

1.1 Context

The Aignostics Python SDK is a comprehensive client library that provides programmatic access to the Aignostics AI Platform services. It serves as a bridge between local development environments and cloud-based AI services, enabling developers to interact with applications, manage data buckets, process datasets, and utilize various AI-powered tools through both command-line and graphical interfaces.

The SDK is designed to support data scientists, researchers, and developers working with digital pathology, whole slide imaging (WSI), and pathological applications in the life pathology domain.

1.2 General Architecture and Patterns Applied

Simplified Overview for Onboarding

flowchart LR
    subgraph "User Interfaces"
        CLI[🖥️ Command Line]
        GUI[🖱️ Launchpad App]
        NB[📓 Notebooks]
    end

    subgraph "Core Functionality"
        APP[🧬 Run ML Applications]
        FILES[🗂️ Manage Files]
        DATA[📊 Handle Datasets]
    end

    subgraph "Cloud Platform"
        API[🌐 Aignostics Platform]
        ML[🤖 ML Processing]
        STORE[💾 Cloud Storage]
    end

    subgraph "External Tools"
        QP[🔍 QuPath Analysis]
        IDC[📚 Public Datasets]
    end

    CLI --> APP
    GUI --> APP
    NB --> APP

    CLI --> FILES
    GUI --> FILES

    CLI --> DATA
    GUI --> DATA

    APP --> API
    FILES --> STORE
    DATA --> IDC

    API --> ML
    APP --> QP

    %% Annotations
    CLI -.->|"Commands like:<br/>aignostics application run"| APP
    GUI -.->|"Drag & drop<br/>Point & click"| APP
    API -.->|"Processes WSI images<br/>Returns results"| ML
    APP -.->|"Opens results in<br/>QuPath projects"| QP

    classDef interface fill:#e3f2fd,stroke:#1976d2,stroke-width:2px
    classDef core fill:#e8f5e8,stroke:#388e3c,stroke-width:2px
    classDef cloud fill:#fff3e0,stroke:#f57c00,stroke-width:2px
    classDef external fill:#f3e5f5,stroke:#7b1fa2,stroke-width:2px

    class CLI,GUI,NB interface
    class APP,FILES,DATA core
    class API,ML,STORE cloud
    class QP,IDC external

The SDK follows a Modulith Architecture pattern, organizing functionality into cohesive modules while maintaining a monolithic deployment structure. This approach provides the benefits of modular design (clear boundaries, focused responsibilities) while avoiding the complexity of distributed systems.

Key Architectural Patterns:

Modulith Pattern: Self-contained modules with clear boundaries and minimal inter-module dependencies
Dependency Injection: Dynamic discovery and registration of services, CLI commands, and GUI pages
Service Layer Pattern: Core business logic encapsulated in service classes with consistent interfaces
Dual Presentation Layers:
- (a) CLI Layer: Command-line interface using Typer framework
- (b) GUI Layer: Web-based graphical interface using NiceGUI framework
Settings-based Configuration: Environment-aware configuration management using Pydantic Settings
Plugin Architecture: Optional modules that can be enabled/disabled based on available dependencies

graph TB
    subgraph "Presentation Layer"
        CLI[CLI Interface<br/>Typer Commands]
        GUI[GUI Interface<br/>NiceGUI/Launchpad]
        NOTEBOOK[Notebook Interface<br/>Marimo Server]
    end

    subgraph "Domain Services"
        AS[Application Service]
        BS[Bucket Service]
        DS[Dataset Service]
        NS[Notebook Service]
        WS[WSI Service]
        QS[QuPath Service]
        SS[System Service]
    end

    subgraph "Platform Layer"
        PS[Platform Service<br/>API Client]
        AUTH[Authentication]
        CLIENT[HTTP Client]
    end

    subgraph "Infrastructure Layer"
        DI[Dependency Injection<br/>Auto-discovery]
        SETTINGS[Settings Management<br/>Pydantic]
        LOGGING[Logging & Monitoring<br/>Sentry/Logfire]
        BOOT[Boot Sequence]
    end

    subgraph "External Services"
        PLATFORM_API[Aignostics Platform API]
        QUPATH_APP[QuPath Application]
        IDC_API[NCI Image Data Commons]
    end

    %% Presentation to Domain Service connections
    CLI --> AS
    CLI --> BS
    CLI --> DS
    CLI --> NS
    CLI --> WS
    CLI --> QS
    CLI --> SS

    GUI --> AS
    GUI --> BS
    GUI --> DS
    GUI --> WS
    GUI --> SS
    GUI --> NS

    NOTEBOOK --> AS
    NOTEBOOK --> BS
    NOTEBOOK --> DS

    %% Domain Service interdependencies
    AS --> BS
    AS --> WS
    AS --> QS
    AS --> PS

    BS --> PS
    DS --> PS
    WS --> PS
    NS --> PS

    %% Platform to External Services
    PS --> AUTH
    PS --> CLIENT
    CLIENT --> PLATFORM_API

    %% External integrations
    QS --> QUPATH_APP
    DS --> IDC_API

    %% Infrastructure connections
    AS --> DI
    BS --> DI
    DS --> DI
    NS --> DI
    WS --> DI
    QS --> DI
    SS --> DI
    PS --> DI

    AS --> SETTINGS
    BS --> SETTINGS
    DS --> SETTINGS
    NS --> SETTINGS
    WS --> SETTINGS
    QS --> SETTINGS
    SS --> SETTINGS
    PS --> SETTINGS

    AS --> LOGGING
    BS --> LOGGING
    DS --> LOGGING
    NS --> LOGGING
    WS --> LOGGING
    QS --> LOGGING
    SS --> LOGGING
    PS --> LOGGING

    DI --> BOOT
    SETTINGS --> BOOT
    LOGGING --> BOOT

    %% Styling
    classDef presentation fill:#e3f2fd,stroke:#1976d2,stroke-width:2px
    classDef domain fill:#e8f5e8,stroke:#388e3c,stroke-width:2px
    classDef platform fill:#fff3e0,stroke:#f57c00,stroke-width:2px
    classDef infrastructure fill:#fce4ec,stroke:#c2185b,stroke-width:2px
    classDef external fill:#f3e5f5,stroke:#7b1fa2,stroke-width:2px

    class CLI,GUI,NOTEBOOK presentation
    class AS,BS,DS,NS,WS,QS,SS domain
    class PS,AUTH,CLIENT platform
    class DI,SETTINGS,LOGGING,BOOT infrastructure
    class PLATFORM_API,QUPATH_APP,IDC_API external

Architecture Overview:

This high-level diagram shows the four main architectural layers and their relationships:

🔵 Presentation Layer: User interfaces (CLI, GUI, Notebooks)
🟢 Domain Services: Business logic modules for specific functionality
🟠 Platform Layer: API client and authentication services
🔴 Infrastructure Layer: Cross-cutting concerns and utilities
🟣 External Services: Third-party integrations

1.3 Language and Frameworks

Core Technologies:

Python 3.11+: Primary programming language with full type hints and modern features
Typer: CLI framework for building intuitive command-line interfaces with automatic help generation
NiceGUI: Modern web-based GUI framework for creating responsive user interfaces
FastAPI: High-performance web framework for API endpoints (inherited from template)
Pydantic: Data validation and settings management with type safety
Requests: HTTP client library for API communication

Key Dependencies:

aignx-codegen: Auto-generated API client for Aignostics Platform
Marimo: Interactive notebook environment for data exploration
Google CRC32C: Data integrity verification for file transfers
Humanize: Human-readable formatting for file sizes, dates, and progress

Optional Extensions:

QuPath Integration: Advanced pathology image analysis capabilities
WSI Processing: Whole slide image format support and processing
Jupyter Notebook: Alternative notebook environment support

1.4 Build Chain and CI/CD

The project implements a comprehensive DevOps pipeline with multiple quality gates and automated processes:

flowchart TD
    subgraph "Development Phase"
        DEV[👩‍💻 Developer]
        CODE[📝 Code Changes]
        PRECOMMIT[🔍 Pre-commit Hooks]
    end

    subgraph "Quality Gates"
        LINT[🧹 Linting<br/><small>Ruff formatter</small>]
        TYPE[🔍 Type Checking<br/><small>MyPy strict mode</small>]
        TEST[🧪 Testing<br/><small>pytest + coverage</small>]
        SEC[🛡️ Security<br/><small>pip-audit + secrets</small>]
    end

    subgraph "Build & Package"
        BUILD[📦 Build Package<br/><small>Python wheel</small>]
        DOCKER[🐳 Docker Images<br/><small>Slim + Full variants</small>]
        DOCS[📚 Documentation<br/><small>Sphinx + API docs</small>]
    end

    subgraph "Release & Deploy"
        PYPI[🐍 PyPI Release]
        REGISTRY[🗂️ Container Registry]
        RTD[📖 Read the Docs]
        MONITOR[📊 Monitoring<br/><small>Sentry + Logfire</small>]
    end

    DEV --> CODE
    CODE --> PRECOMMIT
    PRECOMMIT --> LINT
    PRECOMMIT --> TYPE
    PRECOMMIT --> TEST
    PRECOMMIT --> SEC

    LINT --> BUILD
    TYPE --> BUILD
    TEST --> BUILD
    SEC --> BUILD

    BUILD --> DOCKER
    BUILD --> DOCS
    BUILD --> PYPI

    DOCKER --> REGISTRY
    DOCS --> RTD
    PYPI --> MONITOR

    %% Annotations for clarity
    PRECOMMIT -.->|"Runs automatically<br/>on git commit"| LINT
    TEST -.->|"85% coverage<br/>requirement"| BUILD
    SEC -.->|"Scans dependencies<br/>& secrets"| BUILD
    DOCKER -.->|"Multi-arch builds<br/>ARM64 + AMD64"| REGISTRY

    classDef dev fill:#e3f2fd,stroke:#1976d2,stroke-width:2px
    classDef quality fill:#fff3e0,stroke:#f57c00,stroke-width:2px
    classDef build fill:#e8f5e8,stroke:#388e3c,stroke-width:2px
    classDef deploy fill:#fce4ec,stroke:#c2185b,stroke-width:2px

    class DEV,CODE,PRECOMMIT dev
    class LINT,TYPE,TEST,SEC quality
    class BUILD,DOCKER,DOCS build
    class PYPI,REGISTRY,RTD,MONITOR deploy

Key Pipeline Features:

Code Generation: Automated API client generation from OpenAPI specifications
Pre-commit Hooks: Automated quality checks including detect-secrets and pygrep
Multi-environment Testing: Matrix testing across Python versions and operating systems
Security Scanning: pip-audit dependency vulnerability scanning and secret detection
Documentation Generation: Automated API docs and user guides using Sphinx
Multi-platform Builds: Docker images for both ARM64 and AMD64 architectures
Compliance Integration: Automated reporting to compliance platforms

Code Quality & Analysis:

Linting with Ruff: Fast Python linter and formatter following Black code style
Static Type Checking with MyPy: Strict type checking in all code paths
Pre-commit Hooks: Automated quality checks including detect-secrets and pygrep
Code Quality Analysis: SonarQube and GitHub CodeQL integration

Testing & Coverage:

Unit and E2E Testing with pytest: Comprehensive test suite with parallel execution
Matrix Testing with Nox: Multi-environment testing across Python versions
Test Coverage Reporting: Codecov integration with coverage artifacts
Regression Testing: Automated detection of breaking changes

Security & Compliance:

Dependency Monitoring: Renovate and GitHub Dependabot for automated updates
Vulnerability Scanning: pip-audit and Trivy security analysis
License Compliance: pip-licenses with allowlist validation and attribution generation
SBOM Generation: Software Bill of Materials in CycloneDX and SPDX formats

Documentation & Release:

Documentation with Sphinx: Automated generation of HTML/PDF documentation
API Documentation: Interactive OpenAPI specification with Swagger UI
Version Management: bump-my-version for semantic versioning
Changelog Generation: git-cliff for automated release notes
Multi-format Publishing: PyPI packages, Docker images, and Read The Docs

Monitoring & Observability:

Error Monitoring: Sentry integration for production error tracking
Logging & Metrics: Logfire integration for structured logging
Uptime Monitoring: Prepared integration with monitoring services

Deployment & Distribution:

Multi-stage Docker Builds: Fat (all extras) and slim (core only) variants
Multi-architecture Support: ARM64 and AMD64 container images
Container Security: Non-root execution within immutable containers
Registry Publishing: Docker.io and GitHub Container Registry with attestations

Development Environment:

Dev Containers: One-click development environments with GitHub Codespaces
VSCode Integration: Optimized settings and extensions for development all found under ./vscode directory
GitHub Copilot: Custom instructions and prompts for AI-assisted development
Local CI/CD: Act integration for running GitHub Actions locally

1.5 Layers and Modules

High-Level Module Organization

graph TB
    subgraph "Presentation Interfaces"
        CLI["🖥️ CLI Interface<br/><small>Typer-based Commands</small>"]
        GUI["🖱️ GUI Launchpad<br/><small>NiceGUI Web Interface</small>"]
        NOTEBOOK["📓 Interactive Notebooks<br/><small>Marimo Server</small>"]
    end

    subgraph "Domain Modules"
        APPLICATION["🧬 Application Module<br/><small>ML Application Management</small>"]
        BUCKET["🗂️ Bucket Module<br/><small>Cloud File Storage</small>"]
        DATASET["📊 Dataset Module<br/><small>Data Management</small>"]
        WSI["🔬 WSI Module<br/><small>Whole Slide Image Processing</small>"]
        QUPATH["🔍 QuPath Module<br/><small>Pathology Analysis Integration</small>"]
        SYSTEM["⚙️ System Module<br/><small>Health & Diagnostics</small>"]
        NOTEBOOK_MOD["📔 Notebook Module<br/><small>Interactive Computing</small>"]
    end

    subgraph "Platform Layer"
        PLATFORM["🌐 Platform Module<br/><small>API Client & Authentication</small>"]
    end

    subgraph "Infrastructure Layer"
        UTILS["🔧 Utils Module<br/><small>DI, Settings, Logging</small>"]
    end

    subgraph "Third-party Integrations"
        THIRDPARTY["🔗 Third-party Module<br/><small>External Service Connectors</small>"]
    end

    %% Direct presentation dependencies
    CLI --> APPLICATION
    CLI --> BUCKET
    CLI --> DATASET
    CLI --> WSI
    CLI --> QUPATH
    CLI --> SYSTEM
    CLI --> NOTEBOOK_MOD

    GUI --> APPLICATION
    GUI --> BUCKET
    GUI --> DATASET
    GUI --> WSI
    GUI --> SYSTEM
    GUI --> NOTEBOOK_MOD

    NOTEBOOK --> APPLICATION
    NOTEBOOK --> BUCKET
    NOTEBOOK --> DATASET

    %% Domain module dependencies
    APPLICATION --> PLATFORM
    APPLICATION --> BUCKET
    APPLICATION --> WSI
    APPLICATION --> QUPATH

    BUCKET --> PLATFORM
    DATASET --> PLATFORM
    WSI --> PLATFORM
    QUPATH --> THIRDPARTY
    NOTEBOOK_MOD --> PLATFORM

    %% Infrastructure dependencies (all modules depend on utils)
    APPLICATION --> UTILS
    BUCKET --> UTILS
    DATASET --> UTILS
    WSI --> UTILS
    QUPATH --> UTILS
    SYSTEM --> UTILS
    NOTEBOOK_MOD --> UTILS
    PLATFORM --> UTILS
    THIRDPARTY --> UTILS

    %% Styling with better contrast
    classDef presentation fill:#e3f2fd,stroke:#1976d2,stroke-width:2px,color:#000
    classDef domain fill:#e8f5e8,stroke:#388e3c,stroke-width:2px,color:#000
    classDef platform fill:#fff3e0,stroke:#f57c00,stroke-width:2px,color:#000
    classDef infrastructure fill:#fce4ec,stroke:#c2185b,stroke-width:2px,color:#000
    classDef thirdparty fill:#f3e5f5,stroke:#7b1fa2,stroke-width:2px,color:#000

    class CLI,GUI,NOTEBOOK presentation
    class APPLICATION,BUCKET,DATASET,WSI,QUPATH,SYSTEM,NOTEBOOK_MOD domain
    class PLATFORM platform
    class UTILS infrastructure
    class THIRDPARTY thirdparty

Detailed Data Flow: Application Processing Workflow

sequenceDiagram
    participant User as 👤 User
    participant CLI as 🖥️ CLI/GUI
    participant App as 🧬 Application Service
    participant Bucket as 🗂️ Bucket Service
    participant Platform as 🌐 Platform API
    participant ML as 🤖 ML Pipeline
    participant QuPath as 🔍 QuPath

    Note over User,QuPath: Computational Pathology Analysis Workflow

    User->>CLI: Upload WSI files
    CLI->>Bucket: Upload to cloud storage
    Bucket->>Platform: Generate signed URLs
    Platform-->>Bucket: Return upload URLs
    Bucket-->>CLI: Upload progress updates
    CLI-->>User: Show upload status

    User->>CLI: Submit application run
    CLI->>App: Create application run
    App->>Platform: Submit run with metadata
    Platform->>ML: Start processing pipeline

    Note over ML: Image Analysis:<br/>• Tissue segmentation<br/>• Cell detection<br/>• Feature extraction

    ML-->>Platform: Processing updates
    Platform-->>App: Status notifications
    App-->>CLI: Real-time progress
    CLI-->>User: Show processing status

    ML->>Platform: Results ready
    Platform-->>App: Download URLs available
    App->>Bucket: Download results
    Bucket-->>App: Result files (GeoJSON, TIFF, CSV)

    App->>QuPath: Generate QuPath project
    QuPath-->>App: Project created
    App-->>CLI: Results available
    CLI-->>User: Open in QuPath/Notebooks

The SDK is organized into distinct layers, each with specific responsibilities:

Infrastructure Layer (`utils/`)

Core Utilities and Cross-cutting Concerns:

Boot Sequence: Application initialization and dependency injection setup
Dependency Injection: Dynamic discovery and registration of services and UI components
Settings Management: Environment-aware configuration using Pydantic Settings
Logging & Monitoring: Structured logging with Logfire and Sentry integration
Authentication: Token-based authentication with caching mechanisms
Health Monitoring: Service health checks and status reporting

Platform Layer (`platform/`)

Foundation Services:

API Client: Auto-generated client for Aignostics Platform REST API
Authentication Service: OAuth/JWT token management and renewal
Core Resources: Applications, versions, runs, and user management
Exception Handling: Standardized error handling and API response processing
Configuration: Platform-specific settings and endpoint management

Domain Modules

Each domain module follows a consistent internal structure:

Application Module (application/)

Service (_service.py): Core business logic for application management and execution
CLI (_cli.py): Command-line interface for application operations
GUI (_gui/): Web-based interface for application management
Settings (_settings.py): Module-specific configuration
Utilities (_utils.py): Helper functions and data transformations

Bucket Module (bucket/)

Service: Cloud storage operations, file upload/download with progress tracking
CLI: Command-line file management operations
GUI: Drag-and-drop file manager interface
Settings: Storage configuration and authentication

Dataset Module (dataset/)

Service: Dataset creation, validation, and metadata management
CLI: Batch dataset operations and validation
GUI: Interactive dataset builder and explorer
Settings: Dataset processing configuration

WSI Module (wsi/)

Service: Whole slide image processing and format conversion
Utilities: Image format detection and metadata extraction
Integration: Support for various medical imaging formats (DICOM, TIFF, SVS)

QuPath Module (qupath/)

Service: Integration with QuPath for advanced pathology analysis
Annotation Processing: Import/export of pathology annotations
Project Management: QuPath project creation and synchronization

Notebook Module (notebook/)

Service: Marimo notebook server management
Templates: Pre-configured notebook templates for common workflows
Integration: Seamless data flow between notebooks and platform services

System Module (system/)

Service: System diagnostics and environment information
Health Checks: Comprehensive system health monitoring
Configuration: System-level settings and capability detection

Third-Party Integration (third_party/)

Embedded Dependencies: Vendored third-party libraries for reliability
IDC Index: Integration with Image Data Commons for medical imaging datasets
Bottle.py: Lightweight WSGI micro web-framework for specific use cases

Presentation Layer

CLI Interface (cli.py)

Auto-discovery and registration of module CLI commands
Consistent help text and error handling across all commands
Progress indicators and interactive prompts
Support for both interactive and scripted usage

GUI Interface (gui/)

Responsive web-based interface using NiceGUI
Consistent theming and layout across all modules
Real-time progress tracking and status updates
File drag-and-drop capabilities and interactive forms

2. Design Principles

2.1 Modular Architecture

Each module is designed as a self-contained unit with:

Clear Boundaries: Well-defined interfaces and minimal coupling
Consistent Structure: Standardized organization across all modules
Independent Testing: Module-specific test suites with isolated dependencies
Optional Dependencies: Graceful degradation when optional features are unavailable

2.2 Dependency Injection

The SDK uses a sophisticated dependency injection system:

Automatic Discovery: Services and UI components are automatically registered
Dynamic Loading: Modules are loaded on-demand based on available dependencies
Lifecycle Management: Proper initialization and cleanup of resources
Configuration Injection: Settings are automatically injected into services

2.3 Configuration Management

Hierarchical configuration system:

Environment Variables: Platform and module-specific environment variables
Settings Files: .env files for local development configuration
Default Values: Sensible defaults for all configuration options
Validation: Type-safe configuration with Pydantic validation

2.4 Error Handling and Resilience

Comprehensive error handling strategy:

Typed Exceptions: Domain-specific exception hierarchies
Graceful Degradation: Fallback behavior when services are unavailable
Retry Logic: Automatic retry with exponential backoff for transient failures
User-Friendly Messages: Clear error messages with actionable guidance

2.5 Security and Privacy

Security-first design principles:

Token-based Authentication: Secure API authentication with automatic refresh
Sensitive Data Protection: Automatic masking of secrets in logs and outputs
Input Validation: Comprehensive validation of all user inputs and API responses
Secure Defaults: All security settings default to the most secure option

3. Integration Patterns

3.1 Aignostics Platform API Integration

The SDK serves as a comprehensive client for the Aignostics Platform API, a RESTful web service that provides access to advanced computational pathology applications and machine learning workflows:

Core API Services:

Application Management: Access to computational pathology applications like Atlas H&E-TME, tissue segmentation, cell detection and classification
Run Orchestration: Submit, monitor, and manage application runs with up to 500 whole slide images per batch
Result Management: Incremental download of results as processing completes, with automatic 30-day retention
Resource Management: User quotas, organization management, and usage monitoring

Machine Learning Applications:

The platform provides access to a growing ecosystem of computational pathology applications:

Atlas H&E-TME: Tumor microenvironment analysis for H&E stained slides
Test Application: Free validation application for integration testing
Tissue Quality Control: Automated assessment of slide quality and artifacts
Cell Detection & Classification: Advanced machine learning models for cellular analysis
Biomarker Scoring: Quantitative analysis of immunohistochemical markers

Technical Integration:

Auto-generated Client: Type-safe API client generated from OpenAPI specifications using aignx-codegen
Authentication Handling: OAuth/JWT token management with automatic refresh and secure credential storage
Request/Response Transformation: Conversion between Platform API models and SDK domain objects
Error Mapping: Platform API errors mapped to SDK-specific exceptions with actionable error messages
Batch Processing: Support for high-throughput processing with incremental result delivery

Data Flow Architecture:

sequenceDiagram
    participant SDK as Aignostics SDK
    participant Platform as Aignostics Platform API
    participant ML as ML Processing Pipeline
    participant Storage as Cloud Storage

    SDK->>Platform: Submit Application Run
    Platform->>Platform: Validate Input & Schedule
    Platform->>Storage: Download WSI via Signed URLs
    Platform->>ML: Execute Computational Pathology Application
    ML->>ML: Process: Tissue Segmentation, Cell Detection, etc.
    ML->>Storage: Store Results (GeoJSON, TIFF, CSV)
    Platform->>SDK: Notify Results Available
    SDK->>Storage: Download Results Incrementally

Supported Image Formats & Standards:

Input Formats: Pyramidal DICOM, TIFF, SVS, and other digital pathology formats
Output Formats: QuPath GeoJSON (annotations), TIFF (heatmaps), CSV (measurements and statistics)
Metadata Standards: DICOM metadata extraction and validation
Quality Assurance: CRC32C checksums and automated format validation

3.2 File System Integration

Comprehensive file system operations optimized for large medical imaging files:

Progress Tracking: Real-time progress indicators for large file operations with human-readable size formatting
Integrity Verification: CRC32C checksums for data integrity validation during transfers
Resume Capability: Ability to resume interrupted file transfers for large WSI files
Batch Operations: Efficient handling of multiple whole slide images with parallel processing
Cross-platform Compatibility: Consistent behavior across operating systems

3.3 External Tool Integration

Seamless integration with external tools:

QuPath: Direct integration for pathology image analysis
Jupyter/Marimo: Notebook environments for interactive data exploration
File Managers: Native file manager integration for easy file access
Web Browsers: Embedded browser components for rich user interfaces

4. Application Ecosystem

4.1 Computational Pathology Applications

The SDK provides access to Aignostics' portfolio of advanced computational pathology applications, each designed for specific analysis purposes in digital pathology workflows:

Atlas H&E-TME (Hematoxylin & Eosin - Tumor Microenvironment)

Advanced machine learning-based tissue and cell analysis for H&E stained slides
Quantitative tumor microenvironment characterization
Automated tissue segmentation and cell classification
Spatial analysis of cellular interactions and distributions

Application Versioning & Management

Each application supports multiple versions with semantic versioning
Version-specific input requirements and output schemas
Backward compatibility for stable integrations
Application-specific documentation and constraints (staining methods, tissue types, diseases)

Processing Pipeline

Automated quality control and format validation
Parallel processing of multiple whole slide images (up to 500 per batch)
Real-time status monitoring with detailed error reporting
Incremental result delivery as individual slides complete processing

4.2 Data Processing Workflows

Input Processing

Multi-format support: Pyramidal DICOM, TIFF, SVS, and other digital pathology formats
Automated metadata extraction from DICOM headers
Base magnification (MPP) detection and validation
Image dimension analysis and pyramid level inspection

Machine Learning Execution

Cloud-based processing with enterprise-grade security
Scalable compute resources for high-throughput analysis
GPU-accelerated inference for complex pathology models
Quality control checkpoints throughout the processing pipeline

Output Generation

Standardized result formats for downstream analysis:
- QuPath GeoJSON: Polygon annotations for tissue regions and cellular structures
- TIFF Images: Heatmaps and segmentation masks with spatial information
- CSV Data: Quantitative measurements, statistics, and biomarker scores
Metadata preservation and provenance tracking
Result validation and quality assurance checks

4.3 Integration Capabilities

Enterprise Integration

RESTful API for language-agnostic integration
Support for Laboratory Information Management Systems (LIMS)
Integration with Image Management Systems (IMS)
SAML/OIDC authentication for enterprise identity providers

Research Workflows

Jupyter and Marimo notebook integration for interactive analysis
QuPath project generation for advanced visualization
Export capabilities for external analysis tools
Batch processing for large-scale studies

Quality Assurance & Compliance

Automated validation of input requirements
Processing audit trails and provenance tracking
Secure data handling with configurable retention policies
Two-factor authentication and role-based access control

5. Quality Assurance

5.1 Testing Strategy

Multi-layered testing approach:

Unit Tests: Individual component testing with >85% coverage requirement using pytest with pytest-cov for coverage reporting
End-to-End Tests: Complete workflow testing from CLI and GUI using pytest with NiceGUI testing plugin
Regression Tests: Automated detection of breaking changes using pytest-regressions
Parallel Execution: Multi-process test execution using pytest-xdist
Async Testing: Asynchronous code testing using pytest-asyncio
Long-running Tests: Scheduled integration tests marked with @pytest.mark.long_running and @pytest.mark.scheduled

4.2 Code Quality

Automated code quality enforcement:

Style Consistency: Automated formatting with Ruff/Black
Type Safety: 100% type annotation coverage with MyPy strict mode
Complexity Monitoring: Cyclomatic complexity limits and code metrics
Security Scanning: Automated detection of security vulnerabilities

4.3 Documentation Standards

Comprehensive documentation requirements:

API Documentation: Auto-generated from type hints and docstrings
User Guides: Step-by-step tutorials for common workflows
Architecture Documentation: This document and module-specific designs
Release Notes: Automated changelog generation from commit messages

6. Deployment and Operations

6.1 Distribution Channels

Multiple distribution methods:

PyPI Package: Standard Python package installation via pip/uv
Docker Images: Containerized deployment with multiple variants
Source Installation: Direct installation from GitHub repository
Development Setup: One-click development environment setup

6.2 Configuration Management

Environment-aware configuration:

Development: Local .env files with development defaults
Testing: Isolated test configuration with mock services
Production: Environment variables with validation and defaults
Container: Container-specific configuration and health checks

6.3 Monitoring and Observability

Production monitoring capabilities:

Health Endpoints: Service health checks for monitoring systems
Structured Logging: JSON-formatted logs with correlation IDs
Error Tracking: Automatic error reporting to monitoring services

This architecture document reflects the current state of the Aignostics Python SDK as of August 2025. The design emphasizes modularity, maintainability, and extensibility while providing a consistent developer experience across different interaction modes (CLI, GUI, API).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Software Architecture Document

1. Overview

1.1 Context

1.2 General Architecture and Patterns Applied

Simplified Overview for Onboarding

1.3 Language and Frameworks

1.4 Build Chain and CI/CD

1.5 Layers and Modules

High-Level Module Organization

Detailed Data Flow: Application Processing Workflow

Infrastructure Layer (`utils/`)

Platform Layer (`platform/`)

Domain Modules

Presentation Layer

2. Design Principles

2.1 Modular Architecture

2.2 Dependency Injection

2.3 Configuration Management

2.4 Error Handling and Resilience

2.5 Security and Privacy

3. Integration Patterns

3.1 Aignostics Platform API Integration

3.2 File System Integration

3.3 External Tool Integration

4. Application Ecosystem

4.1 Computational Pathology Applications

4.2 Data Processing Workflows

4.3 Integration Capabilities

5. Quality Assurance

5.1 Testing Strategy

4.2 Code Quality

4.3 Documentation Standards

6. Deployment and Operations

6.1 Distribution Channels

6.2 Configuration Management

6.3 Monitoring and Observability

FilesExpand file tree

SOFTWARE_ARCHITECTURE.md

Latest commit

History

SOFTWARE_ARCHITECTURE.md

File metadata and controls

Software Architecture Document

1. Overview

1.1 Context

1.2 General Architecture and Patterns Applied

Simplified Overview for Onboarding

1.3 Language and Frameworks

1.4 Build Chain and CI/CD

1.5 Layers and Modules

High-Level Module Organization

Detailed Data Flow: Application Processing Workflow

Infrastructure Layer (utils/)

Platform Layer (platform/)

Domain Modules

Presentation Layer

2. Design Principles

2.1 Modular Architecture

2.2 Dependency Injection

2.3 Configuration Management

2.4 Error Handling and Resilience

2.5 Security and Privacy

3. Integration Patterns

3.1 Aignostics Platform API Integration

3.2 File System Integration

3.3 External Tool Integration

4. Application Ecosystem

4.1 Computational Pathology Applications

4.2 Data Processing Workflows

4.3 Integration Capabilities

5. Quality Assurance

5.1 Testing Strategy

4.2 Code Quality

4.3 Documentation Standards

6. Deployment and Operations

6.1 Distribution Channels

6.2 Configuration Management

6.3 Monitoring and Observability

Infrastructure Layer (`utils/`)

Platform Layer (`platform/`)