Architectural Evolution: From EZStitcher to OpenHCS

Overview

OpenHCS evolved from its predecessor EZStitcher, transforming from a CPU-based image stitching tool into a GPU-native scientific computing platform. This document traces the architectural changes and design decisions that enabled this transformation.

EZStitcher: The Architectural Foundation

Core Innovations That Carried Forward

EZStitcher established several concepts that remain central to OpenHCS:

1. Pipeline Architecture Hierarchy

PipelineOrchestrator
├── Pipeline (sequence of steps)
│   ├── Step (single operation)
│   ├── Step (single operation)
│   └── Step (single operation)
└── Pipeline (another sequence)

This hierarchical design enabled complex workflows to be built from simple, reusable components.

2. Variable Components Pattern

Intelligent file grouping based on microscopy metadata patterns.

# EZStitcher pattern that solved the fundamental problem:
Step(
    func=create_projection,
    variable_components=['z_index']
)
# Groups: Files with same (well, channel, site) but different z_index
# Result: [z1.tif, z2.tif, z3.tif] → stack → max_projection → single.tif

Step(
    func=create_composite,
    variable_components=['channel']
)
# Groups: Files with same (well, site, z_index) but different channel
# Result: [ch1.tif, ch2.tif] → stack → composite → single.tif

This pattern automatically solves the image grouping problem - how to intelligently group related images for batch processing without manual specification of grouping logic.

3. Function Pattern System

Function Pattern System: EZStitcher’s most sophisticated innovation was its function pattern system:

# Single function
func = normalize

# Function with arguments
func = (sharpen, {'amount': 1.5})

# Sequential processing (pipeline within a step)
func = [
    stack(sharpen),
    normalize,
    equalize_histogram
]

# Component-specific processing
func = {
    "1": process_dapi,
    "2": process_calcein
}

# Semantically valid nested patterns
func = {
    "1": [                           # Channel 1: sequential processing
        (sharpen, {'amount': 1.5}),
        normalize,
        denoise_dapi
    ],
    "2": [enhance_calcein]           # Channel 2: different processing
}

# Note: Nested dictionaries aren't semantically valid
# (only one group_by per step makes sense)

The Stack Processing Evolution: Bridged the gap between single-image functions and stack-based processing:

# EZStitcher approach
func = stack(gaussian)  # Transforms single-image function to stack-aware

# OpenHCS evolution: stack_slices/unstack_slices system
# Automatic per-slice processing with memory type management

4. Specialized Step Types

ZFlatStep: Z-stack flattening with projection methods
CompositeStep: Multi-channel compositing with weights
PositionGenerationStep: Tile position calculation
ImageStitchingStep: Final image assembly

EZStitcher’s Limitations

Despite its architectural sophistication, EZStitcher hit fundamental performance and reliability walls:

Performance Bottlenecks

CPU-only processing: Hundreds of gigabytes processed slowly
Disk I/O between steps: Every operation read/wrote from disk
Memory inefficiency: No zero-copy operations
Single memory type: Only NumPy arrays supported

Reliability Issues

Silent failures: Academic code patterns that failed quietly
Basic error handling: No validation of processing chains
Format brittleness: Microscope-specific code paths

OpenHCS: The Architectural Revolution

Revolutionary Design Principles

OpenHCS didn’t just port EZStitcher to GPU - it fundamentally reimagined scientific computing architecture:

1. Memory Type System

Innovation: Explicit memory type contracts with automatic conversion.

@torch_func  # Function declares it works with PyTorch tensors
def n2v2_denoise_torch(image: torch.Tensor) -> torch.Tensor:
    # Function receives pre-converted tensor on correct device
    device = image.device  # No device management needed
    return denoised_tensor

@cupy_func   # Function declares it works with CuPy arrays
def gpu_ashlar_align_cupy(images: cp.ndarray) -> cp.ndarray:
    # Function receives pre-converted CuPy array
    return aligned_images

Benefits: - Functions focus on algorithms, not memory management - Automatic conversion between memory types (CuPy ↔ PyTorch ↔ NumPy) - Zero-copy GPU operations where possible - Compile-time validation of memory compatibility

2. Zero-Copy GPU Operations

Innovation: DLPack-based memory conversions for true zero-copy performance.

# Before (EZStitcher): CPU roundtrip
cupy_array → numpy_array → torch_tensor  # 2 copies, GPU→CPU→GPU

# After (OpenHCS): Direct GPU transfer
cupy_array → torch_tensor  # 0 copies, GPU→GPU via DLPack

Impact: Orders of magnitude performance improvement for large datasets.

3. Fail-Loudly Philosophy

Innovation: No silent degradation, explicit error handling.

# OpenHCS principle: Explicit failure over silent degradation
def _cupy_to_torch(data, allow_cpu_roundtrip=False):
    if not allow_cpu_roundtrip:
        raise MemoryConversionError("GPU conversion failed")
    # Never silently fall back to CPU

Contrast with Academic Code: - Academic: Silent CPU fallback when GPU fails - OpenHCS: Loud failure with clear error messages

4. Smell-Loop Validation

Innovation: Architectural review process preventing technical debt.

Plan File → Smell Review → Implementation → Validation

Purpose: Prevent the architectural rot that plagued EZStitcher extensions.

5. Pipeline Compiler

Innovation: Pre-execution validation of entire processing chains.

# Validates memory type compatibility before execution
compiled_contexts = orchestrator.compile_pipelines(
    pipeline_definition=pipeline.steps,
    well_filter=wells
)
# Fails fast if CuPy→PyTorch conversion not supported

Architectural Continuity

What OpenHCS Preserved from EZStitcher: - Pipeline → Step hierarchy (proven architecture) - Variable components pattern (intelligent grouping logic) - Group-by functionality (channel-specific processing) - Modular step design (composable workflows)

What OpenHCS Changed: - Memory management (explicit types vs implicit NumPy) - Error handling (fail loudly vs silent failures) - Performance (GPU-native vs CPU-only) - Validation (compile-time checks vs runtime surprises) - Function ecosystem (unified GPU library access vs manual integration)

Key Innovations and Differentiators

OpenHCS introduces several new systems that make it fundamentally different from traditional scientific computing tools. Each system is documented in detail in dedicated architecture documents:

🔥 Function Registry System 

Unified GPU function ecosystem with type-safe contracts

The most comprehensive GPU imaging function ecosystem in scientific computing, automatically discovering and unifying functions from pyclesperanto, scikit-image, CuCIM, and other libraries with consistent interfaces and memory type safety.

🖥️ TUI System 

Advanced terminal interface

A sophisticated Textual-based interface that works anywhere a terminal works - unprecedented for scientific computing tools. Includes real-time pipeline editing, live configuration management, integrated help, and professional log monitoring.

💾 Storage and Memory System 

Intelligent data management for 100GB+ datasets

Advanced Virtual File System with memory overlay capabilities, OME-ZARR compression, and smart backend switching that automatically scales from small experiments to massive high-content screening datasets.

⚡ Memory Type System 

Zero tolerance for silent failures

Comprehensive architecture that prevents the silent failures plaguing academic software through explicit validation, mandatory contracts, and clear error handling with actionable solutions.

🧬 External Integrations Overview 

Production neuroscience research deployment

Real-world deployment handling 100GB+ datasets in production neuroscience research with seamless integration with Napari, Fiji, and OMERO.

These innovations work together to create a scientific computing platform that is fundamentally different from traditional academic tools - providing enterprise-level reliability, unprecedented scale handling, and comprehensive GPU acceleration in a unified, user-friendly interface.

The Collaborative AI Innovation

Leveraging LLM Architectural Knowledge

The evolution from EZStitcher to OpenHCS represents a unique development methodology:

Traditional Approach: Domain expert → learns software engineering → builds tool OpenHCS Approach: Domain expert + AI architectural knowledge → builds production system

Key Collaborative Patterns

Architectural Guidance: AI provides software engineering best practices
Pattern Recognition: AI identifies anti-patterns and suggests improvements
Implementation Support: AI helps translate architectural vision into code
Debugging Partnership: Systematic problem-solving combining domain and technical expertise

Example: Memory Type System Design

Human: "I need GPU processing but different libraries use different array types"
AI: "Consider explicit memory type contracts with automatic conversion"
Human: "How do I prevent silent CPU fallbacks?"
AI: "Use decorators to declare memory requirements and fail loudly on violations"
Result: @torch_func/@cupy_func decorator system

Methodological Innovation

This represents a new model for scientific software development: - Domain expert drives architectural vision - AI provides software engineering expertise

- Iterative refinement through collaborative debugging - Real-time knowledge transfer from AI to human

Impact and Significance

Technical Impact

Performance: Orders of magnitude improvement through GPU-native processing
Reliability: Fail-loudly philosophy prevents silent data corruption
Extensibility: Memory type system enables easy addition of new processing functions
Interoperability: Format abstraction handles any microscope vendor

Scientific Impact

Reproducibility: Explicit validation prevents pipeline failures
Accessibility: Open-source alternative to expensive commercial solutions
Innovation: Enables new research through reliable, fast processing

Methodological Impact

Collaborative AI Development: Proves domain expert + AI can build production systems
Architectural Discipline: Shows how to prevent technical debt in scientific software
Knowledge Transfer: Demonstrates AI-assisted learning of software engineering

Lessons for Scientific Computing

Architectural Principles

Explicit over implicit: Declare requirements clearly (memory types, device placement)
Fail loudly over silent degradation: Better to crash than produce wrong results
Validation over hope: Check compatibility before execution, not during
Modularity over monoliths: Composable components enable flexible workflows

Development Methodology

Collaborative AI partnership: Leverage AI architectural knowledge
Iterative refinement: Build, test, improve through systematic debugging
Domain-driven design: Let research needs drive architectural decisions
Production mindset: Build for reliability, not just functionality

Future Evolution

OpenHCS establishes patterns that could transform scientific computing:

Technical Directions

Multi-GPU orchestration: Scale to larger datasets
Cloud-native deployment: Enable distributed processing
Real-time processing: Support live microscopy workflows
Advanced validation: Deeper architectural integrity checks

Methodological Directions

AI-assisted architecture: Deeper integration of AI in design decisions
Collaborative debugging: Systematic approaches to complex problem-solving
Knowledge preservation: Document architectural decisions and reasoning
Community development: Enable other researchers to contribute effectively

Conclusion

The evolution from EZStitcher to OpenHCS demonstrates that effective scientific software can emerge from the combination of:

Deep domain expertise (understanding real research problems)
Architectural vision (seeing beyond immediate needs)
Collaborative AI development (leveraging AI software engineering knowledge)
Systematic methodology (disciplined approach to complex problems)

OpenHCS proves that researchers don’t need to become software engineers - they need to become effective collaborators with AI systems that have architectural expertise.

The result: Enterprise-level scientific computing infrastructure that enables better research through better tools.

“The best software comes not from software engineers, but from researchers who refuse to accept that their tools have to suck.” - OpenHCS Origin Story