Middleware Pipeline (BETA)

BETA Feature

Middleware pipeline is currently in beta. It is disabled by default and requires explicit configuration to enable.

Extend GoZen with pluggable middleware for request/response transformation, logging, rate limiting, and custom processing.

Features

Pluggable architecture — Add custom processing logic without modifying core code
Priority-based execution — Control middleware execution order
Request/response hooks — Process requests before sending, responses after receiving
Built-in middleware — Context injection, logging, rate limiting, compression
Plugin loader — Load middleware from local files or remote URLs
Error handling — Graceful error handling with fallback behavior

Architecture

Client Request
    ↓
[Middleware 1: Priority 100]
    ↓
[Middleware 2: Priority 200]
    ↓
[Middleware 3: Priority 300]
    ↓
Provider API
    ↓
[Middleware 3: Response]
    ↓
[Middleware 2: Response]
    ↓
[Middleware 1: Response]
    ↓
Client Response

Configuration

Enable Middleware Pipeline

{
  "middleware": {
    "enabled": true,
    "pipeline": [
      {
        "name": "context-injection",
        "enabled": true,
        "priority": 100,
        "config": {}
      },
      {
        "name": "request-logger",
        "enabled": true,
        "priority": 200,
        "config": {
          "log_level": "info"
        }
      }
    ]
  }
}

Options:

Option	Description
`enabled`	Enable middleware pipeline
`pipeline`	Array of middleware configurations
`name`	Middleware identifier
`priority`	Execution order (lower = earlier)
`config`	Middleware-specific configuration

Built-in Middleware

1. Context Injection

Inject custom context into requests.

{
  "name": "context-injection",
  "enabled": true,
  "priority": 100,
  "config": {
    "system_prompt": "You are a helpful coding assistant.",
    "metadata": {
      "session_id": "sess_123",
      "user_id": "user_456"
    }
  }
}

Use cases:

Add system prompts
Inject session metadata
Add user context

2. Request Logger

Log all requests and responses.

{
  "name": "request-logger",
  "enabled": true,
  "priority": 200,
  "config": {
    "log_level": "info",
    "log_body": false,
    "log_headers": true
  }
}

Use cases:

Debugging
Audit trails
Performance monitoring

3. Rate Limiter

Limit request rate per provider or globally.

{
  "name": "rate-limiter",
  "enabled": true,
  "priority": 300,
  "config": {
    "requests_per_minute": 60,
    "burst": 10,
    "per_provider": true
  }
}

Use cases:

Prevent rate limit errors
Control API usage
Protect against abuse

4. Compression (BETA)

Compress context when token count exceeds threshold.

{
  "name": "compression",
  "enabled": true,
  "priority": 400,
  "config": {
    "threshold_tokens": 50000,
    "target_tokens": 20000
  }
}

See Context Compression for details.

5. Session Memory (BETA)

Maintain conversation memory across sessions.

{
  "name": "session-memory",
  "enabled": true,
  "priority": 150,
  "config": {
    "max_memories": 100,
    "ttl_hours": 24,
    "storage": "sqlite"
  }
}

Use cases:

Remember user preferences
Track conversation history
Maintain context across sessions

6. Orchestration (BETA)

Route requests to multiple providers and aggregate responses.

{
  "name": "orchestration",
  "enabled": true,
  "priority": 500,
  "config": {
    "strategy": "parallel",
    "providers": ["anthropic", "openai"],
    "consensus": "longest"
  }
}

Use cases:

Compare model outputs
Redundancy for critical requests
Quality improvement through consensus

Custom Middleware

Middleware Interface

type Middleware interface {
    Name() string
    Priority() int
    ProcessRequest(ctx *RequestContext) error
    ProcessResponse(ctx *ResponseContext) error
}

type RequestContext struct {
    Provider  string
    Model     string
    Messages  []Message
    Metadata  map[string]interface{}
}

type ResponseContext struct {
    Provider  string
    Model     string
    Response  *APIResponse
    Latency   time.Duration
    Metadata  map[string]interface{}
}

Example: Custom Header Injection

package main

import (
    "github.com/dopejs/gozen/internal/middleware"
)

type CustomHeaderMiddleware struct {
    headers map[string]string
}

func (m *CustomHeaderMiddleware) Name() string {
    return "custom-headers"
}

func (m *CustomHeaderMiddleware) Priority() int {
    return 250
}

func (m *CustomHeaderMiddleware) ProcessRequest(ctx *middleware.RequestContext) error {
    for k, v := range m.headers {
        ctx.Metadata[k] = v
    }
    return nil
}

func (m *CustomHeaderMiddleware) ProcessResponse(ctx *middleware.ResponseContext) error {
    // No response processing needed
    return nil
}

func init() {
    middleware.Register("custom-headers", func(config map[string]interface{}) middleware.Middleware {
        return &CustomHeaderMiddleware{
            headers: config["headers"].(map[string]string),
        }
    })
}

Loading Custom Middleware

Local Plugin

{
  "middleware": {
    "enabled": true,
    "plugins": [
      {
        "type": "local",
        "path": "/path/to/custom-middleware.so",
        "config": {
          "headers": {
            "X-Custom-Header": "value"
          }
        }
      }
    ]
  }
}

Remote Plugin

{
  "middleware": {
    "enabled": true,
    "plugins": [
      {
        "type": "remote",
        "url": "https://example.com/middleware/custom-headers.so",
        "checksum": "sha256:abc123...",
        "config": {}
      }
    ]
  }
}

Web UI

Access middleware settings at http://localhost:19840/settings:

Navigate to "Middleware" tab (marked with BETA badge)
Toggle "Enable Middleware Pipeline"
Add/remove middleware from pipeline
Adjust priority and configuration
Enable/disable individual middleware
Click "Save"

API Endpoints

List Middleware

GET /api/v1/middleware

Response:

{
  "enabled": true,
  "pipeline": [
    {
      "name": "context-injection",
      "enabled": true,
      "priority": 100,
      "type": "builtin"
    },
    {
      "name": "request-logger",
      "enabled": true,
      "priority": 200,
      "type": "builtin"
    }
  ]
}

Add Middleware

POST /api/v1/middleware
Content-Type: application/json

{
  "name": "rate-limiter",
  "enabled": true,
  "priority": 300,
  "config": {
    "requests_per_minute": 60
  }
}

Update Middleware

PUT /api/v1/middleware/{name}
Content-Type: application/json

{
  "enabled": false
}

Remove Middleware

DELETE /api/v1/middleware/{name}

Reload Pipeline

POST /api/v1/middleware/reload

Use Cases

Development Environment

Add debug logging and request inspection:

{
  "middleware": {
    "enabled": true,
    "pipeline": [
      {
        "name": "request-logger",
        "enabled": true,
        "priority": 100,
        "config": {
          "log_level": "debug",
          "log_body": true
        }
      }
    ]
  }
}

Production Environment

Add rate limiting and monitoring:

{
  "middleware": {
    "enabled": true,
    "pipeline": [
      {
        "name": "rate-limiter",
        "enabled": true,
        "priority": 100,
        "config": {
          "requests_per_minute": 100,
          "burst": 20
        }
      },
      {
        "name": "request-logger",
        "enabled": true,
        "priority": 200,
        "config": {
          "log_level": "info",
          "log_body": false
        }
      }
    ]
  }
}

Multi-Provider Comparison

Use orchestration to compare outputs:

{
  "middleware": {
    "enabled": true,
    "pipeline": [
      {
        "name": "orchestration",
        "enabled": true,
        "priority": 500,
        "config": {
          "strategy": "parallel",
          "providers": ["anthropic", "openai", "google"],
          "consensus": "longest"
        }
      }
    ]
  }
}

Best Practices

Use appropriate priorities — Lower numbers execute first
Keep middleware focused — Each middleware should do one thing well
Handle errors gracefully — Don't break the pipeline on errors
Test thoroughly — Verify middleware behavior before production
Monitor performance — Track middleware overhead
Document configuration — Clearly document config options

Limitations

Performance overhead — Each middleware adds latency
Complexity — Too many middleware can make debugging difficult
Plugin security — Remote plugins require trust and verification
Error propagation — Middleware errors can affect all requests
Configuration complexity — Complex pipelines are harder to maintain

Troubleshooting

Middleware not executing

Verify middleware.enabled is true
Check middleware is enabled in pipeline
Verify priority is set correctly
Review daemon logs for middleware errors

Unexpected behavior

Check middleware execution order (priority)
Verify configuration is correct
Test middleware in isolation
Review middleware logs

Performance issues

Identify slow middleware (check logs)
Reduce middleware count
Optimize middleware implementation
Consider disabling non-essential middleware

Plugin loading failures

Verify plugin path is correct
Check plugin is compiled for correct architecture
Verify checksum matches (for remote plugins)
Review plugin logs for errors

Security Considerations

Validate plugins — Only load trusted plugins
Verify checksums — Always verify remote plugin checksums
Sandbox plugins — Consider running plugins in isolated environment
Audit middleware — Review middleware code before deployment
Monitor behavior — Watch for unexpected middleware behavior

Future Enhancements

WebAssembly plugin support for cross-platform compatibility
Middleware marketplace for sharing community plugins
Visual pipeline editor in Web UI
Middleware performance profiling
Hot-reload for plugin updates
Middleware testing framework

Features​

Architecture​

Configuration​

Enable Middleware Pipeline​

Built-in Middleware​

1. Context Injection​

2. Request Logger​

3. Rate Limiter​

4. Compression (BETA)​

5. Session Memory (BETA)​

6. Orchestration (BETA)​

Custom Middleware​

Middleware Interface​

Example: Custom Header Injection​

Loading Custom Middleware​

Local Plugin​

Remote Plugin​

Web UI​

API Endpoints​

List Middleware​

Add Middleware​

Update Middleware​

Remove Middleware​

Reload Pipeline​

Use Cases​

Development Environment​

Production Environment​

Multi-Provider Comparison​

Best Practices​

Limitations​

Troubleshooting​

Middleware not executing​

Unexpected behavior​

Performance issues​

Plugin loading failures​

Security Considerations​

Future Enhancements​

Features

Architecture

Configuration

Enable Middleware Pipeline

Built-in Middleware

1. Context Injection

2. Request Logger

3. Rate Limiter

4. Compression (BETA)

5. Session Memory (BETA)

6. Orchestration (BETA)

Custom Middleware

Middleware Interface

Example: Custom Header Injection

Loading Custom Middleware

Local Plugin

Remote Plugin

Web UI

API Endpoints

List Middleware

Add Middleware

Update Middleware

Remove Middleware

Reload Pipeline

Use Cases

Development Environment

Production Environment

Multi-Provider Comparison

Best Practices

Limitations

Troubleshooting

Middleware not executing

Unexpected behavior

Performance issues

Plugin loading failures

Security Considerations

Future Enhancements