feat: auto-launching local proxy server for eversale LLM routing by codegen-sh[bot] · Pull Request #219 · Zeeeepa/codegen

codegen-sh · 2026-03-08T10:13:05Z

Summary

Adds a local proxy server that auto-launches when eversale starts and routes all LLM calls to your configured backend. This eliminates the need to modify eversale's core logic for different LLM providers.

Architecture

┌──────────────┐     ┌──────────────────────┐     ┌─────────────────┐
│   eversale   │────▶│  Local Proxy Server  │────▶│  Anthropic API  │
│ gpu_llm_client│     │  localhost:8765       │     │  OpenAI API     │
│              │     │                      │     │  Ollama (local) │
│ Always sends │     │ Auto-translates      │     │  Z.AI / Custom  │
│ OpenAI format│     │ to target backend    │     └─────────────────┘
└──────────────┘     └──────────────────────┘

New Files

File	Lines	Description
`proxy_server.py`	~620	aiohttp gateway — 4 backend adapters (Anthropic, OpenAI, Ollama, Custom), full SSE streaming, format translation
`proxy_launcher.py`	~250	Subprocess lifecycle — idempotent start, PID management, health-check gating, CLI
`proxy_config.py`	~165	Config hub — backend selection, model mapping, env var routing

Proxy Endpoints

Endpoint	Method	Description
`/v1/chat/completions`	POST	OpenAI-compatible chat (streaming + non-streaming)
`/v1/models`	GET	Model listing
`/api/chat`	POST	Ollama-compatible chat
`/health`	GET	Readiness probe

Bug Fixes (PR #213)

llm_fallback_chain.py: Added missing import os (was causing NameError at import time)
llm_fallback_chain.py: Added missing _looks_like_ollama_endpoint() method (was causing AttributeError at runtime)

Modified Files

gpu_llm_client.py: Default URL → localhost:8765, added _is_local_proxy(), proxy handles auth
config.yaml: Added proxy section, URLs point to proxy
.env.example: Full documentation of proxy env vars
README.md: Architecture diagram, quick-start for all 4 backends

Quick Start

export LLM_BACKEND=anthropic  # or openai, ollama, custom
export ANTHROPIC_API_KEY=sk-ant-...
python proxy_launcher.py start

Based on PR #213 (codegen-bot/eversale-local-api-integration-a7f3c2).

💻 View my work • 👤 Initiated by @Zeeeepa • About Codegen
⛔ Remove Codegen from PR • 🚫 Ban action checks

Summary by cubic

Adds an auto-launching local proxy that routes all LLM calls to your chosen backend (Anthropic/OpenAI/Ollama/custom), removing provider-specific logic from eversale and simplifying setup. Default LLM requests now go through http://127.0.0.1:8765 with streaming and model mapping handled by the proxy.

New Features
- Added proxy_server.py: OpenAI-compatible /v1/chat/completions, /v1/models, and Ollama-compatible /api/chat with SSE streaming; adapters for Anthropic, OpenAI, Ollama, and custom.
- Added proxy_launcher.py: idempotent background start, health-check gating, PID management; binds to 127.0.0.1:8765.
- Updated gpu_llm_client.py: defaults to the local proxy; auth handled by proxy; supports proxy detection.
- Added proxy_config.py, config.yaml, .env.example, and README.md: backend selection via LLM_BACKEND, model mapping, and quick start.
Bug Fixes
- llm_fallback_chain.py: added missing import os and _looks_like_ollama_endpoint() to fix NameError/AttributeError.

^{Written for commit bcd8193. Summary will update on new commits.}

Modified 7 files from eversale-cli-2.1.216 to run locally with custom API key: 1. engine/config/config.yaml - mode=local, all models=glm-5, endpoints=Z.AI 2. engine/agent/gpu_llm_client.py - URL=ANTHROPIC_BASE_URL, auth=ANTHROPIC_API_KEY 3. engine/agent/llm_fallback_chain.py - defaults to env vars 4. engine/agent/kimi_k2_client.py - added anthropic provider (auto-detect first) 5. bin/eversale.js - license check bypassed for local dev 6. engine/agent/license_validator.py - validate functions return True 7. engine/agent/config_loader.py - ANTHROPIC_BASE_URL in env chain Verification: 30/30 tests pass including live API call to Z.AI with GLM-5.

- proxy_server.py: aiohttp gateway with 4 backend adapters (Anthropic, OpenAI, Ollama, Custom/Z.AI) Endpoints: /v1/chat/completions, /v1/models, /api/chat, /health Full SSE streaming support for all backends - proxy_launcher.py: subprocess lifecycle management Idempotent startup with health-check gating PID file management, graceful shutdown, CLI interface Cross-platform: Unix (SIGTERM) + Windows (TASKKILL) - proxy_config.py: centralized configuration Backend selection via LLM_BACKEND env var Model name mapping (glm-5 → actual model per backend) API key routing from ANTHROPIC_API_KEY, OPENAI_API_KEY, etc. - gpu_llm_client.py: default URL to localhost:8765 proxy Added _is_local_proxy() detection Proxy handles auth internally (no token from client) - llm_fallback_chain.py: BUG FIX Added missing 'import os' (NameError at import time) Added missing _looks_like_ollama_endpoint() method - config.yaml: proxy section, URLs point to localhost:8765 - .env.example: all proxy-related env vars documented - README.md: architecture diagram, quick-start guide Co-authored-by: Zeeeepa <zeeeepa@gmail.com>

codegen-sh bot and others added 2 commits March 5, 2026 05:03

codegen-sh bot assigned Zeeeepa Mar 8, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: auto-launching local proxy server for eversale LLM routing#219

feat: auto-launching local proxy server for eversale LLM routing#219
codegen-sh[bot] wants to merge 2 commits intodevelopfrom
codegen-bot/eversale-proxy-server-f8a2d1

codegen-sh bot commented Mar 8, 2026 •

edited by cubic-dev-ai bot

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

codegen-sh bot commented Mar 8, 2026 • edited by cubic-dev-ai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Architecture

New Files

Proxy Endpoints

Bug Fixes (PR #213)

Modified Files

Quick Start

Summary by cubic

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

codegen-sh bot commented Mar 8, 2026 •

edited by cubic-dev-ai bot

Loading