DS2API: Convert DeepSeek into an OpenAI/Claude/Gemini-compatible API gateway

DS2API exposes DeepSeek via OpenAI/Claude/Gemini-compatible APIs, offering a pure Go backend, React admin UI and concurrency/multi-account governance; ideal for self-hosted or containerized deployments to integrate with existing AI SDKs.

GitHub CJackHwang/ds2api Updated 2026-04-26 Branch main Stars 4.6K Forks 1.5K

Go backend API adapter/proxy Self-hosted / hybrid deployment Tool-calling & concurrency control

💡 Deep Analysis

What core problem does DS2API solve and how does it expose DeepSeek capabilities as OpenAI/Claude/Gemini-compatible APIs?

Core Analysis ¶

Project Positioning: DS2API is a self-hosted protocol compatibility and runtime governance gateway that exposes DeepSeek’s chat/completion/embedding capabilities as OpenAI/Anthropic/Google Gemini-compatible HTTP APIs, enabling existing clients, SDKs, and tools to integrate with minimal changes.

Technical Features ¶

API Surface Compatibility: Implements standard endpoints such as POST /v1/chat/completions, /anthropic/v1/messages, and /v1beta/models/:generateContent to align with mainstream SDK contracts.
PromptCompat: Converts provider-specific request semantics and formats into DeepSeek web-text conversation contexts, addressing differences in prompt structure, role annotations, and metadata.
Runtime Governance: Includes an account pool (auto token refresh, dual login modes), in-flight slot limits + wait queues, and PoW to protect resources and mitigate quota shocks.

Practical Recommendations ¶

Validate compatibility paths: Use the Admin UI or config.example.json for model_aliases mapping and perform small-scale tests to ensure /v1/models behaves as expected.
Enable concurrency protections: Configure in-flight limits and queue sizes before production to avoid request pile-ups during upstream quota drops.
Use the Tool Sieve: If your app invokes tools or external actions, enable incremental delta.tool_calls output and leakage-prevention rules to reduce false positives/negatives.

Important Notice: The README states this repo is for learning/experimentation and the license is unspecified; validate legal and operational constraints before commercial deployment.

Summary: DS2API reduces integration cost by standardizing DeepSeek as a drop-in backend and consolidates common runtime complexities (auth, concurrency, tool safety) into the gateway layer.

85.0%

How does PromptCompat perform semantic adaptation? What are its advantages and boundary conditions?

Core Analysis ¶

Key Question: PromptCompat aims to semantically convert inputs from different APIs (OpenAI, Anthropic, Gemini) into a DeepSeek web-text conversation context so requests behave comparably across backends.

Technical Analysis ¶

Conversion Logic: PromptCompat parses messages / inputs / parameters, extracts system instructions, role order, and tool-call markers, and reconstructs a linear web-text context consumable by DeepSeek. It incorporates model_aliases, history_split, and may insert metadata for backend parsing.
Advantages:
Unified semantics: Reduces client/SDK changes;
Centralized governance: Enables uniform history trimming, Tool Sieve, and leakage prevention;
Operationally tunable: Rules can be hot-updated via Admin UI.
Boundaries & Limitations:
Cannot perfectly reproduce all edge behaviors or vendor-specific metadata (e.g., some response-level fields or token accounting);
Long histories and complex multi-tool interactions depend on history_split and downstream post-processing;
If DeepSeek upstream changes, PromptCompat rules must be maintained.

Practical Recommendations ¶

Validate with representative requests: Simulate typical messages/generateContent calls in the Admin UI and compare outputs.
Enable history_split: Use history splitting for long sessions and tune thresholds in the WebUI.
Maintain regression tests: Keep a suite of vendor-vs-DeepSeek test cases to catch behavioral regressions.

Important Notice: PromptCompat is a semantic mapper—effective for common cases but not a perfect behavioral emulator. If you need vendor-exact behavior, plan for additional adaptation or accept residual differences.

Summary: PromptCompat lowers cross-vendor integration effort but requires extra validation for edge behaviors, complex tool interactions, and long-context scenarios.

85.0%

How do DS2API's account pool, PoW, and concurrency queue work together to ensure stability? What are practical configuration recommendations?

Core Analysis ¶

Key Question: How to prevent upstream quota exhaustion, auth failures, or instability caused by traffic bursts in a self-hosted proxy? DS2API uses an account pool, PoW, and concurrency queue as a three-layer approach to mitigate these risks.

Technical Analysis ¶

Account Pool: Distributes requests across multiple DeepSeek accounts with auto token refresh and rotation. This reduces the chance of exhausting a single account’s quota.
PoW (Proof-of-Work): High-performance Go PoW serves as flow control or anti-abuse mechanism, reducing backend pressure during bursts or malicious traffic.
Concurrency Queue (in-flight limit + waiting queue): Caps concurrent requests per account; excess requests queue or get rejected, ensuring predictable latency and resource usage.

Combined: the pool provides parallelism, the queue controls instantaneous concurrency, and PoW deters abuse. Auto token refresh reduces auth-failure rates.

Practical Configuration Recommendations ¶

Start conservatively: Set a low per-account in-flight limit (e.g., 2–4) in the Admin UI and observe latency and success metrics before scaling up.
Enable auto-refresh and monitor failures: Use the account test features and /healthz to detect auth regressions.
Use history_split: Reduce payload sizes per request to lower per-request processing time and free queue slots faster.
Enable PoW only when needed: PoW introduces client-side compute cost—use it as a defensive measure against abuse or oscillatory load.

Important Notice: Built-in concurrency control suits small-to-medium self-hosted setups. For very large or multi-tenant deployments, add global rate limiting, autoscaling, and advanced telemetry.

Summary: Properly tuned, the account pool + queue + PoW stack improves stability and predictability under constrained upstream quotas, but requires incremental tuning and monitoring to find optimal settings.

85.0%

How to integrate DS2API into existing SDKs (e.g., LangChain, Vercel AI SDK, LlamaIndex)? What are key steps and caveats?

Core Analysis ¶

Key Question: What steps and verifications are needed to plug DS2API into existing SDK/client chains (LangChain, Vercel AI SDK, LlamaIndex)?

Technical Steps & Key Actions ¶

Replace endpoint / base URL: Point the client’s API base to the DS2API instance (e.g., https://ds2api.example.com/v1 or .../anthropic/v1).
Auth & headers: Ensure the SDK’s key format matches DS2API’s Auth Resolver (Authorization, x-goog-api-key, etc.) and verify via Admin UI.
Configure model_aliases: Map common aliases (e.g., gpt-4.1 -> deepseek-v4-pro) in config.example.json or Admin UI to avoid model name mismatches.
Validate streaming / Node SSE: If using Vercel AI SDK streaming, deploy DS2API’s Vercel Node Stream and test SSE event concatenation and completeness.
Test Tool Calling compatibility: For tool invocations, check DeepSeek’s expected XML/structure and adapt PromptCompat or client parsing.

Practical Recommendations ¶

Run an end-to-end PoC: Deploy to local/staging, execute representative LangChain/LlamaIndex flows, capture differences, and tune PromptCompat via Admin UI.
Use health checks & account tests: Employ /healthz, /readyz, and Admin API account tests to verify auth and upstream connectivity.
Watch CORS & preflight: DS2API centralizes CORS policy, but validate browser preflight behavior for your deployment.

Important Notice: Although compatibility is marked P0/P1, validate edge cases (complex responses, token accounting, tool-calling nuances) and run regression tests before production.

Summary: Integration is straightforward: replace base URL, align auth, configure aliases, verify streaming/tool behaviors, and iteratively tune via the Admin UI to use DS2API as a drop-in backend.

85.0%

✨ Highlights

Unified gateway compatible with OpenAI, Claude and Gemini APIs
Pure Go backend with a React-based administrative WebUI
Repository license is unspecified, posing potential commercial compliance risk
Metadata reports zero contributors/commits; verify actual maintenance and activity

🔧 Engineering

Provides rich HTTP endpoints and routing compatible with OpenAI/Claude/Gemini
Built-in PromptCompat, tool-calling, PoW, account pooling and concurrency queue management

⚠️ Risks

License unknown and disclaimer states research-only; evaluate cautiously for commercial/production use
Depends on upstream DeepSeek; credential management introduces security and availability risks

👥 For who?

Self-hosting teams and engineering/platform integrators needing OpenAI-compatible access
Users aiming to integrate DeepSeek into existing SDKs with multi-account and concurrency control