Craft Agents: Agent-native desktop multi-source AI workspace

Agent-native desktop for multi-LLM integration and session management.

GitHub craft-ai-agents/craft-agents-oss Updated 2026-07-03 Branch main Stars 6.6K Forks 916

Desktop App Multi-LLM Integration MCP/REST Data Sources Session Management No-code Skill Config Apache-2.0

💡 Deep Analysis

How to securely deploy Craft Agents in production (self-hosted/headless)?

Core Analysis ¶

Security summary: For production, host the headless server in a controlled environment, enforce TLS and strong token auth, and apply least-privilege, credential management, audit logging, and resource governance.

Mandatory Controls ¶

Network & deployment: Deploy server in a controlled network/private cloud; restrict inbound access to trusted desktop clients/internal networks.
Authentication & encryption: Enforce TLS (HTTPS/WSS) and use CRAFT_SERVER_TOKEN or equivalent; store API keys in a secrets manager (Vault/cloud secrets).
Permissions & audit: Default to conservative permission mode (Ask to Edit), add approvals for critical automations, and enable audit logs exported to SIEM/monitoring.
Subprocess isolation: Run MCP subprocesses in containers or restricted userspaces to limit filesystem/process access.
Cost & quota governance: Implement model call quotas, budget alerts, and rate limits to prevent abuse or surprise bills.

Practical Deployment Steps ¶

Deploy headless server in a controlled environment with TLS and CRAFT_SERVER_TOKEN.
Centralize credential management and adopt least-privilege keys/roles.
Export audit logs to your monitoring/compliance pipeline and enable approval workflows.
Sandbox subprocesses and gate Skill changes via change control.

Important Notice: Do not expose an unprotected server to the public internet; auto-discovery of private APIs may require manual configuration in restricted networks.

Summary: Secure production use is achievable via network isolation, encrypted auth, least-privilege, auditing, and cost controls, but requires coordination with ops and compliance teams.

88.0%

What is the onboarding curve for typical knowledge workers using Craft Agents? What common issues and best practices should they know?

Core Analysis ¶

Positioning & Onboarding: Craft Agents is user-friendly for typical knowledge workers—natural language commands can attach services, create skills, and run conversations. However, mastering self-hosting, MCP subprocesses, and complex automations requires technical support.

Common Issues ¶

Auto-connection failures: Private networks or non-standard API docs can break auto-discovery—manual OpenAPI or doc fixes are needed.
Cost & quota management: Parallel multi-model use and frequent automations can incur significant usage costs or hit rate limits.
Subprocess/permission issues: Local MCP subprocesses may fail due to permission, logging, or process management issues, impacting skill availability.

Best Practices ¶

Pilot at small scale: Connect a limited set of services in one workspace and encapsulate common operations as Skills.
Conservative permission mode: Default to Ask to Edit and set approvals/audit trails for critical automations.
Centralize credentials & network: Deploy headless server in a controlled environment with TLS and a strong token (CRAFT_SERVER_TOKEN).
Monitor cost & rate: Implement quotas/alerts for model calls to avoid unexpected bills.

Important Notice: Automation isn’t foolproof—prepare engineering support for private APIs, complex OAuth flows, and high-concurrency scenarios.

Summary: Day-to-day use is approachable for non-engineers, but production-grade, secure, and cost-controlled deployments require governance and technical operations.

86.0%

How to manage cost, rate limits, and consistency in a multi-model/multi-provider environment?

Core Analysis ¶

Core issue: Running multiple models/providers introduces cost, rate-limit, and consistency challenges. Craft Agents supports multi-provider connections, but platform policies and engineering controls are required to prevent runaway spending and unstable behavior.

Tech & Operational Strategies ¶

Platform configuration:
Use per-workspace defaults to set preferred models and cost-sensitive policies.
Implement per-call cost estimation (tokens/requests) and show estimates in the UI.
Rate & concurrency control:
Enforce rate-limits, concurrency queues, and backoff strategies at the server layer; gate high-cost ops with approvals.
Adapter pattern:
Wrap each provider as an adapter to unify error handling, retries, rate-limits, and fallback logic.
Consistency guarantees:
Use standardized prompt templates, output schemas, and regression tests to compare different model outputs.

Practical Checklist ¶

Set default models and budget thresholds per workspace.
Enable call logs and cost monitoring with alerts for anomalies.
Require approvals or rate limits on high-cost automations.
Run periodic A/B tests of model outputs and update Skill templates for consistency.

Warning: Uncontrolled multi-model setups can quickly blow budgets or produce unpredictable outputs—enforce governance at both platform and organizational levels.

Summary: With workspace defaults/quotas, provider adapters (rate/backoff/fallback), and organizational controls (approvals/monitoring), multi-model deployments in Craft Agents can be managed safely.

85.0%

How feasible is it to migrate existing Claude Code skills or scripts to Craft Agents and what should be considered?

Core Analysis ¶

Feasibility: Craft Agents explicitly supports importing Claude Code skills—prompt/strategy-type skills migrate with minimal friction. Complex skills that depend on local runtimes, specific SDKs, or restricted resources require additional adaptation.

Migration Considerations ¶

Skill types:
Prompt/flow-only: Usually importable as Skills with little change.
Local binary/runtime dependent: Encapsulate runtime as a Source (subprocess/container) and handle filesystem/permission access.
Credentials & permissions: Audit imported Skills immediately—set high-risk operations to Ask to Edit or require approvals.
Resource & cost checks: Model billing and rate limits may differ post-migration—measure in a test environment and set quotas.
Hot updates & change control: Take advantage of hot reloads, but validate changes in a sandbox workspace before production rollout.

Practical Steps ¶

Import Claude Code Skills into an isolated test workspace and run regression tests.
For skills with local deps, create corresponding Sources (containers/subprocesses) with restricted permissions.
Apply approvals and audit logs for critical write operations.
Monitor cost and configure quotas/alerts.

Important Notice: Auto-migration improves efficiency but does not replace manual review for complex runtimes and security boundaries.

Summary: Migration is straightforward for text/API-driven Skills; complex Skills require environment encapsulation, governance, and cost validation.

84.0%

Why use an Electron + bun/TypeScript + Claude/Pi SDK stack, and what are the architectural advantages?

Core Analysis ¶

Rationale: The combination of Electron + bun/TypeScript + multiple model SDKs (Claude/Pi, etc.) aims to balance cross-platform desktop UX, developer velocity, and real-time streaming interactions while supporting diverse data-source integration (HTTP/OpenAPI, local subprocess, filesystem).

Technical Features & Advantages ¶

Cross-platform desktop (Electron): Enables drag-and-drop, local file access, subprocess management, and a unified UI across macOS/Linux/Windows.
Efficient server runtime (bun + TypeScript): Fast startup and modern JS features for headless remote services and quick hot updates.
Real-time streaming (WebSocket + streaming SDKs): Model responses and tool invocations can be streamed, improving perceived responsiveness.
High compatibility (subprocess MCP): stdin/stdout-based subprocesses allow integration with non-HTTP or local MCPs, reducing external dependencies.

Practical Recommendations ¶

Resource trade-offs: Electron has higher memory/CPU needs—prefer headless server + thin client for resource-constrained deployments.
Test bun compatibility: While bun offers performance benefits, validate compatibility with necessary Node ecosystem packages.
Adapterize SDKs: Encapsulate model providers behind adapters to simplify switching or fallback strategies.

Important Notice: Electron increases binary size and memory use; bun may lack the maturity of Node in some enterprise contexts and should be validated before production.

Summary: The stack is well-suited for interactive, multi-model desktop workflows with fast iteration, but requires consideration around client resource use and enterprise compatibility.

83.0%

✨ Highlights

Multi-LLM and MCP one-click integrations with minimal config
Desktop multi-session inbox with persistent conversation storage
Repository metadata is inconsistent; contributor and release data are missing
License and primary tech stack are not fully clear in the provided metadata

🔧 Engineering

Agent-native desktop client supporting streaming responses and tool visualization
Built-in multi-source connectors: MCP, REST APIs, local files and databases

⚠️ Risks

README states Apache-2.0 but the repository license field is missing in the provided summary; license should be verified
Provided development-activity data shows no commits or contributors; this hinders security and long-term maintenance assessment

👥 For who?

Suitable for product and engineering teams that need to orchestrate LLMs into product workflows
Friendly to non-developers; supports natural-language driven configuration of skills and data sources