Gustave is an enterprise AI assistant platform that lets organizations build, deploy, and manage production-grade AI assistants in minutes. It runs entirely on-premises with multi-provider LLM support, knowledge base management, and enterprise security built in.

Can Gustave run on-premises?

Yes. Gustave is designed for 100% on-premises deployment. Your data never leaves your infrastructure. It supports air-gapped environments via local models through Ollama.

Which LLM providers does Gustave support?

Gustave supports five LLM providers out of the box: OpenAI, Azure OpenAI, AWS Bedrock, Anthropic (Claude), and Ollama for local models. You can switch providers at any time without rewriting assistant configurations.

How long does it take to deploy an AI assistant with Gustave?

You can deploy your first production-grade AI assistant in under five minutes. Connect an AI provider, write a prompt, and publish — the assistant is immediately available in the chat interface.

On-Premises AI Infrastructure

Enterprise AI assistants.
Your infrastructure. Your data.

Deploy production-grade AI assistants in minutes, not months. Multi-provider LLM support, intelligent knowledge bases, MCP tool integration, enterprise security — all running on your own infrastructure.

Request a Demo → Explore the Platform

100% On-Premises. Zero compromise.

Gustave runs entirely within your infrastructure. Your data never leaves your environment. No cloud dependencies. No external API calls for core functionality. Complete control.

Full Data Sovereignty

Your Infrastructure

Compliance Ready

Air-Gap Compatible

Built for enterprise scale

Purpose-Built Studios

LLM Providers Supported

100%

On-Premises

<5 min

To First Assistant

Sound Familiar?

Enterprise AI is a complex domain.
These patterns show up everywhere.

We've seen dozens of AI initiatives hit the same walls. Not because teams aren't smart — but because this stuff is genuinely hard, and the pitfalls aren't obvious until you're in them. Gustave is built around what we've learned.

The Document Dump

It seems logical: upload your docs, let the AI figure it out. But without proper chunking strategies and curation, you get inconsistent results. RAG isn't a feature — it's a discipline that takes time to get right.

The Provider Trap

You pick a provider, build around their API, and six months later a better model comes out elsewhere. Suddenly switching means rewriting everything. It's an easy trap to fall into.

The POC Gap

The demo works beautifully. Then someone asks about authentication. Multi-tenancy. Version control. Audit trails. The gap between "working prototype" and "production system" is bigger than it looks.

The Infrastructure Surprise

Every team building AI in-house discovers the same thing: the plumbing takes 3x longer than expected. Weeks turn into months. The use cases wait while infrastructure gets built.

The Security Afterthought

Access control, tenant isolation, SSO — these feel like "later" problems until compliance asks about them. Bolting security onto an existing system is always harder than building it in from the start.

The "What Changed?" Problem

Someone updates a prompt and results get worse. Which version was working? What exactly changed? Without proper versioning, debugging AI systems becomes guesswork.

The Gustave Platform

Four studios.
One platform.

Start with the Core — everything you need to launch your first assistant. Add Knowledge Studio and MCP Studio when you're ready to go deeper. Each piece is independent. All of them work better together.

Gustave Core — Your Launchpad

This is where every Gustave deployment begins. Assistants Studio is your design workbench — wire up providers, author prompts, compose assistants, control who sees what. Chat is where those assistants meet your users. Together, they're everything you need to go from zero to a production assistant in minutes.

Gustave Assistants Studio

The workbench where your AI assistants take shape. Connect providers, craft prompts, compose capabilities, manage access — then publish with a click.

✓ Multi-provider LLM management
✓ Prompt & skill authoring
✓ Assistant composition & versioning
✓ RBAC, roles & project management
✓ Full audit trail

Gustave Chat

Where your assistants meet the world. Real-time streaming, tool calling, RAG-augmented conversations — plus a full API so you can embed assistants anywhere.

✓ Real-time streaming responses
✓ Multi-turn tool calling (MCP)
✓ RAG-augmented conversations
✓ Rich markdown rendering
✓ Assistants API for integration

Gustave Knowledge Studio

Your assistants are only as good as what they know. Knowledge Studio turns your documents into intelligent, searchable knowledge bases with proper chunking, embeddings, and vector stores — so your RAG pipeline delivers real answers, not hallucinations.

Knowledge Studio

From document ingestion to semantic search in one place. Full control over how your data is chunked, embedded, and retrieved — because RAG is a discipline, not a checkbox.

✓ Document collections & file management
✓ Configurable chunking strategies
✓ Multiple embedding providers
✓ Vector store creation & indexing
✓ Semantic search API
✓ Version-controlled knowledge bases

Semantic Search

Query your knowledge bases using natural language. Results are ranked by relevance, scoped to projects, and ready to feed into RAG pipelines or standalone search interfaces.

✓ Natural language queries
✓ Project-scoped search
✓ Relevance-ranked results
✓ Multi-store search
✓ REST API for integration

Gustave MCP Studio

Knowledge makes your assistants smart. Tools make them capable. MCP Studio is where you give assistants the power to act — paste an OpenAPI spec, hot-load plugins, scope tools per assistant, and expose everything through the Model Context Protocol. One control plane for every integration.

MCP Gateway & Plugins

Hot-load plugins to extend your AI with custom tools. Query databases, call APIs, create tickets — or wrap multi-step business workflows into single deterministic tools.

✓ Plugin architecture with hot-loading
✓ Native MCP protocol support
✓ Built-in auth (Bearer, API Key, Basic)
✓ Dynamic configuration & tool bindings
✓ Health monitoring & audit logging

API Catalog

Import any OpenAPI specification and automatically expose it as MCP-compatible tools. Turn your existing REST APIs into AI capabilities without writing a single plugin.

✓ OpenAPI spec import
✓ Automatic MCP endpoint generation
✓ Plugin registry for sharing
✓ Per-configuration access controls
✓ WebSocket real-time notifications

“Like the great concierges of Europe's grand hotels — orchestrating everything behind the scenes so the experience is flawless. That's what Gustave does for enterprise AI.”

Infrastructure that works, so you can focus on what matters.

How It Works

From zero to production assistant
in five steps.

Compose, don't code. Gustave gives you modular building blocks that snap together into production-grade assistants.

Assistants Studio

Connect Your AI Providers

Register one or more LLM providers. Switch between them at any time — your assistant configuration stays the same. Credentials are stored securely and never leave your environment.

OpenAI, Azure, AWS Bedrock, Anthropic, Ollama
Secure credential management
Switch providers without rewriting anything
Run local models for air-gapped environments

OpenAI

Azure

Bedrock

Anthropic

Ollama

Knowledge Studio

Build Your Knowledge Base

Upload documents, organize them into collections, and transform them into searchable knowledge bases with proper chunking strategies — not just document dumps.

Support for multiple file formats
Intelligent chunking per content type
Extensible for additional formats
Versioned with activation controls

                                
                                Knowledge Studio
                            
Collection: "Product Documentation"
Documents: 47 files
Chunking: "By section headings"
Version: v3 ● Active
 
Last indexed: 2 hours ago

Assistants Studio

Author Your Prompts

Write system prompts that define behavior and personality. Create skills that enrich context automatically. Organize with tags for team-wide discovery and reuse.

System prompts for assistant behavior
Skills for automated context enrichment
Tag-based organization

                                
                                Assistants Studio
                            
Name: "Customer Support Agent"
Tags: support, customer-facing
 
You are a helpful customer support
agent. You have access to product
documentation and can look up
order status via tools...

MCP Studio

Connect Tools via MCP

Give your assistants real capabilities through the Model Context Protocol (MCP) — the emerging open standard for AI tool integration. Query databases, call internal APIs, create tickets, search the web — anything exposed through an MCP server becomes a tool your assistant can use.

Native MCP support — STDIO and HTTP servers
Create reusable tool templates, deploy across environments
Built-in auth (Basic, Bearer, API Key) per server
Standards-based — no proprietary lock-in

                                
                                MCP Studio
                            
MCP Servers (3 active)
 
● Order Lookup · HTTP · Bearer Auth
● Inventory Search · STDIO · Local
● Ticket Creator · HTTP · API Key
 
Protocol: Model Context Protocol (MCP)

Gustave Chat

Publish & Chat

Hit publish. Your assistant is immediately live with real-time streaming, rich formatting, and tool calling. Every publish creates an immutable version you can track and roll back.

Version snapshots on every publish
Real-time streaming responses
Rich markdown rendering
Full conversation history

                                
                                Gustave Chat
                            
What's the status of order #4821?
 
⚙ Looking up order...
 
Order #4821 shipped on Feb 18. Tracking: UPS 1Z999... Expected delivery: Feb 22.

Enterprise-Ready from Day One

Everything you need.
Nothing you don't.

Built-in infrastructure that enterprises actually need. No bolt-on afterthoughts.

On-Premises Deployment

Runs entirely within your infrastructure. Your data never leaves your environment. Compatible with air-gapped networks and strict compliance requirements.

Multi-Tenancy

Complete data isolation per tenant. One deployment can serve multiple teams or organizations with full separation and independent configuration.

SSO & Authentication

Enterprise single sign-on integration. Users are provisioned automatically on first login — no manual setup required.

Fine-Grained Access Control

Role-based permissions across all platform capabilities. Control who can create assistants, upload files, manage tools, or access specific features.

Version Control

Immutable version snapshots for assistants and knowledge bases. Track changes, compare versions, roll back with confidence. Full audit trail.

Independent Scaling

Each studio runs as an independent service. Scale Chat, Knowledge Studio, and MCP Studio separately based on workload. Deploy only what you need.

Ready to deploy AI
on your terms?

See what production-ready, on-premises AI infrastructure looks like. Deploy your first assistant in minutes.

Request a Demo → Explore the Platform

Enterprise AI assistants. Your infrastructure. Your data.

100% On-Premises. Zero compromise.

Enterprise AI is a complex domain.These patterns show up everywhere.

The Document Dump

The Provider Trap

The POC Gap

The Infrastructure Surprise

The Security Afterthought

The "What Changed?" Problem

Four studios.One platform.

Gustave Assistants Studio

Gustave Chat

Knowledge Studio

Semantic Search

MCP Gateway & Plugins

API Catalog

From zero to production assistantin five steps.

Connect Your AI Providers

Build Your Knowledge Base

Author Your Prompts

Connect Tools via MCP

Publish & Chat

Everything you need.Nothing you don't.

On-Premises Deployment

Multi-Tenancy

SSO & Authentication

Fine-Grained Access Control

Version Control

Independent Scaling

Ready to deploy AIon your terms?

Enterprise AI assistants.
Your infrastructure. Your data.

Enterprise AI is a complex domain.
These patterns show up everywhere.

Four studios.
One platform.

From zero to production assistant
in five steps.

Everything you need.
Nothing you don't.

Ready to deploy AI
on your terms?