Knowledge Optimizer
The Knowledge Optimizer is the hub for keeping your chatbot accurate. Instead of one tool, it bundles five tabs that each tackle a different part of answer quality — from generating test questions to mining gaps out of real visitor conversations, auditing your knowledge base for contradictions, rewriting the bot's role, and comparing AI models head to head.
Pick a chatbot at the top of the page; that selection is shared across every tab. Each tab runs its own analysis and, where relevant, drops improvement suggestions into the shared Optimization Inbox for you to review and apply.
The Tabs at a Glance
| Tab | What it does | Touches your quota |
|---|---|---|
| Knowledge Test | Auto-generates test questions, runs them against your bot and AI-evaluates the answers. | Yes (chat messages) |
| Optimize | Mines knowledge gaps from real past conversations and unanswered questions. | No |
| Role | Proposes an improved system prompt and proves it with an A/B test. | A/B test only |
| Model Arena | Runs the same questions against 2–3 AI models and compares quality vs. speed. | Yes (chat messages) |
| Audit | Scans the knowledge base for duplicates, contradictions and thin content. | No |
Optimize Tab — Mine Gaps From Real Conversations
The Optimize tab finds what your bot actually failed to answer, using real visitor history rather than guesses. It pulls in unanswered and collected questions from the selected time window, groups them by topic, and generates editable content suggestions to fill each gap.
How to run it:
- Choose a window: Last 30 days or Last 90 days
- Click Analyze
- The tool collects unanswered questions, clusters them by topic, then analyzes each cluster and drafts content
Each run uses up to 10 AI analyses and consumes no chat messages. Results land in the Optimization Inbox below as Knowledge gap suggestions. A suggestion that came from several real questions shows how many occurrences it represents, so you can prioritize the most common gaps first.
If the window has no unanswered questions, the tab tells you there's nothing to optimize right now.
Audit Tab — Find Problems in Your Knowledge Base
The Audit tab checks the content you already have, not what's missing. It scans your indexed sources for three problems and proposes fixes:
| Finding | Why it matters |
|---|---|
| Contradiction | Two sources say different things — the bot may answer the same question differently each time. This is the most dangerous case. |
| Duplicate | Near-identical chunks that add noise without adding information. |
| Thin content | A source with too few characters to produce useful answers. |
How to run it:
- Click Run audit
- The tool scans your knowledge base, then asks an AI judge to check the suspicious pairs it found
- Findings appear in a review list with the affected excerpts side by side
Review the findings, keep the ones worth acting on (each has a checkbox), and click Save to inbox. Where an automatic rewrite is possible, the finding is marked Auto-fix available and you can apply the corrected text directly from the inbox. For crawled or synced sources, the suggestion only flags the issue — you fix it at the origin. If nothing is wrong, the audit reports a healthy knowledge base.
Role Tab — Tune the Bot's System Prompt
The Role tab improves your chatbot's role (its system prompt). It reviews failed test questions and known knowledge gaps, then proposes a rewritten role aimed at better answers.
How to run it:
- Click Propose improved role
- Compare the current role and the proposed role side by side — the proposed text is editable, and a rationale explains what changed and why
- Validate it with an A/B test: choose a question source (generate 5 fresh test questions or reuse the questions from your last test run) and click Run A/B test
- The same questions are asked with both roles, an AI judge picks the better answer per question, and you get a win rate plus a per-question breakdown
- If the new role wins, click Apply role to make it live
The A/B test asks each question twice (current vs. proposed), so it uses roughly 20 test chat messages from your monthly quota. Applying the role requires the chatbots permission.
Model Arena Tab — Compare AI Models
The Model Arena runs the same set of questions against 2–3 AI models in parallel and compares them on answer quality (scored by an AI judge) and median response time. Use it to decide whether a faster or cheaper model holds up on your real content.
How to run it:
- Select 2–3 models to compare (your current model is marked)
- Pick a question source (generate fresh questions or reuse the last run)
- Click Start comparison
- Review the results table — quality percentage, median latency, and a recommendation for the best balance of the two
- Expand any question to see each model's actual answer
- Click Use for this bot on a model to switch to it
This tab uses test chat messages (counted toward your monthly quota) and applying a model requires the chatbots permission.
Knowledge Test Tab — Auto-Test Your Knowledge Base
The Knowledge Test tab is the classic Knowledge Tester. It automatically generates realistic test questions, runs them against your chatbot, evaluates the answers with AI, and helps you fix any gaps.
Test Modes
| Mode | Description | Questions |
|---|---|---|
| Auto Test | AI generates test questions directly from your knowledge base content. Questions cover different topics and difficulty levels, with a mix of short (3–5 word) and longer phrasings. | 3–15 questions |
| Custom Question | Enter a specific question you want to test. The AI generates alternative phrasings (variants) to test the same topic from different angles. | Your question + 2–14 variants |
Auto Test is best for a broad health check of your whole knowledge base. Custom Question is ideal for checking whether a specific topic or recently added content is answered correctly.
Running a Test
- Choose the test mode: Auto Test or Custom Question
- Set the number of questions (Auto Test) or variants (Custom Question)
- For Custom Question mode: enter your question
- Click Start Test
The system then runs three steps automatically:
| Step | What Happens |
|---|---|
| Generate | AI creates test questions (Auto Test) or question variants (Custom Question). |
| Run | Each question is sent to your chatbot; the answer and any source references are collected. |
| Evaluate | AI judges each answer against your knowledge base content and classifies it. |
Evaluation Criteria
| Classification | Description |
|---|---|
| Answered | Correct, helpful, and covers the core question. Minor omissions of edge-case details are fine. |
| Partial | Thematically relevant but missing essential information a user would expect. |
| Not Answered | No answer, completely irrelevant, or fundamentally incorrect. |
For Partial and Not Answered results, the AI explains what's missing or wrong. Use those recommendations to guide your fixes.
Results Dashboard
After a test completes, four cards summarize it: Answered, Partial, Not Answered, and an overall Score (answered ÷ total) with a color-coded bar:
- Green (80%+) — your knowledge base covers the tested topics well
- Amber (50–79%) — some gaps to address
- Red (below 50%) — significant gaps need attention
Each tested question appears as a result card with the question, an evaluation badge, the bot's full answer, a recommendation (for Partial/Not Answered), and action buttons: Fix with AI, Try again, and Find Source.
Fix with AI
Fix with AI is the main way to resolve gaps found during testing. It analyzes all failed questions, searches your existing knowledge base for context, and generates content to fill the gaps.
- Click Fix with AI on a Partial or Not Answered result (or on the results summary)
- The system gathers all failed questions and pulls related content from your knowledge base
- AI drafts text covering the missing information, using concrete data from your sources where available
- Review and edit the draft; optionally set a document name (left empty, one is auto-generated)
- Click Add as Data Source to save it as a new text data source
After saving, you're prompted to verify the fix: Retest all failed, Retest first (a quick single-question check), or Later. The generated content shows up in your chatbot's Data Sources tab, where it stays editable.
Find Source
Find Source locates which existing data sources are relevant to a question — useful when you'd rather edit an existing source than add new content.
- Click Find Source on a result card
- The system searches your knowledge base using vector similarity (with text search as a fallback)
- Up to 10 matches are shown, each with source name, type, a relevance score and a 500-character content preview
From there you can Edit a text source directly or Open in Dashboard to jump to that source in the Data Sources tab.
Retrying & History
- Try again re-runs a single question and updates its answer, evaluation and the overall score in place.
- Retest all failed (after Fix with AI) re-runs every Partial and Not Answered question in one batch.
- The Test History section lists your last 20 test runs per chatbot — score, mode, chatbot and date. Click any entry to reopen its full results.
Limits
| Constraint | Value |
|---|---|
| Trial users | Max 3 tests per chatbot per day |
| Fix generation | Max 20 requests per hour per user |
| Question timeout | 30 seconds per question |
| Global test timeout | 120 seconds per test run |
| Auto Test questions | 3–15 |
| Custom Question variants | 2–14 (plus your original question) |
| Test history | Last 20 runs stored |
The Optimization Inbox
The Optimization Inbox sits below the Optimize and Audit tabs and collects the AI's improvement suggestions in one place. Each suggestion has a type — Knowledge gap, Contradiction, Duplicate, Thin content or Role proposal — and a status:
- Open — waiting for your decision
- Accepted — applied to your chatbot
- Dismissed — set aside (you can reopen it later)
Click a suggestion to open its detail view. Depending on the type you'll see the real questions behind a gap, the affected source excerpts, or a current-vs-proposed comparison. Where the suggestion includes editable content, you can tweak it before applying. Click Add to knowledge base (or Apply rewrite / Apply role, depending on type) to apply it, or Dismiss to skip it.
Best Practices
- Start with Optimize — real conversations show you the gaps that actually cost you answers
- Run Audit after big imports — catch contradictions before visitors hit them
- Use Knowledge Test to verify — after adding content, confirm the bot now answers correctly
- A/B test role changes — don't ship a new prompt on a hunch; the Role tab proves the lift
- Try Model Arena before switching models — make sure a faster model still answers well on your data
- Cross-check with the Questions and Analytics dashboards — watch the Unanswered metric drop as you close gaps
