Knowledge Base

Knowledge Base

The knowledge base is how your AI assistant learns about your product. Feed it your website content, documents, or raw text β€” and it becomes an expert on your business.


How It Works

When a user asks a question, the SDK sends the message to the Produck backend which performs vector search against your knowledge base to find relevant content. This content is injected into the AI's context, enabling accurate, grounded answers.

User Question β†’ Vector Search β†’ Relevant Chunks β†’ AI Response

Adding Sources

Via Dashboard

  1. Go to SDK Projects β†’ select your project
  2. Navigate to the Knowledge Base tab
  3. Click "Add Source"

Source Types

TypeDescriptionBest For
Website URLCrawls and indexes a web pageProduct pages, docs, FAQs
SitemapCrawls all pages in a sitemapEntire documentation sites
DocumentUpload PDF, DOCX, or TXT filesInternal docs, guides
Raw TextPaste text directlyQuick knowledge snippets

Website Crawling

When you provide a URL, Produck uses Firecrawl (opens in a new tab) to:

  1. Fetch and render the page (including JS-rendered content)
  2. Extract clean text content
  3. Split into chunks (~500 tokens each)
  4. Generate vector embeddings
  5. Store in Qdrant vector database
URL β†’ Firecrawl β†’ Clean Text β†’ Chunking β†’ Embeddings β†’ Qdrant

πŸ’‘ Tip: Add your most important pages first β€” pricing, features, FAQ, and docs.


Retrieval

When the AI receives a user message, it:

  1. Generates an embedding of the user's question
  2. Searches Qdrant for the top-k most similar chunks
  3. Includes those chunks as context in the AI prompt
  4. Generates a response grounded in your actual content

This means the AI will never hallucinate about your product β€” it only uses information from your knowledge base.

Relevance Scoring

Each retrieved chunk has a similarity score (0–1). Only chunks above the relevance threshold are used. This prevents irrelevant content from polluting the AI's context.


Best Practices

What to Include

  • βœ… Pricing pages and plan details
  • βœ… Feature descriptions
  • βœ… FAQ and help center articles
  • βœ… API documentation
  • βœ… Getting started guides
  • βœ… Terms of service, refund policies

What to Avoid

  • ❌ Extremely long pages (split into sections)
  • ❌ Pages behind authentication (the crawler can't access them)
  • ❌ Frequently changing data (use API schema for real-time data)

Content Quality

The AI is only as good as the content you feed it. Ensure your knowledge base sources are:

  • Accurate β€” up to date with current product info
  • Clear β€” well-written, unambiguous text
  • Complete β€” cover the topics users ask about most

Limits by Plan

PlanKnowledge Base Sources
Starter2
Pro10
Business50
EnterpriseUnlimited

Next Steps

  • API Schema β€” Let the AI execute actions on your behalf
  • DOM Context β€” Let the AI read the current page
  • Chat Widget β€” Customize the chat interface