The Knowledge Base feature allows you to upload and connect your data sources and ask questions about your documents.
Getting startedUpload your filesAI Agent Knowledge Base AccessUse in chatConnect Your Own Knowledge BaseCompare with Training FilesLimitationsFrequently Asked Questions
Getting started
Retrieval-Augmented Generation (RAG) isΒ a technique that enhances Large Language Models (LLMs) by allowing them to access and incorporate external knowledge sources, like databases or documents, to generate more accurate and contextually relevant responses.Β
TypingMind supports a built-in RAG system with the Knowledge Base feature. You can upload files to TypingMind and then allow AI agents to access these files and get more context to answer your questions more accurately during the conversation.
Upload your files
- Open TypingMind, then click the βKBβ (Knowledge Base) tab.
- Click Add Data Source and start uploading your files.
- Currently supported files are: Text files, PDF, CSV, Excel, Word. We will continue to support more file types and connect other data sources in the future.
You can tag your documents to be used by AI agents later. This helps you split your KB into different topics/categories that can later be used by different AI agents.
AI Agent Knowledge Base Access
Knowledge Base can only be accessed via AI agent, so when you chat with a model directly, it cannot access the Knowledge Base.
Once the data is uploaded, assign the KB access to your AI agents by going to Agents β Edit or create an AI agent β Knowledge Base Access
By default, AI agents don't have access to Knowledge Base. You can set the AI agent to be able to:
- Access all data in the knowledge base
- Access only documents with specific tags
Note:
- If you set multiple tags, the agent will have access to documents that have any of the tags assigned (OR condition used for tags matching).
- Tags are identified by their name. If you later change the tag of the documents from KB, you need to update it again in the AI agent setting.
Use in chat
- Start a new chat with the AI agent that has access to the knowledge base.
- Make sure to turn on the Search Knowledge Base plugin. You can install it from the official Plugin Store if you haven't already.
During your conversation, the AI agent can lookup information from Knowledge Base like this:
Connect Your Own Knowledge Base
RAG has been supported by TypingMind for a long time. Knowledge Base is just a built-in integration to make it easier for users.
You can connect TypingMind with your own knowledge base by doing the following:
- Use a plugin (or create a new one): Provide a way for the AI agents to look up additional information when needed
- Use Dynamic Context: This provides AI agents with context about what kind of data they have access to, so if they need to look up additional information, they can trigger a plugin. This will improve retrieval performance and accuracy. Dynamic Context can also be embedded directly into the plugin with Plugin Context (you will see this section when creating new plugins in TypingMind).
To decide if you want to use TypingMind's built-in Knowledge Base system or connect your own, here is a quick pros and cons comparison:
Use TypingMindβs Knowledge Base | Connect your own Knowledge Base | |
Setup complexity | No setup required, just upload files, set tags, and use. | Complicated. Requires technical knowledge to set up Dynamic Context, probably need to build your own plugin. |
Vector Embedding | All done by TypingMind Cloud and hidden away from the users. You cannot change the technical details like embedding models, dimensions, chunk size, query strategy, etc. | Customize however you want on your own system, then connect your system to TypingMind via Plugin or Dynamic Context. |
Manage data | Easily done via TypingMindβs powerful interface. Upload or connect data easily. Data automatically kept up-to-date with daily refresh (for external sources like Notion, Google Drive). | Need to host your data on an external system separately, keep it maintained and up-to-date. |
Price | Free with some limits, then upgrade for more. | Totally free and you pay for your own cost of server, database, embedding generations, etc. |
Document category / separation by AI agent | Allow AI agent to access a subset of documents in the knowledge base. Use tags to categorize documents and assign different tags access to different AI agent. Select tags directly from TypingMindβs UI. | Not supported, youβll need to implement this by separating at the plugin level. |
Advanced customization | Leverage TypingMindβs optimizations for better parse quality and retrieval results. | Implement your own optimizations. |
Compare with Training Files
Knowledge Base and Training Files are two different ways to add context to an AI agent. We recommend experimenting with both options to achieve the best result.
For your reference, here are some key differences:
Aspect | Training Files | Knowledge Base |
Integration method | Directly embedded into the system prompt | Retrieves data dynamically using RAG |
Context relevance | Highly relevant answers based on full context | Answers depend on the effectiveness of retrieval |
Token consumption | High, as full context is loaded | Low, as only relevant data is retrieved |
Data volume | Limited by the model's context window | Supports large datasets |
Setup complexity | Simple | More complex |
Cost | Higher, due to token usage | Lower, as fewer tokens are consumed |
For a full comparison, see this page:
Training Files and Knowledge Base: Whatβs the differences? Limitations
KB is available for free for all licensed users with the following limitations:
- The only supported data source currently is File Upload. We are working to support Google Drive, Notion, GitHub, and more data sources soon.
- Supported file types: Text files, PDF, CSV, Excel, Word
- KB limits the number of characters in your files based on your TypingMind Cloud plan. Free accounts can only upload up to 100,000 characters; the 1GB plan includes an additional 1,000,000 characters; the 5GB plan includes an additional 5,000,000 characters.
Note that these limitations can change in the future.
Frequently Asked Questions
What are the underlying embedding models and techniques?
We use the best embedding models and chunking techniques, combined with our own private methods to ensure the best retrieval quality possible. We don't publish the details on the underlying models, chunk size, embedding dimensions, etc. because it can be outdated very quickly as we upgrade the system continuously.
What types of files are supported?
Text files, PDF, CSV, Excel, Word, and we are adding more soon.
How much can I upload or connect?
If you are on the free plan, you can upload up to 100,000 characters, counted by the exact characters extracted from your documents, not the file size. You can upgrade your TypingMind Cloud account to unlock higher limits.
Is there a plan to add connections to external sources like Google Drive, Box, Notion, etc.?
Yes! We will add them very soon.
Is this the same system used in the TypingMind Team (Custom) version?
It is mostly the same, but for the TypingMind Team version we have some other processing logic and optimizations for better use in team contexts.