How to Train a Custom ChatGPT on Your Website Data Using Chatbase
You have a website full of product details, pricing information, FAQ answers, and support documentation. Your visitors have questions. And you are tired of answering the same "What are your hours?" and "How do I return this?" questions every single day. The solution is an AI chatbot trained directly on your website content. Not a generic bot that guesses. A custom assistant that answers from your actual data.
Chatbase is the platform that makes this possible without developers. Upload your website URL, train the AI on your content, and embed a chat widget on your site in under 30 minutes. No subscription required for people using your chatbot. No monthly fees for visitors. This tutorial walks through every step. Start your Chatbase free trial here.
Why Train Your Own AI Chatbot Instead of Using Custom GPTs?
OpenAI's Custom GPTs are powerful but come with a critical limitation for businesses. When you build a Custom GPT and want to share it with clients, customers, or the public, every person who uses it needs a ChatGPT Plus subscription. That is $20/month per user. For a business with hundreds of customers asking questions, this is impossible.
Chatbase solves this problem completely. You build and host the assistant. Anyone can use it through a direct link, an embed on your website, or integrations with WhatsApp, Slack, and Messenger. No subscription required on their end. No paywall. Your customers just ask questions and get answers.
The other advantage is control. Custom GPTs live inside OpenAI's ecosystem. Chatbase lives on your website, under your brand, with your colors, your logo, your domain. The visitor never leaves your site.
The Golden Rule: Garbage In, Garbage Out
Before touching any settings or writing prompts, understand this: good data carries more weight than a clever prompt. Most people obsess over instruction wording first. When the source material is messy, outdated, or vague, the AI gives messy answers regardless of how polished the prompt is.
Cleaning up your data fixes what hours of prompt tweaking never could. Review your website content before training. Remove outdated pricing. Update old product descriptions. Fix broken FAQ answers. The AI is only as good as what you feed it.
Step-by-Step: Train Your AI Agent on Website Data
Step 1: Create Your Chatbase Account and Start a New Agent
Go to Chatbase.co and sign up with Google or email. The free tier gives you one chatbot with 50 message credits per month, enough to test the platform and build a proof of concept. Once logged in, click "New AI Agent" from the dashboard.
Step 2: Choose Your Data Source — Website as the Primary Source
Chatbase offers multiple training sources. For a website chatbot, the quickest method is the Website option. Paste your full website URL into the input field, including the https:// prefix. Click "Fetch links" to allow Chatbase to crawl your website.
The crawler automatically extracts information from your pages, including services, pricing, FAQ content, product descriptions, and support documentation. This creates the foundational knowledge base for your new agent. The crawl typically takes 2-5 minutes depending on site size.
Alternative data sources: PDF and document upload for existing training materials, Notion pages for teams already using Notion for documentation, custom Q&A pairs for specific questions you want answered the same way every time, and text paste for quick entry of short content.
Step 3: Review and Refine Your Data Before Training
After the crawl completes, review what was captured. Check for missing pages, outdated information, or irrelevant content that should be excluded. The most common mistake is training the AI on everything without curation. If your website has blog posts from three years ago with outdated product information, the AI will learn from that too.
Key tip: Custom Q&A pairs are remarkably useful for questions you want answered consistently every time. Define the answer once, and the assistant sticks to it. No hallucinations, no improvising. This is also the fastest way to handle edge cases you spot after launch without rebuilding the entire knowledge base.
Step 4: Write Instructions That Give the AI a Real Identity
Once your data is solid, writing instructions determines how the AI behaves. A few specific practices make the biggest difference in response quality.
Be specific about scope. Not just "you are a helpful assistant" but exactly what this assistant handles, what it does not handle, and what tone it uses. Example: "You are a customer support agent for Acme Company. You answer questions about products, pricing, shipping, and returns. You do not answer questions about competitors, investments, or topics outside Acme. Your tone is friendly, professional, and concise." The more specific, the less the AI wanders.
Set temperature low for accuracy. Higher temperature settings make the AI more creative but also cause it to fill gaps too confidently, adding details that were not in the source material. Lower settings keep responses grounded in the actual data. For customer support and factual Q&A, keep temperature between 0.1 and 0.3.
Define fallback behavior. Tell the AI what to say when it does not know an answer. Example: "If you cannot find the answer in the knowledge base, say 'I don't have that information yet. Please contact support@example.com for help.'" This prevents the AI from inventing answers when it lacks information.
Add suggested questions. Without starter prompts, people open the chat and freeze. With them, users immediately understand what the assistant does and start asking better questions from the first message. Include 3-5 starter questions visible in the chat window.
Step 5: Test Your Agent in the Playground
After training and instruction configuration, navigate to the "Playground" or testing environment. This is a sandbox where you can test your agent's responses before making it live.
Ask questions that a typical visitor might ask. Examples: "What services do you offer?" "How much does this cost?" "What is your return policy?" "Do you ship internationally?" See how the AI replies. If an answer is wrong, check whether your source data contains the correct information. If the tone is off, adjust the system prompt.
The Playground also shows you how the AI handles edge cases. Ask questions that are slightly outside your scope to see if the fallback behavior works correctly. Testing before launch saves you from embarrassing moments in front of real users.
Step 6: Configure Lead Collection (Optional but Powerful)
Your AI agent can capture leads while answering questions. Navigate to the "Actions" tab and click "Create action." Scroll down and select the "Collect leads" option.
Configure a form that appears during conversation. Enable fields for Name, Email, and Phone Number. Make them required or optional as needed. In the "Messages" section, customize the success message users see after submitting their information. Once configured, click "Enable" at the top right to activate lead capture.
When a visitor asks a question, the AI can naturally transition to collecting their contact information before providing detailed answers or sending follow-up materials.
Step 7: Automate Leads with Zapier (Optional)
To complete your workflow, automatically send collected leads to Google Sheets, your CRM, or email marketing platforms through Zapier integration.
In a new Zapier workflow, set your trigger to "Chatbase" with the event "Form Submission." Provide your unique Chatbot ID, found in your agent's "Settings" tab. For the action, choose your application such as "Google Sheets" and select "Create Spreadsheet Row." Map the name, email, and phone number data from Chatbase directly into corresponding columns. This creates a fully automated lead management system.
Step 8: Embed the Agent on Your Website
Once satisfied with performance, go to the "Connect" tab to find embedding options.
Chat bubble embed (recommended): Copy the code snippet and paste it into your website's HTML before the closing body tag. This creates a seamless pop-up chat bubble in the corner of your site. Most website builders including WordPress, Wix, Squarespace, Shopify, and Framer support this through custom code injection or dedicated plugin options.
Direct iframe embed: Place the agent within a specific section of a page, such as a dedicated support page or FAQ section. Copy the iframe code and paste it where you want the chat window to appear.
Shareable link: Share a direct URL to the standalone chat interface via email, social media, or SMS. No embedding required. The link works on any device with a browser.
Step 9: Monitor, Iterate, and Improve
After launch, the real work begins. Chatbase shows real conversation data in the "Activity" tab. Review this regularly. It reveals exactly where the assistant is weak and what content needs improvement.
Watch for patterns. If multiple users ask the same question that the AI answered poorly, update your knowledge base or adjust the prompt. If users abandon conversations at a specific point, investigate what happened in that exchange.
Add custom Q&A pairs for gaps. When you spot an edge case after launch, add it as a custom Q&A pair. This is faster than rebuilding the entire knowledge base and ensures consistency for that specific question going forward.
Pricing and Plans for Your AI Agent
| Plan | Monthly Price | Message Credits | Best For |
|---|---|---|---|
| Free | $0 | 50/month | Testing and evaluation |
| Hobby | $19/month or $190/year | 2,000/month | Small websites, startups |
| Standard | $99/month or $990/year | 10,000/month | Growing businesses, e-commerce |
| Unlimited | $399/month or $3,990/year | 40,000/month | High-volume support teams |
Annual billing saves approximately 17% with two free months. Add-ons include extra message credits at $15 per 1,000 credits, custom domains at $199/month, and white labeling to remove "Powered by Chatbase" branding. Start with the free plan.
Pro Tips for Better Results
Spend more time on data than prompting
If there is one rule to follow, it is this one. Most people do it backwards. Clean, organized, up-to-date data produces accurate answers. Messy data produces messy answers regardless of prompt quality. Review your website content before training.
Use custom Q&A pairs for critical answers
For anything that must always be consistent — return policies, pricing disclaimers, legal information — do not leave it to the AI's interpretation. Lock it in with custom Q&A pairs. This removes hallucination risk entirely for those specific questions.
Watch what people actually ask after launch
The conversation data in Chatbase is more useful than guessing upfront. It reveals exactly where the assistant is weak and what content needs improving. Review weekly and update your knowledge base accordingly.
Design for distribution from the start
If your assistant only works on your own account, it is a personal tool, not a real assistant. Chatbase allows sharing via direct link, embed code, Slack, WhatsApp, and Messenger. Use these channels to put your AI where your customers already are.
Frequently Asked Questions
Do people using my chatbot need a subscription?
No. This is the main advantage over Custom GPTs. You build and host the assistant. Anyone can use it through a direct link, website embed, or messaging app integration. No ChatGPT Plus subscription required, no paywall, no friction for your customers.
How accurate is the AI when trained on my website?
Accuracy depends entirely on your source content. Clean, organized, up-to-date website content produces highly accurate answers. The GPT-4 model combined with your specific data creates responses that are consistently reliable for customer support, sales, and FAQ applications.
What file formats does Chatbase support?
PDF, DOCX, DOC, TXT, website URLs, Notion pages, pasted text, and custom Q&A pairs. For Google Docs, export them as PDFs before uploading.
Can the AI answer in multiple languages?
Yes. Chatbase supports 95+ languages. The AI detects the user's language and responds accordingly, based on your knowledge base content. If your website has multilingual content, the AI will answer in the language asked.
Does Chatbase offer human handoff?
Yes. You can configure the AI to escalate conversations to human agents. Chatbase includes a built-in inbox where human agents receive and respond to transferred conversations. No external helpdesk required.
Start Training Your AI Agent Today
Your website already contains the answers your customers need. Those answers are just locked behind navigation menus, search bars, and the patience of visitors willing to click around. An AI chatbot surfaces those answers instantly, at the moment of need, without requiring customers to find them themselves.
Chatbase makes this possible in under 30 minutes, without developers, without subscriptions for your customers, and without ongoing maintenance headaches. The free tier gives you 50 message credits to test the platform and build your first bot.
Start your Chatbase free trial. No credit card required. Your customers will thank you.