Q: Training Sources
My question is about training sources:
I signed up for the trial, but could only find manual ways of copying and pasting articles into the knowledge base. And I wonder whether the sources are restricted to that, or whether I could upload PDF/Word documents, Google documents, or use URLs to train the knowledge base.
In other words:
1.) existing web URL articles on my site that could be scraped and made part of the knowledge base
2.) an XML sitemap so that my chatbot knows everything that has been published on my website.
3.) Integrations with MS-Word, Google Docs or PDF files
4.) WordPress or other CMS integration — where a certain category or post types would comprise the articles to scrape and include in the knowledge base.
Hope that make sense, thank you!

Yehia
Apr 25, 2025A: Hi Juergen, Sure we are launching support for documents as a data source this month (PDF/Word documents/PPT). Websites will follow and will be released in May/June max. As for (4) please suggest it on our feature board.
Hello Juergen, Just letting you know that we just released DOCX, PPTX, HTML, TXT, XML support as a training data source

When you say HTML support, do you mean we can provide a URL to an existing webpage? Or do you mean we have to upload content still manually as a document in HMO format?
currently you can upload a document in DOCX, PPTX, HTML, TXT, XML.Giving a URL as an input is still coming this quarter