Questions

Question

Hi,\u000a\u000aI\u0027m interested in the app and have a few questions.\u000a\u000a1. Is there a way to make the agent answer only on the training data and not generic knowledge?\u000a\u000a2. Is there a way to train URLs and possibly sitemaps? If so, does it have auto\u002Dretrain functionality?

SeanP_AgenticFlowAI · Answer

Hey there!\u000a\u000aGreat questions – these are key to building effective and reliable AI agents!\u000a\u000a1. Agent Answering ONLY on Training Data (Restricting Generic Knowledge):\u000a\u000aYes, this is crucial and achievable through careful prompting. In your Agent\u0027s System Prompt (its core instructions), you need to be very explicit:\u000a\u000a\u002D \u0022You are an assistant for [My Company/Product]. Your role is to answer questions exclusively based on the information provided in your knowledge base (the documents and website content you have been given).\u0022\u000a\u000a\u0022Do NOT use your general knowledge or information from outside these provided sources to answer user queries.\u0022\u000a\u000a\u0022If you cannot find an answer to a question within your provided knowledge base, politely state that you don\u0027t have that specific information and offer to [e.g., connect them to support / provide a contact email / search the web if you\u0027ve given it that tool explicitly].\u0022\u000a\u000aThis technique, combined with Retrieval\u002DAugmented Generation (RAG) where the agent first searches your documents, significantly helps in keeping answers grounded in your data. While no LLM can be 100% guaranteed to never access its base training, strong prompting makes a huge difference.\u000a\u000a2. Training on URLs and Sitemaps \u0026 Auto\u002DRetrain:\u000a\u000aTraining on URLs:\u000a\u000aYes. When you create an Agent using our 1\u002Dclick widget in Templates page, you can directly paste URLs, and AgenticFlow will attempt to crawl and index the content from those pages for the agent\u0027s knowledge base.\u000a\u000aYou can also use workflow nodes like Web Scraping or the Firecrawl MCP (https://agenticflow.ai/mcp/firecrawl) to fetch content from URLs and then process/add that content to a dataset your agent can reference.\u000a\u000aTraining on Sitemaps:\u000aAgenticFlow doesn\u0027t have a direct \u0022input sitemap.xml\u0022 feature for agent creation at this moment.\u000aWorkaround: You can use a workflow:\u000a\u002D Fetch the sitemap.xml (e.g., using the Web Scraping node to get its content or the Firecrawl Map node if it can process sitemaps).\u000a\u002D Parse the XML to extract all the individual page URLs.\u000a\u002D Then, loop through those URLs, scrape each one, and compile the content into a dataset or feed it to an agent for knowledge ingestion (e.g., by updating a Table Dataset programmatically via API, or soon, direct knowledge base updates via API).\u000a\u000aThis is a great feature request for more direct sitemap support! Please add it to our roadmap: https://agenticflow.featurebase.app/\u000a\u000aAuto\u002DRetrain Functionality:\u000aNot fully automatic in the background yet. Currently, if your website content or uploaded documents change, you would typically need to:\u000a\u002D Re\u002Dcrawl the URLs (if using the agent\u0027s URL knowledge source and there\u0027s a \u0022re\u002Dsync\u0022 option, which we\u0027re improving).\u000a\u002D Re\u002Dupload updated files to the Agent\u0027s Knowledge Base.\u000a\u002D Or, re\u002Drun the workflow that populates its Table Dataset.\u000a\u000aScheduled Re\u002DTraining (Roadmap/Workaround): True \u0022auto\u002Dretrain on a schedule\u0022 (e.g., \u0022re\u002Dcrawl these URLs every week and update the agent\u0027s knowledge\u0022) is a more advanced feature we\u0027re planning. For now, you could build a scheduled workflow (triggered by external cron + API call) that re\u002Dscrapes key URLs and updates a Table Dataset that your agent uses for RAG.\u000a\u000aWe\u0027re continuously working on making knowledge ingestion and updates more seamless and automated. Your feedback helps us prioritize!\u000a\u000aHope this helps!\u000a— Sean

AgenticFlow

Share AgenticFlow

Related questions