Built-in website crawling tools
Web Scraper (Built-in)
When a Site has a Website URL, two built-in tools are enabled automatically:
- list_site_pages: Reads to list available pagestext
- fetch_page_content: Fetches and cleans a page's main content for the AI
How it works
- You set Website URL on the Site
- The crawler/sitemap reader become available to the AI
- Results are cached (sitemap ~12h, pages ~2h) for speed
Tips
- Ensure your site serves a sitemap at text
- Keep important content in the main HTML (avoid heavy JS-rendered text)
- Update if you move domains; cache will refreshtext
When to disable
If you need strict answers only from your knowledge base or tools, leave Website URL empty to disable crawling.