1. On the 'Sources' page, click on 'Scrape new domain'. A dialog box will appear with two options: enter a website URL directly, or enter the sitemap URL of the website.
  2. Enter the website URL or the sitemap URL and click submit. The platform will begin to scrape all the links of the website or sitemap.

<aside> 💡 For optimal performance, it is recommended to add server-side websites as these are currently better supported by the platform. The platform also supports GitBook, Intercom articles, and various resource centers

</aside>

  1. After the scraping process, all links from the website or sitemap will be displayed in a table. This table will show details like page URL, character count, the last trained on, and status.

<aside> 💡 Note: Scraping process might take some time depending on the size of the website or the number of links present in the sitemap. The larger the site or the more extensive the sitemap, the longer it will take to fully scrape the domain.

</aside>

Once scraping done, the platform will list down the links in a table. This table includes columns for the page URL, character count, the last trained on, and status.

<aside> 💡 Remember, you can add as many domains as you like until you reach the page limit set by your chosen plan. With each new domain, your bot gains a wider knowledge base to pull from, enhancing its ability to assist your users

</aside>

How status works?

Each source used for training your bot displays a status indicating its current state in the training process. This guide will help you understand what each status means, enabling you to manage and monitor your sources effectively.

Here's a breakdown of what each status signifies:

<aside> 💡 Note The 'Skipped' status is only applicable at the domain page level and not for each individual source.

</aside>

Knowing what each status means is crucial for understanding your bot's training process. This knowledge provides insights into what your bot has learned, what it is currently learning, and what it will potentially learn in the future.