Learn how you can generate responses with the data pulled from your website through our web-scraping tool!
How to Use the Web Scraping Tool
To add new web-scraped data, follow the steps below:
Open your AI Chat
If you haven't upgraded to Context LLM yet, you need to go to the Dashboard -> Popups section and drag and drop the *LLM popup to your bookmark list. For more information, check out the upgrade section below!
Login using !login command
Hit 'Get Started'
Hit Commands 'Add this URL' or 'Add new URL'
Done!
Update/Delete Existing Web-Scraped Data
To update web-scraped data, follow the steps below:
Open your AI Chat
Login using !login command
Hit 'Get Started'
Click Add this URL or Add New URL to rescrape the page with the updated information
To delete web-scraped data, follow the steps below:
While in the Satisfi Dashboard, go to Studio -> NLP Manager -> Responses
Locate the response you'd like to delete
Click the delete button (trash can icon)
Having an issue locating the response you want to update? Follow the steps below!
Determine the topic/keywords you are searching for (ex. door times, exhibit name, etc.)
Use the “content” search within the response library to locate the response(s) requiring an update
Tip: Double-check that you are adjusting within a LLM volume by using the filter option. If you are looking for a web-scraped response, the response name will start with "url_"
Our Recommendations
Create Hidden Webpages
Utilize the power of our web scraping tool without having to surface information publicly on your website. By creating hidden pages on your website, you create a controlled environment that serves as a great data resource for your AI Chat to train from and curate content.
Real-World Examples: Tampa Bay Buccaneers, The Jockey Club
Create a List of Your URLs
We suggest making a list of the URLs you want to scrape and train in advance. Once you locate the popup on the first URL, keep your list handy and add more URLs by clicking the 'Add new URL' button. This way, you won't need to switch between pages and have complete control over the URLs you've already scraped.
Focus on What is Important
Don't attempt to scrape every web page you have! Focus your scraping efforts on web pages where your customers typically learn key information such as A-Z Guides, Things to Do, Directories, etc.
What to Avoid
Avoid scraping:
Web pages that are frequently updated (schedules, rosters, stats, etc.). In these cases, we recommend using a prewritten response.
Any third-party web pages you do not manage.
Web pages lacking rich content such as image-heavy pages, landing pages, etc.
Inspect for Quality
We always recommend that you not only review results from our scraping tool within the response library but also see how responses are generated and exposed within your chat.
If the information you scraped into the dashboard is not being understood by the LLM, this may be due to poor formatting within the response. To fix this:
Find the corresponding URL labeled response in the library (usually named after the web page it was scraped from)
Click Edit
Ensure that:
There is a header description related to the data's topic in each section
All sections are separated by a 5 “-----”
No section is very short or extremely long
Note: Manual edits can be made within web-scraped data responses in the dashboard. However, if a page is rescraped, those manual edits will be overwritten.
FAQs
Last updated