LogoLogo
Click Here For More Support
  • 📍Get Started
    • Welcome
    • Who We Are
    • Our Product
  • 🤖AI Agent Engine
    • AI Agent Engine Basics
      • Get to Know Your AI Agent Engine
      • Glossary
      • Dashboard Overview
    • Manage Your Content
      • Response Types
        • Prewritten Plus Responses
          • Add New Prewritten Responses
          • Edit Prewritten Responses
          • Enrich Prewritten Responses
          • Response Refiner
        • Generated Responses
          • Web-Scraped Data
            • Troubleshooting
          • Documented Data
          • Enrich Generated Responses
        • Fallback Responses
      • Enrichments
        • Buttons
        • Quick Replies
        • Videos
        • Images & GIFs
          • Best Practices
      • Input Director
        • Copy & Paste Input Training
      • Best Practices
        • Locate Responses/Data
        • Revise Responses
        • Unpublish Responses
        • Content Formatting
        • Suppress Thumbs Up/Down
        • Content Groups
      • Upgrade to Context LLM
    • Install Your Chat
      • Web Installation
        • Advanced Pop-Up Options
      • InApp Webview Installation
        • App Provider Product Overview
        • iOS Advanced Install Guide
        • Android Advanced Install Guide
        • Passing Customer Identifiers
    • Enhance the Chat Experience
      • Pages
        • Add/View Chat Pages
        • Maintain State
        • Avatars
        • Chat Header Image
        • Input Container
        • Chat Background
      • Popups
        • Editor Field Definitions
        • Add/Edit Popups
        • Auto-Open
        • Prompt Bubble
        • Popup Button Image
        • Best Practices
      • Page Themes
        • Field Definitions
        • Adjust Page Themes
      • Activation & Drivers
        • Embed Webpages
        • Link Other Chats
        • Welcome Messages
      • Placement
        • Digital Placement
          • Chat Tile
          • Chat in Chat
        • In-Person Placement
      • Sponsor Inclusion
    • Features & Integrations
      • Channels
        • Facebook Messenger
          • Maintenance & Best Practices
        • Inbound SMS
      • Integrations
        • Ticket Commerce
          • Installation/Maintenance
          • Best Practices
        • Simpleview Integration
          • Listings
          • Events
          • Search Logic Summary
        • Zapier
          • Installation Guide
            • Connect to Salesforce
            • Connect to Google Sheets
        • FEVO
          • Installation Guide
          • Best Practices
        • Carbonhouse Integration
      • Features
        • Multi-Language Functionality
        • Mobile Ticketing Guide
        • Food & Beverage Finder
          • Installation/Maintenance
          • Best Practices
        • Weather Feature
        • Consent Form Feature
        • Satisfaction Score
        • 'Notify Subscription' Feature
    • User Management
      • User Types
      • Adjust User Type & Access
      • Add/Remove a User
      • Update Your Password
      • Unlock Your Account
      • Multi-Factor Authentication
    • Reporting & Analytics
      • Analytics Dashboard
        • Overview
        • AI Chat Performance Dashboard
        • Intent Explorer Dashboard
        • Intent Trends Dashboard
        • Intent Deep-Dive Dashboard
        • Bridge Agent Performance Dashboard
        • Mobile Ticketing Guide Dashboard
      • Data Reports
        • Conversation Transcripts
        • AI Resolutions Log
        • URL Link-Outs
        • Bridge Escalations
        • CSAT Entries
        • Ticket Commerce Records
        • End User Contacts
        • NLP Traffic Log
        • Input and Response Log
        • List URL Subscriptions
        • Get Response Feedback
        • Knowtifi Subscriptions
      • API Report Options
      • External IDs
      • UTM Tracking
      • Security Alerts
  • 📬Marketing Engine
    • Marketing Engine Basics
      • Get to Know Your Marketing Engine
      • Glossary
      • Login to Your Account
    • Marketing Calendar
      • How to Use the Marketing Calendar
    • Segments
      • Segments Overview
      • Uploading Contacts Manually
      • Uploading Contacts in Bulk
      • Importing a File into Unsubscribe List
      • Manage Contact Lists
      • Create Custom Object & Custom Fields for Contact Data Management
      • Contact Engagement Rules
      • Folders
      • Audience
        • Audience Dashboard
        • Audience Filters/Segments
        • Create a Meta Audience
        • Create a Audience Using Marketing Engine
      • Best Practices
        • Use Send Time Optimization
    • Templates & Landing Pages
      • Email Templates
        • Create & Preview an Email Template with Dynamic Content
      • SMS/MMS Template
      • WhatsApp Template
        • WhatsApp Carousel Template
        • WhatsApp LTO Template
        • WhatsApp Product Message Templates
      • Webpush Template
      • Mobilepush Template
      • Landing Page Template with a Form
      • Social Conversation Templates
    • Campaigns
      • Messaging Campaigns
        • Select Your SMS/MMS Sender Number
        • How to Create an SMS/MMS Campaign
        • Enable Double Opt-In
        • Messaging Campaign Report
        • Best Practices
        • SMS Regulations
          • SMS Regulations for India
          • SMS Regulations for Canada
          • SMS/MMS Regulations for USA
      • Social Campaigns
      • Email Campaigns
        • Create an Email Campaign
        • Update an Email Campaign
        • Email Campaign Report
        • Smart AI Tools
        • Best Practices
          • Reduce SPAM rate in emails
          • Craft Compelling Email Pre-Header Text
          • Add "View This Email in Your Browser" Link
    • Journeys
      • Overview
      • Journey Blocks
      • Journey Templates
      • Create a Journey
      • Variant Testing
      • Conversion Tracking
      • Campaign or Journey Metadata
      • Advanced Filters for Sorting Campaigns/ Journeys
      • WhatsApp Journey
        • How to Use WhatsApp for Commerce
        • How to Send a WhatsApp One-Way Notification
        • Send a WhatsApp Audio Message
        • Automated WhatsApp Welcome Journey
        • WhatsApp Journey Report
      • Best Practices
        • Email Journey Strategy for Ticket Buyers
        • Journey Examples
    • Settings & Integrations
      • Integrations
        • Ticketing Integrations
          • Ticketmaster Integration
            • Best Practices
          • SeatGeek Integration
          • Glitnir Ticketing Integration
            • Maximize Marketing Engine with Glitnir Integration:
        • E-Commerce Integrations
          • Shopify
        • Website Tracking
          • Track Your Website
          • JavaScript Tracking Client
          • Track WordPress Websites
          • Track Shopify Stores
          • Enable User ID Tracking
        • Zapier Integration
      • User Management
        • Account Types
        • Add/Remove a User
        • Adjust Role & Access
        • Update Your Password
        • Two-Factor Authentication
      • Settings
        • Add Email Sender Addresses to Launch Campaigns
        • Connect Your Email Domain with the Marketing Engine
        • Connect Your Social Accounts
        • Connect your Shopify Store
        • Integrate Webpush Notifications
        • Setup Segment-based Engagement Rules
    • Factcubes
      • Fan Maturity Model
  • đź’¬Live Agent Engine
    • Get to Know Your Live Agent Engine
    • System Configuration
      • Installation Guide
      • Escalation Schedule
      • Multiple Team Involvement
      • Leave a Message & Contact Collection
      • Conversation Labels
      • Additional Channels & Placement
        • Bridge Placement
        • Bridge Email
          • Installation Guide
        • Facebook Messenger
    • Agent Overview
      • Logging In
      • Dashboard Overview
      • Profile Setup
      • Notification Preferences
      • Set Your Availability
      • Conversation Status
      • Assign a Conversation
      • Reply in a Conversation
      • Canned Responses
        • Add/Use a Canned Response
        • Modify/Delete a Canned Response
        • Best Practices
      • Participate in a Conversation
      • Private Notes & Mentions
      • Mute/Block a User
      • Prioritize Conversations
      • Resolve a Conversation
      • Export Transcripts
      • Macros/Automation
      • Keyboard Shortcuts
      • Ending Your Shift
      • Troubleshooting
    • Admin Overview
      • Adjust Team Assignments
      • Update Agent Status
      • Additional System Controls
    • Data & Reporting (Bridge)
    • Bridge Mobile App
  • SYSTEM UPDATES & SUPPORT
    • Help Center
    • Release Notes
      • Archived Product Updates
Powered by GitBook
On this page
  • How to Use the Web Scraping Tool
  • Edit/Delete Web-Scraped Data
  • Update Web-Scraped Data
  • Delete Web-Scraped Data
  • Our Recommendations
  • Create Hidden Webpages
  • Create a List of Your URLs
  • Focus on What is Important
  • What to Avoid
  • Inspect for Quality
  • FAQs

Was this helpful?

Last updated 7 months ago

Was this helpful?

How to Use the Web Scraping Tool

To add new web-scraped data, follow the steps below:

  1. Open your AI Chat

  2. Login using !login command

  3. Click 'Get Started'

  4. Click Commands 'Add this URL' or 'Add new URL'

  5. Go to Studio -> NLP Manager -> Responses to ensure the data is carried over and formatted correctly.


Edit/Delete Web-Scraped Data

Update Web-Scraped Data

There are two ways to update web-scraped data:

Rescrape Your Webpage

  1. Open your AI Chat

  2. Login using !login command

  3. Hit 'Get Started'

  4. Click Add this URL or Add New URL to rescrape the page with the updated information

  5. Go to Studio -> NLP Manager -> Responses to ensure the data is carried over and formatted correctly

Manually Update Your Content

  1. Locate the response you'd like to edit

  2. Once you click on the response name, select the three dots in the right-hand corner of the window

  3. Uncheck the box next to Content Subscription.

  4. Click the pencil icon

  5. Make any necessary edits

  6. Hit Save and Publish or just Save to keep your changes in draft mode

Why Would You Manually Update Scraped Data?

  • Add Topic Headers (ex. Topic: Group Ticket Perks)

  • Update information that hasn't been updated on your website yet

  • Improve the chunking of data

  • Removing unnecessary scraped data (webpage alerts, footer text, etc.)

  • Adding redacted information such as emails/phone numbers when applicable

Delete Web-Scraped Data

  1. Locate the response you'd like to delete

  2. Click the delete button (trash can icon)


Our Recommendations

Create Hidden Webpages

Utilize the power of our web scraping tool without having to surface information publicly on your website. By creating hidden pages on your website, you create a controlled environment that serves as a great data resource for your AI Chat to train from and curate content.

Create a List of Your URLs

We suggest making a list of the URLs you want to scrape and train in advance. Once you locate the popup on the first URL, keep your list handy and add more URLs by clicking the 'Add new URL' button. This way, you won't need to switch between pages and have complete control over the URLs you've already scraped.

Focus on What is Important

Don't attempt to scrape every web page you have! Focus your scraping efforts on web pages where your customers typically learn key information such as A-Z Guides, Things to Do, Directories, etc.

What to Avoid

Avoid scraping:

  • Web pages that are frequently updated (schedules, rosters, stats, etc.). In these cases, we recommend using a prewritten response.

  • Any third-party web pages you do not manage.

  • Web pages lacking rich content such as image-heavy pages, landing pages, etc.

Inspect for Quality

We always recommend that you not only review results from our scraping tool within the response library but also see how responses are generated and exposed within your chat.

If the information you scraped into the dashboard is not being understood by the LLM, this may be due to poor formatting within the response. To fix this:

  • Find the corresponding URL labeled response in the library (usually named after the web page it was scraped from)

  • Click Edit

  • Ensure that:

    • There is a header description related to the data's topic in each section

    • All sections are separated by a 5 “-----”

    • No section is very short or extremely long

    Note: Manual edits can be made within web-scraped data responses in the dashboard. However, if a page is rescraped, those manual edits will be overwritten.

FAQs

I'm not seeing my scraped responses and/or LLM volume in the dashboard

In the responses library, ensure that the filter is set to “company name LLM.” From there, data should change and reflect everything within the new experience

Is there any specific formatting required for emails/phone numbers in scraped data?

No! If your contact information is in a text format and written clearly (ex. email@email.com or #-###-###-####) it will be surfaced to users in generated responses.

Can I scrape text from PDFs, images and/or web banners?

Unfortunately, this content cannot be scraped. If you'd like to include this data to create generated responses, we suggest creating a documented response

Can multiple people from my team scrape the website using the same popup?

Yes! If the page is already scraped, a message will trigger that it’s already listed, and a page refresh will be triggered

How many pages can I scrape?

You can scrape as many web pages as you need; however, we typically recommend scraping between 10-30 total depending on your website

Does the chat ever pull in information from beyond our scraped webpages?

No! Responses are only generated from information scraped from websites by you

Can I scrape any webpage I want?

Avoid scraping web pages that you do not manage, as their information may go against your organization's policies and procedures

My chat is not prompting me to login when I enter !login

You may already be logged in! Check for the asterisk in your chat's text container. If it's there, that means you're logged in. Type !commands and continue scraping.

If you still experience an issue logging into admin mode, reach out to our Product Support team by clicking the Click Here For More Support button in the header and asking for a live agent!

The popup doesn’t load on a page I'm trying to scrape

Load the popup on a different page and perform the “Add New URL” command instead and enter the URL

I'm unable to scan my web page because the URL is not reachable

No worries! Submit a service request and we can assist!

If my website is updated, will it automatically update my scraped response?

As the information on your website changes, this is not automatically reflected in your scraped data. Rescrape the updated webpage to ensure your chat is properly trained and creating up-to-date responses for users

What if I have pages on other languages, should I scrape them?

No, you need to only scrape pages in English language. If you have other languages as add-ons, we will make sure to set the content in the second language for you

How can I check the exact URL the content is scraped from?

Locate the url_ response and click on Notes button

When I try to scrape my website, I get an http 500 error

No worries! Submit a service request and we can assist!

While in the , go to Studio -> NLP Manager -> Responses

While in the , go to Studio -> NLP Manager -> Responses

Real-World Examples: ,

Satisfi Dashboard
Satisfi Dashboard
Tampa Bay Buccaneers
The Jockey Club
How to Add, Update and Delete Web-Scraped Data
Update Web-Imported Data
Page cover image