WebCrawl

Overview:

A web crawler, also known as a "spider" or "bot," is an automated program that systematically browses the internet to discover, index, and store information from webpages. Used mainly by search engines like Google, it builds a massive, searchable index of the web, helping users find relevant content quickly.

Prerequisites:

How to create an Assistant using WebCrawl in iX Hello:

  1. On the Home Screen, click on the Quick Start App Button.

  1. You will be navigated to Quick Start Apps screen and then click on "Add a New Assistant" button.

  1. You will see the Quick Start Apps Library, which helps you to create an app using ready to use AI apps.

  2. To create a new assistant using the Add a New Assistant button and choose Quick RAG-App.

  1. Fill out the Configuration screen of General Information Tab.

  2. The table below explains each of the inputs shown on this screen.

Input

Information

Name

This is the display name of the assistant.

For Ex: WebCrawl Test

Description

Explains what the assistant is designed to do.

For Ex: WebCrawl Test

Welcome Message

This is the first message the assistant sends when a conversation starts.

Goodbye Message

This is the closing message when the conversation ends.

Persona and Behavior (Sample)

Defines the assistant’s tone, personality, and response style.

For Ex: I am a concise and focused assistant, providing accurate information based solely on the provided Content files from URL. I aim to deliver clear and relevant answers to your queries from Content files from URL provided

  1. Fill out the Configuration screen of Knowledge Base Data Tab,

Input

Information

SharePoint Upload Section

Displays files and folders from the selected SharePoint site.

Instructional Text

Connect to SharePoint, browse folders/files, and select content for AI training.

  • Supported File Types: .pdf, .docx, .csv, .xls, .xlsx, .txt, .ppt, .pptx, .gif, .jpg, .jpeg, .png, .webp.

  • Upload Limits:

    • Max per file: 200MB

    • Total: 1GB

Action Buttons

Click Save Button to save all the information updated.

  1. The bot created will appear under My Assistant and enable the PII Redaction by clicking on the warning "Click Here" and hit the Save button on the pop-up.

  2. To know more on PII Redaction refer to PII-Management

  1. Click on "Activate" button to activate the bot. You can click on "Chat Now" and start the conversation with the bot created.

  2. Based on the content in the file, which is uploaded, you can query the bot accordingly:

  1. You can access the same bot through Custom Apps.

circle-info

Using the Quick Start App to create this assistant ensures that all the necessary tabs in custom apps are automatically generated or updated. This includes the AI-Content tab, Methods Tab, Intents, Utterance, Slot, and Input.

  1. The list of contents provided while creating the assistant are as below

a. Preview Content

  • Let's you view the data that has been extracted or crawled from the specified URL.

  • Helps verify that the correct content was fetched.

  • Ensures the data is relevant and clean before using it for AI training.

b. Refresh Content

  • Updates the stored content by re-fetching data from the original URL.

  • Keeps training data current when the source website changes.

  • Ensures the AI model uses the latest information.

c. Delete

  • Removes the selected training URL and its associated content from the system.

  • Cleans up outdated or irrelevant sources.

  • Prevents unnecessary data from influencing the AI model.

Let's test the assistant

When the user clicks on the provided links, the user will be able to view the content screen associated to the link provided with in the Chat Widget.

Last updated

Was this helpful?