WebCrawl
Overview:
A web crawler, also known as a "spider" or "bot," is an automated program that systematically browses the internet to discover, index, and store information from webpages. Used mainly by search engines like Google, it builds a massive, searchable index of the web, helping users find relevant content quickly.
Prerequisites:
An iX Hello Account For steps to create an iX Hello account, follow the steps at
Use this link to Sign up for iX Hello platform.
To create an APP using Quick Start App refer to this
How to create an Assistant using WebCrawl in iX Hello:
On the Home Screen, click on the Quick Start App Button.

You will be navigated to Quick Start Apps screen and then click on "Add a New Assistant" button.

You will see the Quick Start Apps Library, which helps you to create an app using ready to use AI apps.
To create a new assistant using the Add a New Assistant button and choose Quick RAG-App.

Fill out the Configuration screen of General Information Tab.
The table below explains each of the inputs shown on this screen.
Input
Information
Name
This is the display name of the assistant.
For Ex: WebCrawl Test
Description
Explains what the assistant is designed to do.
For Ex: WebCrawl Test
Welcome Message
This is the first message the assistant sends when a conversation starts.
Goodbye Message
This is the closing message when the conversation ends.
Persona and Behavior (Sample)
Defines the assistant’s tone, personality, and response style.
For Ex: I am a concise and focused assistant, providing accurate information based solely on the provided Content files from URL. I aim to deliver clear and relevant answers to your queries from Content files from URL provided

Fill out the Configuration screen of Knowledge Base Data Tab,
Input
Information
SharePoint Upload Section
Displays files and folders from the selected SharePoint site.
Instructional Text
Connect to SharePoint, browse folders/files, and select content for AI training.
Supported File Types:
.pdf, .docx, .csv, .xls, .xlsx, .txt, .ppt, .pptx, .gif, .jpg, .jpeg, .png, .webp.Upload Limits:
Max per file: 200MB
Total: 1GB
Action Buttons
Click Save Button to save all the information updated.

The bot created will appear under My Assistant and enable the PII Redaction by clicking on the warning "Click Here" and hit the Save button on the pop-up.
To know more on PII Redaction refer to PII-Management


Click on "Activate" button to activate the bot. You can click on "Chat Now" and start the conversation with the bot created.
Based on the content in the file, which is uploaded, you can query the bot accordingly:

You can access the same bot through Custom Apps.
Using the Quick Start App to create this assistant ensures that all the necessary tabs in custom apps are automatically generated or updated. This includes the AI-Content tab, Methods Tab, Intents, Utterance, Slot, and Input.

The list of contents provided while creating the assistant are as below

a. Preview Content
Let's you view the data that has been extracted or crawled from the specified URL.
Helps verify that the correct content was fetched.
Ensures the data is relevant and clean before using it for AI training.
b. Refresh Content
Updates the stored content by re-fetching data from the original URL.
Keeps training data current when the source website changes.
Ensures the AI model uses the latest information.
c. Delete
Removes the selected training URL and its associated content from the system.
Cleans up outdated or irrelevant sources.
Prevents unnecessary data from influencing the AI model.
Let's test the assistant

When the user clicks on the provided links, the user will be able to view the content screen associated to the link provided with in the Chat Widget.

Last updated
Was this helpful?