Read Website

On this page

Options
Outputs
Tips

Read website is a simple step that extracts the text content from a provided URL. We are able to read most public website pages (pages which don’t require authentication), but websites with sophisticated bot protection (ie. CAPTCHAs) or those that require authentication may not be able to be read. Read Website step

The text content returned from this step may not be formatted for readability, and it will include all text (even hidden text), making it quite verbose in many cases. In most cases, you’ll want to process this data using a Generate Text, Extract Fields, or similar step directly afterwards.

Options

Name	Type	Description
URL	URL	The URL of the website page you want to extract text content from.

Outputs

Name	Type	Description
Website Contents	Plain Text	The full text content of the website page.

Tips

If you receive an error about not being able to access the website page, it may be blocked. Unfortunately there’s not much we can do in most cases, though feel free to reach out to the Respell team to see if we have any workarounds in mind.

Research Agent Search Google

Flow Tools

Text Tools

File Tools

Web & Code Tools

Integrations

Options

Outputs

Tips

Flow Tools

Text Tools

File Tools

Web & Code Tools

Integrations

​Options

​Outputs

​Tips

Options

Outputs

Tips