Category:Scraping Use Cases
Web Scraper for Alteryx The Ultimate Plug-and-Play Template (Powered by Scrape.do)

Growth
In the world of data analytics and growth hacking, getting your hands on reliable web data is half the battle. You need data to fuel your Alteryx workflows, but building scrapers that can bypass anti-bot walls, CAPTCHAs, and proxy bans is a massive headache.
That’s why we built the Scrape.do Universal Web Scraper Template for Alteryx.
We’ve done the heavy lifting. We packaged the world’s most powerful scraping API (Scrape.do) into a simple, custom-branded Alteryx Macro (.yxmc). All you have to do is plug it in, feed it URLs, and watch the data flow.
Here is your step-by-step guide on how to use this template to vacuum data from any website on the internet.
Step 1: Install the Engine and Add Your Key
First, download the ScrapeDo_Universal_Scraper.yxmc file to your computer.
- Open a blank workflow in Alteryx Designer.
- Right-click anywhere on the white canvas, select Insert, and then click Macro...
- Browse your computer and select the
.yxmcfile you just downloaded. You’ll see our custom Scrape.do logo drop onto your canvas.
Crucial First Step: Add Your API Token Before you can scrape, you need to authorize the tool.
- Right-click the Scrape.do macro on your canvas and select Open Macro.
- Inside the macro, look for the green Text Input Tool at the very beginning.
- Click it, and in the configuration window, replace the text
ENTER_YOUR_TOKEN_HEREwith your actual Scrape.do API Token (you can grab this from your Scrape.do dashboard). - Hit Save (
Ctrl + S) and close the macro tab. You only have to do this once!
Step 2: Feed the URLs
Now, jump back to your main Alteryx workflow.
- Drag a standard Text Input tool onto the canvas and place it to the left of the Scrape.do macro.
- Create a column named
urland paste the list of target websites you want to scrape (e.g., product pages, real estate listings, directories). - Connect the output of your Text Input tool to the input anchor of the Scrape.do macro.
Step 3: Run the Scraper (Getting the Raw HTML)
Add a Browse tool to the output of the Scrape.do macro and hit the blue Run button.
What just happened? The macro took your URLs, routed them through Scrape.do's massive pool of rotating residential proxies, bypassed any target website security, and brought back the goods.
In your results window, you will see a column named DownloadData. This column contains the raw, unblocked HTML source code of the entire webpage.
Step 4: The Magic Trick - Extracting Data with RegEx
Having the raw HTML is great, but you usually want specific data points—like a product title, a price, or an email address. Because this template is universal, it doesn't parse the data for you; it lets you decide what to extract using Alteryx’s built-in Regular Expression (RegEx) tool.
Here is how to slice up that HTML:
- Drag Alteryx’s RegEx tool onto the canvas and connect it to the output of the Scrape.do macro.
- In the RegEx configuration panel, select the
DownloadDatacolumn as your Field to Parse. - Choose Parse as your Output Method.
Example: Extracting the Page Title
Let’s say you want to extract the title of the webpage. In the Regular Expression box, you would type: <title>(.\*?)</title>
What does this do? It tells Alteryx to look inside the HTML, find the <title> tag, and extract everything trapped inside the parentheses (.\*?) into a brand new, clean column.
Example: Extracting a Price
If you are scraping an e-commerce site and notice the price is always wrapped in a specific HTML class like <span class="price">99.99</span>, your RegEx would look like this: class="price">([^<]+)
You can string as many RegEx tools together as you need to build out a massive, highly structured dataset from the raw HTML our template provides.
Ready to Scrape?
With this template, you no longer need to write Python scripts or pay for expensive, rigid scraping software. You have a universal skeleton key that integrates natively into your existing Alteryx data pipelines.
Download the Scrape.do Universal Alteryx Template Here
Grab your token, fire up Alteryx, and let the data flow!

Growth

