1 of 24

Browser Requests

Making web automation requests has never been so simple.

Browser Requests allow you to send the Gaffa API a URL and a list of actions you want to be carried out, including any outputs you want from the page. We'll carry out the request on our cloud browsers and return you the response with no need to worry about proxies, IP rotation, web automation frameworks and scaling.

There's absolutely zero configuration needed and you can interact with Gaffa from any program that can send web requests. We think it's by far the simplest way to automate simple web tasks and the good news is, we're just getting started and have much more planned.

Example request

Running a new browser request is as simple as sending the following . Below, you can see the url () and a list of actions which instruct Gaffa to wait for a table to load and print the page to PDF.

Actions

When you can specify a list of actions you wish for us to carry out on the requested web page. These actions conform to the following format:

Universal Parameters

All actions have the following parameters:

Name

Type

Required

Description

Block DOM Removals

Type: block_dom_removals

This action will prevent the page from removing items from the page. This is useful if you are trying to scrape data from a Javascript-based web application that removes items from the page when they are out of view which can make grabbing data difficult.

Using this action will block DOM removals for the rest of the browser request.

Parameters

See universal parameters.

Usage

Capture the cookies of the current page

Capture Cookies

Type: capture_cookies

This action will capture the browser cookies currently saved for the web page you are on and return them as a JSON object with key/values.

Parameters

See universal parameters.

Usage

Capture the cookies of the current page

Capture DOM

Type: capture_dom

This action will capture and return the raw dom of the site which you can then extract data from on your end.

For common AI scenarios you may find this returns too much data so we have provided a action which distills the DOM to only the important elements.

Parameters

See .

Capture Screenshot

Type: capture_screenshot

Takes a screenshot of the current page. You can choose to take a full screen screenshot showing the whole page or just the current view.

Parameters

Name

Type

Required

Capture Element

Type: capture_element

Returns the , essentially the contents, of a particular element on the page. This can be used when you are only interested in the contents of a particular element.

Parameters

Name

Type

Required

Capture Snapshot

Type: capture_snapshot

This output type will return a HTML file which captures a static version of the page state. The page will load offline and can be saved to your local machine.

This will:

Load and embed all images on the page.
Embed all css files

Currently, Javascript will be disabled and interactivity might not worked as expected but this feature should be useful for preserving the page state as it was and allowing you to view it offline.

Click

Type: click

Request that the browser clicks a particular element on the page.

Parameters

Name

Type

Required

Description

Download File

Type: download_file

Request a copy of the most recent file viewed in the browser.

Parameters

Name

Type

Required

Description

Generate Markdown

Type: generate_markdown

The markdown output format can export the data of the page (an article, table etc.) in a human and LLM readable format which removes unnecessary styling data and other "junk" that is only relevant for the site to work properly.

Gaffa exports GitHub flavoured markdown with comments removed and unknown tags ignored.

Parameters

See universal parameters.

Usage

The following converts the current page to markdown:

Example Output

Generate Simplified DOM

Type: generate_simplified_dom

When you're looking at the DOM of a web page, there's a lot of unnecessary data that can be discarded if you are only interested in the page's elements or looking to export the data into a LLM. The generate_simplified_dom output format processes the HTML in the following way:

Removes all links in the head
Removes all script

Print

Type: print

Request that the browser prints the page to a PDF.

Parameters

Name

Type

Required

Description

Parse Table

Type: parse_table

Finds a table on the page with a given selector and then converts the table data into a JSON object.

This action first fins the table headers and converts them into property names by converting them to lower case and replacing non-alphanumeric characters with underscores. It then processes each table row and for each cell is extracts the contents and saves a value. At the moment, all values will be string types.

Parameters

Name

Type

Required

Description

See .

Usage

Extract a table on the page

The following code will wait 1 second for the .large_table element to appear and return a JSON file with the headers and rows converted.

Scroll

Type: scroll

Request that the browser scrolls to a certain point on the page or, in the case of pages with infinite scrolling, scrolls for a particular amount of time.

Parameters

Name

Type

Required

Description

Type

Type: type

Request that the browser type a particular bit of text into a field.

Parameters

Name

Type

Required

Description

See .

Sites that use more advanced bot detection often use keyboard events to detect unusual activity on their site, rather than immediately dropping all characters of the text into a field our platform types the text in a human-like manner.

Usage

Type into a text box

The following action will type into a particular text field.

Wait for an element to appear before typing

The following code will wait a maximum of 10 seconds for the email input to appear in the field and then type in the provided email.

Wait

Type: wait

Request that the browser waits a given amount of time or for a particular item to appear on the page.

Parameters

Name

Type

Required

Description

API Playground Examples

In the following pages you can view all the pre-built requests we've built to show what is possible with the Gaffa web automation API.

You can start using these in the API Playground once you've created an account.

Infinitely Scroll an Ecommerce Site

An example request that uses Gaffa to infinitely scroll down a simulated ecommerce site whilst recording the interaction.

The following example is a request we've pre-built to show you Gaffa's capabilities against our demo site. You can run this request right now in the Gaffa API Playground.

Gaffa automates infinite scrolling on dynamic pages like e-commerce storefronts. Set a duration, and Gaffa will capture all content as it scrolls. Each session can be recorded as a video for playback, letting you debug or review the interaction.

API Request

The request below uses the POST endpoint to open the demo site on the ecommerce site simulator with an infinitely scrolling storefront. It will wait for and dismiss a dialog box, wait for a product to load and then scroll down the page for a maximum of 20 seconds - if new items load it will keep scrolling.

Actions

Response

Here's a video showing Gaffa scrolling the page for 20 seconds as more items load.

Read more about screen recording here. (TODO)

Capture a Full Height Screenshot

An example request that uses Gaffa to dismiss a modal, scroll to the bottom of a page and then capture a full height screenshot.

The following example is a request we've pre-built to show you Gaffa's capabilities against our You can run this request right now in the .

Gaffa can also capture screenshots at any point during your interaction for use in your app or just to work out exactly was being shown at a given point in time. You can capture just what is shown as if you were looking at the screen or the full height of the page.

API Request

The request below uses the to open the demo site on the ecommerce page with 20 items, wait for and dismiss the dialog, scroll to the bottom of the page and capture a full height screenshot.

Automated Form Filling

An example request that uses Gaffa to automate the completion of a form and waits for a success modal to appear.

The following example is a request we've pre-built to show you Gaffa's capabilities against our You can run this request right now in the .

Filling forms is tedious, Gaffa can be used to fill out a form in a human-like manner so you can spend time doing much more interesting things.

API Request

The request below uses the to open the demo site on the form simulator page with some sections pre-filled (for speed). After typing in the required information and clicking submit, Gaffa waits for the success dialog to show before returning a video of the interaction.

Parse JSON

Paid Action: This action will consume credits based on the amount of content being parsed, see more below.

Beta Feature: This feature is currently in beta and restricted to approved users. If you're are interested in trying it, please and we can enable this feature for your account.

Type: parse_json

The parse_json action extracts data from web pages and online PDFs. It uses AI to parse web content from text into a pre-defined data schema and return it as a JSON object.

The action allows you to convert unstructured content such as academic papers, forms, and webpages into JSON objects, which you can use in automations, analysis, or further processing.

This feature currently works for online PDFs and web page text.

Parameters

Name

Type

Required

Description

See .

Defining Data Schemas

A data schema tells the model exactly what JSON structure to produce.

You can define schemas in two ways:

Inline schemas (defined directly inside the action)
Reusable schemas (created via the Schema API and referenced by ID in your requests)

Schema Structure

A schema has:

Property

Type

Description

Each field in the fields array has:

Supported Field Types

Type

Description

Inline Schema Example

This example shows:

Simple fields (string, datetime) for basic data
Object fields for grouped related data with nested fields
Array fields for lists of items with nested fields defining each item's structure

Schema Operations

Instead of defining schemas inline every time, they can be saved to your Gaffa account and be reused across multiple requests. This makes your actions more readable, easier to maintain, and ensures consistency when parsing similar content.

Creating a Saved Schema

Use the endpoint to create a reusable schema:

Response:

Save the id returned in the response, you'll use this to reference the schema in your requests

Managing Schemas

List all schemas:

Allows you to view all schemas saved to your account:

Endpoint:

Update a schema:

Allows you to modify an existing schema by its ID:

Endpoint:

Delete a schema:

Removes a schema from your account:

Endpoint:

Common Schema Patterns

Simple List Extraction

Nested Objects

Pricing

The credits this action uses depends on the model used. Here are the current supported models and their pricing:

Model

Input Token Cost

Output Token Cost

Browser Requests

Example request

Actions

Universal Parameters

Block DOM Removals

Parameters

Usage

Capture Cookies

Parameters

Usage

Capture DOM

Parameters

Capture Screenshot

Parameters

Capture Element

Parameters

Capture Snapshot

Click

Parameters

Download File

Parameters

Generate Markdown

Parameters

Usage

Example Output

Generate Simplified DOM

Print

Parameters

Parse Table

Parameters

Usage

Extract a table on the page

Scroll

Parameters

Type

Parameters

Usage

Type into a text box

Wait for an element to appear before typing

Wait

Parameters

API Playground Examples

Infinitely Scroll an Ecommerce Site

API Request

Actions

Response

Read More

Capture a Full Height Screenshot

API Request

Automated Form Filling

API Request

Block DOM Removals

Parameters

Usage

Capture Cookies

Parameters

Usage

Capture Element

Parameters

Capture DOM

Parameters

Capture Screenshot

Parameters

Example Output

Usage

Click an element on the page

Usage

Example Output

Actions

Universal Parameters

Capture Snapshot

Click

Parameters

Generate Simplified DOM

API Playground Examples

Usage

Example Output

Parameters

Usage

Example Output