Web scraping and automation dictionary

Confused by terminology? Use our dictionary to quickly understand what you need to know.

22 articles
David Barton avatarNatasha Lekh avatar
Written by David Barton and Natasha Lekh

What is an API?

An application programming interface (API) makes it easy for computer programs to talk to each other
David Barton avatar
Written by David Barton. Updated over a week ago

What is a proxy?

Proxy servers act as a shield protecting your digital identity from the websites you visit
Natasha Lekh avatar
Written by Natasha Lekh. Updated over a week ago

What is a residential proxy?

A residential proxy is the most reliable way to scrape data while using proxies.
Natasha Lekh avatar
Written by Natasha Lekh. Updated over a week ago

What is a datacenter proxy?

A datacenter proxy is the fastest and most affordable way to scrape data while using proxies.
Natasha Lekh avatar
Written by Natasha Lekh. Updated over a week ago

What is a SERP?

SERP is an abbreviation for Search Engine Result Pages
Natasha Lekh avatar
Written by Natasha Lekh. Updated over a week ago

What is JavaScript?

JavaScript is the main programming language of the Internet
Natasha Lekh avatar
Written by Natasha Lekh. Updated over a week ago

What is a session?

A session is reusing the same IP address during web scraping
Natasha Lekh avatar
Written by Natasha Lekh. Updated over a week ago

What is a request?

A request is your command for an actor to open a specific webpage to further scrape it for data
Natasha Lekh avatar
Written by Natasha Lekh. Updated over a week ago

What is a request queue?

A request queue is a list of web pages lined up for your actor to visit and scrape for data within one run.
Natasha Lekh avatar
Written by Natasha Lekh. Updated over a week ago

What is key-value store?

A key-value store is a space for storing important files
Natasha Lekh avatar
Written by Natasha Lekh. Updated over a week ago

What is Node.js?

Node.js is an open-source environment that executes JavaScript code without the need for a browser
Natasha Lekh avatar
Written by Natasha Lekh. Updated over a week ago

What is JSON?

JSON is a format for storing and transporting data
Natasha Lekh avatar
Written by Natasha Lekh. Updated over a week ago

What is a dataset?

A dataset is a table that acts as a log of your results from web scraping activities.
Natasha Lekh avatar
Written by Natasha Lekh. Updated over a week ago

What is a web scraper?

A web scraper is a program made to extract data from the web
Natasha Lekh avatar
Written by Natasha Lekh. Updated over a week ago

What is Apify SDK?

Apify SDK is your number one resource for creating your own Apify actor
Natasha Lekh avatar
Written by Natasha Lekh. Updated over a week ago

What is web crawling?

Web crawling is a process of browsing webpages systematically
Natasha Lekh avatar
Written by Natasha Lekh. Updated over a week ago

What is web scraping?

Web scraping is extracting data and content from websites automatically
David Barton avatar
Written by David Barton. Updated over a week ago

What is RPA?

RPA stands for robotic process automation
David Barton avatar
Written by David Barton. Updated over a week ago

What is Puppeteer?

Puppeteer is a way for you to control the Chrome browser using JavaScript
David Barton avatar
Written by David Barton. Updated over a week ago

What is Cheerio?

Cheerio is an implementation of jQuery designed specifically for servers
David Barton avatar
Written by David Barton. Updated over a week ago

What is a headless browser?

A headless browser is a web browser with no graphical user interface
David Barton avatar
Written by David Barton. Updated over a week ago

What is an Actor?

An Apify Actor is a program that can carry out any task in a web browser
David Barton avatar
Written by David Barton. Updated this week