UNLIMITED

Website and RSS feed Python scraping

Oct 18, 2022 8 minutes

OUR EXPERT

Matt Holder has worked in IT support for over a decade, and is keen to utilise Linux alongside other installed systems.

Before we begin, a word of warning. Web scraping can be viewed as a negative endeavour – even a type of hacking – if what you’re trying to do is take somebody else’s intellectual property. When taking the ideas in the article further, ensure to take into consideration any legal implications.

The web scraping that we’ll carry out in this article is using purely fictional data, so this won’t be a problem. That said, web scraping is a way to take information from a rendered web page, store it into variables, lists and other data types, and then use it for carrying out another purpose.

Okay, let’s get on to the article proper. In this tutorial we’ll be using Python and the lxml and beautiful soup modules to scrape information from a website. As previously stated, the data used is purely fictional, but once the concepts have been learned it can be used for many different purposes. Once we’ve scraped data from the web page, we’ll use it to calculate some statistics.

X marks the spot

The first concept to

You’re reading a preview, subscribe to read more.

Start your free 30 days

Website and RSS feed Python scraping

OUR EXPERT

QUICK TIP

X marks the spot

Sharing Options

More from Linux Format

Related Books & Audiobooks

Linux for Beginners: 37 Linux Commands you Must Know

Practical Malware Prevention

Programming in C | Step by Step: The Simple Beginner's Guide

Tune into the Cloud: The story so far

Adobe Photoshop: The world's best imaging and photo editing software

Internet Basics: Everything You Need to Know

GPlus: Google Plus Strategies, Profiles, Circles, Communities, & Hangouts. A DivaPreneurs Quick Start Guide to Google Plus

Basic Hash Cracking

Detained

Penetration Testing of Computer Networks Using BurpSuite and Various Penetration Testing Tools

Common Windows, Linux and Web Server Systems Hacking Techniques