Category Archives: media

Cleaning Transcript Data with Python

Performing an analysis of text data or using text data to train machine learning models oftentimes requires a lot of data. Usually people look to Wikipedia for large amounts of text data, but occasionally scholars will make use of less traditional sources of data, like movie reviews for performing sentiment analysis on sentences or Ubuntu IRC chat… Read More »

How-to: Scrape Data with Python’s BeautifulSoup

Hello. In this post I’ll show how to scrape semi-structured data from a target webpage with Python’s BeautifulSoup module. BeautifulSoup is indeed beautiful. It is the go-to package for scraping data and working with HTML. We’ll also use requests to grab the HTML from the target URL. The page I use in my example should… Read More »

The Raspberry Pi 2 as a Dedicated VPN (round 2)

Previously, I had tested the Pi Zero as a dedicated VPN. And while that was a great way to securely connect to my home network and access files, the Zero definitely struggled performing some tasks (like streaming media). Raspberry Pi’s are ideal for 24/7 use because they barely draw single watt, compared to desktop computers where the… Read More »

Raspberry Pi + Plex = RasPlex

I’m going to share with you a little gem I discovered recently that relieved me of all my digital media player headaches. That gem is of course, RasPlex – a Linux based distribution that is optimized to run Plex client software on the Raspberry Pi, a single board computer. For the layman, this is more or… Read More »

Can the Raspberry Pi 2 be a HTPC?

After having flipped through several Linux and Raspberry Pi magazines, it became apparent to me that there is no shortage of things to do with a Pi. While certain projects approach what some might consider gimmicky (a portable briefcase retro gaming kit), others have recognizable utility that will only become more apparent with time, like a… Read More »