Category Archives: Project

How-to: Scrape Data with Python’s BeautifulSoup

Hello. In this post I’ll show how to scrape semi-structured data from a target webpage with Python’s BeautifulSoup module. BeautifulSoup is indeed beautiful. It is the go-to package for scraping data and working with HTML. We’ll also use requests to grab the HTML from the target URL. The page I use in my example should… Read More »

100 Organic Google Search Results, milestone

Today marks the first day that got 100 Google search results in a single day (save for the 450 anomaly). Meaning 100 people searched something and clicked on this site’s search result, also known as an organic search result. Why is this important? Organic search results are considered one of the strongest indicators of website health… Read More »

How to Scrape Data from Webpages with Python’s Scrapy

In this post I’ll show how to gather unstructured information that exists on webpages using Python’s open source web crawling framework, Scrapy. Web crawlers have been around since the conception of the internet, in fact Google started out by visiting links from Stanford’s homepage until all 10 million of them had been explored. In the… Read More »

The Art of the Sub $200 NAS Build

For decades building your own computer has been a central experience among computer enthusiasts. There’s an indescribable satisfaction that comes with picking out the parts and assembling them into a full computer all by yourself. That same satisfaction extends itself when people decide to build their own network attached storage (NAS) if they’re not simply repurposing… Read More »

How to Build a Budget NAS Machine

Intel build AMD build Other parts, operating systems, and more reading Bonus build A small bit on RAID The failure debate Conclusion Similar to how building your own computer can be cheaper, building (or repurposing) a network attached storage (NAS) can save you a few hundred dollars as well. A NAS machine, or a computer… Read More »

6 Fascinating Distributed Computing Projects

In this post, I’m going to cover some scientific distributed computing projects coordinated through the BOINC and @home distributed networks. For an introduction to what distributed computing is, read this post and maybe the Wikipedia page. Essentially, the BOINC software sends your computer work units to complete, which are sent back to headquarters and combined… Read More »

My Quest for a Quieter Cooler

Introduction Whether you have an Intel or an AMD processor the fact remains that most stock coolers suck (not just air). They come with low quality pre-applied thermal paste, they’re noisy, and often times they just don’t dissipate heat well enough. For these reasons, I decided to upgrade my cooler. It’s not an expensive upgrade… Read More »