Extract thousands of Google Results into your Google Sheet


Scraping, i.e. the extraction of data from websites, is one of the most widely used techniques on the web. The uses are innumerable: Analysis of competitors, its SEO positioning, content generation, influencer tracking, etc., etc., etc.. Once the data is extracted, you have to process it, and there, nothing like our good old spreadsheet… In […]

Read More…

Useful regex rules to clean up your imported data


Numbers Transformation Regex rule Replacement 10,94 -> 10.94 \, . 10.94 -> 10,94 \. , 10,999,999 -> 10999999 \d* Currencies Transformation Regex rule Replacement 180 EUR -> 180 \d+ $180 -> 180 \d+ $10.94 -> 10,94 \$(.*?)(\.|$)(.*) $1,$200 €10,94 -> 10.94 \€(.*?)(,|$)(.*) $1.$200 Dates Transformation Regex rule Replacement month/day/year -> day/month/year (\d*)\/(\d*)\/(\d*) $2/$1/$3 day/month/year -> […]

Read More…

Google Sheets: How to organize your data correctly


In the company world, spreadsheets are everywhere and used – at different degrees – by everyone. In my experience as an analyst, I have been able to study why most of the people have trouble extracting meaningful information, and it’s all about a lack of clean-up and organization of their datasets. There are dos and […]

Read More…

Google Sheets: The few shortcuts you absolutely need


That said, I have no idea why Google Sheets developers make us juggle between Ctrl, ⌘, ⌥ and ⇧, but it doesn’t provide an easy way to remember the combinations. However it doesn’t take that long before the combinations become reflexes. In a side note: These shortcuts apply to US keyboards, however I show you […]

Read More…

Are Grand Chelem tournaments for left-handed players?


Rafael Nadal is perhaps the most famous left-handed player on the courts right now. But the competition form right-hands is fierce: Roger Federer and Novak Djokovic, with no doubt ones of the most frightening opponents, are right-handed. In this little exercice we will use Google Sheets and the add-on function IMPORTFROMWEB() to see who is […]

Read More…

Find an XPath with little HTML knowledge


XPath is a syntax for defining nodes from an XML document. An HTML page being an XML document, knowing XPath basics is a requirement to scrape content from an HTML page. This means that we can use an XPath to retrieve any element of an HTML page. Let’s take a basic example of a HTML […]

Read More…