You might have heard of the Advent of Code,
a 25-day challenge involving a programming puzzle a day, to be solved
with the language of your choice. I’ve noted the popularity of this
activity in my Twitter timeline but also in my GitHub timeline where
I’ve seen the creation of a few advent-of-code or so repositories.
AoC is largely an exercise in figuring how to write your favourite language as if were C or C++ 😁, which can be fun ... in moderation
If I were to participate one year, I’d probably use R. Jenny Bryan’s
tweet above inspired me to try and gauge the popularity of languages
used in the Advent of Code. To do that, in this post, I shall use the
search endpoint of GitHub V3 API to identify Advent of Code 2018 repos.
It’s October, time for spooky Twitter names! If you’re on this social
media platform, you might have noticed some of your friends switching
their names to something spooky and punny. Last year I was “Maelstrom
Salmon”, which I find scary but is arguably not that funny. Anyhow, what
if you want to switch your name but have no inspiration? In this post,
we shall explore R’s abilities to help us with that with the help of
webscraping, phonetic spelling and string distance algorithms, and the
magic of randomness!
What can a kaka, a kakapo, an European rabbit and a grey heron have in
common? Well, they might co-habit in the bookshelf of an R user, since
they’re all animals on the covers of popular R books: “R
Packages”, “R for Data
Science”, “Text mining with
R” and “Efficient R
Their publisher, O’Reilly, has now based its brand on covers featuring
beautiful gravures of animals.
Recently, while wondering what the name of R for Data Science bird was
again (I thought it was a kea!), I was thrilled to find the whole
O’Reilly menagerie, i.e. a list of
books and corresponding animals! The website also features a link to “A
short history of the O’Reilly
that was an amazing read. In it was noted that “The animals are in
trouble.”, with a few examples of endangered species. It inspired me to
actually try and assess the conservation status of O’Reilly animals
using responsible webscraping, taxonomic name resolving and IUCN Redlist
I was until recently subscribed to an email list, ALLSTAT, “A UK-based worldwide e-mail broadcast system for the statistical community, operated by ICSE for HEA Statistics.” created in 1998. That’s how I saw the ad for my previous job in Barcelona! Now, I dislike emails more and more so I unsubscribed, but I’d still check out the archives any time I need a job, since many messages are related to openings. Nowadays, I probably identify more as a research software engineer or data scientist than a statistician… which made me wonder, when did ALLSTAT start featuring data scientist jobs? How do their frequency compare to those of statisticians?
In this post, I’ll webscrape and analyse meta-data of ALLSTAT emails. It’ll also be the occasion for me to take the wonderful new polite package for a ride, that helps respectful webscraping!