We are pleased to announce the full line-up for this year’s Shiny in Production conference! Don’t miss out on this excellent set of talks and workshops - head over to the conference we...
Ahh, blogging. I think we can all agree it’s probably one of the greatest forms of written communication to have ever existed.
Whats that you say? You’d like to set up your own blog? And you say you want to use a dead simple, data science friendl...
Clustering Clustering is one of the most popular applications of machine learning. It is actually the most common unsupervised learning technique. When clustering, we are usually using some distance metric. Distance metrics are a way to define how close things are to each other. The most popular distance metric, by ...
The Royal Statistical Society International conference is next week from 4-7 September 2023, hosted in Harrogate. Jumping Rivers are exhibiting at the conference, as well as delivering workshops a...
In a previous post I explored some “quick wins” for wrangling and analyzing data in Python that would be otherwise difficult to do in Excel. That post largely avoided data visualization, so this post will be on a similar topic — visualizations that would be difficult to build in regular Excel ...
Enterprise Data Warehouse – Definition An enterprise data warehouse is a centralized digital repository. It gathers, polishes, and stores vast amounts of data from every department of an enterprise. With a data warehouse, all the data is right there, always ready for analysis. So, instead of rummaging through multiple, disjointed databases ...
As a long-time Python enthusiast who has frequently recommended Excel users delve into Python (and even authored a book on the subject), a prevalent question I encountered from hesitant Excel users is: What can I achieve in Python that I can’t in Excel? With the recent integration of Python ... [...Read more...]
Look what touched down in my mailbox today… 🏈 _Football Analytics with Python & R_ by Eric Eager and Richard Erickson. I had the pleasure to be a tech reviewer for this book and recommend it to anyone interested in, as the subtitle suggests, learning data science through the lens of sports. ...
I’m excited to share that “pandas Analytics for Excel users,” a course I produced with Madecraft, is now available on LinkedIn Learning. View the course with a one-month free trial here. Also be sure to check your local library, university or workplace for access. Course description Python is one ... [...Read more...]