A tabular dataset is arranged in two dimensions -- what in Excel we would call rows and columns, but a fancier statistical way of saying this is observations and variables. …
Continue Reading about Variables vary… and why that matters →
Teaching analytics with modern Excel
posted on
A tabular dataset is arranged in two dimensions -- what in Excel we would call rows and columns, but a fancier statistical way of saying this is observations and variables. …
Continue Reading about Variables vary… and why that matters →
posted on
Anscombe's quartet is a well-known statistical parlor trick with a powerful message. Fortunately, with the help of the seaborn package as well as pandas, it can be demonstrated in Python with just a …
Continue Reading about How to visualize Anscombe’s quartet in Python →
posted on
Don't listen to the haters. Excel is a fantastic way to learn data analytics and a valued slice of the data analytics stack. That said, you may need to make a couple of tweaks from Excel's "default …
Continue Reading about Every dataset in Excel should include these two features →
posted on
I'm a big believer that data analysts will derive far more value from their tools when they understand the underlying philosophy and worldview of those tools. For example, the way that open source …
Continue Reading about Here’s how R and Python think differently about data →
posted on
I like to step off the beaten tutorial path with blog posts. In fact, I much prefer posts that explore the culture and philosophy behind being a data analyst rather than pure "how-to" posts where I …
Continue Reading about I stopped speaking at meetups… and when you should too →
posted on
For a long time, many data analysts worked firmly ensconsed in the Microsoft stack -- definitely Excel, maybe some Access, PowerPoint or VBA serving as the automative glue. If it wasn't Microsoft, it …
Continue Reading about What data analysts should know about open source →