Data science

Including ggplot2 Plots in Python Notebooks

September 25, 2023 | jmount

For an article on A/B testing that I am preparing, I asked my partner Dr. Nina Zumel if she could do me a favor and write some code to produce the diagrams. She prepared an excellent parameterized diagram generator. However being the author of the book Practical Data Science ...
[...Read more...]

Live from London! Global Excel Summit 2024

September 21, 2023 | George Mount

After a pandemic-induced hiatus and move to fully virtual, the Global Excel Summit is BACK in hybrid format this February 2024, live from London, UK. I’m super excited to participate in person at this year’s event. Would you like to join me? Use promocode GEORGE at checkout to get 10% ...
[...Read more...]

XGBoost for Regression in Python

September 17, 2023 | Matt Bowers

In this post I’m going to show you my process for solving regression problems with XGBoost in python, using either the native xgboost API or the scikit-learn interface. This is a powerful methodology that can produce world class results in a short...
[...Read more...]

Schemas for Python Data Frames

September 12, 2023 | John Mount

The Pandas data frame is probably the most popular tool used to model tabular data in Python. For in-memory data, Pandas serves a role that might normally fall to a relational database. Though, Pandas data frames are typically manipulated through methods, instead of with a relational query language. One can […] [...Read more...]

Asynchronous SqlAlchemy and multiple databases

September 12, 2023 | Mateusz Głowiński

In today's rapidly evolving technological landscape, the need for handling multiple databases is more pressing than ever. Whether to enhance redundancy, optimize performance, test correctness of the new solution or cater to varied data storage requirements, employing multiple databases can offer significant advantages.
[...Read more...]
1 34 35 36 37 38 124