Data science

Day 27: Enhancement

December 3, 2024 | OSM

On Day 26, we extended the comparative error analysis to the original, 12-by-12 strategy and showed how results were similar to the unadjusted strategy relative to the adjusted one. The main observation that emerged was that the adjusted strategy pe...
[...Read more...]

Scraping and Not Modified Responses

December 3, 2024 | Python - datawookie

In a previous post I looked at the HTTP request headers used to manage browser caching. In this post I’ll look at a real world example. It’s a rather deep dive into something that’s actually quite simple. However, I find it helpful for my understanding to pick ...
[...Read more...]

Understanding Why Sklearn PCA Differs from Scratch Implementations

December 2, 2024 | Andrea Rekasi

Data scientists often face a puzzling challenge. Their carefully crafted Principal Component Analysis (PCA) implementation gives different results compared to scikit-learn’s version. These differences create confusion and make debugging harder, even when both implementations appear mathematically sound. PCA is a significant dimensionality reduction technique in machine learning that helps ...
[...Read more...]

Day 26: Adjusted vs. Original

December 2, 2024 | OSM

The last five days! On Day 25, we compared the peformance of the adjusted vs. unadjusted strategy for different prediction scenarios: true and false positives and negatives. For true positives and false negatives, the adjusted strategy performed bet...
[...Read more...]

How to calculate Z-Scores in Python

November 28, 2024 | Ponne, Bruno

If you’ve worked with statistical data, you’ve likely encountered z-scores. A z-score measures how far a data point is from the mean, expressed in terms of standard deviations. It helps identify outliers and compare data distributions, making it a vital tool in data science. In this guide, we’...
[...Read more...]

Big-Scale Data Dashboards With Observable Framework

November 27, 2024 | Filip Stachura

If your organization is looking to build dynamic and interactive dashboards with a strong emphasis on performance or large scale, the Observable Framework could be an ideal fit. This article introduces the framework, its core features, and real-world use cases to illustrate how it can help your data science team ...
[...Read more...]
1 17 18 19 20 21 145