Random Forest Classification with Python

March 31, 2019 | Dr. Darrin

Random forest is a type of machine learning algorithm in which the algorithm makes multiple decision trees that may use different features and subsample to making as many trees as you specify. The trees then vote to determine the class of an example. This approach helps to deal with the ...
[...Read more...]

Top 8 Docker Images for Data Science

March 1, 2019 | Nagdev Amruthnath

Dockerizing Data Science: Introduction PreReqs: Docker, images, and containers Dockerizing data science packages have become more relevant these days mainly because you can isolate your data science projects without breaking anything. Dockerizing data science projects also make most of your projects portable and sharable and not worrying about installing right ... [...Read more...]

Top 8 Docker Images for Data Science

March 1, 2019 | Nagdev

Dockerizing Data Science: Introduction PreReqs: Docker, images, and containers Dockerizing data science packages have become more relevant these days mainly because you can isolate your data science projects without breaking anything. Dockerizing data science projects also make most of your projects portable and sharable and not worrying about installing right ... [...Read more...]

Gartner’s 2019 Take on Data Science Software

February 26, 2019 | Bob Muenchen

I've just updated The Popularity of Data Science Software to reflect my take on Gartner's 2019 report, Magic Quadrant for Data Science and Machine Learning Platforms. To save you the trouble of digging through all 40+ pages of my report, here's just the updated section: Continue reading →
[...Read more...]

Data Exploration Case Study: Credit Default

February 21, 2019 | Dr. Darrin

Exploratory data analysis is the main task of a Data Scientist with as much as 60% of their time being devoted to this task. As such, the majority of their time is spent on something that is rather boring compared to building models. This post will provide a simple example of ...
[...Read more...]

RANSAC Regression in Python

February 7, 2019 | Dr. Darrin

RANSAC is an acronym for Random Sample Consensus. What this algorithm does is fit a regression model on a subset of data that the algorithm judges as inliers while removing outliers. This naturally improves the fit of the model due to the removal of some data points. The process that ...
[...Read more...]
1 41 42 43 44 45