Top 8 Docker Images for Data Science

March 1, 2019 | Nagdev

Dockerizing Data Science: Introduction PreReqs: Docker, images, and containers Dockerizing data science packages have become more relevant these days mainly because you can isolate your data science projects without breaking anything. Dockerizing data science projects also make most of your projects portable and sharable and not worrying about installing right ... [...Read more...]

Gartner’s 2019 Take on Data Science Software

February 26, 2019 | Bob Muenchen

I've just updated The Popularity of Data Science Software to reflect my take on Gartner's 2019 report, Magic Quadrant for Data Science and Machine Learning Platforms. To save you the trouble of digging through all 40+ pages of my report, here's just the updated section: Continue reading →
[...Read more...]

Data Exploration Case Study: Credit Default

February 21, 2019 | Dr. Darrin

Exploratory data analysis is the main task of a Data Scientist with as much as 60% of their time being devoted to this task. As such, the majority of their time is spent on something that is rather boring compared to building models. This post will provide a simple example of ...
[...Read more...]

RANSAC Regression in Python

February 7, 2019 | Dr. Darrin

RANSAC is an acronym for Random Sample Consensus. What this algorithm does is fit a regression model on a subset of data that the algorithm judges as inliers while removing outliers. This naturally improves the fit of the model due to the removal of some data points. The process that ...
[...Read more...]

Combining Algorithms for Classification with Python

January 20, 2019 | Dr. Darrin

Many approaches in machine learning involve making many models that combine their strength and weaknesses to make more accuracy classification. Generally, when this is done it is the same algorithm being used. For example, random forest is simply many decision trees being developed. Even when bagging or boosting is being ...
[...Read more...]

Gradient Boosting Regression in Python

January 13, 2019 | Dr. Darrin

In this  post, we will take a look at gradient boosting for regression. Gradient boosting simply makes sequential models that try to explain any examples that had not been explained by previously models. This approach makes gradient boosting superior to AdaBoost. Regression trees are mostly commonly teamed with boosting. There ...
[...Read more...]

Gradient Boosting Classification in Python

January 8, 2019 | Dr. Darrin

Gradient Boosting is an alternative form of boosting to AdaBoost. Many consider gradient boosting to be a better performer than adaboost. Some differences between the two algorithms is that gradient boosting uses optimization for weight the estimators. Like adaboost, gradient boosting can be used for most algorithms but is commonly ...
[...Read more...]

AdaBoost Regression with Python

January 6, 2019 | Dr. Darrin

This post will share how to use the adaBoost algorithm for regression in Python. What boosting does is that it makes multiple models in a sequential manner. Each newer model tries to successful predict what older models struggled with. For regression, the average of the models are used for the ...
[...Read more...]

AdaBoost Classification in Python

January 1, 2019 | Dr. Darrin

Boosting is a technique in machine learning in which multiple models are developed sequentially. Each new model tries to successful predict what prior models were unable to do. The average for regression and majority vote for classification are used. For classification, boosting is commonly associated with decision trees. However, boosting ...
[...Read more...]

Recommendation Engine with Python

December 25, 2018 | Dr. Darrin

Recommendation engines make future suggestion to a person based on their prior behavior. There are several ways to develop recommendation engines but for purposes, we will be looking at the development of a user-based collaborative filter. This type of filter takes the ratings of others to suggest future items to ...
[...Read more...]
1 22 23 24 25 26