A case study on predicting house prices in Ames, Iowa
Find a file
2021-05-25 08:00:15 +02:00
data Update data files due to rounding 2020-06-29 01:09:58 +02:00
static Update the overall project info 2021-05-25 07:55:52 +02:00
.gitignore Update .gitignore for pyenv 2020-06-29 01:07:37 +02:00
1_data_cleaning.ipynb Update rendered versions of the notebooks 2020-06-29 01:10:19 +02:00
2_pairwise_correlations.ipynb Update rendered versions of the notebooks 2020-06-29 01:10:19 +02:00
3_descriptive_visualizations.ipynb Update rendered versions of the notebooks 2020-06-29 01:10:19 +02:00
4_predictive_models.ipynb Update rendered versions of the notebooks 2020-06-29 01:10:19 +02:00
LICENSE.txt Add txt extension and update copyright year 2021-05-25 07:37:37 +02:00
poetry.lock Add pylint to the dev dependencies 2021-05-25 08:00:15 +02:00
pyproject.toml Add pylint to the dev dependencies 2021-05-25 08:00:15 +02:00
README.md Update the overall project info 2021-05-25 07:55:52 +02:00
requirements.txt Add requirements.txt 2020-06-29 01:08:40 +02:00
utils.py Add a simple version of predictive models 2018-09-05 22:30:42 +02:00

Ames Housing

This repository is a case study of applying various machine learning models to the problem of predicting house prices.

The dataset is publicly available and can be downloaded, for example, at Kaggle.

The case study is based on this research paper.

A video presentation of the case study is available on YouTube .

Table of Contents

The analyses are presented in four notebooks that may be interactively worked with by following these links:

Objective

The main goal is to show students how Python can be used to solve a typical data science task.

Prerequisites

To be suitable for beginners, there are no formal prerequisites. It is only expected that the student has:

  • a solid understanding of the English language and
  • knowledge of basic mathematics from high school.

Some background knowledge in Python is still helpful. To learn about Python and programming in detail, this introductory course is recommended.

Getting started & Installation

To follow this workshop, an installation of Python 3.8 or higher is expected.

A popular and beginner friendly way is to install the Anaconda Distribution that not only ships Python itself but also comes pre-packaged with a lot of third-party libraries including Python's scientific stack.

Detailed instructions can be found here .

Contributing

Feedback is highly encouraged and will be incorporated. Open an issue in the issues tracker or initiate a pull request if you are familiar with the concept. Simple issues that anyone can help fix are, for example, spelling mistakes or broken links. If you feel that some topic is missing entirely, you may also mention that. The materials here are considered a permanent work-in-progress.

About the Author

Alexander Hess is a PhD student at the Chair of Logistics Management at WHU - Otto Beisheim School of Management where he conducts research on urban delivery platforms and teaches coding courses based on Python in the BSc and MBA programs.

Connect with him on LinkedIn.