An intro to data science for absolute beginners
Find a file
2024-07-15 12:13:33 +02:00
00_python_in_a_nutshell Run black on all the notebooks 2024-07-15 12:12:51 +02:00
01_scientific_stack Run black on all the notebooks 2024-07-15 12:12:51 +02:00
03_classification Run black on all the notebooks 2024-07-15 12:12:51 +02:00
static/link Rework the introduction on numpy 2021-10-05 01:27:41 +02:00
.gitignore Ignore poetry's .venv/ folder 2020-09-16 23:29:55 +02:00
LICENSE.txt Update the copyright year info 2021-05-23 18:19:28 +02:00
poetry.lock Add invoke to the dev dependencies 2024-07-15 12:12:49 +02:00
pyproject.toml Add invoke to the dev dependencies 2024-07-15 12:12:49 +02:00
README.md Add requirements.txt as an alternative to poetry.lock 2024-07-15 12:12:48 +02:00
requirements.txt Add requirements.txt as an alternative to poetry.lock 2024-07-15 12:12:48 +02:00
tasks.py Add tasks to install and remove kernels 2024-07-15 12:12:49 +02:00

An Introduction to Data Science

This project is an introductory workshop in Data Science in the programming language Python . To learn about Python and programming in detail, this introductory course is recommended.

Table of Contents

Objective

The main goal is to show students how Python can be used to solve typical data science tasks.

Prerequisites

To be suitable for beginners, there are no formal prerequisites. It is only expected that the student has:

  • a solid understanding of the English language and
  • knowledge of basic mathematics from high school.

Getting started & Installation

To follow this workshop, an installation of Python 3.9 or higher is expected.

A popular and beginner friendly way is to install the Anaconda Distribution that not only ships Python itself but also comes pre-packaged with a lot of third-party libraries including Python's scientific stack.

Detailed instructions can be found here .

If you are not using the Anaconda Distribution, you must install the third-party libraries via the command pip install -r requirements.txt (or something equivalent) before working with the notebook files.

Contributing

Feedback is highly encouraged and will be incorporated. Open an issue in the issues tracker or initiate a pull request if you are familiar with the concept. Simple issues that anyone can help fix are, for example, spelling mistakes or broken links. If you feel that some topic is missing entirely, you may also mention that. The materials here are considered a permanent work-in-progress.

About the Author

Alexander Hess is a PhD student at the Chair of Logistics Management at WHU - Otto Beisheim School of Management where he conducts research on urban delivery platforms and teaches coding courses based on Python in the BSc and MBA programs.

Connect with him on LinkedIn.