An intro to Python & programming for wanna-be data scientists
Find a file
Alexander Hess e24cf31104 Restructure *.ipynb files
- split review and exercises into files on their own
- update the contents overviews to include links to
  reviews and exercises
2020-01-28 18:36:36 +01:00
sample_package Add the sample_package folder to the project 2019-09-22 20:26:58 +02:00
static Add initial version of notebook 08 2019-11-20 11:01:43 +01:00
.gitignore Add a simple .gitignore file 2019-09-16 00:14:56 +02:00
00_start_up_00_content.ipynb Restructure *.ipynb files 2020-01-28 18:36:36 +01:00
00_start_up_10_review.ipynb Restructure *.ipynb files 2020-01-28 18:36:36 +01:00
00_start_up_20_exercises.ipynb Restructure *.ipynb files 2020-01-28 18:36:36 +01:00
01_elements_00_content.ipynb Restructure *.ipynb files 2020-01-28 18:36:36 +01:00
01_elements_10_review.ipynb Restructure *.ipynb files 2020-01-28 18:36:36 +01:00
01_elements_20_exercises.ipynb Restructure *.ipynb files 2020-01-28 18:36:36 +01:00
02_functions_00_content.ipynb Restructure *.ipynb files 2020-01-28 18:36:36 +01:00
02_functions_10_review.ipynb Restructure *.ipynb files 2020-01-28 18:36:36 +01:00
02_functions_20_exercises.ipynb Restructure *.ipynb files 2020-01-28 18:36:36 +01:00
03_conditionals_00_content.ipynb Restructure *.ipynb files 2020-01-28 18:36:36 +01:00
03_conditionals_10_review.ipynb Restructure *.ipynb files 2020-01-28 18:36:36 +01:00
03_conditionals_20_exercises.ipynb Restructure *.ipynb files 2020-01-28 18:36:36 +01:00
04_iteration_00_content.ipynb Restructure *.ipynb files 2020-01-28 18:36:36 +01:00
04_iteration_10_review.ipynb Restructure *.ipynb files 2020-01-28 18:36:36 +01:00
04_iteration_20_exercises.ipynb Restructure *.ipynb files 2020-01-28 18:36:36 +01:00
05_numbers_00_content.ipynb Restructure *.ipynb files 2020-01-28 18:36:36 +01:00
05_numbers_10_review.ipynb Restructure *.ipynb files 2020-01-28 18:36:36 +01:00
05_numbers_20_exercises.ipynb Restructure *.ipynb files 2020-01-28 18:36:36 +01:00
06_text_00_content.ipynb Restructure *.ipynb files 2020-01-28 18:36:36 +01:00
06_text_10_review.ipynb Restructure *.ipynb files 2020-01-28 18:36:36 +01:00
07_sequences_00_content.ipynb Restructure *.ipynb files 2020-01-28 18:36:36 +01:00
07_sequences_10_review.ipynb Restructure *.ipynb files 2020-01-28 18:36:36 +01:00
07_sequences_20_exercises.ipynb Restructure *.ipynb files 2020-01-28 18:36:36 +01:00
08_mappings_00_content.ipynb Restructure *.ipynb files 2020-01-28 18:36:36 +01:00
08_mappings_10_review.ipynb Restructure *.ipynb files 2020-01-28 18:36:36 +01:00
08_mappings_20_exercises.ipynb Restructure *.ipynb files 2020-01-28 18:36:36 +01:00
LICENSE Initial commit 2019-09-15 12:20:18 +02:00
lorem_ipsum.txt Streamline previous content 2019-10-30 11:04:59 +01:00
poetry.lock Add numpy to the dependencies 2019-09-22 20:19:28 +02:00
pyproject.toml Bump version number 2019-11-20 11:05:04 +01:00
README.md Restructure *.ipynb files 2020-01-28 18:36:36 +01:00
requirements.txt Add numpy to the dependencies 2019-09-22 20:19:28 +02:00
sample_module.py Add sample_module.py to the project 2019-09-22 20:25:40 +02:00
stream.py Add exercise on packing/unpacking with functions 2019-11-20 11:01:19 +01:00

Important: The notebooks are being updated and amended throughout the spring semester of 2020!

An Introduction to Python and Programming

The purpose of this repository is to serve as an interactive "book" for a thorough introductory course on programming in the Python language.

The course's main goal is to prepare the student for further studies in the "field" of data science.

The "chapters" are written in Jupyter notebooks which are a de-facto standard for exchanging code and results among data science professionals and researchers. They can be viewed in a plain web browser with the help of nbviewer:

However, it is recommended that students install Python and Jupyter locally and run the code in the notebooks on their own. This way, the student can play with the code and learn more efficiently. Precise installation instructions are either in the 00th notebook or further below.

Feedback is encouraged and will be incorporated. Open an issue in the issues tracker or initiate a pull request if you are familiar with the concept.

Prerequisites

To be suitable for total beginners, there are no formal prerequisites. It is only expected that the student has:

  • a solid understanding of the English language,
  • knowledge of basic mathematics from high school,
  • the ability to think conceptually and reason logically, and
  • the willingness to invest 2-4 hours a day for a month.

Installation

To follow this course, a working installation of Python 3.7 or higher is expected.

A popular and beginner friendly way is to install the Anaconda Distribution that not only ships Python but comes pre-packaged with a lot of third-party libraries from the so-called "scientific stack". Just go to the download section and install the latest version (i.e., 2019-10 with Python 3.7 at the time of this writing) for your operating system.

Then, among others, you will find an entry "Jupyter Notebook" in your start menu like below. Click on it and a new tab in your web browser will open where you can switch between folders as you could in your computer's default file browser.

To download the course's materials as a ZIP file, click on the green "Clone or download" button on the top right on this website. Then, unpack the ZIP file into a folder of your choosing (ideally somewhere within your personal user folder so that the files show up right away).

Alternative Installation

Python can also be installed in a "pure" way as obtained from its core development team (i.e., without any third-party packages installed). However, this is somewhat too "advanced" for a beginner as it involves working with a terminal emulator, which looks similar to the picture below and is used without a mouse by typing commands into it.

Assuming that you already have a working version of Python 3.7 or higher installed (cf., the official download page), the following summarizes the commands to be typed into a terminal emulator to get the course materials up and running on a local machine without Anaconda. You are then responsible for understanding the concepts behind them.

First, the git command line tool is a more professional way of "cloning" the course materials as compared to downloading them in a ZIP file.

  • git clone https://github.com/webartifex/intro-to-python.git

This creates a new folder intro-to-python with all the materials of this repository in it.

Inside this folder, it is recommended to create a so-called virtual environment with Python's venv module. This must only be done the first time. A virtual environment is a way of isolating the third-party packages installed by different projects, which is considered a best practice.

  • python -m venv venv

The second venv is the environment's name and by convention often chosen to be venv. However, it could be another name as well.

From then on, each time you want to resume work, go back into the intro-to-python folder inside your terminal and "activate" the virtual environment (venv is the name chosen before).

  • source venv/bin/activate

This may change how the terminal's command prompt looks.

poetry and virtualenvwrapper are popular tools to automate the described management of virtual environments.

After activation for the first time, you must install the project's dependencies (= the third-party packages needed to run the code), most notably Jupyter in this project (the "python -m" is often left out but should not be; if you have poetry installed, you may just type poetry install instead).

  • python -m pip install -r requirements.txt

With everything installed, you can now do the equivalent of clicking the "Jupyter Notebook" entry in your start menu.

  • jupyter notebook

This opens a new tab in your web browser just as above.

About the Author

Alexander Hess is a PhD student at the Chair of Logistics Management at the WHU - Otto Beisheim School of Management where he conducts research on urban delivery platforms and teaches an introductory course on Python (cf., Fall Term 2019, Spring Term 2020).

Connect him on LinkedIn.