2020-09-30 16:00:05 +02:00
# An Introduction to Python and Programming
2020-10-12 21:32:57 +02:00
This project is a *thorough* introductory course
in programming with ** [Python <img height="12" style="display: inline-block" src="static/link/to_py.png"> ](https://www.python.org/ )**.
2020-09-30 16:00:05 +02:00
2020-10-12 21:38:27 +02:00
2020-10-12 21:42:29 +02:00
### Table of Contents
2020-10-19 12:00:32 +02:00
The following is a high-level overview of the contents.
For a more *detailed version* with **clickable links**
see the [CONTENTS.md ](CONTENTS.md ) file.
2020-10-12 21:42:29 +02:00
2020-10-12 21:56:04 +02:00
- *Chapter 0*: Introduction
2020-10-13 12:08:58 +02:00
- **Part A: Expressing Logic**
- *Chapter 1*: Elements of a Program
2020-10-13 23:19:17 +02:00
- *Chapter 2*: Functions & Modularization
2020-10-15 15:29:58 +02:00
- *Chapter 3*: Conditionals & Exceptions
2020-10-15 18:12:43 +02:00
- *Chapter 4*: Recursion & Looping
2020-10-15 23:32:24 +02:00
- **Part B: Managing Data and Memory**
- *Chapter 5*: Numbers & Bits
2020-10-19 11:00:39 +02:00
- *Chapter 6*: Text & Bytes
2020-10-19 18:35:13 +02:00
- *Chapter 7*: Sequential Data
2020-10-21 16:58:20 +02:00
- *Chapter 8*: Map, Filter, & Reduce
2020-10-22 18:19:34 +02:00
- *Chapter 9*: Mappings & Sets
2024-04-08 16:53:41 +02:00
- *Chapter 10*: Arrays & Dataframes
2020-10-26 16:26:32 +01:00
- *Chapter 11*: Classes & Instances
2020-10-12 21:56:04 +02:00
2020-10-12 21:42:29 +02:00
2020-10-13 11:10:37 +02:00
#### Videos
2024-04-08 17:45:13 +02:00
Presentations of the chapters are available on this [YouTube playlist <img height="12" style="display: inline-block" src="static/link/to_yt.png"> ](https://www.youtube.com/playlist?list=PL-2JV1G3J10kRUPgP7EwLhyeN5lOZW2kH ).
2020-10-13 11:10:37 +02:00
The recordings are about 25 hours long in total
and were made in spring 2020
after a corresponding in-class Bachelor course was cancelled due to Corona.
2020-10-12 21:38:27 +02:00
### Objective
2020-09-30 16:00:05 +02:00
The **main goal** is to **prepare** students
2020-10-12 21:38:27 +02:00
for **further studies** in the "field" of **data science** ,
including but not limited to topics such as:
- algorithms & data structures
- data cleaning & wrangling
- data visualization
- data engineering (incl. SQL databases)
- data mining (incl. web scraping)
- linear algebra
- machine learning (incl. feature generation & deep learning)
- optimization & (meta-)heuristics (incl. management science & operations research)
- statistics & econometrics
- quantitative finance (e.g., option valuation)
- quantitative marketing (e.g., customer segmentation)
- quantitative supply chain management (e.g., forecasting)
- web development (incl. APIs)
2020-10-12 12:02:40 +02:00
### Prerequisites
To be suitable for *beginners* , there are *no* formal prerequisites.
It is only expected that the student has:
- a *solid* understanding of the **English** language,
- knowledge of **basic mathematics** from high school,
- the ability to **think conceptually** and **reason logically** , and
- the willingness to **invest** around **90-120 hours** on this course.
2020-10-12 15:21:39 +02:00
## Getting started
If you are a total beginner,
follow the instructions in the "Installation" section next.
If you are familiar with
the [git ](https://git-scm.com/ )
and [poetry ](https://python-poetry.org/docs/ ) command-line tools,
you may want to look at the "Alternative Installation" section further below.
### Installation
2024-04-08 17:45:13 +02:00
To follow this course, an installation of **Python 3.11** or higher is expected.
2020-10-12 15:21:39 +02:00
A popular and beginner friendly way is
2024-04-08 17:45:13 +02:00
to install the [Anaconda Distribution ](https://www.anaconda.com/download )
2020-10-12 15:21:39 +02:00
that not only ships Python itself
but also comes pre-packaged with a lot of third-party libraries.
< img src = "static/anaconda_download.png" width = "50%" >
2024-04-08 17:45:13 +02:00
Scroll down to the "Anaconda Installers" section
2020-10-12 15:21:39 +02:00
and install the latest version for your operating system
2024-04-08 17:45:13 +02:00
(i.e., *2024-02* with Python 3.11 at the time of this writing).
2020-10-12 15:21:39 +02:00
After installation,
2024-04-08 17:45:13 +02:00
you find an entry "[Anaconda Navigator](https://docs.anaconda.com/free/navigator/)"
2020-10-12 15:21:39 +02:00
in your start menu.
Click on it.
< img src = "static/anaconda_start_menu.png" width = "50%" >
A window opens giving you several options to start various applications.
In the beginning, we will work mostly with [JupyterLab ](https://jupyterlab.readthedocs.io/en/stable/ ).
Click on "Launch".
< img src = "static/anaconda_navigator.png" width = "50%" >
A new tab in your web browser opens:
The website is "localhost" and some number (e.g., 8888).
This is the [JupyterLab ](https://jupyterlab.readthedocs.io/en/stable/ ) application
that is used to display the course materials.
On the left, you see the files and folders on your computer.
This file browser works like any other.
In the center, you see several options to launch (i.e., "create") new files.
< img src = "static/jupyter_lab.png" width = "50%" >
To check if your Python installation works,
double-click on the "Python 3" tile under the "Notebook" section.
That opens a new [Jupyter notebook ](https://jupyter-notebook.readthedocs.io/en/stable/ )
named "Untitled.ipynb".
< img src = "static/jupyter_notebook_blank.png" width = "50%" >
Enter some basic Python in the **code cell** , for example, `1 + 2` .
Then, press the **Enter** key *while* holding down the **Control** key
(if that does not work, try with the **Shift** key)
to **execute** the snippet.
The result of the calculation, `3` in the example, shows up below the cell.
< img src = "static/jupyter_notebook_example.png" width = "50%" >
After setting up Python,
click on the green "Code" button on the top right on this website
to download the course materials.
As a beginner, choosing "Download ZIP" is likely the easiest option.
Then, unpack the ZIP file into a folder of your choice,
ideally somewhere within your personal user folder
so that the files show up right away in [JupyterLab ](https://jupyterlab.readthedocs.io/en/stable/ ).
< img src = "static/repo_download.png" width = "50%" >
2024-04-08 17:45:35 +02:00
### Alternative Installation (for Instructors using Linux)
2020-10-12 15:21:39 +02:00
Python can also be installed in a "pure" way
obtained directly from its core development team [here ](https://www.python.org/downloads/ ).
Then, it comes *without* any third-party packages,
which is *not* a problem at all.
Managing third-party packages can be automated to a large degree,
for example, with tools such as [poetry ](https://python-poetry.org/docs/ ).
However, this may be too "advanced" for a beginner
as it involves working with a [command-line interface <img height="12" style="display: inline-block" src="static/link/to_wiki.png"> ](https://en.wikipedia.org/wiki/Command-line_interface ) (CLI),
also called a **terminal** ,
which looks like the one below.
It is used *without* a mouse by typing commands into it.
The following instructions assume that
[git ](https://git-scm.com/ ), [poetry ](https://python-poetry.org/docs/ ),
and [pyenv ](https://github.com/pyenv/pyenv ) are installed.
< img src = "static/cli_install.png" width = "50%" align = "center" >
2020-10-15 15:16:11 +02:00
The screenshot above shows how this project can be set up in an alternative way
2020-10-12 15:21:39 +02:00
with the [zsh ](https://en.wikipedia.org/wiki/Z_shell ) CLI.
2020-10-15 15:16:11 +02:00
First, [git ](https://git-scm.com/ ) is used
2020-10-12 15:21:39 +02:00
to **clone** the course materials as a **repository**
into a new folder called "*intro-to-python*"
that lives under a "*repos*" folder.
- `git clone https://github.com/webartifex/intro-to-python.git`
The `cd` command is used to "change directories".
2020-10-15 15:16:11 +02:00
In the screenshot, [pyenv ](https://github.com/pyenv/pyenv ) is used
2020-10-12 15:21:39 +02:00
to set the project's Python version.
[pyenv ](https://github.com/pyenv/pyenv )'s purpose is
to manage *many* parallel Python installations on the same computer.
It is highly recommended for professional users;
however, any other way of installing Python works as well.
- `pyenv local ...`
2020-10-15 15:16:11 +02:00
On the contrary, [poetry ](https://python-poetry.org/docs/ )'s purpose is
2020-10-12 15:21:39 +02:00
to manage third-party packages within the *same* Python installation
and, more importantly, on a per-project basis.
So, for example,
whereas "Project A" may depend on [numpy ](https://numpy.org/ ) *v1.19*
from June 2020 be installed,
"Project B" may use *v1.14* from January 2018 instead
(cf., numpy's [release history ](https://pypi.org/project/numpy/#history )).
To achieve this per-project **isolation** ,
[poetry ](https://python-poetry.org/docs/ ) uses so-called **virtual environments**
behind the scenes.
While one could do that manually,
for example, by using Python's built-in
[venv <img height="12" style="display: inline-block" src="static/link/to_py.png"> ](https://docs.python.org/3/library/venv.html ) module,
it is more convenient and reliable to have [poetry ](https://python-poetry.org/docs/ )
automate this.
The following *one* command not only
creates a new virtual environment (manually: `python -m venv venv` )
and *activates* it (manually: `source venv/bin/activate` ),
it also installs the versions of the project's third-party dependencies
as specified in the [poetry.lock ](poetry.lock ) file
(manually: `python -m pip install -r requirements.txt`
if a [requirements.txt ](https://docs.python.org/3/tutorial/venv.html#managing-packages-with-pip )
file is used;
the `python -m` part is often left out [but should not be ](https://snarky.ca/why-you-should-use-python-m-pip/ )):
- `poetry install`
[poetry ](https://python-poetry.org/docs/ ) is also used
to execute commands in the project's (virtual) environment.
2020-10-15 15:16:11 +02:00
To do that, the command is prefixed with `poetry run ...` .
2020-10-12 22:47:50 +02:00
The project uses [nox ](https://nox.thea.codes/en/stable/ )
to manage various maintenance tasks.
After cloning the repository and setting up the virual environment,
it is recommended to run the initialization task.
That needs to be done only once.
- `poetry run nox -s init-project`
To do the equivalent of clicking "Launch" in the Anaconda Navigator:
2020-10-12 15:21:39 +02:00
- `poetry run jupyter lab`
This opens a new tab in your web browser just as above.
The command-line interface stays open in the background,
like in the screenshot below,
and prints log messages as we work in [JupyterLab ](https://jupyterlab.readthedocs.io/en/stable/ ).
2020-10-12 15:29:31 +02:00
< img src = "static/cli_jupyter_lab.png" width = "50%" align = "center" >
2020-10-12 21:40:46 +02:00
## Contributing
Feedback **is highly encouraged** and will be incorporated.
Open an issue in the [issues tracker <img height="12" style="display: inline-block" src="static/link/to_gh.png"> ](https://github.com/webartifex/intro-to-python/issues )
or initiate a [pull request <img height="12" style="display: inline-block" src="static/link/to_gh.png"> ](https://help.github.com/en/articles/about-pull-requests )
if you are familiar with the concept.
Simple issues that *anyone* can **help fix** are, for example,
**spelling mistakes** or **broken links** .
If you feel that some topic is missing entirely, you may also mention that.
The materials here are considered a **permanent work-in-progress** .
A "Show HN" post about this course was made on [Hacker News <img height="12" style="display: inline-block" src="static/link/to_hn.png"> ](https://news.ycombinator.com/item?id=22669084 )
and some ideas for improvement were discussed there.
2020-10-12 15:29:31 +02:00
## About the Author
Alexander Hess is a PhD student
at the Chair of Logistics Management at [WHU - Otto Beisheim School of Management ](https://www.whu.edu )
where he conducts research on urban delivery platforms
and teaches coding courses based on Python in the BSc and MBA programs.
2020-10-15 15:16:11 +02:00
Connect him on [LinkedIn ](https://www.linkedin.com/in/webartifex ).