Create notebook for the fourth application of tidying
This commit is contained in:
parent
442a541ad5
commit
d18f7133a8
3 changed files with 5001 additions and 2 deletions
|
|
@ -28,7 +28,7 @@
|
|||
"name": "stdout",
|
||||
"output_type": "stream",
|
||||
"text": [
|
||||
"2018-08-26 00:55:18 CEST\n",
|
||||
"2018-08-26 14:39:56 CEST\n",
|
||||
"\n",
|
||||
"CPython 3.6.5\n",
|
||||
"IPython 6.5.0\n",
|
||||
|
|
@ -1026,7 +1026,7 @@
|
|||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"### Tidy Data\n",
|
||||
"### \"Tidy\" Data\n",
|
||||
"\n",
|
||||
"As before the *pd.melt* function is used to transform the data from \"wide\" to \"long\" form."
|
||||
]
|
||||
|
|
@ -1079,6 +1079,13 @@
|
|||
")"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"Note that this dataset is not yet fully tidy as will be explained in notebook No. 4."
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "code",
|
||||
"execution_count": 16,
|
||||
|
|
@ -1313,6 +1320,24 @@
|
|||
"source": [
|
||||
"molten_billboard.head(15)"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"### Save the Data\n",
|
||||
"\n",
|
||||
"The above \"tidy\" billboard dataset is saved as input for notebook No. 4."
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "code",
|
||||
"execution_count": 17,
|
||||
"metadata": {},
|
||||
"outputs": [],
|
||||
"source": [
|
||||
"molten_billboard.to_csv('data/billboard_cleaned.csv', index=False)"
|
||||
]
|
||||
}
|
||||
],
|
||||
"metadata": {
|
||||
|
|
|
|||
Loading…
Add table
Add a link
Reference in a new issue