Commit f61b3a96 by Danyel Fisher

Updates to readme files

parent 8293554d
......@@ -3,3 +3,4 @@ node_modules/**
png/**
svg/**
scripts/*.bash
Chapters6and7/figures for book.zip
The ipython notebooks in this section are based on the versions of the IMDB data that was attainable by FTP as of mid-July 2017.
Since then, IMDB has changed to distributing this data via Amazon Web Services. I have not yet had an opportunity to update the notebooks. IMDB's license does not allow for redistribution of their data. My apologies for the rotten bits. -DAF
Since then, IMDB has changed to distributing this data via Amazon Web Services. I have not yet had an opportunity to update the notebooks. IMDB's license does not allow for redistribution of their data. My apologies for the rotten bits. -DAF 12/31/2017
http://www.imdb.com/interfaces/
https://s3-ap-southeast-2.amazonaws.com/scico-labs/docs/lab-jupyter-aws.pdf
# Vis examples
# Chapter 6 & 7 Visualization Examples
## Consumer complaints
12/31/2017: We are working on a website, http://makingdatavisual.github.io, which contains executable examples of the Chapter 6 & 7 figures. Check it out!
```bash
csvsql --db sqlite:///complaints.db Consumer_Complaints.csv -y 1000 --insert
```
The [data](data) subdirectory contains original CSV or JSON formats of the data source files; the readme in that folder describes where the files come from.
```sql
select "Date received", "Product", "Sub-product", "Company", "State", "ZIP code", "Consumer consent provided?", "Submitted via", "Company response to consumer", "Timely response?", "Consumer disputed?"
from Consumer_Complaints
where ("Date received" like '2012%' or "Date received" like '2013%' or "Date received" like '2014%' or "Date received" like '2015%' or "Date received" like '2016%') and ("Product" = 'Mortgage' or "Product" = 'Credit reporting' or "Product" = 'Debt collection' or "Product" = 'Credit card' or "Product" like 'Bank account%')
order by random()
limit 20000;
```
If you download this entire directory and run a local webserver, the index.html should correctly render the visualizations correctly.
```bash
sqlite3 complaints.db -header -csv < query.sql > complaints.cs
```
## CO2 and Temperature data
https://www.ncdc.noaa.gov/cag/time-series/global/globe/land_ocean/ytd/12/1880-2017
http://data.okfn.org/data/core/co2-ppm
```node http-server```
or
```python -m SimpleHTTPServer```
\ No newline at end of file
# Getting the data
These datasets come from a number of different places. These are some of our notes on reproducing our figures with original data from online sources.
## Automobile dataset
https://archive.ics.uci.edu/ml/datasets/Automobile
## Les Miserables
Knuth's "Les Miserables" dataset came from http://www-cs-staff.stanford.edu/~knuth/sgb.html and has been translated into a useful JSON format at
https://bost.ocks.org/mike/miserables/miserables.json
For the figures in the book, we truncated to the first 30 or so names
## Consumer complaints
The original source of the data is from https://catalog.data.gov/dataset/consumer-complaint-database
```bash
csvsql --db sqlite:///complaints.db Consumer_Complaints.csv -y 1000 --insert
```
```sql
select "Date received", "Product", "Sub-product", "Company", "State", "ZIP code", "Consumer consent provided?", "Submitted via", "Company response to consumer", "Timely response?", "Consumer disputed?"
from Consumer_Complaints
where ("Date received" like '2012%' or "Date received" like '2013%' or "Date received" like '2014%' or "Date received" like '2015%' or "Date received" like '2016%') and ("Product" = 'Mortgage' or "Product" = 'Credit reporting' or "Product" = 'Debt collection' or "Product" = 'Credit card' or "Product" like 'Bank account%')
order by random()
limit 20000;
```
```bash
sqlite3 complaints.db -header -csv < query.sql > complaints.cs
```
## CO2 and Temperature data
https://www.ncdc.noaa.gov/cag/time-series/global/globe/land_ocean/ytd/12/1880-2017
http://data.okfn.org/data/core/co2-ppm
## Income data
For images in the book, we trimmed some income levels and some unrendered regions (e.g. Puerto Rico)
\ No newline at end of file
......@@ -119,7 +119,7 @@
"fill": "hurricanes",
"orient": "none",
"padding": 20,
"title": "hurricanes per state",
"title": "Hurricanes per state",
"encode": {
"legend": {
"update": {"x": {"value": 320},"y": {"value": 420}}
......
## Example files for the title:
# Making Data Visual, by Miriah Meyer
# Making Data Visual, by Danyel Fisher and Miriah Meyer
[![Making Data Visual, by Miriah Meyer](http://akamaicovers.oreilly.com/images/9781491928400/cat.gif)](https://www.safaribooksonline.com/library/view/title/9781491960493//)
[![Making Data Visual](http://akamaicovers.oreilly.com/images/9781491928400/cat.gif)](https://www.safaribooksonline.com/library/view/title/9781491960493//)
This section contains example files for the book "Making Data Visual".
* [Chapter 3](Chapter3) contains iPython notebooks that carry out the "IMDB" example
* [Chapters 6 and 7](Chapters6and7) contains [Vega and Vega-Lite](http://vega.github.io) scripts that generate all figures in Chapter 6, and the interactive multi-views in Chapter 7.
* [Chapter9CaseStudy](Chapter9CaseStudy) contains the videos for Chapter 9 of the book that show how the Multeesum prototyped worked; the videos are in .mov format.
12/31/2017: We are working on a website, http://makingdatavisual.github.io, which contains executable examples of the Chapter 6 & 7 figures. Check it out!
## Fine Print
The following applies to example files from material published by O’Reilly Media, Inc. Content from other publishers may include different rules of usage. Please refer to any additional usage rights explained in the actual example files or refer to the publisher’s website.
......
Markdown is supported
0% or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment