Commit 9d0aef9b authored by O'Reilly Media, Inc.'s avatar O'Reilly Media, Inc.

Initial commit

parents
## Example files for the title:
# Data Analysis with Open Source Tools, by Philipp Janert
[![Data Analysis with Open Source Tools, by Philipp Janert](http://akamaicovers.oreilly.com/images/9780596802356/cat.gif)](https://www.safaribooksonline.com/library/view/title/9781449389802//)
The following applies to example files from material published by O’Reilly Media, Inc. Content from other publishers may include different rules of usage. Please refer to any additional usage rights explained in the actual example files or refer to the publisher’s website.
O'Reilly books are here to help you get your job done. In general, you may use the code in O'Reilly books in your programs and documentation. You do not need to contact us for permission unless you're reproducing a significant portion of the code. For example, writing a program that uses several chunks of code from our books does not require permission. Answering a question by citing our books and quoting example code does not require permission. On the other hand, selling or distributing a CD-ROM of examples from O'Reilly books does require permission. Incorporating a significant amount of example code from our books into your product's documentation does require permission.
We appreciate, but do not require, attribution. An attribution usually includes the title, author, publisher, and ISBN.
If you think your use of code examples falls outside fair use or the permission given here, feel free to contact us at <permissions@oreilly.com>.
Please note that the examples are not production code and have not been carefully testing. They are provided "as-is" and come with no warranty of any kind.
Chapter 02:
===========
Glass Identifcation:
http://archive.ics.uci.edu/ml/datasets/Glass+Identification
Chapter 03:
===========
Draft Lottery:
http://lib.stat.cmu.edu/DASL/Stories/DraftLottery.html
Marathon:
http://en.wikipedia.org/wiki/List_of_winners_of_the_Boston_Marathon
Sunspots:
http://www.ngdc.noaa.gov/stp/SOLAR/ftpsunspotnumber.html
Chapter 04:
===========
CO2:
http://robjhyndman.com/tsdldata/monthly/co2.dat
Furnace Exhaust:
http://robjhyndman.com/tsdldata/data/gas2.dat
Phone Call:
http://robjhyndman.com/tsdldata/data/9-13.dat
Air Traffic:
http://robjhyndman.com/tsdldata/data/airpass.dat
Chapter 05:
===========
Wine Quality:
http://archive.ics.uci.edu/ml/datasets/Wine+Quality
Chapter 06:
===========
CO2:
http://lib.stat.cmu.edu/datasets/visualizing.data.zip
Chapter 14:
===========
Wine Quality:
http://archive.ics.uci.edu/ml/datasets/Wine+Quality
Chapter 18:
===========
Iris:
http://archive.ics.uci.edu/ml/datasets/Iris
Teacher 66470 0.34047 0.340
Principal 22958 0.11759 0.458
Superintendent 12521 0.06413 0.522
Director 12202 0.06250 0.584
Secretary 4427 0.02267 0.607
Coordinator 3201 0.01639 0.623
Vice Principal 2771 0.01419 0.637
Program Director 1926 0.00986 0.647
Program Coordinator 1718 0.00880 0.656
Student 1596 0.00817 0.664
Consultant 1440 0.00737 0.672
Administrator 1169 0.00598 0.678
President 1114 0.00570 0.683
Program Manager 1063 0.00544 0.689
Supervisor 1009 0.00516 0.694
Professor 961 0.00492 0.699
Librarian 940 0.00481 0.704
Project Coordinator 880 0.00450 0.708
Project Director 866 0.00443 0.713
Office Manager 839 0.00429 0.717
Assistant Director 773 0.00395 0.721
Administrative Assistant 724 0.00370 0.725
Bookkeeper 697 0.00357 0.728
Intern 693 0.00354 0.732
Program Supervisor 602 0.00308 0.735
Lead Teacher 587 0.00300 0.738
Instructor 580 0.00297 0.741
Head Teacher 572 0.00292 0.744
Program Assistant 572 0.00292 0.747
Assistant Teacher 546 0.00279 0.749
0.00498 -0.31232
0.00995 -0.24445
0.01493 -0.20398
0.01990 -0.13291
0.02488 -0.07582
0.02985 -0.01094
0.03483 -0.00865
0.03980 -0.00047
0.04478 0.06948
0.04975 0.07899
0.05473 0.08632
0.05970 0.09368
0.06468 0.10777
0.06965 0.14865
0.07463 0.15978
0.07960 0.17564
0.08458 0.19966
0.08955 0.27658
0.09453 0.31762
0.09950 0.33518
0.10448 0.33783
0.10945 0.37953
0.11443 0.38963
0.11940 0.39009
0.12438 0.39211
0.12935 0.42098
0.13433 0.42615
0.13930 0.43438
0.14428 0.45380
0.14925 0.45479
0.15423 0.47143
0.15920 0.48250
0.16418 0.48324
0.16915 0.49544
0.17413 0.50368
0.17910 0.53416
0.18408 0.54620
0.18905 0.55380
0.19403 0.55816
0.19900 0.57019
0.20398 0.57120
0.20896 0.57473
0.21393 0.57758
0.21891 0.58766
0.22388 0.59275
0.22886 0.59358
0.23383 0.60049
0.23881 0.60571
0.24378 0.61292
0.24876 0.61421
0.25373 0.61801
0.25871 0.62361
0.26368 0.62697
0.26866 0.65231
0.27363 0.65683
0.27861 0.65834
0.28358 0.66157
0.28856 0.67360
0.29353 0.69243
0.29851 0.71562
0.30348 0.73450
0.30846 0.74144
0.31343 0.77495
0.31841 0.79411
0.32338 0.79581
0.32836 0.79706
0.33333 0.79795
0.33831 0.80044
0.34328 0.82820
0.34826 0.84640
0.35323 0.84719
0.35821 0.85730
0.36318 0.86440
0.36816 0.86966
0.37313 0.87376
0.37811 0.87845
0.38308 0.88049
0.38806 0.88113
0.39303 0.88974
0.39801 0.89170
0.40299 0.89283
0.40796 0.90060
0.41294 0.92277
0.41791 0.93635
0.42289 0.94315
0.42786 0.95456
0.43284 0.95911
0.43781 0.97146
0.44279 0.97826
0.44776 0.97979
0.45274 0.97985
0.45771 0.98285
0.46269 0.98738
0.46766 0.98969
0.47264 0.99214
0.47761 1.00383
0.48259 1.02352
0.48756 1.03789
0.49254 1.04479
0.49751 1.04837
0.50249 1.04896
0.50746 1.04914
0.51244 1.05031
0.51741 1.05220
0.52239 1.06393
0.52736 1.08228
0.53234 1.10508
0.53731 1.10910
0.54229 1.11102
0.54726 1.12230
0.55224 1.12551
0.55721 1.13400
0.56219 1.13519
0.56716 1.13980
0.57214 1.14183
0.57711 1.14278
0.58209 1.15109
0.58706 1.15199
0.59204 1.15540
0.59701 1.16970
0.60199 1.17393
0.60697 1.17973
0.61194 1.18532
0.61692 1.20877
0.62189 1.22040
0.62687 1.22702
0.63184 1.22706
0.63682 1.22901
0.64179 1.25149
0.64677 1.26511
0.65174 1.27223
0.65672 1.27545
0.66169 1.30221
0.66667 1.32439
0.67164 1.33194
0.67662 1.33385
0.68159 1.34937
0.68657 1.35641
0.69154 1.35856
0.69652 1.36401
0.70149 1.36590
0.70647 1.36795
0.71144 1.36883
0.71642 1.38497
0.72139 1.39249
0.72637 1.39681
0.73134 1.40407
0.73632 1.40979
0.74129 1.41503
0.74627 1.44658
0.75124 1.44716
0.75622 1.46818
0.76119 1.47686
0.76617 1.48286
0.77114 1.48527
0.77612 1.50035
0.78109 1.50233
0.78607 1.50831
0.79104 1.51153
0.79602 1.51492
0.80100 1.52091
0.80597 1.53637
0.81095 1.53860
0.81592 1.55110
0.82090 1.56222
0.82587 1.56327
0.83085 1.56555
0.83582 1.56810
0.84080 1.57843
0.84577 1.59086
0.85075 1.62011
0.85572 1.62090
0.86070 1.62624
0.86567 1.63331
0.87065 1.64294
0.87562 1.64471
0.88060 1.64546
0.88557 1.67419
0.89055 1.68192
0.89552 1.68292
0.90050 1.69478
0.90547 1.70439
0.91045 1.78858
0.91542 1.79999
0.92040 1.84485
0.92537 1.85295
0.93035 1.86572
0.93532 1.88879
0.94030 1.90336
0.94527 1.95051
0.95025 1.96180
0.95522 1.97431
0.96020 2.04516
0.96517 2.04853
0.97015 2.06082
0.97512 2.07985
0.98010 2.09538
0.98507 2.12929
0.99005 2.39907
0.99502 2.42371
46 46 Engine
35 81 Electrical System
12 93 Brakes
5 98 Air Conditioning
1 99 Transmission
1 100 Body Integrity
1 Washington 94
2 Adams 48
3 Jefferson 96
4 Madison 96
5 Monroe 96
6 Adams 48
7 Jackson 96
8 Van Buren 48
9 Harrison 1
10 Tyler 47
11 Polk 48
12 Taylor 16
13 Filmore 32
14 Pierce 48
15 Buchanan 48
16 Lincoln 49
17 Johnson 47
18 Grant 96
19 Hayes 48
20 Garfield 7
21 Arthur 41
22 Cleveland 48
23 Harrison 48
24 Cleveland 48
25 McKinley 54
26 Roosevelt 90
27 Taft 48
28 Wilson 96
29 Harding 29
30 Coolidge 67
31 Hoover 48
32 Roosevelt 146
33 Truman 92
34 Eisenhower 96
35 Kennedy 34
36 Johnson 62
37 Nixon 67
38 Ford 29
39 Carter 48
40 Reagan 96
41 Bush 48
42 Clinton 96
43 Bush 96
1 United States 0.34047 0.340
2 Brazil 0.11759 0.458
3 Japan 0.06413 0.522
4 India 0.06250 0.584
5 Germany 0.02267 0.607
6 United Kingdom 0.01639 0.623
7 Russia 0.01419 0.637
8 France 0.00986 0.647
9 Portugal 0.00880 0.656
10 Italy 0.00817 0.664
11 Mexico 0.00737 0.672
12 Spain 0.00598 0.678
13 Canada 0.00570 0.683
14 South Korea 0.00544 0.689
15 Indonesia 0.00516 0.694
16 Turkey 0.00492 0.699
17 Sweden 0.00481 0.704
18 Australia 0.00450 0.708
19 Taiwan 0.00443 0.713
20 Netherlands 0.00429 0.717
21 Poland 0.00395 0.721
22 Switzerland 0.00370 0.725
23 Argentina 0.00357 0.728
24 Thailand 0.00354 0.732
25 Philippines 0.00308 0.735
452.42
318.58
144.82
129.13
1216.45
991.56
1476.69
662.73
1302.85
1278.55
627.65
1030.78
215.23
44.50
133.410
619.96
970.97
1062.71
87.83
1068.87
157.42
492.97
381.01
1443.27
210.66
682.67
633.17
1236.97
1123.57
253.38
728.62
653.73
128.03
782.99
287.34
222.74
419.38
227.53
999.10
548.75
126.83
930.17
1399.19
820.88
2166.62
386.35
1915.44
1951.81
369.60
651.13
1041.65
848.52
812.15
131.27
2912.47
1099.05
536.90
158.68
2094.56
581.96
723.73
598.58
1773.79
218.55
594.83
2103.11
280.36
570.67
493.57
293.22
586.25
353.87
255.48
1367.96
721.66
360.59
673.46
1288.49
752.39
2527.11
1608.05
225.63
454.82
600.10
252.13
576.20
158.01
1576.44
687.30
1035.11
465.18
246.86
257.55
1300.21
869.90
2727.03
313.71
252.88
1640.32
1913.25
2000.17
562.12
1523.73
231.20
1275.59
546.98
353.40
955.06
719.15
1425.71
536.49
765.95
649.56
291.63
857.83
721.24
873.04
781.55
380.93
1368.84
745.09
756.59
1699.47
919.90
178.03
859.59
551.36
236.87
852.59
587.37
1993.22
354.37
688.57
195.55
702.82
705.11
921.86
421.14
944.19
130.57
2799.98
312.23
449.45
140.18
254.70
508.51
205.40
2362.88
1741.61
311.69
561.54
1603.72
383.11
2488.72
2722.94
1962.97
250.30
176.55
744.73
402.80
724.13
1638.88
794.95
2213.75
918.05
1786.72
284.57
595.27
512.54
297.53
617.06
310.80
418.10
1206.19
493.64
818.00
1136.76
1115.79
529.77
180.14
1063.00
304.78
1463.55
907.76
1301.04
492.20
286.37
697.97
2368.64
504.74
717.03
2499.45
195.90
354.85
2139.92
418.00
322.94
2459.05
233.99
457.74
193.66
752.21
1476.84
2106.03
233.46
1899.17
980.12
1873.61
899.63
1531.18
314.45
479.19
610.71
579.60
247.08
1645.14
1423.59
1412.39
207.25
1101.83
513.00
458.63
242.26
563.67
218.63
2340.79
2953.15
778.92
1906.94
759.89
542.88
661.30
1632.94
1123.42
111.83
287.99
590.73
994.96
1204.58
597.36
404.92
352.60
185.31
926.88
1001.26
1243.45
2790.89
215.82
589.09
138.78
729.17
246.78
1326.40
730.89
560.96
2000.48
1830.32
1410.95
346.26
566.51
101.40
295.59
872.86
294.74
415.80
2220.32
636.29
1668.21
753.74
706.75
1119.05
1549.08
592.78
2403.32
732.85
1047.00
701.49
508.81
570.88
939.32
237.01
308.16
497.31
567.55
487.54
197.96
786.83
625.44
157.11
929.10
632.80
321.09
866.24
846.95
301.36
1914.00
549.38
295.28
374.92
1868.64
540.91
1130.45
888.82
89.55
271.13
330.07
320.32
1202.39
502.77
1537.44
485.01
852.86
1568.39
856.32
129.70
203.95
298.33
407.60
695.60
1955.43
690.74
795.80
1442.01
299.73
482.30
1582.73
953.60
601.21
231.81
584.09
1008.54
648.55
1078.42
952.16
227.62
2819.79
453.60
1824.68
368.62
2181.53
106.55
1531.04
2695.54
1048.08
1076.04
118.56
846.40
424.04
1070.19
2580.90
963.94
893.70
428.10
1969.80
668.34
397.36
342.86
266.19
802.67
476.94
887.14
259.60
612.19
1125.39
258.48
1446.45
233.48
295.68
2883.59
2382.43
1417.73
180.45
599.98
182.84
555.72
1919.65
251.77
404.68
2242.30
1050.19
543.09
290.14
356.94
330.33
446.02
1428.40
621.57
2978.80