100 Best Data Science Books of All Time

We've researched and ranked the best data science books in the world, based on recommendations from world experts, sales data, and millions of reader ratings. Learn more

Featuring recommendations from Barack Obama, Malcolm Gladwell, Bill Gates, and 220 other experts.
1
Written by renowned data science experts Foster Provost and Tom Fawcett, Data Science for Business introduces the fundamental principles of data science, and walks you through the "data-analytic thinking" necessary for extracting useful knowledge and business value from the data you collect. This guide also helps you understand the many data-mining techniques in use today.

Based on an MBA course Provost has taught at New York University over the past ten years, Data Science for Business provides examples of real-world business problems to illustrate these principles. You’ll...
more
Recommended by Kirk Borne, and 1 others.

Kirk BorneGreat book for Business Analytics and for building #AnalyticThinking >> “#DataScience for Business — What You Need to Know about #DataMining and Data-Analytic Thinking”: https://t.co/e9rAFnVYYQ #BigData #MachineLearning #DataStrategy #AnalyticsStrategy #Algorithms https://t.co/yEblfU2MZd (Source)

See more recommendations for this book...

2

An Introduction to Statistical Learning

With Applications in R

An Introduction to Statistical Learning provides an accessible overview of the field of statistical learning, an essential toolset for making sense of the vast and complex data sets that have emerged in fields ranging from biology to finance to marketing to astrophysics in the past twenty years. This book presents some of the most important modeling and prediction techniques, along with relevant applications. Topics include linear regression, classification, resampling methods, shrinkage approaches, tree- based methods, support vector machines, clustering, and more. Color graphics and... more
Recommended by Roger D. Peng, and 1 others.

Roger D. PengThis book is written by a powerhouse of authors in the machine learning community, true authorities in the field. But beyond that, they’re also great writers. (Source)

See more recommendations for this book...

3
Don't simply show your data--tell a story with it! Storytelling with Data teaches you the fundamentals of data visualization and how to communicate effectively with data. You'll discover the power of storytelling and the way to make data a pivotal point in your story. The lessons in this illuminative text are grounded in theory, but made accessible through numerous real-world examples--ready for immediate application to your next graph or presentation.

Storytelling is not an inherent skill, especially when it comes to data visualization, and the tools at our disposal...
more
Recommended by Roger D. Peng, and 1 others.

Roger D. PengIt’s important to think in terms of what your audience needs, and what would be best for them among the many choices you could make when analysing data. (Source)

See more recommendations for this book...

4
Looking for complete instructions on manipulating, processing, cleaning, and crunching structured data in Python? The second edition of this hands-on guide--updated for Python 3.5 and Pandas 1.0--is packed with practical cases studies that show you how to effectively solve a broad set of data analysis problems, using Python libraries such as NumPy, pandas, matplotlib, and IPython.

Written by Wes McKinney, the main author of the pandas library, Python for Data Analysis also serves as a practical, modern introduction to scientific computing in Python for data-intensive...
more

See more recommendations for this book...

5
Data Science gets thrown around in the press like it's magic. Major retailers are predicting everything from when their customers are pregnant to when they want a new pair of Chuck Taylors. It's a brave new world where seemingly meaningless data can be transformed into valuable insight to drive smart business decisions.

But how does one exactly do data science? Do you have to hire one of these priests of the dark arts, the "data scientist," to extract this gold from your data? Nope.

Data science is little more than using straight-forward steps to process raw data into...
more

See more recommendations for this book...

6
To really learn data science, you should not only master the tools--data science libraries, frameworks, modules, and toolkits--but also understand the ideas and principles underlying them. Updated for Python 3.6, this second edition of Data Science from Scratch shows you how these tools and algorithms work by implementing them from scratch.

If you have an aptitude for mathematics and some programming skills, author Joel Grus will help you get comfortable with the math and statistics at the core of data science, and with the hacking skills you need to get started as a data...
more

Thorsten HellerThe Best #book to Start your #DataScience Journey - Towards #DataScience https://t.co/D8PlkkSxw6 by @benthecoder1 (Source)

See more recommendations for this book...

7
For many researchers, Python is a first-class tool mainly because of its libraries for storing, manipulating, and gaining insight from data. Several resources exist for individual pieces of this data science stack, but only with the Python Data Science Handbook do you get them all—IPython, NumPy, Pandas, Matplotlib, Scikit-Learn, and other related tools.

Working scientists and data crunchers familiar with reading and writing Python code will find this comprehensive desk reference ideal for tackling day-to-day issues: manipulating, transforming, and cleaning data; visualizing...
more
Recommended by Kirk Borne, and 2 others.

Kirk Borne✨🎉🌟Must see this >> Free #Python #DataScience Coding book series for #DataScientists ...via @DataScienceCtrl Go to https://t.co/To10VVZzIl ——————— #abdsc #BigData #MachineLearning #AI #DeepLearning #BeDataBrilliant #DataLiteracy https://t.co/Msuo1jiZSm (Source)

See more recommendations for this book...

8
One of Wall Street Journal's Best Ten Works of Nonfiction in 2012

New York Times Bestseller

"Not so different in spirit from the way public intellectuals like John Kenneth Galbraith once shaped discussions of economic policy and public figures like Walter Cronkite helped sway opinion on the Vietnam War…could turn out to be one of the more momentous books of the decade."
-New York Times Book Review

"Nate Silver's The Signal and the Noise is The Soul of a New Machine for the 21st century."
-Rachel Maddow, author of Drift

"A serious...
more
Recommended by Bill Gates, and 1 others.

Bill GatesAnyone interested in politics may be attracted to Nate Silver’s The Signal and the Noise: Why So Many Predictions Fail—but Some Don't. Silver is the New York Times columnist who got a lot of attention last fall for predicting—accurately, as it turned out–the results of the U.S. presidential election. This book actually came out before the election, though, and it’s about predictions in many... (Source)

See more recommendations for this book...

9
Factfulness: The stress-reducing habit of only carrying opinions for which you have strong supporting facts.

When asked simple questions about global trends—what percentage of the world’s population live in poverty; why the world’s population is increasing; how many girls finish school—we systematically get the answers wrong. So wrong that a chimpanzee choosing answers at random will consistently outguess teachers, journalists, Nobel laureates, and investment bankers.

In Factfulness, Professor of International Health and global TED phenomenon...
more

Barack ObamaAs 2018 draws to a close, I’m continuing a favorite tradition of mine and sharing my year-end lists. It gives me a moment to pause and reflect on the year through the books I found most thought-provoking, inspiring, or just plain loved. It also gives me a chance to highlight talented authors – some who are household names and others who you may not have heard of before. Here’s my best of 2018... (Source)

Bill GatesThis was a breakthrough to me. The framework Hans enunciates is one that took me decades of working in global development to create for myself, and I could have never expressed it in such a clear way. I’m going to try to use this model moving forward. (Source)

Nigel WarburtonIt’s an interesting book, it’s very challenging. It may be over-optimistic. But it does have this startling effect on the readers of challenging widely held assumptions. It’s a plea to look at the empirical data, and not just assume that you know how things are now. (Source)

See more recommendations for this book...

10
Longlisted for the National Book Award
New York Times Bestseller


A former Wall Street quant sounds an alarm on the mathematical models that pervade modern life -- and threaten to rip apart our social fabric

We live in the age of the algorithm. Increasingly, the decisions that affect our lives--where we go to school, whether we get a car loan, how much we pay for health insurance--are being made not by humans, but by mathematical models. In theory, this should lead to greater fairness: Everyone is judged according to the same rules, and bias is...
more
Recommended by Paula Boddington, Ramesh Srinivasan, and 2 others.

Paula BoddingtonHow the use of algorithms has affected people’s lives and occasionally ruined them. (Source)

Ramesh SrinivasanThis book is a really fantastic analysis of how quantification, the collection of data, the modelling around data, the predictions made by using data, the algorithmic and quantifiable ways of predicting behaviour based on data, are all built by elites for elites and end up, quite frankly, screwing over everybody else. (Source)

See more recommendations for this book...

Don't have time to read the top Data Science books of all time? Read Shortform summaries.

Shortform summaries help you learn 10x faster by:

  • Being comprehensive: you learn the most important points in the book
  • Cutting out the fluff: you focus your time on what's important to know
  • Interactive exercises: apply the book's ideas to your own life with our educators' guidance.
11
Foreword by Steven Pinker

Blending the informed analysis of The Signal and the Noise with the instructive iconoclasm of Think Like a Freak, a fascinating, illuminating, and witty look at what the vast amounts of information now instantly available to us reveals about ourselves and our world—provided we ask the right questions.

By the end of an average day in the early twenty-first century, human beings searching the internet will amass eight trillion gigabytes of data. This staggering amount of information—unprecedented in history—can tell us a great deal about who we...
more
Recommended by Jj. Omojuwa, Ron Fournier, and 2 others.

Jj. Omojuwa@SympLySimi Lol. Read this book. You’d love it. https://t.co/d2cLOyoiZ9 (Source)

Ron FournierJust finished, “Everybody Lies” by @SethS_D, which in addition to being a tremendous education on Big Data, includes the best conclusion to a non-fiction book I’ve ever read. Read it. -30- (Source)

See more recommendations for this book...

12
Moneyball is a quest for something as elusive as the Holy Grail, something that money apparently can't buy: the secret of success in baseball. The logical places to look would be the front offices of major league teams and the dugouts, perhaps even in the minds of the players themselves. Michael Lewis mines all these possibilities - his intimate and original portraits of big league ballplayers are alone worth the price of admission - but the real jackpot is a cache of numbers - numbers! - collected over the years by a strange brotherhood of amateur baseball enthusiasts: software... more

Carol DweckYou would think that the relationship between training and skill would be utterly obvious in sports, but apparently it isn’t. (Source)

David PapineauIt’s a parable of the disinclination of people in general to base their practices on evidence, a parable for evidence-based policy in general. (Source)

Ed SmithThis is about a guy using econometrics to predict which baseball players will do better in advancing wins, a remarkable use of economic thinking. (Source)

See more recommendations for this book...

13
Once considered tedious, the field of statistics is rapidly evolving into a discipline Hal Varian, chief economist at Google, has actually called “sexy.” From batting averages and political polls to game shows and medical research, the real-world application of statistics continues to grow by leaps and bounds. How can we catch schools that cheat on standardized tests? How does Netflix know which movies you’ll like? What is causing the rising incidence of autism? As best-selling author Charles Wheelan shows us in Naked Statistics, the right data and a few well-chosen statistical tools... more

See more recommendations for this book...

14
Through a series of recent breakthroughs, deep learning has boosted the entire field of machine learning. Now, even programmers who know close to nothing about this technology can use simple, efficient tools to implement programs capable of learning from data. This practical book shows you how.

By using concrete examples, minimal theory, and two production-ready Python frameworks-scikit-learn and TensorFlow-author Aurélien Géron helps you gain an intuitive understanding of the concepts and tools for building intelligent systems. You'll learn a range of techniques, starting with simple...

more
Recommended by Mark Tabladillo, and 1 others.

Mark TabladilloBook to Start You on Machine Learning - KDnuggets https://t.co/19fdX59b0d This book is “Hands-On Machine Learning with Scikit-Learn & TensorFlow”. each new revision has become an even better version of one of the best in-depth resources to learn Machine Learning by doing. https://t.co/ujyUH3xU3e (Source)

See more recommendations for this book...

15
The classic book on statistical graphics, charts, tables. Theory and practice in the design of data graphics, 250 illustrations of the best (and a few of the worst) statistical graphics, with detailed analysis of how to display data for precise, effective, quick analysis. Design of the high-resolution displays, small multiples. Editing and improving graphics. The data-ink ratio. Time-series, relational graphics, data maps, multivariate designs. Detection of graphical deception: design variation vs. data variation. Sources of deception. Aesthetics and data graphical displays.
This is the...
more
Recommended by Bret Victor, Michael Okuda, and 2 others.

Michael OkudaEdward Tufte's classic book, The Visual Display of Quantitative Information is a fascinating, surprisingly readable treatise for anyone interested in infographics. When I hired artists for the Star Trek graphics dept, I sometimes asked them to read it.https://t.co/cK4GQqBDxp (Source)

See more recommendations for this book...

16
A black swan is a highly improbable event with three principal characteristics: It is unpredictable; it carries a massive impact; and, after the fact, we concoct an explanation that makes it appear less random, and more predictable, than it was.

The astonishing success of Google was a black swan; so was 9/11. For Nassim Nicholas Taleb, black swans underlie almost everything about our world, from the rise of religions to events in our own personal lives.

Why do we not acknowledge the phenomenon of black swans until after they occur? Part of the answer, according to...
more
Recommended by Bill Gates, Jeff Bezos, Simon Sinek, and 22 others.

Bill Gates[On Bill Gates's reading list in 2012.] (Source)

Jeff Bezos[From the book "The Everything Store: and the Age of Amazon"] “The scholar argues that people are wired to see patterns in chaos while remaining blind to unpredictable events, with massive consequences. Experimentation and empiricism trumps the easy and obvious narrative,” Stone writes. (Source)

James AltucherAnd throw in “The Black Swan” and “Fooled by Randomness”. “Fragile” means if you hit something might break. “Resilient” means if you hit something, it will stay the same. On my podcast Nassim discusses “Antifragility” – building a system, even on that works for you on a personal level, where you if you harm your self in some way it becomes stronger. That podcast changed my life He discusses... (Source)

See more recommendations for this book...

17

Thinking, Fast and Slow

Major New York Times bestseller
Winner of the National Academy of Sciences Best Book Award in 2012
Selected by the New York Times Book Review as one of the best books of 2011
A Globe and Mail Best Books of the Year 2011 Title
One of The Economist's 2011 Books of the Year
One of The Wall Street Journal's Best Nonfiction Books of the Year 2011
2013 Presidential Medal of Freedom Recipient

In the international bestseller, Thinking, Fast and Slow, Daniel Kahneman, the renowned psychologist and winner of the Nobel...
more

Barack ObamaA few months ago, Mr. Obama read “Thinking, Fast and Slow,” by Daniel Kahneman, about how people make decisions — quick, instinctive thinking versus slower, contemplative deliberation. For Mr. Obama, a deliberator in an instinctive business, this may be as instructive as any political science text. (Source)

Bill Gates[On Bill Gates's reading list in 2012.] (Source)

Marc AndreessenCaptivating dive into human decision making, marred by inclusion of several/many? psychology studies that fail to replicate. Will stand as a cautionary tale? (Source)

See more recommendations for this book...

18
Which is more dangerous, a gun or a swimming pool? What do schoolteachers and sumo wrestlers have in common? Why do drug dealers still live with their moms? How much do parents really matter? What kind of impact did Roe v. Wade have on violent crime? Freakonomics will literally redefine the way we view the modern world.

These may not sound like typical questions for an economist to ask. But Steven D. Levitt is not a typical economist. He is a much heralded scholar who studies the stuff and riddles of everyday life -- from cheating and crime to sports and child rearing -- and whose...
more

Malcolm GladwellI don’t need to say much here. This book invented an entire genre. Economics was never supposed to be this entertaining. (Source)

Daymond JohnI love newer books like [this book]. (Source)

James Altucher[James Altucher recommended this book on the podcast "The Tim Ferriss Show".] (Source)

See more recommendations for this book...

19

In April 1956, a refitted oil tanker carried fifty-eight shipping containers from Newark to Houston. From that modest beginning, container shipping developed into a huge industry that made the boom in global trade possible. "The Box" tells the dramatic story of the container's creation, the decade of struggle before it was widely adopted, and the sweeping economic consequences of the sharp fall in transportation costs that containerization brought about.
Published on the fiftieth anniversary of the first container voyage, this is the first comprehensive history of the shipping...
more

Bill GatesI picked this one up after seeing it on a Wall Street Journal list of good books for investors. It was first published in 1954, but it doesn’t feel dated (aside from a few anachronistic examples—it has been a long time since bread cost 5 cents a loaf in the United States). In fact, I’d say it’s more relevant than ever. One chapter shows you how visuals can be used to exaggerate trends and give... (Source)

Tobi LütkeWe all live in Malcolm’s world because the shipping container has been hugely influential in history. (Source)

Jason ZweigThis is a terrific introduction to critical thinking about statistics, for people who haven’t taken a class in statistics. (Source)

See more recommendations for this book...

20

The Elements of Statistical Learning

Data Mining, Inference, and Prediction

During the past decade there has been an explosion in computation and information technology. With it have come vast amounts of data in a variety of fields such as medicine, biology, finance, and marketing. The challenge of understanding these data has led to the development of new tools in the field of statistics, and spawned new areas such as data mining, machine learning, and bioinformatics. Many of these tools have common underpinnings but are often expressed with different terminology. This book describes the important ideas in these areas in a common conceptual framework. While the...

more
Recommended by Nassim Nicholas Taleb, and 1 others.

Nassim Nicholas TalebVery comprehensive, sufficiently technical to get most of the plumbing behind machine learning. Very useful as a reference book (actually, there is no other complete reference book). The authors are the real thing (Tibshirani is the one behind the LASSO regularization technique). Uses some mathematical statistics without the burdens of measure theory and avoids the obvious but complicated... (Source)

See more recommendations for this book...

Don't have time to read the top Data Science books of all time? Read Shortform summaries.

Shortform summaries help you learn 10x faster by:

  • Being comprehensive: you learn the most important points in the book
  • Cutting out the fluff: you focus your time on what's important to know
  • Interactive exercises: apply the book's ideas to your own life with our educators' guidance.
21

Pattern Recognition and Machine Learning

Pattern recognition has its origins in engineering, whereas machine learning grew out of computer science. However, these activities can be viewed as two facets of the same field, and together they have undergone substantial development over the past ten years. In particular, Bayesian methods have grown from a specialist niche to become mainstream, while graphical models have emerged as a general framework for describing and applying probabilistic models. Also, the practical applicability of Bayesian methods has been greatly enhanced through the development of a range of approximate inference... more

See more recommendations for this book...

22
Statistical methods are a key part of of data science, yet very few data scientists have any formal statistics training. Courses and books on basic statistics rarely cover the topic from a data science perspective. This practical guide explains how to apply various statistical methods to data science, tells you how to avoid their misuse, and gives you advice on what's important and what's not.

Many data science resources incorporate statistical methods but lack a deeper statistical perspective. If you're familiar with the R programming language, and have some exposure to...
more

See more recommendations for this book...

23

Deep Learning

An introduction to a broad range of topics in deep learning, covering mathematical and conceptual background, deep learning techniques used in industry, and research perspectives.

Deep learning is a form of machine learning that enables computers to learn from experience and understand the world in terms of a hierarchy of concepts. Because the computer gathers knowledge from experience, there is no need for a human computer operator to formally specify all the knowledge that the computer needs. The hierarchy of concepts allows the computer to learn complicated concepts by...
more

Elon MuskWritten by three experts in the field, Deep Learning is the only comprehensive book on the subject. (Source)

Nassim Nicholas TalebVery clear exposition, does the math without getting lost in the details. Although many of the concepts of the introductory first 100 pages can be found elsewhere, they are presented with remarkable cut-to-the-chase clarity. (Source)

Satya NadellaElon Musk and Facebook AI chief Yann LeCun have praised this textbook on one of software’s most promising frontiers. After its publication, Microsoft signed up coauthor Bengio, a pioneer in machine learning, as an adviser (Source)

See more recommendations for this book...

24

Doing Data Science

Now that people are aware that data can make the difference in an election or a business model, data science as an occupation is gaining ground. But how can you get started working in a wide-ranging, interdisciplinary field that’s so clouded in hype? This insightful book, based on Columbia University’s Introduction to Data Science class, tells you what you need to know.

In many of these chapter-long lectures, data scientists from companies such as Google, Microsoft, and eBay share new algorithms, methods, and models by presenting case studies and the code they use. If you’re...
more

See more recommendations for this book...

25

Lean Analytics

Use Data to Build a Better Startup Faster

Whether you’re a startup founder trying to disrupt an industry or an intrapreneur trying to provoke change from within, your biggest challenge is creating a product people actually want. Lean Analytics steers you in the right direction.

This book shows you how to validate your initial idea, find the right customers, decide what to build, how to monetize your business, and how to spread the word. Packed with more than thirty case studies and insights from over a hundred business experts, Lean Analytics provides you with hard-won, real-world information no entrepreneur...
more

Ola OlusogaLike Charlie Munger once said: “I’ve long believed that a certain system - which almost any intelligent person can learn - works way better than the systems most people use [to understand the world]. What you need is a latticework of mental models in your head. And, with that system, things gradually fit together in a way that enhances cognition. Just as multiple factors shape every system,... (Source)

See more recommendations for this book...

26
WARNING! To avoid buying counterfeit on Amazon, click on "See All Buying Options" and choose "Amazon.com" and not a third-party seller.

Concise and to the point — the book can be read during a week. During that week, you will learn almost everything modern machine learning has to offer. The author and other practitioners have spent years learning these concepts.

Companion wiki — the book has a continuously updated wiki that extends some book chapters with additional information: Q&A, code snippets, further reading, tools, and other relevant resources.
more
Recommended by Kirk Borne, and 1 others.

Kirk BorneRecent top-selling books in #AI & #MachineLearning: https://t.co/Ij9I7SzR4d ————— #BigData #DataScience #DataMining #Algorithms #PredictiveAnalytics #Python ————— ...in the TOP 10: 1)The Hundred-Page ML Book: https://t.co/dQ7nP6gwP0 2)Hands-on ML with...: https://t.co/Y0Iz3GbtGP https://t.co/72rAFN1FwW (Source)

See more recommendations for this book...

27

Applied Predictive Modeling

This text is intended for a broad audience as both an introduction to predictive models as well as a guide to applying them. Non- mathematical readers will appreciate the intuitive explanations of the techniques while an emphasis on problem-solving with real data across a wide variety of applications will aid practitioners who wish to extend their expertise. Readers should have knowledge of basic statistical ideas, such as correlation and linear regression analysis. While the text is biased against complex equations, a mathematical background is needed for advanced topics. Dr. Kuhn is a... more
Recommended by Kirk Borne, and 1 others.

Kirk BorneFind more than 40 useful #PredictiveModeling articles here at @DataScienceCtrl https://t.co/KdcvLRffRk #abdsc ———— #BigData #DataScience #AI #MachineLearning #Forecasting #Statistics #PredictiveAnalytics ——— +This is the best book on the subject: https://t.co/SmsepmniHi https://t.co/amBJHCJSHN (Source)

See more recommendations for this book...

28

Deep Learning with Python

Deep learning is applicable to a widening range of artificial intelligence problems, such as image classification, speech recognition, text classification, question answering, text-to-speech, and optical character recognition. It is the technology behind photo tagging systems at Facebook and Google, self-driving cars, speech recognition systems on your smartphone, and much more.

In particular, Deep learning excels at solving machine perception problems: understanding the content of image data, video data, or sound data. Here's a simple example: say you have a large collection of...
more

See more recommendations for this book...

29
A New York Times Bestseller

An audacious, irreverent investigation of human behavior—and a first look at a revolution in the making

 
Our personal data has been used to spy on us, hire and fire us, and sell us stuff we don’t need. In Dataclysm, Christian Rudder uses it to show us who we truly are.
 
For centuries, we’ve relied on polling or small-scale lab experiments to study human behavior. Today, a new approach is possible. As we live more of our lives online, researchers can finally observe us directly, in vast numbers, and...
more
Recommended by Elad Yom-Tov, and 1 others.

Elad Yom-TovChristian Rudder was the chief scientist of a dating website, OK Cupid. (Source)

See more recommendations for this book...

30

Algorithms to Live By

The Computer Science of Human Decisions

A fascinating exploration of how insights from computer algorithms can be applied to our everyday lives, helping to solve common decision-making problems and illuminate the workings of the human mind

All our lives are constrained by limited space and time, limits that give rise to a particular set of problems. What should we do, or leave undone, in a day or a lifetime? How much messiness should we accept? What balance of new activities and familiar favorites is the most fulfilling? These may seem like uniquely human quandaries, but they are not: computers, too, face the same...
more

Doug McMillonHere are some of my favorite reads from 2017. Lots of friends and colleagues send me book suggestions and it's impossible to squeeze them all in. I continue to be super curious about how digital and tech are enabling people to transform our lives but I try to read a good mix of books that apply to a variety of areas and stretch my thinking more broadly. (Source)

Sriram Krishnan@rabois @nealkhosla Yes! Love that book (Source)

Chris OliverThis is a great book talking about how you can use computer science to help you make decisions in life. How do you know when to make a decision on the perfect house? Car? etc? It helps you apply algorithms to making those decisions optimally without getting lost. (Source)

See more recommendations for this book...

Don't have time to read the top Data Science books of all time? Read Shortform summaries.

Shortform summaries help you learn 10x faster by:

  • Being comprehensive: you learn the most important points in the book
  • Cutting out the fluff: you focus your time on what's important to know
  • Interactive exercises: apply the book's ideas to your own life with our educators' guidance.
31
A comprehensive introduction to machine learning that uses probabilistic models and inference as a unifying approach.

Today's Web-enabled deluge of electronic data calls for automated methods of data analysis. Machine learning provides these, developing methods that can automatically detect patterns in data and then use the uncovered patterns to predict future data. This textbook offers a comprehensive and self-contained introduction to the field of machine learning, based on a unified, probabilistic approach.

The coverage combines breadth and depth, offering...
more
Recommended by Kirk Borne, and 1 others.

Kirk Borne[Book] #MachineLearning — a Probabilistic Perspective: https://t.co/wAZwLoUFGF ———— #BigData #Statistics #DataScience #DeepLearning #AI #Algorithms #StatisticalLiteracy #Mathematics #abdsc ——— ⬇Get this brilliant 1100-page 28-chapter highly-rated book: https://t.co/Tm2zchpHSu https://t.co/jprUDdzkj8 (Source)

See more recommendations for this book...

32

Big Data

A Revolution That Will Transform How We Live, Work, and Think

A revelatory exploration of the hottest trend in technology and the dramatic impact it will have on the economy, science, and society at large.

Which paint color is most likely to tell you that a used car is in good shape? How can officials identify the most dangerous New York City manholes before they explode? And how did Google searches predict the spread of the H1N1 flu outbreak?

The key to answering these questions, and many more, is big data. “Big data” refers to our burgeoning ability to crunch vast collections of information, analyze it instantly, and draw...
more

See more recommendations for this book...

33
Link to the GitHub Repository containing the code examples and additional material: https://github.com/rasbt/python-machi...

Many of the most innovative breakthroughs and exciting new technologies can be attributed to applications of machine learning. We are living in an age where data comes in abundance, and thanks to the self-learning algorithms from the field of machine learning, we can turn this data into knowledge. Automated speech recognition on our smart phones, web search...
more

See more recommendations for this book...

34

Superforecasting

The Art and Science of Prediction

New York Times Bestseller

An Economist Best Book of 2015

"The most important book on decision making since Daniel Kahneman's Thinking, Fast and Slow."
Jason Zweig, The Wall Street Journal
 
Everyone would benefit from seeing further into the future, whether buying stocks, crafting policy, launching a new product, or simply planning the week’s meals. Unfortunately, people tend to be terrible forecasters. As Wharton professor Philip Tetlock showed in a landmark 2005 study, even...
more

Sheil KapadiaRead the book Superforecasting, had a great conversation with @bcmassey and came up with seven ideas for how NFL teams can try to find small edges during the draft process. Would love to hear feedback on this one. https://t.co/PdN1fKCagl (Source)

Julia Galef[Has] some good advice on how to improve your ability to make accurate predictions. (Source)

See more recommendations for this book...

35
Many forces affect software today: larger datasets, geographical disparities, complex company structures, and the growing need to be fast and nimble in the face of change.

Proven approaches such as service-oriented and event-driven architectures are joined by newer techniques such as microservices, reactive architectures, DevOps, and stream processing. Many of these patterns are successful by themselves, but as this practical ebook demonstrates, they provide a more holistic and compelling approach when applied together.

Author Ben Stopford explains how service-based...
more

See more recommendations for this book...

36

Envisioning Information

The celebrated design professor here tackles the question of how best to communicate real-life experience in a two-degree format, whether on the printed page or the computer screen. The Whole Earth Review called Envisioning Information a "passionate, elegant revelation." less

Kevin RoseThe master when it comes to taking complicated data and turning it into beautiful charts and graphs that are easy to understand. If you’re into graphic design, print design, web design, you name it, you’re going to get some really good information and how tos out of these books. He has a whole series of these books. (Source)

See more recommendations for this book...

37
"Mesmerizing & fascinating..." -- The Seattle Post-Intelligencer

"The Freakonomics of big data." --Stein Kretsinger, founding executive of Advertising.com

Award-winning - Used by over 30 universities - Translated into 9 languages

An introduction for everyone. In this rich, fascinating -- surprisingly accessible -- introduction, leading expert Eric Siegel reveals how predictive analytics (aka machine learning) works, and how it affects everyone every day. Rather than a "how to" for hands-on...
more

See more recommendations for this book...

38
Machine learning has become an integral part of many commercial applications and research projects, but this field is not exclusive to large companies with extensive research teams. If you use Python, even as a beginner, this book will teach you practical ways to build your own machine learning solutions. With all the data available today, machine learning applications are limited only by your imagination.

You'll learn the steps necessary to create a successful machine-learning application with Python and the scikit-learn library. Authors Andreas Muller and Sarah Guido focus on the...
more
Recommended by Francesco Marconi, and 1 others.

Francesco MarconiTop programming languages ranked by its annual search engine popularity. Python has gained momentum because of its importance to machine learning development. At @WSJ we are using it to build tools for journalists. Tip: this is a great book for anyone who wants to get started! https://t.co/ZsHjqB5gvC (Source)

See more recommendations for this book...

39

Data Science with R

Recommended by Tim @Realscientists, and 1 others.

Tim @RealscientistsIf you are interested in learning programming, there are lots of great tutorials. For data analysis, R and the R 4 data science book is a great way to go https://t.co/zezYpG0TRL, and for general R syntax, there is the swirl learning package https://t.co/Tzfpnlgo3O /20 (Source)

See more recommendations for this book...

40
No matter what your actual job title, you are--or soon will be--a data worker.
Every day, at work, home, and school, we are bombarded with vast amounts of free data collected and shared by everyone and everything from our co-workers to our calorie counters. In this highly anticipated follow-up to The Functional Art--Alberto Cairo's foundational guide to understanding information graphics and visualization--the respected data visualization professor explains in clear terms how to work with data, discover the stories hidden within, and share those stories with the world in...
more

See more recommendations for this book...

Don't have time to read the top Data Science books of all time? Read Shortform summaries.

Shortform summaries help you learn 10x faster by:

  • Being comprehensive: you learn the most important points in the book
  • Cutting out the fluff: you focus your time on what's important to know
  • Interactive exercises: apply the book's ideas to your own life with our educators' guidance.
41
Collecting data is relatively easy, but turning raw information into something useful requires that you know how to extract precisely what you need. With this insightful book, intermediate to experienced programmers interested in data analysis will learn techniques for working with data in a business environment. You'll learn how to look at data to discover what it contains, how to capture those ideas in conceptual models, and then feed your understanding back into the organization through business plans, metrics dashboards, and other applications.

Along the way, you'll experiment...
more

See more recommendations for this book...

42
A thought-provoking and wide-ranging exploration of machine learning and the race to build computer intelligences as flexible as our own
In the world's top research labs and universities, the race is on to invent the ultimate learning algorithm: one capable of discovering any knowledge from data, and doing anything we want, before we even ask. In The Master Algorithm, Pedro Domingos lifts the veil to give us a peek inside the learning machines that power Google, Amazon, and your smartphone. He assembles a blueprint for the future universal learner--the Master Algorithm--and...
more
Recommended by Vinod Khosla, and 1 others.

Vinod KhoslaIf you want speculation about what the master AI might need (one view). For a slightly more technical read, I’d suggest Ian Goodfellows Deep Learning. (Source)

See more recommendations for this book...

43
Imagine a world where your phone is too big for your hand, where your doctor prescribes a drug that is wrong for your body, where in a car accident you are 47% more likely to be seriously injured, where every week the countless hours of work you do are not recognised or valued. If any of this sounds familiar, chances are that you're a woman.

Invisible Women shows us how, in a world largely built for and by men, we are systematically ignoring half the population. It exposes the gender data gap – a gap in our knowledge that is at the root of perpetual, systemic discrimination against...
more

Konnie Huq@FenTiger697 @WokingAmnesty @CCriadoPerez @Hatchards @radioleary Brilliant book by the brilliant @CCriadoPerez 😍 (Source)

Feminist Next Door@Rockmedia Awesome book (Source)

Nigel ShadboltInvisible Women is an exposé of just how much of the world around us is designed around the default male. Deploying a huge range of data and examples, Caroline Criado Perez, who is a writer, broadcaster and award winning campaigner, presents on overwhelming case for change. Every page is full of facts and data that support her fundamental contention that in a world built for and by men, gender... (Source)

See more recommendations for this book...

44
Fooled by Randomness is a standalone book in Nassim Nicholas Taleb’s landmark Incerto series, an investigation of opacity, luck, uncertainty, probability, human error, risk, and decision-making in a world we don’t understand. The other books in the series are The Black Swan, Antifragile, and The Bed of Procrustes.

Now in a striking new hardcover edition, Fooled by Randomness is the word-of-mouth sensation that will change the way you think about business and the world. Nassim Nicholas Taleb–veteran trader, renowned risk expert, polymathic scholar,...
more

James AltucherAnd throw in “The Black Swan” and “Fooled by Randomness”. “Fragile” means if you hit something might break. “Resilient” means if you hit something, it will stay the same. On my podcast Nassim discusses “Antifragility” – building a system, even on that works for you on a personal level, where you if you harm your self in some way it becomes stronger. That podcast changed my life He discusses... (Source)

Howard MarksReally about how much randomness there is in our world. (Source)

Anant JainThe five-book series, "Incerto", by Nassim Nicholas Taleb has had a profound impact on how I think about the world. There’s some overlap across the books — but you'll likely find the repetition helpful in retaining the content better. (Source)

See more recommendations for this book...

45
Want to tap the power behind search rankings, product recommendations, social bookmarking, and online matchmaking? This fascinating book demonstrates how you can build Web 2.0 applications to mine the enormous amount of data created by people on the Internet. With the sophisticated algorithms in this book, you can write smart programs to access interesting datasets from other web sites, collect data from users of your own applications, and analyze and understand the data once you've found it.

Programming Collective Intelligence takes you into the world of machine learning...
more

See more recommendations for this book...

46
This book describes, simply and in general terms, the process of analyzing data. The authors have extensive experience both managing data analysts and conducting their own data analyses, and have carefully observed what produces coherent results and what fails to produce useful insights into data. This book is a distillation of their experience in a format that is applicable to both practitioners and managers in data science. less

See more recommendations for this book...

47

Advanced R

An Essential Reference for Intermediate and Advanced R Programmers

Advanced R presents useful tools and techniques for attacking many types of R programming problems, helping you avoid mistakes and dead ends. With more than ten years of experience programming in R, the author illustrates the elegance, beauty, and flexibility at the heart of R.



The book develops the necessary skills to produce quality code that can be used in a variety of circumstances. You will learn:

The fundamentals of R, including standard data types...
more

See more recommendations for this book...

49

The Book of Why

The New Science of Cause and Effect

A Turing Award-winning computer scientist and statistician shows how understanding causality has revolutionized science and will revolutionize artificial intelligence
"Correlation is not causation." This mantra, chanted by scientists for more than a century, has led to a virtual prohibition on causal talk. Today, that taboo is dead. The causal revolution, instigated by Judea Pearl and his colleagues, has cut through a century of confusion and established causality--the study of cause and effect--on a firm scientific basis. His work explains how we can know easy things,...
more
Recommended by D.a. Wallach, Kirk Borne, and 2 others.

D.a. Wallach@EricTopol @yudapearl @bschoelkopf @MPI_IS I love @yudapearl 's book so much! Profound, heterodox. (Source)

Kirk Borne.@yudapearl wrote the awesome "Book of Why", but he recommends this fun and less #mathematics-heavy read >> his #AI lecture given in 1999: https://t.co/kNYIoJ8qcY #DataScience #MachineLearning #Statistics #BookofWhy #Causalinference #Bayes https://t.co/CNQlKP8cU3 (Source)

See more recommendations for this book...

50
Your cell phone provider tracks your location and knows who’s with you. Your online and in-store purchasing patterns are recorded, and reveal if you're unemployed, sick, or pregnant. Your e-mails and texts expose your intimate and casual friends. Google knows what you’re thinking because it saves your private searches. Facebook can determine your sexual orientation without you ever mentioning it.

The powers that surveil us do more than simply store this information. Corporations use surveillance to manipulate not only the news articles and advertisements we each see, but also the...
more

See more recommendations for this book...

Don't have time to read the top Data Science books of all time? Read Shortform summaries.

Shortform summaries help you learn 10x faster by:

  • Being comprehensive: you learn the most important points in the book
  • Cutting out the fluff: you focus your time on what's important to know
  • Interactive exercises: apply the book's ideas to your own life with our educators' guidance.
51

Natural Language Processing with Python

This book offers a highly accessible introduction to natural language processing, the field that supports a variety of language technologies, from predictive text and email filtering to automatic summarization and translation. With it, you'll learn how to write Python programs that work with large collections of unstructured text. You'll access richly annotated datasets using a comprehensive range of linguistic data structures, and you'll understand the main algorithms for analyzing the content and structure of written communication.

Packed with examples and exercises, Natural...
more

See more recommendations for this book...

52

Learning From Data

A Short Course

Machine learning allows computational systems to adaptively improve their performance with experience accumulated from the observed data. Its techniques are widely applied in engineering, science, finance, and commerce. This book is designed for a short course on machine learning. It is a short course, not a hurried course. From over a decade of teaching this material, we have distilled what we believe to be the core topics that every student of the subject should know. We chose the title `learning from data' that faithfully describes what the subject is about, and made it a point to cover... more

See more recommendations for this book...

53
Dr. Kai-Fu Lee—one of the world’s most respected experts on AI and China—reveals that China has suddenly caught up to the US at an astonishingly rapid and unexpected pace.   In AI Superpowers, Kai-fu Lee argues powerfully that because of these unprecedented developments in AI, dramatic changes will be happening much sooner than many of us expected. Indeed, as the US-Sino AI competition begins to heat up, Lee urges the US and China to both accept and to embrace the great responsibilities that come with significant technological power. Most experts already say that AI will have a devastating... more

Yuval Noah HarariA superb and very timely survey of the impact of AI on the geopolitical system, the job market and human society. (Source)

Arianna HuffingtonKai-Fu Lee's experience as an AI pioneer, top investor, and cancer survivor has led to this brilliant book about global technology. AI Superpowers gives us a guide to a future that celebrates all the benefits that AI will bring, while cultivating what is unique about our humanity. It’s one of those books you read and think, ‘Why are people reading any other book right now when this is so clearly... (Source)

Satya NadellaKai-Fu Lee's smart analysis on human-AI coexistence is clear-eyed and a must-read. We must look deep within ourselves for the values and wisdom to guide AI's development. (Source)

See more recommendations for this book...

54
Information theory and inference, often taught separately, are here united in one entertaining textbook. These topics lie at the heart of many exciting areas of contemporary science and engineering - communication, signal processing, data mining, machine learning, pattern recognition, computational neuroscience, bioinformatics, and cryptography. This textbook introduces theory in tandem with applications. Information theory is taught alongside practical communication systems, such as arithmetic coding for data compression and sparse-graph codes for error-correction. A toolbox of inference... more

See more recommendations for this book...

55
Practical data design tips from a data visualization expert of the modern age Data doesn't decrease; it is ever-increasing and can be overwhelming to organize in a way that makes sense to its intended audience. Wouldn't it be wonderful if we could actually visualize data in such a way that we could maximize its potential and tell a story in a clear, concise manner? Thanks to the creative genius of Nathan Yau, we can. With this full-color book, data visualization guru and author Nathan Yau uses step-by-step tutorials to show you how to visualize and tell stories with data. He explains... more
Recommended by Roger D. Peng, and 1 others.

Roger D. PengThis book is about how best to present data to other people, what are the tools that you can use, and the types of visualizations that you can make. (Source)

See more recommendations for this book...

56
Unlike any time before in our lives, we have access to vast amounts of free information. With the right tools, we can start to make sense of all this data to see patterns and trends that would otherwise be invisible to us. By transforming numbers into graphical shapes, we allow readers to understand the stories those numbers hide. In this practical introduction to understanding and using information graphics, you'll learn how to use data visualizations as tools to see beyond lists of numbers and variables and achieve new insights into the complex world around us. Regardless of the kind... more

See more recommendations for this book...

57
Superintelligence asks the questions: what happens when machines surpass humans in general intelligence? Will artificial agents save or destroy us? Nick Bostrom lays the foundation for understanding the future of humanity and intelligent life.

The human brain has some capabilities that the brains of other animals lack. It is to these distinctive capabilities that our species owes its dominant position. If machine brains surpassed human brains in general intelligence, then this new superintelligence could become extremely powerful--possibly beyond our control. As the fate of the...
more

Elon MuskWorth reading Superintelligence by Bostrom. We need to be super careful with AI. Potentially more dangerous than nukes. (Source)

Maria RamosRamos will take the summer to examine some of the questions weighing more heavily on humankind as we contemplate our collective future: what happens when we can write our own genetic codes, and what happens when we create technology that is meaningfully more intelligent than us. The Gene: An Intimate History—Siddhartha Mukherjee Superintelligence: Paths, Dangers, Strategies—Nick Bostrom The... (Source)

Will MacAskillI picked this book because the possibility of us developing human-level artificial intelligence, and from there superintelligence—an artificial agent that is considerably more intelligent than we are—is at least a contender for the most important issue in the next two centuries. Bostrom’s book has been very influential in effective altruism, lots of people work on artificial intelligence in order... (Source)

See more recommendations for this book...

58
Even the smartest among us can feel inept as we fail to figure out which light switch or oven burner to turn on, or whether to push, pull, or slide a door. The fault, argues this ingenious—even liberating—book, lies not in ourselves, but in product design that ignores the needs of users and the principles of cognitive psychology. The problems range from ambiguous and hidden controls to arbitrary relationships between controls and functions, coupled with a lack of feedback or other assistance and unreasonable demands on memorization. The Design of Everyday Things shows that good, usable... more

Marius Ciuchete Pauneval(ez_write_tag([[250,250],'theceolibrary_com-large-mobile-banner-2','ezslot_5',164,'0','1'])); Question: Was there a moment, specifically, when something you read in a book helped you? Answer: Yes there was. In fact, I can remember two separate sentences from two different books: The first one comes from “The Design of Everyday Things” by Don Norman. It says: “great design will help... (Source)

Grey BakerI mainly read to decompress and change my state of mind, so it’s hard to point to an insight I read that helped me. Reading fiction has pulled me out of a bad mood more times than I can count, though, and always reenergises me to attack problems that had stumped me again. That said, I read and loved Norman Norman’s “The Design of Everyday Things”, and it’s helped me think through design problems... (Source)

Kaci LambeThese three books are about how people actually use design in their lives. They helped me understand this very basic idea: There are no dumb users, only bad designers. Take the time to create based on how your design will be interacted with. Test it. Iterate. That's how you become a good designer. (Source)

See more recommendations for this book...

59

Artificial Intelligence

A Modern Approach

For one or two-semester, undergraduate or graduate-level courses in Artificial Intelligence. The long-anticipated revision of this best-selling text offers the most comprehensive, up-to-date introduction to the theory and practice of artificial intelligence. *NEW-Nontechnical learning material-Accompanies each part of the book. *NEW-The Internet as a sample application for intelligent systems-Added in several places including logical agents, planning, and natural language. *NEW-Increased coverage of material - Includes expanded coverage of: default reasoning and truth maintenance systems,... more

See more recommendations for this book...

Don't have time to read the top Data Science books of all time? Read Shortform summaries.

Shortform summaries help you learn 10x faster by:

  • Being comprehensive: you learn the most important points in the book
  • Cutting out the fluff: you focus your time on what's important to know
  • Interactive exercises: apply the book's ideas to your own life with our educators' guidance.
61

Antifragile

Things That Gain from Disorder

From the bestselling author of The Black Swan and one of the foremost philosophers of our time, Nassim Nicholas Taleb, a book on how some systems actually benefit from disorder.

In The Black Swan Taleb outlined a problem; in Antifragile he offers a definitive solution: how to gain from disorder and chaos while being protected from fragilities and adverse events. For what he calls the "antifragile" is one step beyond robust, as it benefits from adversity, uncertainty and stressors, just as human bones get stronger when subjected to stress and tension.

Taleb stands...
more

James AltucherYou ask about success. To be successful you have to avoid being “fragile” – the idea that if something hurts you, you let collapse completely. You also have to avoid simply being resilient. Bouncing back is not enough. Antifragile is when something tries to hurt you and you come back stronger. That is real life business. That is real life success. Nassim focuses on the economy. But when I read... (Source)

Marvin Liaoeval(ez_write_tag([[250,250],'theceolibrary_com-leader-2','ezslot_7',164,'0','1'])); My list would be (besides the ones I mentioned in answer to the previous question) both business & Fiction/Sci-Fi and ones I personally found helpful to myself. The business books explain just exactly how business, work & investing are in reality & how to think properly & differentiate yourself. On... (Source)

Vlad TenevThe general concept is applicable to many fields beyond biology, for instance finance, economics and monetary policy. (Source)

See more recommendations for this book...

62
Everyone knows that abuse of statistics is rampant in popular media. Politicians and marketers present shoddy evidence for dubious claims all the time. But smart people make mistakes too, and when it comes to statistics, plenty of otherwise great scientists--yes, even those published in peer-reviewed journals--are doing statistics wrong.

"Statistics Done Wrong" comes to the rescue with cautionary tales of all-too-common statistical fallacies. It'll help you see where and why researchers often go wrong and teach you the best practices for avoiding their mistakes.

In this...
more

See more recommendations for this book...

63
Since Don’t Make Me Think was first published in 2000, over 400,000 Web designers and developers have relied on Steve Krug’s guide to help them understand the principles of intuitive navigation and information design.

In this 3rd edition, Steve returns with fresh perspective to reexamine the principles that made Don’t Make Me Think a classic-–with updated examples and a new chapter on mobile usability. And it’s still short, profusely illustrated…and best of all–fun to read.

If you’ve read it before, you’ll rediscover what made Don’t Make Me Think so essential to Web...
more

Chris GowardHere are some of the books that have been very impactful for me, or taught me a new way of thinking: [...] Don't Make Me Think. (Source)

Nicolae AndronicI’m a technical guy. I studied the IT field and did software development for a long time until I discovered the business world. So the path for me is to slowly adapt from the clear, technical world, to the fuzzy, way more complex, business world. All the books that I recommend help this transition. “Don’t Make Me Think” - Steve Krug: for seeing software with the eyes of the user. (Source)

Nick GanjuAbout usability and making software and user interfaces that are friendly to people. (Source)

See more recommendations for this book...

64
Want to tap the tremendous amount of valuable social data in Facebook, Twitter, LinkedIn, and Google+? This refreshed edition helps you discover who’s making connections with social media, what they’re talking about, and where they’re located. You’ll learn how to combine social web data, analysis techniques, and visualization to find what you’ve been looking for in the social haystack—as well as useful information you didn’t know existed.

Each standalone chapter introduces techniques for mining data in different areas of the social Web, including blogs and email. All you need to...
more

See more recommendations for this book...

65

Practical Data Science with R

Simply put, data science is the discipline of extracting meaning from data. While it can involve deep knowledge of statistics, mathematics, machine learning, and computer science, for most non-academics, data science looks like applying analysis techniques to answer key business questions.

Practical Data Science with R lives up to its name. It explains basic principles without the theoretical mumbo-jumbo and jumps right to the real use cases faced while collecting, curating, and analyzing the data crucial to the success of businesses. Readers will apply the R programming...
more

See more recommendations for this book...

66
There is an explosion of interest in Bayesian statistics, primarily because recently created computational methods have finally made Bayesian analysis tractable and accessible to a wide audience. Doing Bayesian Data Analysis, A Tutorial Introduction with R and BUGS, is for first year graduate students or advanced undergraduates and provides an accessible approach, as all mathematics is explained intuitively and with concrete examples. It assumes only algebra and rusty calculus. Unlike other textbooks, this book begins with the basics, including essential concepts of probability and random... more

See more recommendations for this book...

67
With more than 200 practical recipes, this book helps you perform data analysis with R quickly and efficiently. The R language provides everything you need to do statistical work, but its structure can be difficult to master. This collection of concise, task-oriented recipes makes you productive with R immediately, with solutions ranging from basic tasks to input and output, general statistics, graphics, and linear regression.

Each recipe addresses a specific problem, with a discussion that explains the solution and offers insight into how it works. If you're a beginner, R...
more

See more recommendations for this book...

68

Introduction To Algorithms

Der "Cormen" bietet eine umfassende und vielseitige Einfuhrung in das moderne Studium von Algorithmen. Es stellt viele Algorithmen Schritt fur Schritt vor, behandelt sie detailliert und macht deren Entwurf und deren Analyse allen Leserschichten zuganglich. Sorgfaltige Erklarungen zur notwendigen Mathematik helfen, die Analyse der Algorithmen zu verstehen. Den Autoren ist es dabei gegluckt, Erklarungen elementar zu halten, ohne auf Tiefe oder mathematische Exaktheit zu verzichten. Jedes der weitgehend eigenstandig gestalteten Kapitel stellt einen Algorithmus, eine Entwurfstechnik, ein... more

See more recommendations for this book...

69
1. 1 Welcome to ggplot2 ggplot2 is an R package for producing statistical, or data, graphics, but it is unlike most other graphics packages because it has a deep underlying grammar. This grammar, based on the Grammar of Graphics (Wilkinson, 2005), is composed of a set of independent components that can be composed in many di?erent ways. This makesggplot2 very powerful, because you are not limited to a set of pre-speci?ed graphics, but you can create new graphics that are precisely tailored for your problem. This may sound overwhelming, but because there is a simple set of core principles and... more

See more recommendations for this book...

70

Numsense! Data Science for the Layman

No Math Added

---------------
Reference text for data science in top universities like Stanford and Cambridge. Sold in over 85 countries and translated into more than 5 languages.
---------------

Want to get started on data science?
Our promise: no math added.

This book has been written in layman's terms as a gentle introduction to data science and its algorithms. Each algorithm has its own dedicated chapter that explains how it works, and shows an example of a real-world application. To help you grasp key concepts, we stick to intuitive explanations and...
more

See more recommendations for this book...

Don't have time to read the top Data Science books of all time? Read Shortform summaries.

Shortform summaries help you learn 10x faster by:

  • Being comprehensive: you learn the most important points in the book
  • Cutting out the fluff: you focus your time on what's important to know
  • Interactive exercises: apply the book's ideas to your own life with our educators' guidance.
71
This practical guide provides more than 150 recipes to help you generate high-quality graphs quickly, without having to comb through all the details of R's graphing systems. Each recipe tackles a specific problem with a solution you can apply to your own project, and includes a discussion of how and why the recipe works.

Most of the recipes use the ggplot2 package, a powerful and flexible way to make graphs in R. If you have a basic understanding of the R language, you're ready to get started.


Use R's default graphics for quick exploration of data
Create a...
more

See more recommendations for this book...

72

Think Stats

If you know how to program, you have the skills to turn data into knowledge using the tools of probability and statistics. This concise introduction shows you how to perform statistical analysis computationally, rather than mathematically, with programs written in Python.

You'll work with a case study throughout the book to help you learn the entire data analysis process—from collecting data and generating statistics to identifying patterns and testing hypotheses. Along the way, you'll become familiar with distributions, the rules of probability, visualization, and many other tools...
more

See more recommendations for this book...

73

Bayesian Data Analysis

Incorporating new and updated information, this second edition of THE bestselling text in Bayesian data analysis continues to emphasize practice over theory, describing how to conceptualize, perform, and critique statistical analyses from a Bayesian perspective. Its world-class authors provide guidance on all aspects of Bayesian data analysis and include examples of real statistical analyses, based on their own research, that demonstrate how to solve complicated problems. Changes in the new edition include:


Stronger focus on MCMC Revision of the computational advice in Part...
more

See more recommendations for this book...

74
The Freakonomics of matha math-world superstar unveils the hidden beauty and logic of the world and puts its power in our hands

The math we learn in school can seem like a dull set of rules, laid down by the ancients and not to be questioned. In How Not to Be Wrong, Jordan Ellenberg shows us how terribly limiting this view is: Math isn’t confined to abstract incidents that never occur in real life, but rather touches everything we do—the whole world is shot through with it.

Math allows us to see the hidden structures underneath the messy and...
more

Bill GatesThe writing is funny, smooth, and accessible -- not what you might expect from a book about math. What Ellenberg has written is ultimately a love letter to math. If the stories he tells add up to a larger lesson, it’s that 'to do mathematics is to be, at once, touched by fire and bound by reason' -- and that there are ways in which we’re all doing math, all the time. (Source)

Auston BunsenI’ve got a few, one book that really impacted me early on as someone coming from a middle-class family was “Rich dad, Poor dad”. Since then I’ve read many books but one that really stands out is “How not to be wrong” by Jordan Ellenberg which really reignited my appetite & appreciation for math. (Source)

Nick GanjuWritten for an audience of people who have historically been intimidated by math [...] and introduces things in a very simple way, and then works up to more complex concepts. (Source)

See more recommendations for this book...

75

Mining of Massive Datasets

The popularity of the Web and Internet commerce provides many extremely large datasets from which information can be gleaned by data mining. This book focuses on practical algorithms that have been used to solve key problems in data mining and which can be used on even the largest datasets. It begins with a discussion of the map-reduce framework, an important tool for parallelizing algorithms automatically. The authors explain the tricks of locality-sensitive hashing and stream processing algorithms for mining data that arrives too fast for exhaustive processing. The PageRank idea and related... more

See more recommendations for this book...

76
How will Artificial Intelligence affect crime, war, justice, jobs, society and our very sense of being human? The rise of AI has the potential to transform our future more than any other technology--and there's nobody better qualified or situated to explore that future than Max Tegmark, an MIT professor who's helped mainstream research on how to keep AI beneficial.

How can we grow our prosperity through automation without leaving people lacking income or purpose? What career advice should we give today's kids? How can we make future AI systems more robust, so that they do...
more

Barack ObamaAs 2018 draws to a close, I’m continuing a favorite tradition of mine and sharing my year-end lists. It gives me a moment to pause and reflect on the year through the books I found most thought-provoking, inspiring, or just plain loved. It also gives me a chance to highlight talented authors – some who are household names and others who you may not have heard of before. Here’s my best of 2018... (Source)

Bill GatesAnyone who wants to discuss how artificial intelligence is shaping the world should read this book. (Source)

Elon MuskA compelling guide to the challenges and choices in our quest for a great future of life, intelligence and consciousness—on Earth and beyond. (Source)

See more recommendations for this book...

77

Data Science

A concise introduction to the emerging field of data science, explaining its evolution, relation to machine learning, current uses, data infrastructure issues, and ethical challenges.

The goal of data science is to improve decision making through the analysis of data. Today data science determines the ads we see online, the books and movies that are recommended to us online, which emails are filtered into our spam folders, and even how much we pay for health insurance. This volume in the MIT Press Essential Knowledge series offers a concise introduction to the...
more

See more recommendations for this book...

78
Why would a casino try and stop you from losing? How can a mathematical formula find your future spouse? Would you know if a statistical analysis blackballed you from a job you wanted?Today, number crunching affects your life in ways you might never imagine. In this lively and groundbreaking new book, economist Ian Ayres shows how today's best and brightest organizations are analyzing massive databases at lightening speed to provide greater insights into human behavior. They are the Super Crunchers. From internet sites like Google and Amazon that know your tastes better than you do, to a... more

See more recommendations for this book...

79
What do flashlights, the British invasion, black cats, and seesaws have to do with computers? In CODE, they show us the ingenious ways we manipulate language and invent new means of communicating with each other. And through CODE, we see how this ingenuity and our very human compulsion to communicate have driven the technological innovations of the past two centuries.

Using everyday objects and familiar language systems such as Braille and Morse code, author Charles Petzold weaves an illuminating narrative for anyone who’s ever wondered about the secret inner life of...
more
Recommended by Ana Bell, and 1 others.

Ana BellIt gets you to use your imagination to virtually build a computer. It’s easy to read, you can lie down on the couch and enjoy it—it’s not so much of a textbook. It demystifies the magic of a computer and what it is. (Source)

See more recommendations for this book...

80
A look inside the algorithms that are shaping our lives and the dilemmas they bring with them.

If you were accused of a crime, who would you rather decide your sentence—a mathematically consistent algorithm incapable of empathy or a compassionate human judge prone to bias and error? What if you want to buy a driverless car and must choose between one programmed to save as many lives as possible and another that prioritizes the lives of its own passengers? And would you agree to share your family’s full medical history if you were told that it would help researchers find a cure for...
more
Recommended by David Smith, Jim Al-Khalili, and 2 others.

David SmithDarroch: “The best book I’ve read recently is called Hello World... It’s about the impact of algorithms across different areas... For me this was the best piece of learning I’ve done in recent months.” (Source)

Jim Al-KhaliliThe fact is, the age of AI is coming fast, and we need to be ready for it. This book will help you decide how worried you should be. (Source)

See more recommendations for this book...

Don't have time to read the top Data Science books of all time? Read Shortform summaries.

Shortform summaries help you learn 10x faster by:

  • Being comprehensive: you learn the most important points in the book
  • Cutting out the fluff: you focus your time on what's important to know
  • Interactive exercises: apply the book's ideas to your own life with our educators' guidance.
81

Machine Learning

Mitchell covers the field of machine learning, the study of algorithms that allow computer programs to automatically improve through experience and that automatically infer general laws from specific data. less
Recommended by Daniel H Wilson, and 1 others.

Daniel H WilsonYes, Machine Learning is a textbook and I would call it the textbook for machine learning and artificial intelligence. Machine learning is just the math of teaching a machine how to solve a problem on its own, because you’re not going to be able to be there to solve it for the machine. It can be any kind of problem: it could be a robot that needs to figure out how to get from point A to point B... (Source)

See more recommendations for this book...

82
Addressing the prevalent issue of poorly designed quantitative information presentations, this accessible, practical, and comprehensive guide teaches how to properly create tables and graphs for effective and efficient communication. The critical numbers that measure the health, identify the opportunities, and forecast the future of organizations are often misrepresented because few people are trained to design accurate, informative materials, but this manual helps put an end to misinformation. This revised edition of the highly successful book includes updated figures and 91 additional pages... more

See more recommendations for this book...

83
Written as a tutorial to explore and understand the power of R for machine learning. This practical guide that covers all of the need to know topics in a very systematic way. For each machine learning approach, each step in the process is detailed, from preparing the data for analysis to evaluating the results. These steps will build the knowledge you need to apply them to your own data science tasks.Intended for those who want to learn how to use R's machine learning capabilities and gain insight from your data. Perhaps you already know a bit about machine learning, but have never used R; or... more

See more recommendations for this book...

84
Hadoop: The Definitive Guide helps you harness the power of your data. Ideal for processing large datasets, the Apache Hadoop framework is an open source implementation of the MapReduce algorithm on which Google built its empire. This comprehensive resource demonstrates how to use Hadoop to build reliable, scalable, distributed systems: programmers will find details for analyzing large datasets, and administrators will learn how to set up and run Hadoop clusters.

Complete with case studies that illustrate how Hadoop solves specific problems, this book helps you:
more

See more recommendations for this book...

85

Reinforcement Learning

An Introduction

The significantly expanded and updated new edition of a widely used text on reinforcement learning, one of the most active research areas in artificial intelligence.

Reinforcement learning, one of the most active research areas in artificial intelligence, is a computational approach to learning whereby an agent tries to maximize the total amount of reward it receives while interacting with a complex, uncertain environment. In Reinforcement Learning, Richard Sutton and Andrew Barto provide a clear and simple account of the field's key ideas and algorithms. This...
more
Recommended by Zachary Lipton, and 1 others.

Zachary Lipton@innerproduct 1. Tor Lattimore Great book work on bandits (https://t.co/gttspSm40W) and work on causality + bandits (https://t.co/lkwvtEiKvE) 2. Caroline Uhler — Interesting work on causal inference + discovery, causal inference under measurement error etc (https://t.co/I3IRpwmdMd) (Source)

See more recommendations for this book...

87
If you want to learn how to program, working with Python is an excellent way to start. This hands-on guide takes you through the language a step at a time, beginning with basic programming concepts before moving on to functions, recursion, data structures, and object-oriented design. This second edition and its supporting code have been updated for Python 3.

Through exercises in each chapter, you'll try out programming concepts as you learn them. Think Python is ideal for students at the high school or college level, as well as self-learners, home-schooled students, and...
more

See more recommendations for this book...

88
In 1956 two Bell Labs scientists discovered the scientific formula for getting rich. One was mathematician Claude Shannon, neurotic father of our digital age, whose genius is ranked with Einstein's. The other was John L. Kelly Jr., a Texas-born, gun-toting physicist. Together they applied the science of information theory—the basis of computers and the Internet—to the problem of making as much money as possible, as fast as possible.

Shannon and MIT mathematician Edward O. Thorp took the "Kelly formula" to Las Vegas. It worked. They realized that there was even more money to be made...
more
Recommended by P. D. Mangan, and 1 others.

P. D. Mangan@MarquisDeMarche @natstewart5 Great book. (Source)

See more recommendations for this book...

89
How anyone can become a data ninja

From the stock market to genomics laboratories, census figures to marketing email blasts, we are awash with data. But as anyone who has ever opened up a spreadsheet packed with seemingly infinite lines of data knows, numbers aren't enough: we need to know how to make those numbers talk. In The Model Thinker, social scientist Scott E. Page shows us the mathematical, statistical, and computational models--from linear regression to random walks and far beyond--that can turn anyone into a genius. At the core of the book is Page's...
more

See more recommendations for this book...

90
Esta apasionante lectura nos descubre la naturaleza de los procesos arbitrarios de la vida cotidiana y cambia para siempre la percepción que tenemos de ellos. En 1905 Albert Einstein publicó una impactante explicación sobre el movimiento browniano -el movimiento arbitrario de partículas- comparándolo con la clase de movimiento que se observaría en el caminar de un borracho. La comparación se convirtió desde entonces en una poderosa herramienta para entender el movimiento puramente arbitrario que, por definición, no tiene ningún modelo específico.
En este nuevo libro, Leonard Mlodinow...
more
Recommended by David Spiegelhalter, Gabriel Coarna, and 2 others.

David SpiegelhalterThis is a general introduction to the history of probability and the way it comes into everyday life. It intersperses the historical development with modern applications, and looks at finance, sport, gambling, lotteries and coincidences. (Source)

Gabriel CoarnaLeonard Mlodinow's "The Drunkarkd's Walk" -more precisely, the section on the "Monty Hall" problem- totally changed how I look-at/think-about probabilities and choices in general; this has impacted almost every real-life choice I've made since I read this book. (Source)

See more recommendations for this book...

Don't have time to read the top Data Science books of all time? Read Shortform summaries.

Shortform summaries help you learn 10x faster by:

  • Being comprehensive: you learn the most important points in the book
  • Cutting out the fluff: you focus your time on what's important to know
  • Interactive exercises: apply the book's ideas to your own life with our educators' guidance.
91

Beautiful Evidence

Science and art have in common intense seeing, the wide-eyed observing that generates visual information. Beautiful Evidence is about how seeing turns into showing, how data and evidence turn into explanation. The book identifies excellent and effective methods for showing nearly every kind of information, suggests many new designs (including sparklines), and provides analytical tools for assessing the credibility of evidence presentations (which are seen from both sides: how to produce and how to consume presentations). For alert consumers of presentations, there are chapters on... more
Recommended by Bret Victor, and 1 others.

See more recommendations for this book...

92
The definitive guide to statistical thinking
Statistics are everywhere, as integral to science as they are to business, and in the popular media hundreds of times a day. In this age of big data, a basic grasp of statistical literacy is more important than ever if we want to separate the fact from the fiction, the ostentatious embellishments from the raw evidence -- and even more so if we hope to participate in the future, rather than being simple bystanders.
In The Art of Statistics, world-renowned statistician David Spiegelhalter shows readers how to derive knowledge...
more
Recommended by Dan Davies, and 1 others.

Dan Davies@amoralelite @d_spiegel It's a great book. I thought it was like coming home because I've always tried to avoid calculation due to the dyspraxia and it was just "yes, that's how you think about it" (Source)

See more recommendations for this book...

93
The New York Times bestselling Freakonomics changed the way we see the world, exposing the hidden side of just about everything. Then came SuperFreakonomics, a documentary film, an award-winning podcast, and more.

Now, with Think Like a Freak, Steven D. Levitt and Stephen J. Dubner have written their most revolutionary book yet. With their trademark blend of captivating storytelling and unconventional analysis, they take us inside their thought process and teach us all to think a bit more productively, more creatively, more rationally—to think, that is, like a Freak.
more

See more recommendations for this book...

94
Dashboards have become popular in recent years as uniquely powerful tools for communicating important information at a glance. Although dashboards are potentially powerful, this potential is rarely realized. The greatest display technology in the world won't solve this if you fail to use effective visual design. And if a dashboard fails to tell you precisely what you need to know in an instant, you'll never use it, even if it's filled with cute gauges, meters, and traffic lights. Don't let your investment in dashboard technology go to waste.

This book will teach you the visual...
more

See more recommendations for this book...

95
Taken literally, the title "All of Statistics" is an exaggeration. But in spirit, the title is apt, as the book does cover a much broader range of topics than a typical introductory book on mathematical statistics. This book is for people who want to learn probability and statistics quickly. It is suitable for graduate or advanced undergraduate students in computer science, mathematics, statistics, and related disciplines. The book includes modern topics like nonparametric curve estimation, bootstrapping, and clas- sification, topics that are usually relegated to follow-up courses. The reader... more

See more recommendations for this book...

96

Visual Explanations

Few would disagree: Life in the information age can be overwhelming. Through computers, the Internet, the media, and even our daily newspapers, we are awash in a seemingly endless stream of charts, maps, infographics, diagrams, and data. Visual Explanations is a navigational guide through this turbulent sea of information. The book is an essential reference for anyone involved in graphic, web, or multimedia design, as well as for educators and lecturers who use graphics in presentations or classes.

Jacket design: Dmitry Krasny.
Other artwork by Bonnie Scranton, Dmitry...
more
Recommended by Jeff Atwood, Bret Victor, and 2 others.

See more recommendations for this book...

97

Information is Beautiful

Facts, statistics, issues, theories, relationships, numbers, words - there is just too much information in the world. We need a brand new way to take it all in. 'Information is Beautiful' transforms the ideas surrounding and swamping us into graphs and maps that anyone can follow at a single glance. less

See more recommendations for this book...

98

R in Action

Summary

R in Action is the first book to present both the R system and the use cases that make it such a compelling package for business developers. The book begins by introducing the R language, including the development environment. Focusing on practical solutions, the book also offers a crash course in practical statistics and covers elegant methods for dealing with messy and incomplete data using features of R.
About the Technology
R is a powerful language for statistical computing and graphics that can handle virtually any data-crunching task. It...
more

See more recommendations for this book...

99
In today’s data-driven world, professionals need to know how to express themselves in the language of graphics effectively and eloquently. Yet information graphics is rarely taught in schools or is the focus of on-the-job training. Now, for the first time, Dona M. Wong, a student of the information graphics pioneer Edward Tufte, makes this material available for all of us. In this book, you will learn:

to choose the best chart that fits your data;

the most effective way to communicate with decision makers when you have five minutes of their time;

how to chart...
more

See more recommendations for this book...

100
Second edition of the best selling Python book in the world. A fast-paced, no-nonsense guide to programming in Python. This book teaches beginners the basics of programming in Python with a focus on real projects.

This is the second edition of the best selling Python book in the world. Python Crash Course, 2nd Edition is a straightforward introduction to the core of Python programming. Author Eric Matthes dispenses with the sort of tedious, unnecessary information that can get in the way of learning how to program, choosing instead to provide a foundation in general...
more

See more recommendations for this book...

Don't have time to read the top Data Science books of all time? Read Shortform summaries.

Shortform summaries help you learn 10x faster by:

  • Being comprehensive: you learn the most important points in the book
  • Cutting out the fluff: you focus your time on what's important to know
  • Interactive exercises: apply the book's ideas to your own life with our educators' guidance.