How Is Social Science Data Collected? Big Data!

This article is an excerpt from the Shortform book guide to "Everybody Lies" by Seth Stephens-Davidowitz. Shortform has the world's best summaries and analyses of books you should be reading.

Like this article? Sign up for a free trial here .

How can we use big data to study social science? How does data give us more insight into the social sciences?

Through search data, researchers can discover psychological and sociological information that traditional surveys couldn’t provide. Seth Stephens-Davidowitz, the author of Everybody Lies, uses Freud’s theories of sexuality as an example.

Read how to receive social science data with the help of big data.

Data Gives the Social Sciences More Rigor

In addition to improving our natural intuition, big data studies can help make social science data more rigorous. Stephens-Davidowitz notes that traditionally, there’s a divide between hard sciences (such as physics and chemistry) and soft sciences (such as psychology and sociology). That divide boils down to differences in method and types of evidence, with critics accusing the social sciences of advancing theories that can’t be falsified. 

Stephens-Davidowitz gives the example of Freud’s theories of sexuality, which Freud based on his own observations and interpretations rather than on experimental evidence. Stephens-Davidowitz shows how Google and Pornhub search data let us test these previously untestable ideas (he finds no evidence for Freud’s claim that phallic symbols in dreams reveal latent desires; on the other hand, he finds a surprising number of searches for parent-child incest videos, suggesting some truth to Freud’s Oedipal theory).

Big Data and the Replication Crisis

It’s worth wondering about the role of data-driven research in light of science’s (including psychology’s) ongoing replication crisis—in which an alarming number of experimental findings can’t be reproduced in follow-up experiments. 

On one hand, some of the explanations for the replication problem have to do with experimental sample sizes. In some cases, unreliable studies might use undersized samples, which increases the chances of finding false positives. Similarly, some scientists have been accused of consciously or unconsciously manipulating their data, for example, by ending an experiment as soon as they find a statistically significant result—a practice that likewise increases the chances of faulty findings. 

How Is Social Science Data Collected? Big Data!

———End of Preview———

Like what you just read? Read the rest of the world's best book summary and analysis of Seth Stephens-Davidowitz's "Everybody Lies" at Shortform .

Here's what you'll find in our full Everybody Lies summary :

  • How people confess their darkest secrets to Google search
  • How this "big data" can be used in lieu of voluntary surveys
  • The unethical uses and limitations of big data

Katie Doll

Somehow, Katie was able to pull off her childhood dream of creating a career around books after graduating with a degree in English and a concentration in Creative Writing. Her preferred genre of books has changed drastically over the years, from fantasy/dystopian young-adult to moving novels and non-fiction books on the human experience. Katie especially enjoys reading and writing about all things television, good and bad.

Leave a Reply

Your email address will not be published. Required fields are marked *