Download - The art and science of data-driven journalism

Transcript
Page 1: The art and science of data-driven journalism

The Art and Science of Data-Driven Journalism

Alexander B. HowardTow Fellow, Columbia University

May 30, 2014

Page 2: The art and science of data-driven journalism

You know something, John Snow.

Page 3: The art and science of data-driven journalism

This John Snow knew something.

Page 4: The art and science of data-driven journalism

Newspapers have used data for centuries

Source: The Guardian

Page 5: The art and science of data-driven journalism

1960s: computer-assisted reporting (CAR)

Bob Woodward, via Cliff1066

Page 6: The art and science of data-driven journalism

Traditional tools applying tech to journalism…

• Calculators and Graphs• Mainframe and PCs• Spreadsheets• Databases• Text and code editors• Statistics • Programming

Page 7: The art and science of data-driven journalism

In the 1990s, government and civil society spread the Internet globally

Page 8: The art and science of data-driven journalism

In the 2000s, mobile phones and social networking connected us ever more

Page 9: The art and science of data-driven journalism

In the 2010s, data creation exploded.

Image Credit: Real Time Rome from Senseable.MIT.edu

Page 10: The art and science of data-driven journalism

“Data-driven journalism is the future”

Source: Tim Berners-Lee in the Guardian

Page 11: The art and science of data-driven journalism

…combined with new tools & context…

• Online spreadsheets and wikis• Data visualization tools• Open source frameworks • Code sharing• Agile development• Cloud storage and processing (EC2 & Heroku)• More data and more access• Privacy and security riskss

Page 12: The art and science of data-driven journalism

2014: data journalism is the present

Gathering, cleaning, organizing, analyzing, visualizing and publishing data to support

the creation of acts of journalism

Page 13: The art and science of data-driven journalism
Page 14: The art and science of data-driven journalism

Trendy but not new

• The collection, protection and interrogation of data as a source, complementing traditional “shoe leather” investigative reporting relying on witnesses, experts and authorities

Page 15: The art and science of data-driven journalism
Page 16: The art and science of data-driven journalism

Dollars for Docs

Page 17: The art and science of data-driven journalism

The Guardian

Page 18: The art and science of data-driven journalism

Chicago Tribune

• Flame retardants

Page 19: The art and science of data-driven journalism
Page 21: The art and science of data-driven journalism
Page 22: The art and science of data-driven journalism

Los Angeles Times

Page 23: The art and science of data-driven journalism
Page 25: The art and science of data-driven journalism

Reuters: Connected China

Page 26: The art and science of data-driven journalism
Page 27: The art and science of data-driven journalism
Page 28: The art and science of data-driven journalism
Page 29: The art and science of data-driven journalism

Best practices?

Page 30: The art and science of data-driven journalism

Report it out

Page 31: The art and science of data-driven journalism
Page 32: The art and science of data-driven journalism

Show people something new about the world

Page 33: The art and science of data-driven journalism
Page 34: The art and science of data-driven journalism

Tell a story

Page 36: The art and science of data-driven journalism

Storytelling still matters.

“We use these tools to find and tell stories. We use them like we use a telephone. The story is still the thing.”

- Anthony DeBarros USA Today

Source: Data Journalism and the Big Picture

Page 37: The art and science of data-driven journalism

Make it personal

Page 38: The art and science of data-driven journalism
Page 39: The art and science of data-driven journalism

Understand the context for the data

Page 40: The art and science of data-driven journalism
Page 41: The art and science of data-driven journalism

Show your data

Page 42: The art and science of data-driven journalism
Page 43: The art and science of data-driven journalism

Show your work

Page 44: The art and science of data-driven journalism
Page 45: The art and science of data-driven journalism

Share your code

Page 46: The art and science of data-driven journalism
Page 47: The art and science of data-driven journalism

Consider ethics

Page 48: The art and science of data-driven journalism

Questions

• Is the data clean?• Is the data representative?• What biases might be hidden in the data?• Was the data legally obtained?• Does the data contain personally identifiable

information (PII)?

Page 49: The art and science of data-driven journalism

Collection

• Who gathered the data? How?• Was it clear how data would be used?• Can people opt-out of collection or

usage?• “Notice and consent” is not enough• “Privacy by design” applies to news apps

Page 50: The art and science of data-driven journalism
Page 51: The art and science of data-driven journalism

Data Analysis & Numeracy

• N = ?• Average vs Median• Statistical significance?• Correlation != causation• Regression to the mean

Page 52: The art and science of data-driven journalism
Page 53: The art and science of data-driven journalism

Presentation

Page 54: The art and science of data-driven journalism

Bad Data Vizwtfviz.net

Page 55: The art and science of data-driven journalism

Present data with context, in context

Page 56: The art and science of data-driven journalism

Be aware of de-anonymization risks

Page 57: The art and science of data-driven journalism

Emerging trends

Page 59: The art and science of data-driven journalism

Networked reporting of corruption

ICIJ: Offshore Leaks

Page 60: The art and science of data-driven journalism

International Consortium of Investigative Journalists

Offshoring $80 journalists 40 countries 260 gigabytes2.5 million files

Page 61: The art and science of data-driven journalism

Create your data“If Stage 1 of data journalism was “find and scrape

data,” then…

Stage 2 was “ask government agencies to release data” in easy to use formats.

Stage 3 is going to be “make your own data”, and those sources of data are going to be automated and updated in real-time.”

-Javaun Moradi, Mozilla

Page 63: The art and science of data-driven journalism

Networked accountability

Page 64: The art and science of data-driven journalism

Bus route in Nairobi, Kenya

Page 65: The art and science of data-driven journalism

Sensor Journalism

Page 66: The art and science of data-driven journalism
Page 67: The art and science of data-driven journalism
Page 68: The art and science of data-driven journalism

Citizens as Sensors: Andhra Pradesh

Page 69: The art and science of data-driven journalism

Drones + data collection

Page 70: The art and science of data-driven journalism

Privacy challenges

Page 71: The art and science of data-driven journalism
Page 72: The art and science of data-driven journalism

Open Data, FOIA & Press Freedom

Page 73: The art and science of data-driven journalism

An expanding number of data sources

Page 74: The art and science of data-driven journalism
Page 75: The art and science of data-driven journalism
Page 76: The art and science of data-driven journalism

Social data and crisis data

Page 77: The art and science of data-driven journalism

Open government data platforms

Page 78: The art and science of data-driven journalism
Page 79: The art and science of data-driven journalism
Page 80: The art and science of data-driven journalism

Fauxpen DataIn an age of “openwashing”…

We need to:

Evaluate licenses.

Peruse the Terms of Service.

Review the governance.

Look at community.

Check the format.

Page 81: The art and science of data-driven journalism
Page 82: The art and science of data-driven journalism
Page 84: The art and science of data-driven journalism

Accountability for “personalized redlining”

• Gun map graphic

Page 85: The art and science of data-driven journalism

Transparency for geographic profiling

• Gun map graphic

WSJ: Websites vary prices, based upon user information

Page 87: The art and science of data-driven journalism

Investigating human tissue trafficking

• Gun map graphic

ICIJ: The data behind skin and bone

Page 88: The art and science of data-driven journalism

Data + journalism + activism + responsive institutions = social change

Page 89: The art and science of data-driven journalism

The fun part: predictions, prognostications and recommendations!

Page 90: The art and science of data-driven journalism

1) Data will become even more of a strategic resource for media.

Page 91: The art and science of data-driven journalism

2) Better tools will emerge that democratize data skills.

Page 92: The art and science of data-driven journalism

3) News apps will explode as a primary way people consume data journalism.

Page 93: The art and science of data-driven journalism

4) Being digital first means being data-centric and mobile-friendly.

Page 94: The art and science of data-driven journalism

5. Expect more robo-journalism. Human relationships and storytelling still matter.

Page 95: The art and science of data-driven journalism

6) More journalists will need to study the social sciences and statistics.

Source: Ed Yong

Page 96: The art and science of data-driven journalism

7) There will be higher standards for accuracy and corrections.

Source: Jake Harris

Page 97: The art and science of data-driven journalism

8) Competency in security and data protection will become more important.

Source: Jake Harris

Page 98: The art and science of data-driven journalism

9) Demand for more transparency on reader data collection and use.

Source: eConsultancy

Page 99: The art and science of data-driven journalism

10) More conflicts over public records, data scraping, and ethics will arise.

• Gun map graphic

Page 100: The art and science of data-driven journalism

12) Data-driven personalization and predictive news in wearables.

Page 101: The art and science of data-driven journalism

13) More diverse newsrooms will produce better (data) journalism.

SOURCE: The Atlantic

A 2013 ASNE survey of 68 online news organizations found that 63% of them had no minorities.

Page 102: The art and science of data-driven journalism

14) Be mindful of data-ism and bad data. Embrace skepticism.