Open Citizen Science...Open Citizen Science: Galaxy Zoo / Zooniverse case study Context: “Data...

22
Jordan Raddick Johns Hopkins University Open Citizen Science: Galaxy Zoo / Zooniverse case study

Transcript of Open Citizen Science...Open Citizen Science: Galaxy Zoo / Zooniverse case study Context: “Data...

Page 1: Open Citizen Science...Open Citizen Science: Galaxy Zoo / Zooniverse case study Context: “Data Avalanche” • “Data avalanche” in all areas of science • Doubling time of

Jordan Raddick

Johns Hopkins University

Open Citizen Science: Galaxy Zoo / Zooniverse case study

Page 2: Open Citizen Science...Open Citizen Science: Galaxy Zoo / Zooniverse case study Context: “Data Avalanche” • “Data avalanche” in all areas of science • Doubling time of

Context: “Data Avalanche”

• “Data avalanche” in all

areas of science

• Doubling time of science

data is one year

(Szalay & Gray 2006)

Szalay, A.S. & Gray, J. 2006. “2020

Computing: Science in an exponential world,”

Nature, 440, 413-414.

Page 3: Open Citizen Science...Open Citizen Science: Galaxy Zoo / Zooniverse case study Context: “Data Avalanche” • “Data avalanche” in all areas of science • Doubling time of

Context: Beating the Machine

• Some scientific problems can be solved only

by humans

• Lots of data, very few astronomers

spiral elliptical

Page 4: Open Citizen Science...Open Citizen Science: Galaxy Zoo / Zooniverse case study Context: “Data Avalanche” • “Data avalanche” in all areas of science • Doubling time of

Problem: Kevin’s Sanity (2007)

• Kevin Schawinski

(Oxford astronomy

grad student)

• His advisor asked

him to classify

50,000 galaxies… by Friday

• He complained to a friend in a pub

• But there are 850,000 more!

Page 5: Open Citizen Science...Open Citizen Science: Galaxy Zoo / Zooniverse case study Context: “Data Avalanche” • “Data avalanche” in all areas of science • Doubling time of

Solution: Galaxy Zoo

Page 6: Open Citizen Science...Open Citizen Science: Galaxy Zoo / Zooniverse case study Context: “Data Avalanche” • “Data avalanche” in all areas of science • Doubling time of

Galaxy Zoo: week one

• June 19, 2007:

spot on BBC

morning radio,

website

• Traffic melted web

server

• 50,000 galaxies =

1 “Kevin-week”

Page 7: Open Citizen Science...Open Citizen Science: Galaxy Zoo / Zooniverse case study Context: “Data Avalanche” • “Data avalanche” in all areas of science • Doubling time of

Result: Searchable dataset

• 900,000 galaxies

• ~40 classifications

each

• Provably reliable

• Online, free

• Other data available

http://skyserver.sdss3.org

Page 8: Open Citizen Science...Open Citizen Science: Galaxy Zoo / Zooniverse case study Context: “Data Avalanche” • “Data avalanche” in all areas of science • Doubling time of

“Probably spirals”

265,000 galaxies to 80% consensus

select top 15 objid, ra, dec

from zoospec

where p_cw+p_acw+p_edge > 0.8

order by newid()

select count(*)

from zoospec

where (p_el > 0.8 or p_cw > 0.8 or

p_acw > 0.8 or p_edge > 0.8 or

p_dk > 0.8 or p_mg > 0.8)

Page 9: Open Citizen Science...Open Citizen Science: Galaxy Zoo / Zooniverse case study Context: “Data Avalanche” • “Data avalanche” in all areas of science • Doubling time of

“Almost certainly spirals”

select top 15 objid, ra, dec

from zoospec

where p_cw+p_acw+p_edge > 0.8

order by newid()

select count(*)

from zoospec

where (p_el > 0.8 or p_cw > 0.8 or

p_acw > 0.8 or p_edge > 0.8 or

p_dk > 0.8 or p_mg > 0.8)

42,000 galaxies to 95% consensus

Page 10: Open Citizen Science...Open Citizen Science: Galaxy Zoo / Zooniverse case study Context: “Data Avalanche” • “Data avalanche” in all areas of science • Doubling time of

“We have no idea”

select top 15 objid, ra, dec

from zoospec

where (p_el < 0.5 and p_cw< 0.5

and p_acw < 0.5 and p_edge < 0.5

and p_dk < 0.5 and p_mg < 0.5)

order by newid()

select count(*)

from zoospec

where (p_el < 0.5 and p_cw< 0.5

and p_acw < 0.5 and p_edge < 0.5

and p_dk < 0.5 and p_mg < 0.5)

165,000 galaxies to less than 50% consensus

Page 11: Open Citizen Science...Open Citizen Science: Galaxy Zoo / Zooniverse case study Context: “Data Avalanche” • “Data avalanche” in all areas of science • Doubling time of

Galaxy Zoo science

• We have 40 peer-reviewed publications and

counting…

• 203 papers with “Galaxy Zoo” in title or

abstract (http://bit.ly/zoopapers)

Page 12: Open Citizen Science...Open Citizen Science: Galaxy Zoo / Zooniverse case study Context: “Data Avalanche” • “Data avalanche” in all areas of science • Doubling time of

“Galaxy Zoo Peas”

“Give peas a chance” (joke)

Spectral analysis using pro tools

Published paper:

Page 13: Open Citizen Science...Open Citizen Science: Galaxy Zoo / Zooniverse case study Context: “Data Avalanche” • “Data avalanche” in all areas of science • Doubling time of

The Volunteers

Page 14: Open Citizen Science...Open Citizen Science: Galaxy Zoo / Zooniverse case study Context: “Data Avalanche” • “Data avalanche” in all areas of science • Doubling time of

Motivations to Volunteer

• Interviews & posts coded into themes (3 raters)

• Twelve categories of motivation

• Follow-up survey (n=10,532)

Page 15: Open Citizen Science...Open Citizen Science: Galaxy Zoo / Zooniverse case study Context: “Data Avalanche” • “Data avalanche” in all areas of science • Doubling time of

Zooniverse.org

• “Portal” for

citizen scientists

• Many projects

accessible

– Volunteers

move from one

to another

• Tech platform

(Ruby on Rails, MongoDB, Amazon cloud)

• Structure for adding new projects

Page 16: Open Citizen Science...Open Citizen Science: Galaxy Zoo / Zooniverse case study Context: “Data Avalanche” • “Data avalanche” in all areas of science • Doubling time of

OldWeather.com

• Historical temperature data for climate change

Page 17: Open Citizen Science...Open Citizen Science: Galaxy Zoo / Zooniverse case study Context: “Data Avalanche” • “Data avalanche” in all areas of science • Doubling time of

PlanetHunters.org

• Search for extrasolar planets in star data

Page 18: Open Citizen Science...Open Citizen Science: Galaxy Zoo / Zooniverse case study Context: “Data Avalanche” • “Data avalanche” in all areas of science • Doubling time of

SnapshotSerengeti.org

• Animals in motion-camera photos from Africa

Page 19: Open Citizen Science...Open Citizen Science: Galaxy Zoo / Zooniverse case study Context: “Data Avalanche” • “Data avalanche” in all areas of science • Doubling time of

Proposing New “Zoos”

• Apply for

“click time”

• Peer-

reviewed

• We’ll help

you build

your “Zoo”

Page 20: Open Citizen Science...Open Citizen Science: Galaxy Zoo / Zooniverse case study Context: “Data Avalanche” • “Data avalanche” in all areas of science • Doubling time of

The big dream for science

• Citizen science should

be a standard tool in

scientists’ toolkit

• Analogy: observatory

• Part of open science

Page 21: Open Citizen Science...Open Citizen Science: Galaxy Zoo / Zooniverse case study Context: “Data Avalanche” • “Data avalanche” in all areas of science • Doubling time of

The big dream for volunteers

• Participating in

science should be

as common as

participation in sports

today

• Difference: citizen

science adds to goals

of professional science

• Difference: everybody

wins

Page 22: Open Citizen Science...Open Citizen Science: Galaxy Zoo / Zooniverse case study Context: “Data Avalanche” • “Data avalanche” in all areas of science • Doubling time of

Acknowledgements

400,000 Zooniverse volunteers, including…

• Mark Wiltshire

• Jerrold Grochow

• Alexandra Walker

Contact information

Jordan Raddick

[email protected]

410-516-8889 Football photos from Flickr:

(1) Hector Alejandro

(2) Maryland Government