The Dynamics of Micro-Task Crowdsourcing

The Dynamics of Micro-TaskCrowdsourcing

The Case of Amazon MTurk

Djellel Eddine Difallah, Michele Catasta, Gianluca Demartini, Panos Ipeirotis, Philippe Cudré-Mauroux

WWW’15 - 20th May 2015 - Florence 1

Background

Crowdsourcing is an Effective solution to certain classes of problems

Background

A Crowdsourcing Platform allows requesters to publish a crowdsourcing request (batch)

composed of multiple tasks (HITs)

Programmatically Invoke the crowd with APIs

Background

Paid Microtask Crowdsourcing scales-out but remains highly unpredictable

Background

Paid Microtask Crowdsourcing scales-out but remains highly unpredictable

#HITs/ Minute

Batch Throughput

SLAs are expensive

MTurk is a Marketplace for HITs

Direct: Price, Time of the day, #workers, #HITs etc

Other: Forums, Reputation-sys (TurkOpticon), Recommendation-sys (Openturk) 7

A Data Driven Approach

...Five Years Later[2009 - 2014]

mturk-tracker collected 2.5Million different batches

with over 130Million HITs

mturk-tracker.com

● Collects metadata about each visible batch (Title, description, rewards, required qualifications, HITs available etc)

● Records batch progress (every ~20 minutes)

We note that the tracker reports data periodically only and does not reflect fine-grained information (e.g., real-time variations)

1. Notable Facts Extracted from the Data

2. Large-scale HIT Type Classification

3. Analyzing the Features Affecting Batch Throughput

4. Market Analysis

1) Notable Facts Extracted from the Data

Country-Specific HITs

US and India?

Country-Specific HITs

Workers from US, India and Canada are the most sought after.15

Distribution of Batch Size

“Power-law”

Evolution of Batch Sizes

Very large batches

start to appear

HIT Pricing

Is 1-cent per HIT the norm?

HIT Pricing

5-cents is the new

1-cent

Requesters and Reward Evolution

Increasing number of New and Distinct Requesters

2) Large-scale HIT Type Classification

Classify HITs into types (Gadiraju et. al 2014)- Information Finding (IF)- Verification and Validation (VV )- Interpretation and Analysis (IA)- Content Creation (CC)- Surveys (SU)- Content Access (CA)

HIT Classes

We trained a Support Vector Machine (SVM) model

- HIT title, description, keywords, reward, date, allocated time, and batch size

- Created labeled data on Mturk for 5,000 HITs uniformly sampled HITs- Our HIT used 3 repetitions

- Consensus reached for 89% of the tasks- 10-fold cross validation

- Precision of 0.895- Recall of 0.899- F-Measure of 0.895

- We then performed a large-scale classification for all 2.5M HITs

Supervised ClassificationWith the Crowd

Distribution of HIT Types

Less Content Access batches

Content Creation being the most popular24

3) Analyzing the Features Affecting Batch

Throughput

#HITs/ Minute

Batch Throughput

Batch Throughput Prediction

29 Features

HIT Features

HITs available, Start Time, Reward, Description length, Title length, Keywords, requester_id, Time_alloted, Task type, Age (minutes) etc.

Market Features

Total HITs available, HITs arrived, rewards Arrived, % HITs completed etc.

- Predict batch throughput at time T by training a Random Forest Regression model with samples taken in [T-delta, T) time span

- 29 Features (including the Type of the Batch)- Hourly Data in range [June-October] 2014- We sampled 50 times points for evaluation purposes

- Predict batch throughput at time T by training a Random Forest Regression model with samples taken in [T-delta, T) time span

- 29 Features (including the Type of the Batch)- Hourly Data in range [June-October] 2014- We sampled 50 times points for evaluation purposes

We are interested in cases where prediction works reasonably28

Predicted vs. Actual Batch Throughput (delta=4 hours)

Prediction Works best for larger batches having large momentum

Significant Features

- What features contribute best when the

prediction works reasonably

- We proceed by feature ablation

- Re-run prediction by removing 1 feature at a time

- 1000 samples

Significant Features

- What features contribute best when the prediction works reasonably

- We proceed by feature ablation- Re-run prediction by removing 1 feature at a time.- 1000 samples

HITs_Available (Number of tasks in the batch)

Age_Minutes (how long ago the batch was created)

4) Market Analysis

Demand - The number of new tasks published on the platform by the requesters

Supply - The workforce that the crowd is providing

Supply Elasticity

How does the market reacts when new tasks arrive on the platform?

Supply Elasticity

We regressed the percentage of work done (within 1 Hour) against the number of new HITs

Supply Elasticity

Intercept = 2.5Slope = 0.5%

20% of new work gets completed within an hour

Supply Elasticity

Intercept = 2.5Slope = 0.5%

20% of new work gets completed within an hour

Demand and Supply Periodicity

Demand Supply37

Demand and Supply Periodicity

Strong weekly periodicity 7-10 days.38

Conclusions

- Long time data analysis uncovers some hidden trends

- Large scale HIT classification

- Important features in throughput prediction (HITs

available, Age_minutes)

- Supply is Elastic

- (More work available -> More work Done)

- Supply and Demand are periodic (7-10days) 39

Is a Crowdsourcing Marketplace the right paradigm for efficient and predictable

crowdsourcing?

Is a Crowdsourcing Marketplace the right paradigm for efficient and predictable

crowdsourcing?

Djellel Difallah

ded@exascale.info

The Dynamics of Micro-Task Crowdsourcing

Science

Transcript of The Dynamics of Micro-Task Crowdsourcing

UNHCR & Crowdsourcing A partnership with the Stand By Task Force

Micro Task Interface Evaluation Method

TIPRDC: Task-Independent Privacy-Respecting Data ...learning task may be unknown or changing, we present TIPRDC, a task-independent privacy-respecting data crowdsourcing frame-work

A Taxonomy of Crowdsourcing CampaignsCrowdsourcing; collaboration platform; crowdfunding 1. two-levels the authors classified micro tasks or work performed INTRODUCTION As crowdsourcing

A Server-Assigned Spatial Crowdsourcing Framework · A Server-Assigned Spatial Crowdsourcing Framework 2:3 maximize the task assignment at every time instance (i.e., local optimization)

QASCA: A Quality-Aware Task Assignment System for ...dbgroup.cs.tsinghua.edu.cn › ligl › papers › sigmod2015-qasca.pdf · Crowdsourcing; Quality Control; Online Task Assignment

Dynamic Task Assignment in Spatial Crowdsourcing...2006/10/02 · Dynamic Task Assignment in Spatial Crowdsourcing Yongxin Tong1, Zimu Zhou2 1 BDBC and SKLSDE Lab, Beihang University,

A Micro Crowdsourcing Architecture to Localize Web Content ... · A Micro Crowdsourcing Architecture to Localize ... experimenting with music on my own, but although I enjoy my hobbies

Crowdsourcing: Challenges and Opportunitiesecheng/di/papers/Crowdsourcing... · Crowdsourcing Definition Coordinating a crowd (a large group of people on the web) to do micro-work

Motivation-Aware Task Assignment in Crowdsourcing · Motivation-Aware Task Assignment in Crowdsourcing Julien Pilourdault Sihem Amer-Yahia Dongwon Lee Senjuti Basu Roy Université

Crowdsourcing Innovation at Caltrans2016/09/19 · Crowdsourcing Innovation at Caltrans Task ID 3064 Final Report Prepared by: Loren L. Turner, P.E. California Department of Transportation

Predictive Task Assignment in Spatial Crowdsourcing: A ...zheng-kai.com/paper/icde_2020_zhao.pdf · effectiveness of our framework. Keywords-prediction, task assignment, spatial crowdsourcing

Demands on task recommendation in crowdsourcing platforms - the …crowdrecworkshop.org/papers/CrowdDemands.pdf · · 2015-09-22Demands on task recommendation in crowdsourcing platforms

Task Assignment in Spatial Crowdsourcing [Experiments and ... · isting algorithms under a general spatial crowdsourcing deﬁnition to show their pros and cons. Currently, there

Monetary Interventions in Crowdsourcing Task Switchingyiling.seas.harvard.edu/wp-content/uploads/taskswitching-camera.pdfMonetary Interventions in Crowdsourcing Task Switching Ming

Budget-Optimal Task Allocation for Reliable Crowdsourcing ...

Task Routing and Assignment in Crowdsourcing based …ubicomp.oulu.fi/files/ · Task Routing and Assignment in Crowdsourcing based on Cognitive ... ETS factor-referenced cognitive

Online Mobile Micro-Task Allocation in Spatial Crowdsourcing · Online Mobile Micro-Task Allocation in Spatial Crowdsourcing Yongxin Tong y, Jieying She z, Bolin Ding #, Libin Wang

Active Content-Based Crowdsourcing Task Selection

Task Routing and Assignment in Crowdsourcing based on ... · assigned to a given task cannot be controlled. Our work proposes an ... in turn can enable a more ... the typical crowdsourcing