Webscraping for jounalists
9
Webscraping for journalists CAJ May 13, 2011 “A little Wget magic”
-
Upload
glen-mcgregor -
Category
News & Politics
-
view
1.775 -
download
1
description
From a presentation I have at the Canadian Association of Journalists on how journalists can learn to web scrape. Most of the presentation was real-time demos not included in this PPT deck.
Transcript of Webscraping for jounalists
Webscraping for journalistsCAJ May 13, 2011
“A little Wget magic”
Webscraping
Using software that simulates a web browser to download large quantities of information from a web site.
Why webscrape?
• Assemble your own copy of online data• Save time pointing-and-clicking
Why webscrape?
• Data publishers (governments) want you to access data on their terms
Tools for scraping
• DownThemAll (2)
• APIs• Wget• Custom scripts