Wikipedia Day 2011 Talk

download Wikipedia Day 2011 Talk

If you can't read please download the document

Transcript of Wikipedia Day 2011 Talk

Wikipedia Day

Free Software and Wikipedia

A bit about Wikipedia

The word Wikipedia is a portmanteau from wiki (a technology for creating collaborative websites, from the Hawaiian word wiki, meaning "quick") and encyclopedia.

8th Most popular website in the world according to Alexa.

There are 3,527,596 articles, and 22,896,500 pages in total.

There have been 437,835,862 edits.

There are 848,745 uploaded files.

There are 13,750,719 registered users, including 1,769 administrators.

This information is correct as of 07:08 (UTC) on January 13, 2011.

Wikipedia receives between 25,000 and 90,000 page requests per second, depending on time of day.

Employs 50 people to keep it running

http://en.wikipedia.org/wiki/Wikipediahttp://www.alexa.com/siteinfo/wikipedia.orghttp://en.wikipedia.org/wiki/Wikipedia:Abouthttp://toolserver.org/%7Eleon/stats/reqstats/reqstats-monthly.png

Some common misconceptions

Wikipedia != WikileaksThey just use some of the same software, they are not affiliated.

Wikimedia != Wikipedia Wikipedia is one of the projects operated by Wikimedia. Others include Wiktionary, Wikiquote, Wikibooks, Wikisource, Wikimedia Commons, Wikispecies, Wikinews, Wikiversity, Wikimedia Incubator and Meta-Wiki.

The Wikimedia Foundation's stated goal is to develop and maintain open content, wiki-based projects and to provide the full contents of those projects to the public free of charge.

Who invented the Media Wiki software?The current software was originally written for Wikipedia by Lee Daniel Crocker to replace the Perl based UseModWiki that Wikipedia originally used.

http://en.wikipedia.org/wiki/Wikimedia_Foundation

http://en.wikipedia.org/wiki/MediaWiki#History

Why should I contribute to Wikipedia?

Wikipedia is a great educational resource

Learn new things

To help address the imbalance 87% of Wikipedia editors are male.

To help fulfill the goal of making the sum of all human knowledge available to everyone on the planet for free.

How can I contribute?

Use it!

Donations

EditsImproving existing articlesStart your own

Translations especially importantOther languages are lagging way behind Englishhttp://stats.wikimedia.org/EN/PlotsPngWikipediansContributors.htm

Translate articles into English from other languages or vice versa

http://en.wikipedia.org/wiki/Wikipedia:Pages_needing_translation_into_English

Humans neededMachine translation, especially between unrelated languages (e.g. English and Japanese), produces very low quality results. Wikipedia consensus is that an unedited machine translation, left as a Wikipedia article, is worse than nothing.

http://en.wikipedia.org/wiki/Wikipedia:Translation

LanguageNumber of articles

English3,528,812

German1,175,939

French1,055,863

Polish766,348

Italian764,027

Japanese728,066

Spanish701,847

Portuguese667,002

Dutch665,634

Russian649,558

Translations are especially useful for education

Wikipedia can be downloaded for offline viewing which can be invaluable in remote areas.http://schools-wikipedia.org/

http://dumpathome.evopedia.info/

Why Take Wikipedia Offline?Because how can knowledge equal freedomif people need a modem to get it?http://thewikireader.com/why/

A quick look behind the scenes

Wikipedia relies on Free Software

All their servers use Ubuntu

PHP

My SQL

Apache

MediaWiki

More...

The site started as a Perl CGI script running on single server in 2001. Wikipedia now has 200 application servers, 20 database servers and 70 servers dedicated to Squid cache servers.Wikipedia is powered by the MediaWiki software, which was originally written to run Wikipedia and is now an open source project. MediaWiki uses PHP running on a MySQL database. Mituzas said MySQL instances range from 200 to 300 gigabytes. In addition to Squid, Wikipedia uses Memcached and the Linux Virtual Server load balancer. Wikipedia alsouses database sharding to set up master-slave relationships between Databases.

http://www.datacenterknowledge.com/archives/2008/06/24/a-look-inside-wikipedias-infrastructure/

The importance of Free Software to Wikipedia

In an interview with the Free Software Foundation Jimmy Wales said...

"People sometimes ask me why Im so adamant that Wikipedia must always use free software, even when in some cases it might be the case that proprietary software might be more convenient or better suited for some particular need that we have.

...

Free knowledge requires free software. It is a conceptual error to think about our mission as being somehow separate from that."

http://www.fsf.org/working-together/profiles/wikipedia

Serving up the content-how they use Free Software

All multimedia in the Wikipedia database is stored using the free Ogg Theora and Ogg Vorbis formats.http://blog.wikimedia.org/blog/2009/01/26/mozilla-and-wikimedia-join-forces-to-support-open-video/

Some of the Free Software Wikipedia uses:

A LAMP stack:Ubuntu GNU Linux OS

Apache - webserver

MySQL - database

PHP for coding the web front end

But that's not it...

Media Wiki for content management

Squid for caching data which allows them to significantly reduce their bandwidth use.Squid has a wide variety of uses, from speeding up a web server by caching repeated requests; to caching web, DNS and other computer network lookups for a group of people sharing network resources; to aiding security by filtering traffic.

The Squid web site claims that if working in front of the server application, it can improve performance by up to four times. Squid is especially efficient in case of (probably unexpected) high traffic to one or several particular pages.http://en.wikipedia.org/wiki/Squid_%28software%29

Memcached - is a general-purpose distributed memory caching system is often used to speed up dynamic database-driven websites by caching data and objects in RAM to reduce the number of times an external data source (such as a database or API) must be read.

How they use Free Software to help with performance

What's the Creative Commons and why is it important?

The Creative Commons organisation

seeks to support the building of a richer public domain by providing an alternative to the automatic "all rights reserved" copyright, dubbed "some rights reserved."

http://en.wikipedia.org/wiki/Creative_commons

http://www.youtube.com/watch?v=2BESbnMJg9M

So what are the licenses?There are six major licenses of the Creative Commons:

* Attribution (CC-BY) * Attribution Share Alike (CC-BY-SA) * Attribution No Derivatives (CC-BY-ND) * Attribution Non-Commercial (CC-BY-NC) * Attribution Non-Commercial Share Alike (CC-BY-NC-SA) * Attribution Non-Commercial No Derivatives (CC-BY-NC-ND)

Creative Commons licenses in a nutshell

If something is licensed under the Creative Commons by default you have a "core right" to redistribute a work for non-commercial purposes without modification, NC and ND make it non-free.

BY Attributation requires attributation to the original author.

SA Share Alike - allowing derivative works under the same or a similar license (later or jurisdiction version).

NC Non Commercial - requiring the work is not used for commercial purposes.

ND - No Derivative Works - allowing only the original work, without derivatives.

Wikipedia uses CC-BY-SA

The Wikipedia and Free Software philosophy

The Right to Readhttp://www.gnu.org/philosophy/right-to-read.html

The Creative Commons licensesThis is important in keeping the content free-concerned with free access.

Most of Wikipedia's text and many of its images are dual-licensed under the Creative Commons Attribution-Sharealike 3.0 Unported License (CC-BY-SA) and the GNU Free Documentation License (GFDL) (unversioned, with no invariant sections, front-cover texts, or back-cover texts).http://en.wikipedia.org/wiki/Wikipedia:About#Trademarks_and_copyrights

Collaboration wisdom of the crowds everyone has something to contributehttp://en.wikipedia.org/wiki/Wikipedia:Errors_in_the_Encyclop%C3%A6dia_Britannica_that_have_been_corrected_in_Wikipedia

CensorshipJimmy Wales has made clear that censorship is never acceptablehttp://www.youtube.com/watch?v=257swlMMxmA

He has also come out against Internet censorship and the importance of keeping knowledge fee.http://www.youtube.com/watch?v=0suY1pMJN08

Contributing to Free Software

You can contribute to free software projects in many of the same ways you can to Wikipedia:

Use it

Documentation

Translations

Testing

Editing code changes instead of page changes

Promotion

http://libreofficeaustralia.org/ca/community/teams

More information & Resources

http://ten.wikipedia.org/wiki/Main_Page

http://www.datacenterknowledge.com/archives/2008/06/24/a-look-inside-wikipedias-infrastructure/

Jimmy Wales' talk at Bristol University for Wikipedia's 10th anniversaryhttp://www.ustream.tv/recorded/11988215

Wikimedia chooses Ubuntu for all of its servershttp://www.canonical.com/about-canonical/resources/case-studies/wikimedia-chooses-ubuntu-all-its-servers

Creative Commonshttp://creativecommons.org/

http://www.youtube.com/watch?v=2BESbnMJg9M

www.diffen.com/difference/WikiLeaks_vs_Wikipedia