World Seer : A Realtime Geo-Tweet Photo Mapping...

2
World Seer : A Realtime Geo-Tweet Photo Mapping System Keiji Yanai Department of Informatics, The University of Electro-Communications Chofu, Tokyo 182-8585 JAPAN [email protected] ABSTRACT Twitter is a unique microblog which is different from con- ventional social media in terms of its quickness. Many Twit- ter’s users send messages to Twitter on the spot with mo- bile phones or smart phones, and some of them send tweets with photos and geotags, which can be regarded as being geotagged photos. Geotagged tweet photos are very useful to understand what happens currently over the world. In the demo, we introduce “World Seer” which is a real-time geo-tweet photo mapping system. Users can see the latest geo-tweet photos related to given keywords and areas on the online maps. The system shows geo-tweet photos not only on the map, but also on the street-view. In addition, for some parts of the geo-tweet photos, the system can show representative photos for the given locations and the given times employing the GeoVisualRank method which takes into account both visual features of photos and proximity of geotags. Categories and Subject Descriptors H.4 [Information Systems Applications]: Miscellaneous General Terms Algorithms, Design, Experimentation Keywords geotagged photo, geotag, Twitter, geo-tweet 1. INTRODUCTION Twitter is a unique microblog, which is different from conventional social media in terms of its quickness. Many Twitter’s users send messages, which is commonly called “tweets”, to Twitter on the spot with mobile phones or smart phones, and some of them send photos and geotags as well as tweets. Then, we can regard Twitter as a data source of geotagged photos. In fact, we collected about 75 thousand geotagged photos a day on average in December 2012 via Twitter Streaming API. We think that Twitter is a promis- ing data source of geotagged of photos, while Flickr has been the most popular data source of geotagged photos in the Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. To copy otherwise, to republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. ICMR ’12, June 5-8, Hong Kong, China Copyright c 2012 ACM 978-1-4503-1329-2/12/06 ...$10.00. Figure 1: World Seer: a realtime geo-tweet photo mapping system. This figure shows the latest geo- tweet photos related to “noodle”. You can get to know what kinds of “noodle” people is eating over the world now from the geo-tweet photos. In addi- tion, from the street-view, you can get to know what the places where “noodles” are eaten are like, too. research community of multimedia so far. Since the char- acteristic of Twitter is quickness and on-the-spot-ness, the photos on Twitter are different from the photos on Flickr. Flickr has many travel photos, while Twitter has many pho- tos related to everyday life such as food, weather and street scene. Therefore, we think that geo-tweet photos are more useful to understand what happens currently over the world than Flickr geo-photos. We have been collecting geo-photo tweets continuously since February 2011. Currently, our geo-photo tweet database has more than 20 million geo-photo tweets, which includes ones related to some special events such as the huge earth- quake hit in Japan on March 11th, 2011. To make collection of geo-photo tweets successful, we im- plemented not only geo-tweet collecting system but also a system showing geo-tweet photos on the online map in the real-time way in order to monitor the status of the geo-tweet collection. Although the original objective of the geo-tweet mapping system is just monitoring, currently the system has been extended a lot so that we can get to know what happens over the world with geo-tweet photos, and then we named the system“World Seer”. In the demo, we introduce “World Seer” which is a real- time geo-tweet photo mapping system as shown in Fig.1. This figure shows the latest geo-tweet photos selected by the search keyword,“noodle”, from which you can get to know what kinds of “noodle” people are eating over the world now. Users can see the latest geo-tweet photos related to given keywords and areas on the online maps, and can search more than 20 million geo-photo tweets stored in our geo-photo tweet database.

Transcript of World Seer : A Realtime Geo-Tweet Photo Mapping...

Page 1: World Seer : A Realtime Geo-Tweet Photo Mapping Systemimg.cs.uec.ac.jp/pub/conf12/120606yanai_0.pdf · been collecting geo-photo tweets continuously since Febru-ary 2011. Currently,

World Seer : A Realtime Geo-Tweet Photo Mapping System

Keiji YanaiDepartment of Informatics, The University of Electro-Communications

Chofu, Tokyo 182-8585 [email protected]

ABSTRACTTwitter is a unique microblog which is different from con-ventional social media in terms of its quickness. Many Twit-ter’s users send messages to Twitter on the spot with mo-bile phones or smart phones, and some of them send tweetswith photos and geotags, which can be regarded as beinggeotagged photos. Geotagged tweet photos are very usefulto understand what happens currently over the world. Inthe demo, we introduce “World Seer” which is a real-timegeo-tweet photo mapping system. Users can see the latestgeo-tweet photos related to given keywords and areas on theonline maps. The system shows geo-tweet photos not onlyon the map, but also on the street-view. In addition, forsome parts of the geo-tweet photos, the system can showrepresentative photos for the given locations and the giventimes employing the GeoVisualRank method which takesinto account both visual features of photos and proximity ofgeotags.

Categories and Subject DescriptorsH.4 [Information Systems Applications]: Miscellaneous

General TermsAlgorithms, Design, Experimentation

Keywordsgeotagged photo, geotag, Twitter, geo-tweet

1. INTRODUCTIONTwitter is a unique microblog, which is different from

conventional social media in terms of its quickness. ManyTwitter’s users send messages, which is commonly called“tweets”, to Twitter on the spot with mobile phones or smartphones, and some of them send photos and geotags as wellas tweets. Then, we can regard Twitter as a data source ofgeotagged photos. In fact, we collected about 75 thousandgeotagged photos a day on average in December 2012 viaTwitter Streaming API. We think that Twitter is a promis-ing data source of geotagged of photos, while Flickr has beenthe most popular data source of geotagged photos in the

Permission to make digital or hard copies of all or part of this work forpersonal or classroom use is granted without fee provided that copies arenot made or distributed for profit or commercial advantage and that copiesbear this notice and the full citation on the first page. To copy otherwise, torepublish, to post on servers or to redistribute to lists, requires prior specificpermission and/or a fee.ICMR ’12, June 5-8, Hong Kong, ChinaCopyright c©2012 ACM 978-1-4503-1329-2/12/06 ...$10.00.

Figure 1: World Seer: a realtime geo-tweet photomapping system. This figure shows the latest geo-tweet photos related to “noodle”. You can get toknow what kinds of “noodle” people is eating overthe world now from the geo-tweet photos. In addi-tion, from the street-view, you can get to know whatthe places where “noodles” are eaten are like, too.

research community of multimedia so far. Since the char-acteristic of Twitter is quickness and on-the-spot-ness, thephotos on Twitter are different from the photos on Flickr.Flickr has many travel photos, while Twitter has many pho-tos related to everyday life such as food, weather and streetscene. Therefore, we think that geo-tweet photos are moreuseful to understand what happens currently over the worldthan Flickr geo-photos.

We have been collecting geo-photo tweets continuouslysince February 2011. Currently, our geo-photo tweet databasehas more than 20 million geo-photo tweets, which includesones related to some special events such as the huge earth-quake hit in Japan on March 11th, 2011.

To make collection of geo-photo tweets successful, we im-plemented not only geo-tweet collecting system but also asystem showing geo-tweet photos on the online map in thereal-time way in order to monitor the status of the geo-tweetcollection. Although the original objective of the geo-tweetmapping system is just monitoring, currently the system hasbeen extended a lot so that we can get to know what happensover the world with geo-tweet photos, and then we namedthe system “World Seer”.

In the demo, we introduce “World Seer” which is a real-time geo-tweet photo mapping system as shown in Fig.1.This figure shows the latest geo-tweet photos selected by thesearch keyword, “noodle”, from which you can get to knowwhat kinds of “noodle”people are eating over the world now.Users can see the latest geo-tweet photos related to givenkeywords and areas on the online maps, and can search morethan 20 million geo-photo tweets stored in our geo-phototweet database.

Page 2: World Seer : A Realtime Geo-Tweet Photo Mapping Systemimg.cs.uec.ac.jp/pub/conf12/120606yanai_0.pdf · been collecting geo-photo tweets continuously since Febru-ary 2011. Currently,

The system shows geo-tweet photos not only on the map,but also on the street-view. Seeing photos over the street-view photos corresponding to the geotag place is also quitehelpful to know what the place is like. The geo-tweet photoshown in the system window and its geo-location on the mapare switched one after another every five seconds automati-cally.

In addition, for some parts of the geo-tweet photos, thesystem can show representative photos for the given loca-tions and the give dates employing the GeoVisualRank method [1]which takes into account both visual features of photos andproximity of geotags. To do that, the backend part of thesystem downloads photo JPEG files and extracts visual fea-tures in advance. Due to computation loads, currently werestrict photos from which we extract visual features to thegeo-tweet photos tweeted in Japan. An example is shown inFig.3.

Figure 2: Overview of the system architecture.

2. OVERVIEW OF THE SYSTEMAs shown in Fig.2, the system consists of the following

five sub-systems and two databases:

Geo-tweet recording system receives tweet stream flow-ing through Twitter Streaming API, picks up geo-tweetshaving photos, and stores them on the metadata databasein a real-time way. It handles only textual data. Weare collecting geo-tweets from whole the world.

Geo-tweet photo downloading system extracts image URLsfrom each tweet stored in the metadata database, down-loads image files from Twitter photo affiliate sites suchas Twitpic.com, Instagr.am and yfrog.com., and storeimage files into the photo and feature database. Cur-rently, photos to be downloaded is limited to the pho-tos taken in Japan.

Geo-tweet photo feature extraction system extracts SIFTfeatures, convert them into bag-of-features (BoF) rep-resentation based on the pre-computed codebook, andstore BoF vectors into the photo and feature database.

Geo-tweet photo mapping system (Fig.1) shows geo-tweet photos on the Google Maps and the Google StreetViews.It fetches the latest geotag tweets from the metadata

database, or search the database for the tweets cor-responding to the given search keywords and/or thegiven term. This sub-system is available on the Web 1.

Geo-tweet photo clustering and ranking system (Fig.3)shows geo-photo tweet clusters on the map, and rep-resentative photos corresponding to each cluster. Im-age clusters are constructed by applying the mean-shiftclustering method to a set of 2D geotag vectors (lati-tude and longitude), and representative images corre-sponding to each cluster are selected by the GeoVisual-Rank method [1] which is a PageRank-based rankingmethod of geotagged images considering both visualsimilarity and geo-location proximity.

Geo-tweet metadata database stores metadata on ex-tracted geo-tweets which contain URLs of images.

Geo-tweet photo and visual feature database stores im-age JPEG files and corresponding BoF vectors.

To obtain tweets from Twitter, we can use two kinds ofTwitter Web API, Twitter API and Twitter Streaming API.Since we can obtain tweets only for recent ten days at mostvia Twitter API, we are collecting geo-tweets via TwitterStreaming API. Through Twitter Streaming API, a hugeamount of tweets are broadcasted continuously. In the sys-tem, the recording system selects only the tweets which havegeotags and photo URLs from the tweet stream. We havebeen collecting geo-photo tweets continuously since Febru-ary 2011. Currently, our geo-photo tweet database has morethan 20 million geo-photo tweets. We collected 2.3 milliongeo-tweet photos a month last December, which is equiva-lent to one geo-tweet with a photo per second. The numberof geo-tweet photos a month is increasing gradually.

Figure 3: Geo-tweet photo clustering system. Thegeo-tweet photos in this figure are recorded at theday of the huge earthquake hit in Japan on March11th, 2011.

3. REFERENCES[1] H. Kawakubo and K. Yanai. Geovisualrank: A ranking

method of geotagged images considering visualsimilarity and geo-location proximity. In Proc. of theACM International World Wide Web Conference, 2011.

1http://mm.cs.uec.ac.jp/geotwphoto/