NLPainter “Text Analysis for picture/movie generation”

25
NLPainter “Text Analysis for picture/movie generation” David Leoni Eduardo Cárdenas 12/01/2012

description

NLPainter “Text Analysis for picture/movie generation”. David Leoni Eduardo C á rdenas 12/01/2012. Motivation for choosing the project:. The purpose of our project is to transform text in images trying that both express the same mining. - PowerPoint PPT Presentation

Transcript of NLPainter “Text Analysis for picture/movie generation”

Page 1: NLPainter  “Text Analysis for picture/movie generation”

NLPainter

“Text Analysis for picture/movie generation”

David LeoniEduardo Cárdenas

12/01/2012

Page 2: NLPainter  “Text Analysis for picture/movie generation”

2

MOTIVATION FOR CHOOSING THE PROJECT:

The purpose of our project is to transform text in images trying that both express the same mining.

• Adding illustrations to text can be of great help to memorize its contents• But searching images that represent the text is a time consuming task• Drawing entirely new images from scratch takes even longer.

Page 3: NLPainter  “Text Analysis for picture/movie generation”

3

HOW THE PROBLEM CAN BE SOLVE?

In order to solve this problem we are going to use different techniques like text mining, natural language processing and semantic web:

We obtained a big Image database.

We have image with tags with the things that are inside of them.

We selected the most representative picture in our database that describes a specific object.

Page 4: NLPainter  “Text Analysis for picture/movie generation”

4

HOW THE PROBLEM CAN BE SOLVE?

We used some text mining techniques in order to obtain entities, attributes, etc.

We used the PoS of the phrase that we want to convert to image.

We associated the text with the images.

Page 5: NLPainter  “Text Analysis for picture/movie generation”

5

DATABASES

The following databases of images was used for our project:

LabelMe images are annotated with the shapes of the objects contained in the scene. labeling was done by unpaid users More than 70,000 shapes where obtained!

Animal Diversity Web •we fetched nearly 10000 pages.•1545 were information about animals.•3500 picture pages of animals (and for each picture page we extracted ~5 pics links) and 5000 were simply the pages about the hierarchy, needed to arrive to the information at the leaves•we fetched mammals,reptiles ,birds, bony fishes, insects, echinoderms, arthropods

Page 6: NLPainter  “Text Analysis for picture/movie generation”

6

LabelMe

DATABASES

Page 7: NLPainter  “Text Analysis for picture/movie generation”

7

Animal Diversity

DATABASES

Page 8: NLPainter  “Text Analysis for picture/movie generation”

8

General Diagram:

Domain(Ani

mals)

Obtain andproce

ssText

Obtain and proces

spictur

es

Create

Ontology

and data

Query according

to the text

Image representing the text

Semanti

c Web

Image

processin

g

Page 9: NLPainter  “Text Analysis for picture/movie generation”

Obtain the text

Parse the text (NLP)

EntitiesAttributes

PathActions

Text ready for be

converted to image

9

Specific Diagram (Text):

9

Page 10: NLPainter  “Text Analysis for picture/movie generation”

Pictures in

Internet

Pictures in Databases (LabelMe Matlab)

XML to RDF

Retrieve Image

Image processi

ng

10

Specific Diagram (Images):

10

Page 11: NLPainter  “Text Analysis for picture/movie generation”

Specific Diagram (Ontology):

Domain(Animals)

Create the ontology

Extract linked data into asemantic repository

Semantic Web

Application

11

11

Page 12: NLPainter  “Text Analysis for picture/movie generation”

Technologies and algorithms(Text)

12

Programming Environment:Netbeans

Packages:Stanford Parser

Additional Packages: Image Generator

12

Page 13: NLPainter  “Text Analysis for picture/movie generation”

Technologies and algorithms(Image)

13

Programmation Language:MATLAB, Java

Programming Environment:Netbeans

Packages:LabelMeXOM

13

Page 14: NLPainter  “Text Analysis for picture/movie generation”

Technologies and algorithms(Ontology)

14

Editor:Protégé 4.1

RDF engine: OWLim Lite

Upper ontology: Wordnet

14

Page 15: NLPainter  “Text Analysis for picture/movie generation”

Technologies and algorithms(General project)

15

Programming Environment:NetBeans

RDF engine:OWLIM lite

Packages:XOM

Web server:Apache Tomcat 7.0 JSP

15

Page 16: NLPainter  “Text Analysis for picture/movie generation”

Technologies and algorithms(General project)

16

Documentation: Google Wiki

Versioning:SVN

Project Web Page:http://code.google.com/p/nlpainter/

16

Page 17: NLPainter  “Text Analysis for picture/movie generation”

How to run the project?

17

17

Page 18: NLPainter  “Text Analysis for picture/movie generation”

18

The Story Picturing Engine

A Text-to-Picture Synthesis System for Augmenting Communication

WordsEye

Comparison with other results:

Page 19: NLPainter  “Text Analysis for picture/movie generation”

Our Project working!19

19

Page 20: NLPainter  “Text Analysis for picture/movie generation”

20

Some Results:

Page 21: NLPainter  “Text Analysis for picture/movie generation”

21

Some Results:

Page 22: NLPainter  “Text Analysis for picture/movie generation”

22

Some Results:

Page 23: NLPainter  “Text Analysis for picture/movie generation”

23

Some Results:•The car and the sky, and the street.•The bike is at left of the car.•A person walking.•a person in the hotel.•the tree and a person.•a person in the water.•a door is to the right of a bed.•a door is to the left of a bed.

Let see it works!!!

Page 24: NLPainter  “Text Analysis for picture/movie generation”

24

Conclusions

Page 25: NLPainter  “Text Analysis for picture/movie generation”

25

References:• [LM] Bryan C. Russell and Antonio Torralba and Kevin P. Murphy and

William T. Freeman}, Labelme: A database and web-based tool for image annotation, MIT AI Lab Memo, 2005

• [DBP] Christian Bizer, Jens Lehmann, Georgi Kobilarov, Sören Auer, Christian Becker, Richard Cyganiak, Sebastian Hellmann: DBpedia – A Crystallization Point for the We of Data. Journal of Web Semantics: Science, Services and Agents on the World Wide Web, Issue 7, Pages 154–165, 2009.

• [TRA] Mihalcea, R., and Tarau, P. 2004. TextRank: Bringing order into texts. In Proc. Conf. Empirical Methods in Natural Language Processing, 404–411

• [CAPS] Ken Xu and James Stewart and Eugene Fiume , Constraint-Based Automatic Placement for Scene Composition, Proc. Graphics Interface, 2002,May, Calgary, Alberta, pp 25--34[ADW] Myers, P., R. Espinosa, C. S. Parr, T. Jones, G. S. Hammond, and T. A. Dewey. 2006. The Animal Diversity Web (online). Accessed November 01, 2011 at http://animaldiversity.org