NLPainter “Text Analysis for picture/movie generation”

NLPAINTER

“TEXT ANALYSIS FOR PICTURE/MOVIE GENERATION”

David LeoniEduardo Cárdenas

11/01/2012

2

MOTIVATION FOR CHOOSING THE PROJECT:

The purpose of our project is to transform text in images trying that both express the same mining.

More than 50% of human brain is devoted to vision

• A fact mere text can't exploit, no matter how inspired and well written it is. • Adding illustrations to text can be of great help to memorize its contents• But searching images that represent the text is a time consuming task• Drawing entirely new images from scratch takes even longer.

3

HOW THE PROBLEM CAN BE SOLVE?

In order to solve this problem we are going to use different techniques like text mining, natural language processing and semantic web:

We obtained a big Image database.

We have image with tags with the things that are inside of them.

We selected the most representative picture in our database that describes a specific object.

4

HOW THE PROBLEM CAN BE SOLVE?

We used some text mining techniques in order to obtain entities, attributes, etc.

We used the PoS of the phrase that we want to convert to image.

We associated the text with the images.

5

WHY AN ONTOLOGY?

We need an ontology to hold implicit knowledge which might not be present in the text provided by the user

Examples:

• an animal is in the tundra

• We don’t have pictures tags that tell us animal, so we use an ontology in order to know that a lion is an animal.

• Which are the usual animals living in the tundra?

6

DATABASES

The following databases of images was used for our project:

LabelMe images are annotated with the shapes of the objects contained in the scene. labeling was done by unpaid users More than 70,000 shapes where obtained!

Animal Diversity Web •we fetched nearly 10000 pages.•1545 were information about animals.•3500 picture pages of animals (and for each picture page we extracted ~5 pics links) and 5000 were simply the pages about the hierarchy, needed to arrive to the information at the leaves•we fetched mammals,reptiles ,birds, bony fishes, insects, echinoderms, arthropods

7

LabelMe

DATABASES

8

Animal Diversity

DATABASES

9

The Story Picturing EngineA Text-to-Picture Synthesis System for Augmenting CommunicationWordsEye

State of the Art

10

PART OF SPEECH

STANFORD

This software is a Java implementation of the log-linear part-of-speech taggers. The English taggers use the Penn Treebank tag set. Have been improved its speed, performance, usability, and support for other languages.

11

STANFORD PROJECT

12

Ontology:

13

Ontology:

14

Ontology:

15

Ontology:

16

General Diagram:

Domain(Animals)

Obtain andprocess

Text

Obtain and processpictures

Create Ontology and data

Query according to

the text

Image representing

the textSemantic

WebImage

processing

17

Obtain the text Parse the text (NLP)

EntitiesAttributes

PathActions

Text ready for be converted to image

Specific Diagram (Text):

17

18

Pictures in

Internet

Pictures in Databases (LabelMe Matlab)

XML to RDF Retrieve Image

Image processing

Specific Diagram (Images):

18

19

Domain(Animals)

Create the ontology

Extract linked data into a

semantic repository

Semantic WebApplication

SPECIFIC DIAGRAM (IMAGES):

19

TECHNOLOGIES AND ALGORITHMS(TEXT)

20

Programmation Language: Java

Programming Environment:Netbeans

Packages:Stanford Parser

Additional Packages: Image Generator

20

TECHNOLOGIES AND ALGORITHMS(IMAGE)

21

Programmation Language:MATLAB, Java

Programming Environment:Netbeans

Packages:LabelMeXOM Image Processing (Crop, class, merge class, etc. )

21

TECHNOLOGIES AND ALGORITHMS(ONTOLOGY)

22

Editor:Protégé 4.1

RDF engine: OWLim Lite

Upper ontology: Wordnet

22

TECHNOLOGIES AND ALGORITHMS(GENERAL PROJECT)

23

Programmation Language: Java

Programming Environment:NetBeans

RDF engine:OWLIM lite

Packages:XOM Image Processing (awt)

Web server:Apache Tomcat 7.0 JSP

23

TECHNOLOGIES AND ALGORITHMS(GENERAL PROJECT)

24

Documentation: Google Wiki

Versioning:SVN

Project Web Page:http://code.google.com/p/nlpainter/

24

WEB PAGE:

25

/

25

http://code.google.com/p/nlpainter/

OUR PROJECT WORKING!26

/

26

http://code.google.com/p/nlpainter/

27

LET SE IT WORKS!!!

28

QUESTIONS?

29

REFERENCES: [LM] Bryan C. Russell and Antonio Torralba and Kevin P. Murphy

and William T. Freeman}, Labelme: A database and web-based tool for image annotation, MIT AI Lab Memo, 2005

[DBP] Christian Bizer, Jens Lehmann, Georgi Kobilarov, Sören Auer, Christian Becker, Richard Cyganiak, Sebastian Hellmann: DBpedia – A Crystallization Point for the We of Data. Journal of Web Semantics: Science, Services and Agents on the World Wide Web, Issue 7, Pages 154–165, 2009.

[TRA] Mihalcea, R., and Tarau, P. 2004. TextRank: Bringing order into texts. In Proc. Conf. Empirical Methods in Natural Language Processing, 404–411

[CAPS] Ken Xu and James Stewart and Eugene Fiume , Constraint-Based Automatic Placement for Scene Composition, Proc. Graphics Interface, 2002,May, Calgary, Alberta, pp 25--34[ADW] Myers, P., R. Espinosa, C. S. Parr, T. Jones, G. S. Hammond, and T. A. Dewey. 2006. The Animal Diversity Web (online). Accessed November 01, 2011 at http://animaldiversity.org

NLPainter “Text Analysis for picture/movie generation”

Documents

Transcript of NLPainter “Text Analysis for picture/movie generation”