Reading Metadata Between the Lines - Searching for Stories, People, Places and More: Presented by...

54
Reading Metadata Between the Lines: Searching for Stories, People, Places and More in Television News Kai Chan Social Sciences Computing University of California, Los Angeles [email protected]

Transcript of Reading Metadata Between the Lines - Searching for Stories, People, Places and More: Presented by...

Page 1: Reading Metadata Between the Lines - Searching for Stories, People, Places and More: Presented by Kai Chan, UCLA

Reading Metadata Between the Lines:

Searching for Stories, People, Places and More

in Television News

Kai Chan Social Sciences Computing

University of California, Los Angeles [email protected]

Page 2: Reading Metadata Between the Lines - Searching for Stories, People, Places and More: Presented by Kai Chan, UCLA

What?

Page 3: Reading Metadata Between the Lines - Searching for Stories, People, Places and More: Presented by Kai Chan, UCLA

What We Do with Television News

Page 4: Reading Metadata Between the Lines - Searching for Stories, People, Places and More: Presented by Kai Chan, UCLA

Make Metadata Searchable

Page 5: Reading Metadata Between the Lines - Searching for Stories, People, Places and More: Presented by Kai Chan, UCLA

Make Metadata Searchable

cap$on  THESE RECALLED CARS ARE AMONG THE MOST POPULAR FOR THE PAST 12 YEARS.

Page 6: Reading Metadata Between the Lines - Searching for Stories, People, Places and More: Presented by Kai Chan, UCLA

Make Metadata Searchable

cap$on  (searchable)  

THESE RECALLED CARS ARE AMONG THE MOST POPULAR FOR THE PAST 12 YEARS.

Page 7: Reading Metadata Between the Lines - Searching for Stories, People, Places and More: Presented by Kai Chan, UCLA

Make Metadata Searchable

metadata  

cap$on  (searchable)  

THESE RECALLED CARS ARE AMONG THE MOST POPULAR FOR THE PAST 12 YEARS.

Page 8: Reading Metadata Between the Lines - Searching for Stories, People, Places and More: Presented by Kai Chan, UCLA

Make Metadata Searchable

metadata  (not  searchable)  

cap$on  (searchable)  

THESE RECALLED CARS ARE AMONG THE MOST POPULAR FOR THE PAST 12 YEARS.

Page 9: Reading Metadata Between the Lines - Searching for Stories, People, Places and More: Presented by Kai Chan, UCLA

Story Segment

Story  1   Story  2  

Page 10: Reading Metadata Between the Lines - Searching for Stories, People, Places and More: Presented by Kai Chan, UCLA

Story Segment

Page 11: Reading Metadata Between the Lines - Searching for Stories, People, Places and More: Presented by Kai Chan, UCLA

Name Entity Name:  John  McCain  Role:    US  Senator  Party:  Republican  

Name:  Greta  Van  Susteren  Role:    Anchor  Network:  Fox  News  Channel  

Page 12: Reading Metadata Between the Lines - Searching for Stories, People, Places and More: Presented by Kai Chan, UCLA

Name Entity

NJ  Governor:  coopera$on  from  US  President  “outstanding”,  “deserves  great  credit”  

Republican   Democrat   praise  (!)  

Page 13: Reading Metadata Between the Lines - Searching for Stories, People, Places and More: Presented by Kai Chan, UCLA

Non-Verbal Communication

Page 14: Reading Metadata Between the Lines - Searching for Stories, People, Places and More: Presented by Kai Chan, UCLA

Non-Verbal Communication

Page 15: Reading Metadata Between the Lines - Searching for Stories, People, Places and More: Presented by Kai Chan, UCLA

On-Screen Text

Page 16: Reading Metadata Between the Lines - Searching for Stories, People, Places and More: Presented by Kai Chan, UCLA

On-Screen Text

Page 17: Reading Metadata Between the Lines - Searching for Stories, People, Places and More: Presented by Kai Chan, UCLA

How?

Page 18: Reading Metadata Between the Lines - Searching for Stories, People, Places and More: Presented by Kai Chan, UCLA

1. Help Users Search

Page 19: Reading Metadata Between the Lines - Searching for Stories, People, Places and More: Presented by Kai Chan, UCLA

Define Metadata Structure

Tag   AQribute  Name:  Value  

AQribute  Name:  Value  

AQribute  Name:  Value  

Start  Time   End  Time  

Page 20: Reading Metadata Between the Lines - Searching for Stories, People, Places and More: Presented by Kai Chan, UCLA

Define Metadata Structure

SEG   Type:  Headline  

Topic:  Ebola  Scare  

Country:  US  

1:00:00   1:03:00  (story  segment)  

start  $me   end  $me   tag   aQributes  

Page 21: Reading Metadata Between the Lines - Searching for Stories, People, Places and More: Presented by Kai Chan, UCLA

Search in Multiple Places

Page 22: Reading Metadata Between the Lines - Searching for Stories, People, Places and More: Presented by Kai Chan, UCLA

Offer Suggestions

Page 23: Reading Metadata Between the Lines - Searching for Stories, People, Places and More: Presented by Kai Chan, UCLA

2. Make the Search Happen

Page 24: Reading Metadata Between the Lines - Searching for Stories, People, Places and More: Presented by Kai Chan, UCLA

Map to Documents and Fields

SEG_Topic:  Ebola  Scare  

NER_Name:  John  McCain  

NER_Role:  Senator  

fields  

SEG_Type:  Headline  

(program  info,  cap$on)  

document  

NER_State:  Arizona  

Page 25: Reading Metadata Between the Lines - Searching for Stories, People, Places and More: Presented by Kai Chan, UCLA

Map to Documents and Fields

SEG_Topic:  Ebola  Scare  

NER_Name:  John  McCain  

NER_Role:  Senator  

fields  

SEG_Type:  Headline  

(program  info,  cap$on)  

document  

NER_State:  Arizona  

NER_Name:  John  Chiang  

NER_Role:  Controller  

NER_State:  California  

SEG_Topic:  Drought  

SEG_Type:  Poli$cs  

Page 26: Reading Metadata Between the Lines - Searching for Stories, People, Places and More: Presented by Kai Chan, UCLA

Define Semantics

+TEXT_Text:“drought”    +NER_Role:“Senator”    +NER_State:“California”  

Query:  

Page 27: Reading Metadata Between the Lines - Searching for Stories, People, Places and More: Presented by Kai Chan, UCLA

Define Semantics

Interpreta$on  1:  

“drought”  

$me  

start   end  

Role:  Senator   State:  California  

start   end  

Page 28: Reading Metadata Between the Lines - Searching for Stories, People, Places and More: Presented by Kai Chan, UCLA

Define Semantics

Interpreta$on  2:  

“drought”  

$me  

start   end   start   end  

“drought”  

Role:  Senator   State:  California  

Page 29: Reading Metadata Between the Lines - Searching for Stories, People, Places and More: Presented by Kai Chan, UCLA

Define Semantics

Interpreta$on  3:  

“drought”  

$me  

start   end  

Role:  Senator  State:  California  

Page 30: Reading Metadata Between the Lines - Searching for Stories, People, Places and More: Presented by Kai Chan, UCLA

Define Semantics

Interpreta$on  4:  

“drought”  

$me  

start   end  

Role:  Senator  State:  California  

Page 31: Reading Metadata Between the Lines - Searching for Stories, People, Places and More: Presented by Kai Chan, UCLA

Map to Documents and Fields

SEG_Topic:  Ebola  Scare  

NER_Name:  John  McCain  

NER_Role:  Senator  

fields  

SEG_Type:  Headline  

(program  info,  cap$on)  

document  

NER_State:  Arizona  

NER_Name:  John  Chiang  

NER_Role:  Controller  

NER_State:  California  

SEG_Topic:  Drought  

SEG_Type:  Poli$cs  

Page 32: Reading Metadata Between the Lines - Searching for Stories, People, Places and More: Presented by Kai Chan, UCLA

Map to Documents and Fields

SEG_Topic:  Ebola  Scare  

NER_Name:  John  McCain  

NER_Role:  Senator  

fields  

SEG_Type:  Headline  

(program  info,  cap$on)  

document  

NER_State:  Arizona  

NER_Name:  John  Chiang  

NER_Role:  Controller  

NER_State:  California  

SEG_Topic:  Drought  

SEG_Type:  Poli$cs  

Page 33: Reading Metadata Between the Lines - Searching for Stories, People, Places and More: Presented by Kai Chan, UCLA

Map to Documents and Fields

SEG_Topic:  Ebola  Scare  

NER_Name:  John  McCain  

NER_Role:  Senator  

fields  

SEG_Type:  Headline  

(program  info,  cap$on)  

document  

NER_State:  Arizona  

NER_Name:  John  Chiang  

NER_Role:  Controller  

NER_State:  California  

SEG_Topic:  Drought  

SEG_Type:  Poli$cs  

Page 34: Reading Metadata Between the Lines - Searching for Stories, People, Places and More: Presented by Kai Chan, UCLA

Map to Documents and Fields

SEG_Topic:  Ebola  Scare  

NER_Name:  John  McCain  

NER_Role:  Senator  

fields  

SEG_Type:  Headline  

(program  info,  cap$on)  

document  

NER_State:  Arizona  

NER_Name:  John  Chiang  

NER_Role:  Controller  

NER_State:  California  

SEG_Topic:  Drought  

SEG_Type:  Poli$cs  

Page 35: Reading Metadata Between the Lines - Searching for Stories, People, Places and More: Presented by Kai Chan, UCLA

Map to Documents and Fields

SEG_Topic:  Ebola  Scare  

NER_Name:  John  McCain  

NER_Role:  Senator  

fields  

SEG_Type:  Headline  

(program  info,  cap$on)  

document  

NER_State:  Arizona  

NER_Name:  John  Chiang  

NER_Role:  Controller  

NER_State:  California  

SEG_Topic:  Drought  

SEG_Type:  Poli$cs  

Page 36: Reading Metadata Between the Lines - Searching for Stories, People, Places and More: Presented by Kai Chan, UCLA

Map to Documents and Fields

SEG_Topic:  Ebola  Scare  

NER_Name:  John  McCain  

NER_Role:  Senator  

fields  

SEG_Type:  Headline  

(program  info,  cap$on)  

document  

NER_State:  Arizona  

NER_Name:  John  Chiang  

NER_Role:  Controller  

NER_State:  California  

SEG_Topic:  Drought  

SEG_Type:  Poli$cs  

Page 37: Reading Metadata Between the Lines - Searching for Stories, People, Places and More: Presented by Kai Chan, UCLA

Map to Documents and Fields

SEG_Topic:  Ebola  Scare  

NER_Name:  John  McCain  

NER_Role:  Senator  

fields  

SEG_Type:  Headline  

(program  info,  cap$on)  

document  

NER_State:  Arizona  

NER_Name:  John  Chiang  

NER_Role:  Controller  

NER_State:  California  

SEG_Topic:  Drought  

SEG_Type:  Poli$cs  

Page 38: Reading Metadata Between the Lines - Searching for Stories, People, Places and More: Presented by Kai Chan, UCLA

3. Make the Search Meaningful

Page 39: Reading Metadata Between the Lines - Searching for Stories, People, Places and More: Presented by Kai Chan, UCLA

Two Levels of Document

program  document  

tag  document  

tag  document  

tag  document  

1  document  =  1  metadata  instance  

Page 40: Reading Metadata Between the Lines - Searching for Stories, People, Places and More: Presented by Kai Chan, UCLA

Two Levels of Document

program  document  

tag  document  

tag  document  

tag  document  

1  document  =  1  news  program  

Page 41: Reading Metadata Between the Lines - Searching for Stories, People, Places and More: Presented by Kai Chan, UCLA

Two Levels of Document

program  document  

tag  document  

tag  document  

tag  document  

1.  search  metadata  content  

Page 42: Reading Metadata Between the Lines - Searching for Stories, People, Places and More: Presented by Kai Chan, UCLA

Two Levels of Document

program  document  

tag  document  

tag  document  

tag  document  

2.  lookup  program  document(s)  

Page 43: Reading Metadata Between the Lines - Searching for Stories, People, Places and More: Presented by Kai Chan, UCLA

Two Levels of Document

program  document  

tag  document  

tag  document  

tag  document  

3.  filter  by  program  informa$on  

Page 44: Reading Metadata Between the Lines - Searching for Stories, People, Places and More: Presented by Kai Chan, UCLA

Two Levels of Document

NER_Role:  Senator  

NER_State:  Arizona  

tag  document  

NER_Role:  Senator  

tag  document   tag  document  

Tag:  NER  

NER_State:  California  

Tag:  NER   Tag:  NER  

NER_Role:  Controller  

NER_State:  California  

match  NOT  match   NOT  match  

Page 45: Reading Metadata Between the Lines - Searching for Stories, People, Places and More: Presented by Kai Chan, UCLA

Two Levels of Document

Date  

Network  

Show  

program  document  

NER_Role:  Senator  

tag  document  

Tag:  NER  

NER_State:  California  

Page 46: Reading Metadata Between the Lines - Searching for Stories, People, Places and More: Presented by Kai Chan, UCLA

Filter by Metadata Boundaries

“drought”  

$me  

start   end  

“drought”  “drought”  

Role:  Governor  State:  California  

Page 47: Reading Metadata Between the Lines - Searching for Stories, People, Places and More: Presented by Kai Chan, UCLA

Filter by Metadata Boundaries

... EMERGENCY PLED TO THE STATE OF CALIFORNIA IN MAY TO CONSERVE WATER. >> THIS DROUGHT IS A BIG WAKE-UP CALL, A REMINDER. THE COUPLE SAYS THAT THEY NEED NO REMINDERS. ...

36:18  

36:22   36:18  –  36:22  Tag:  NER  Name:  Jerry  Brown  Role:  Governor  State:  California  

36:19  

Page 48: Reading Metadata Between the Lines - Searching for Stories, People, Places and More: Presented by Kai Chan, UCLA

4. Make the Search More Powerful

Page 49: Reading Metadata Between the Lines - Searching for Stories, People, Places and More: Presented by Kai Chan, UCLA

Proximity Search – Word as Unit

... >> THIS DROUGHT IS A BIG WAKE-UP CALL, A REMINDER. THE COUPLE SAYS THAT THEY NEED NO REMINDERS THEY DO ADMIT THAT THEIR LAWN HAS BECOME A BIT UNSIGHTLY. ...

posi$on  100  

posi$on  121  

20  words  

Page 50: Reading Metadata Between the Lines - Searching for Stories, People, Places and More: Presented by Kai Chan, UCLA

Proximity Search – Time as Unit

... >> THIS DROUGHT IS A BIG WAKE-UP CALL, A REMINDER. THE COUPLE SAYS THAT THEY NEED NO REMINDERS THEY DO ADMIT THAT THEIR LAWN HAS BECOME A BIT UNSIGHTLY. ...

36:19  

36:25  

6  s  

Page 51: Reading Metadata Between the Lines - Searching for Stories, People, Places and More: Presented by Kai Chan, UCLA

Make Metadata Searchable

metadata  (not  searchable)  

cap$on  (searchable)  

THESE RECALLED CARS ARE AMONG THE MOST POPULAR FOR THE PAST 12 YEARS.

Page 52: Reading Metadata Between the Lines - Searching for Stories, People, Places and More: Presented by Kai Chan, UCLA

Make Metadata Searchable – Accomplished

metadata  (now  searchable)  

cap$on  (searchable)  

THESE RECALLED CARS ARE AMONG THE MOST POPULAR FOR THE PAST 12 YEARS.

Page 53: Reading Metadata Between the Lines - Searching for Stories, People, Places and More: Presented by Kai Chan, UCLA

Thank you for coming!

Ques$ons  or  comments?  My  e-­‐mail:  [email protected]  

Slides  available  at:  hQp://bit.ly/lsr2014tvnews  (or  scan  this  barcode)  

Page 54: Reading Metadata Between the Lines - Searching for Stories, People, Places and More: Presented by Kai Chan, UCLA