Semantic Search – Do you know what I mean?

Post on 01-Dec-2014

731 views 2 download

description

Search is broken It can be fixed

Transcript of Semantic Search – Do you know what I mean?

Collaboration for Life

Semantic SearchDo you know what I mean?

Collaboration for Life

Search is broken

1. Search engines rely heavily on words and too little on concepts

2. Search engines only use geography to limit linguistic scope

3. Seach engines ignore the linguistic scope variables of industry, organisation, product line, scientific discipline, project

4. No formal notion of semantic equivalence

Collaboration for Life

Words vs concepts

“software” “architecture”“architectural design”“program code”

“program”“structural &

dynamic models”

“software architecture” “solution architecture”

“application”

“architecture”

“database”

“integration”

“product”

“structural & dynamic models”

“architectural design”

“persistent datastore”

“program code”“software”

“program”

words are labels

concepts contain semantic links

Collaboration for Life

Mapping between labels and semantics depends on linguistic scope!

“solution architecture” “solution architecture”

“application”

“architecture”

“database”

“integration”

“product”

“structural & dynamic models”

“architectural design”

“persistent datastore”

“program code”“software”

“program”

“application”

“architecture”

“database”

“structural & dynamic models”

“architectural design”

“persistent datastore”

“program code”“software”

“program”≠

Organisation A Organisation B

Collaboration for Life

Semantic equivalence“architecture”“software”

“software architecture”

“software” “architecture”

=

“software” “architecture”

“solution”

“solution architecture”

“application”

“software” “architecture”

“database”

“integration”

“product”

= =

Collaboration for Life

Fixing search

1. Interact with user to confirm translation of words into concepts

2. Fully take into account the significance of linguistic scope, in particular the variables of industry, organisation, product line, scientific discipline, project

3. Introduce a formal notion of semantic equivalence to simplify search specification and execution

PS: The W3C Semantic Web standards are not the solution

Collaboration for Life

Example 1Semantic Seach

Collaboration for Life

Matches: ∞

1. System: Ask the user for one or two concepts

?

?

Collaboration for Life

Matches: < 10,000

Jorn Bettin

car

2. User: Nominate two concepts to reduce the search space

Collaboration for Life

Matches: < 10,000

Jorn Bettin

car

?

3. System: Suggest statistically related 2nd level concepts

Collaboration for Life

owner Jorn Bettin

car Matches: < 100

4. User: Select a concept to reduce the search space

Collaboration for Life

owner Jorn Bettin

car ? Matches: < 100

5. System: Suggest statistically related concepts

Collaboration for Life

Victoria

owner Jorn Bettin

car

Matches: < 10

6. User: Select a concept to reduce the search space

Collaboration for Life

Victoria

owner Jorn Bettin

car?

Matches: < 10

7. System: Suggest statistically related 2nd level concepts

Collaboration for Life

Victoria

owner Jorn Bettin

car

Volkswagen

Matches: < 3

8. User: Select a concept to reduce the search space

Collaboration for Life

Victoria

owner Jorn Bettin

car

Volkswagen

?

Matches: < 3

9. System: Suggest statistically related 3rd level concepts

Collaboration for Life

Victoria

owner Jorn Bettin

car

Volkswagen

Golf

Matches: 1

10. User: Select a concept to reduce the search space

Collaboration for Life

Example 2Semantic Seach

Collaboration for Life

Victoria

owner John Smith

car

Volkswagen

Golf

What if the name of the owner is highly ambiguous?

Matches: < 3

Collaboration for Life

Victoria

owner John Smith

car

Volkswagen

Golf ?

11. System: Suggest statistically related 2nd level concepts

Matches: < 3

Collaboration for Life

Victoria

owner John Smith

car

Volkswagen

Golf

AustraliaMatches:

1

12. User: Select a concept to reduce the search space

Collaboration for Life

Example 3Semantic Seach

Collaboration for Life

Matches: ∞

1. System: Ask the user for one or two concepts

?

?

Collaboration for Life

Matches: < 100,000,000,000car

2. User: Nominate one concept to reduce the search space

Collaboration for Life

Matches: < 100,000,000,000car

?

3. System: Suggest statistically related concepts

Collaboration for Life

ABC 123

car Matches: < 100

4. User: Select or nominate a concept to reduce the search space

Collaboration for Life

ABC 123

car

?

Matches: < 100

5. System: Suggest statistically related 2nd level concepts

Collaboration for Life

ABC 123

car

number plate

6. User: Select a concept to reduce the search space

Matches: < 10

Collaboration for Life

car ?

number plate ABC 123

Matches: < 10

7. System: Suggest statistically related concepts

Collaboration for Life

Victoriacar

number plate ABC 123

Matches: 1

8. User: Select a concept to reduce the search space

Collaboration for Life

Example 4Semantic Seach

Collaboration for Life

Matches: ∞

??

1. System: Ask the user for one or two concepts

Collaboration for Life

Matches: < 1,000,000

architecture

software

2. User: Nominate two concepts to reduce the search space

Collaboration for Life

Matches: < 1,000,000

architecture

software

?

3. System: Suggest statistically related concepts

Collaboration for Life

4. User: Select or nominate a concept to reduce the search space

architecture

Melbourne

software

Matches: < 1,000

Collaboration for Life

Matches: < 3,000

architecture

Sydney

Melbourne

software

5. User: Select or nominate concept to enlarge the search space

Collaboration for Life

Matches: < 3,000

architecture

Sydney

Melbourne

software

?

6. System: Suggest statistically related concepts

Collaboration for Life

Matches: < 1,500

architecture

Sydney

Melbourne

software

7. User: Select a concept to reduce the search space

product development

Collaboration for Life

8. User: Connect concepts to reduce the search space

architecture

Sydney

Melbourneproduct development

software

Matches: < 1,000

Collaboration for Life

architecture

Sydney

Melbourneproduct development

software

Matches: < 300

9. User: Connect concepts to reduce the search space

Collaboration for Life

architecture

Sydney

Melbourneproduct development

software?

Matches: < 300

10. System: Suggest statistically related 2nd level concepts

Collaboration for Life

architecture

Sydney

Melbourneproduct development

software

Matches: < 200

Web

11. User: Select a 2nd level concept to reduce the search space

Collaboration for Life

architecture

Sydney

Melbourneproduct development

software

Matches: < 200

?

Web

12. System: Suggest statistically related 2nd level concepts

Collaboration for Life

architecture

Sydney

Melbourne

mobile

product development

software

Matches: < 100

Web

13. User: Select a 2nd level concept to reduce the search space

Collaboration for Life

architecture

Sydney

Melbourne

mobile

product development

Websoftware

?

Matches: < 100

14. System: Suggest statistically related concepts

Collaboration for Life

architecture

permanent

Sydney

Melbourne

mobile

product development

Websoftware

Matches: < 50

15. User: Select a concept to reduce the search space

Collaboration for Life

architecture

permanent

Sydney

Melbourne

mobile

product development

Websoftware

Matches: < 50

?

16. System: Suggest statistically related concepts

Collaboration for Life

architecture

permanent

Sydney

Melbourne

mobile

product development

Websoftware

Matches: < 20

senior

17. User: Select a concept to reduce the search space

Collaboration for Life

architecture

permanent

Sydney

Melbourne

mobile

product development

Websoftware

Matches: < 20

senior

?

18. System: Suggest statistically related 2nd level concepts

Collaboration for Life

architecture

permanent

Sydney

Melbourne

mobile

product development

Websoftware

senior

semantic search

Matches: 1

19. User: Select a 2nd level concept to reduce the search space

Collaboration for Life

Semantic search can be fixed

http://s23m.org/S23M/cell-platform.html

Cell Platform

Collaboration for Life

The latest book on Domain EngineeringDomain Engineering is of considerable practical significance, as it provides methods and techniques that help reduce time-to-market, development costs, and project risks on one hand, and helps improve system quality and performance on a consistent basis on the other.

• The most comprehensive and up-to-date work on domain engineering

• Covers all important technological aspects, including software product lines, domain-specific languages, and conceptual modeling

• Introduces novel approaches and techniques, and includes a wealth of pointers for further research

• ISBN 978-3-642-36653-6, published 2013

http://www.springer.com/computer/swe/book/978-3-642-36653-6

Collaboration for Life

S23M

info @ s23m.com

Nothing beats capturing the knowledge flow of leading domain experts to co-create

organisations & systems that are understandable by future generations of

humans & software tools.

Semantic SearchS23M knows what you mean!