Semantic Search – Do you know what I mean?
-
Upload
jorn-bettin -
Category
Education
-
view
731 -
download
2
description
Transcript of Semantic Search – Do you know what I mean?
Collaboration for Life
Semantic SearchDo you know what I mean?
Collaboration for Life
Search is broken
1. Search engines rely heavily on words and too little on concepts
2. Search engines only use geography to limit linguistic scope
3. Seach engines ignore the linguistic scope variables of industry, organisation, product line, scientific discipline, project
4. No formal notion of semantic equivalence
Collaboration for Life
Words vs concepts
“software” “architecture”“architectural design”“program code”
“program”“structural &
dynamic models”
“software architecture” “solution architecture”
“application”
“architecture”
“database”
“integration”
“product”
“structural & dynamic models”
“architectural design”
“persistent datastore”
“program code”“software”
“program”
words are labels
concepts contain semantic links
Collaboration for Life
Mapping between labels and semantics depends on linguistic scope!
“solution architecture” “solution architecture”
“application”
“architecture”
“database”
“integration”
“product”
“structural & dynamic models”
“architectural design”
“persistent datastore”
“program code”“software”
“program”
“application”
“architecture”
“database”
“structural & dynamic models”
“architectural design”
“persistent datastore”
“program code”“software”
“program”≠
Organisation A Organisation B
Collaboration for Life
Semantic equivalence“architecture”“software”
“software architecture”
“software” “architecture”
=
“software” “architecture”
“solution”
“solution architecture”
“application”
“software” “architecture”
“database”
“integration”
“product”
= =
Collaboration for Life
Fixing search
1. Interact with user to confirm translation of words into concepts
2. Fully take into account the significance of linguistic scope, in particular the variables of industry, organisation, product line, scientific discipline, project
3. Introduce a formal notion of semantic equivalence to simplify search specification and execution
PS: The W3C Semantic Web standards are not the solution
Collaboration for Life
Example 1Semantic Seach
Collaboration for Life
Matches: ∞
1. System: Ask the user for one or two concepts
?
?
Collaboration for Life
Matches: < 10,000
Jorn Bettin
car
2. User: Nominate two concepts to reduce the search space
Collaboration for Life
Matches: < 10,000
Jorn Bettin
car
?
3. System: Suggest statistically related 2nd level concepts
Collaboration for Life
owner Jorn Bettin
car Matches: < 100
4. User: Select a concept to reduce the search space
Collaboration for Life
owner Jorn Bettin
car ? Matches: < 100
5. System: Suggest statistically related concepts
Collaboration for Life
Victoria
owner Jorn Bettin
car
Matches: < 10
6. User: Select a concept to reduce the search space
Collaboration for Life
Victoria
owner Jorn Bettin
car?
Matches: < 10
7. System: Suggest statistically related 2nd level concepts
Collaboration for Life
Victoria
owner Jorn Bettin
car
Volkswagen
Matches: < 3
8. User: Select a concept to reduce the search space
Collaboration for Life
Victoria
owner Jorn Bettin
car
Volkswagen
?
Matches: < 3
9. System: Suggest statistically related 3rd level concepts
Collaboration for Life
Victoria
owner Jorn Bettin
car
Volkswagen
Golf
Matches: 1
10. User: Select a concept to reduce the search space
Collaboration for Life
Example 2Semantic Seach
Collaboration for Life
Victoria
owner John Smith
car
Volkswagen
Golf
What if the name of the owner is highly ambiguous?
Matches: < 3
Collaboration for Life
Victoria
owner John Smith
car
Volkswagen
Golf ?
11. System: Suggest statistically related 2nd level concepts
Matches: < 3
Collaboration for Life
Victoria
owner John Smith
car
Volkswagen
Golf
AustraliaMatches:
1
12. User: Select a concept to reduce the search space
Collaboration for Life
Example 3Semantic Seach
Collaboration for Life
Matches: ∞
1. System: Ask the user for one or two concepts
?
?
Collaboration for Life
Matches: < 100,000,000,000car
2. User: Nominate one concept to reduce the search space
Collaboration for Life
Matches: < 100,000,000,000car
?
3. System: Suggest statistically related concepts
Collaboration for Life
ABC 123
car Matches: < 100
4. User: Select or nominate a concept to reduce the search space
Collaboration for Life
ABC 123
car
?
Matches: < 100
5. System: Suggest statistically related 2nd level concepts
Collaboration for Life
ABC 123
car
number plate
6. User: Select a concept to reduce the search space
Matches: < 10
Collaboration for Life
car ?
number plate ABC 123
Matches: < 10
7. System: Suggest statistically related concepts
Collaboration for Life
Victoriacar
number plate ABC 123
Matches: 1
8. User: Select a concept to reduce the search space
Collaboration for Life
Example 4Semantic Seach
Collaboration for Life
Matches: ∞
??
1. System: Ask the user for one or two concepts
Collaboration for Life
Matches: < 1,000,000
architecture
software
2. User: Nominate two concepts to reduce the search space
Collaboration for Life
Matches: < 1,000,000
architecture
software
?
3. System: Suggest statistically related concepts
Collaboration for Life
4. User: Select or nominate a concept to reduce the search space
architecture
Melbourne
software
Matches: < 1,000
Collaboration for Life
Matches: < 3,000
architecture
Sydney
Melbourne
software
5. User: Select or nominate concept to enlarge the search space
Collaboration for Life
Matches: < 3,000
architecture
Sydney
Melbourne
software
?
6. System: Suggest statistically related concepts
Collaboration for Life
Matches: < 1,500
architecture
Sydney
Melbourne
software
7. User: Select a concept to reduce the search space
product development
Collaboration for Life
8. User: Connect concepts to reduce the search space
architecture
Sydney
Melbourneproduct development
software
Matches: < 1,000
Collaboration for Life
architecture
Sydney
Melbourneproduct development
software
Matches: < 300
9. User: Connect concepts to reduce the search space
Collaboration for Life
architecture
Sydney
Melbourneproduct development
software?
Matches: < 300
10. System: Suggest statistically related 2nd level concepts
Collaboration for Life
architecture
Sydney
Melbourneproduct development
software
Matches: < 200
Web
11. User: Select a 2nd level concept to reduce the search space
Collaboration for Life
architecture
Sydney
Melbourneproduct development
software
Matches: < 200
?
Web
12. System: Suggest statistically related 2nd level concepts
Collaboration for Life
architecture
Sydney
Melbourne
mobile
product development
software
Matches: < 100
Web
13. User: Select a 2nd level concept to reduce the search space
Collaboration for Life
architecture
Sydney
Melbourne
mobile
product development
Websoftware
?
Matches: < 100
14. System: Suggest statistically related concepts
Collaboration for Life
architecture
permanent
Sydney
Melbourne
mobile
product development
Websoftware
Matches: < 50
15. User: Select a concept to reduce the search space
Collaboration for Life
architecture
permanent
Sydney
Melbourne
mobile
product development
Websoftware
Matches: < 50
?
16. System: Suggest statistically related concepts
Collaboration for Life
architecture
permanent
Sydney
Melbourne
mobile
product development
Websoftware
Matches: < 20
senior
17. User: Select a concept to reduce the search space
Collaboration for Life
architecture
permanent
Sydney
Melbourne
mobile
product development
Websoftware
Matches: < 20
senior
?
18. System: Suggest statistically related 2nd level concepts
Collaboration for Life
architecture
permanent
Sydney
Melbourne
mobile
product development
Websoftware
senior
semantic search
Matches: 1
19. User: Select a 2nd level concept to reduce the search space
Collaboration for Life
Semantic search can be fixed
http://s23m.org/S23M/cell-platform.html
Cell Platform
Collaboration for Life
The latest book on Domain EngineeringDomain Engineering is of considerable practical significance, as it provides methods and techniques that help reduce time-to-market, development costs, and project risks on one hand, and helps improve system quality and performance on a consistent basis on the other.
• The most comprehensive and up-to-date work on domain engineering
• Covers all important technological aspects, including software product lines, domain-specific languages, and conceptual modeling
• Introduces novel approaches and techniques, and includes a wealth of pointers for further research
• ISBN 978-3-642-36653-6, published 2013
http://www.springer.com/computer/swe/book/978-3-642-36653-6
Collaboration for Life
S23M
info @ s23m.com
Nothing beats capturing the knowledge flow of leading domain experts to co-create
organisations & systems that are understandable by future generations of
humans & software tools.
Semantic SearchS23M knows what you mean!