New Approach To Personal Network Search Based On Information Extraction (Tin180 Com)

Post on 27-Jun-2015

223 views 2 download

Tags:

description

http://tin180.com

Transcript of New Approach To Personal Network Search Based On Information Extraction (Tin180 Com)

1

Personal Social Network— A New Approach to Personal Network Search based on Information Extraction

Jie Tang, Mingcai Hong, Jing Zhang, Bangyong Liang, and Juanzi Li

Knowledge Engineering Group, Department of Computer Science and Technology, Tsinghua University

Sep. 5th, 2006

2

Personal Social Network• Personal social network is an important research

area.• A person usually has different types of

information– Personal profile (including portrait, homepage,

position, affiliation, publications, and documents) – Contact information (including address, email,

telephone, and fax number)– Friends

• Unfortunately, the information is often hidden in heterogeneous and distributed web pages

3

Our Approach• Personal Social Network = Building + Search + Mining

Doc collectionAnnotation Integration

Person searchPublication searchAssociation search

Expert findingResearch

interesting finding

4

Processing Flow

Submitted to Returned pages

Fed to

Extracting and saving to

Ontologybase

Query

Classification Model

5

Building the Personal Network

>400,000 Persons>700,000 Publications

6

SVMs

Start line model

Two SVM models

End line model

Training data

Test data

Feature extraction

Feature extraction

Identified blocks

Position feature

Positive word feature

...

SVMs

Feature extraction

Position feature

Positive word feature

...

Start line feature set End line feature set

Annotation using SVMsPersonal profile: e.g. image,

affiliation, etc.Contact information: fax, email, phone, etc.

Start position model End position model Identified info.

Features sets

7

Person Search

Search for a person using the name or other information,

e.g. affiliation

8

Publication Search

Searching for a publication using IR

model

9

Publication Online-View

10

Association Search

Finding associations between persons - high efficiency - Top-K associations

Usage: - to find a partner - to find a person with same interests

11

Expert Finding

Finding experts on a topic

12

Research Interest Finding

Finding research interests for a person

13

Homepage:

http://keg.cs.tsinghua.edu.cn/persons/tj