Homework 1 is out - Stanford...

Homework 1 is out: It is long! Start early. Analyze a co‐authorship network Due in 1 week (Oct 8 in class) Due in 1 week (Oct 8 in class)

Project proposals are due in 2 weeks! Groups of max 3 studentsGroups of max 3 students 1 proposal per group, max 3 pages Read at least 3 papers from the Easley‐Kleinberg book S i th d ib t / k i t d t i Summarize them, describe strong/weak points and extensions

Propose what you want to do (can be one of the following): Experimental evaluation Theoretical project model simulation Theoretical project, model, simulation In‐depth critical survey

10/1/2009 Jure Leskovec, Stanford CS322: Network Analysis

CS 322: (Social and Information) Network AnalysisJure LeskovecStanford University

[Adamic‐Adar 2005]

Adamic‐Adar 2005: HP Labs email logs (436 people) Link if u,v exchanged >5 emails each way

Map of the organization hierarchy Finding:

P(uv) ~ 1 / (size of smallest t i i d )3/4group containing u and v)3/4

Differences: Data has weighted edges Cubicle

locations

Org. chart

g g Data has people on non‐leaf nodes Data not b‐ary or uniform depth

Q H d t thi t

locations

Q: How can we adapt this to weighted graphs?

[Adamic‐Adar 2005]

Search strategies using degree, hierarchy, distance between the cubicles

Probability of link vs. distance in the hierarchy

[Liben‐Nowell et al. ‘05]

Liben Nowell et al ’05:Liben‐Nowell et al 05: Live Journal data (blogers+zip codes)(blogers+zip codes)

Link length in a network of bloggers Link length in a network of bloggers (0.5 million bloggers, 4 million links)

Solution:Rank based friendshipRank based friendship

Close to Close to theoretical optimum of ‐1optimum of ‐1

The difference between the East and West coast disappears

Decentralized search in a LiveJournal network 12% searches finish, average 4.12 hops

Why is rank exponent close to ‐1? Why in any network? Why online? How robust/reproducible?

Conjecture [Sandbeng‐Clark 2007]: Nodes in a ring with random edges Process of morphing links Process of morphing links Update step: Randomly choose s, t, run decentralized search alg. Path compression: each node on path Path compression: each node on path updates long range link to go directly to twith some small prob.

Conjecture from simulation:Conjecture from simulation: P(uv) ~ dist -1

Al ith iAlgorithmic consequence:

How to find files in Peer‐to‐Peer networks?

Napster existed Napster existed from June 1999 and July 2001y

Hybrid between P2P and a centralized network

Once layers got the t l tcentral server to

shut down the network fell apartnetwork fell apart

Networks that can’t be turned “off” Networks that can t be turned off BitTorrent ML d k ML‐donkez KazaaG ll Gnutella

C fi d fil i t kCan we find a file in a network without a central server?

Protocol Chord consistently maps key Protocol Chord consistently maps key (filename) to a node: Keys are files we are searching for Keys are files we are searching for Computer that keeps the key can then point to the true location of the filetrue location of the file

Keys and nodes have m‐bit IDs assigned to them:them: Node ID is a hash‐code of the IP address Key ID is a hash code of the file Key ID is a hash‐code of the file

Cycle with nodeN1K58

Cycle, with node ids 0 to 2m-1

Key k is assignedN8

K10 Key k is assigned to a node a(k)with ID k

N14N48

m=6with ID k

N21N42 N21

N32N38 K24K30

Assume we have N nodes and K keys (files) Assume we have N nodes and K keys (files).How many keys has each node?

When a node joins/leaves the system it only needs to talk to its immediate neighborsneeds to talk to its immediate neighbors When N+1 nodes join or leave, then only O(K/N) keys need to be rearrangedkeys need to be rearranged

Each node know the IP address of its immediate neighbor

If every nodeN1K58

If every node knows its immediate

K10immediate neighbor then use sequential search

N14N48

m=6sequential search

N21N42 N21

N32N38 K24K30

A node maintains a table of m=log(N) entries A node maintains a table of m=log(N) entries i‐th entry of a node n contains the address of

(n+2i)‐th neighbor(n+2 )‐th neighbor

Search algorithm: Search algorithm: Take the longest link that does not overshoot This way with each step we half the distance to the This way with each step we half the distance to the target

N8N56N8+1 = N14N8+2 = N14N8+4 = N14

N14N48

N51N8+4 N14N8+8 = N21N8+16 = N32N8 32 N 2

N48 N8+32 = N42

N32N38

N8N56N8+1 = N14N8+2 = N14N8+4 = N14

N14N48

N51N8+4 N14N8+8 = N21N8+16 = N32N8 32 N 2

N48 N8+32 = N42

N42+1 = N48N N 8 N21

N32N38

N42N42+2 = N48N42+4 = N48N42+8 = N51

N324 5N42+16 = N1N42+32 = N8

Search for a key in a network of N nodes Search for a key in a network of N nodesvisits O(log N) nodes

Assume that node n queries for key k Let the key k reside at node t Let the key k reside at node t

How many steps do we need to reach t? How many steps do we need to reach t?

We start the search at node n Let i be a number such that t is contained in interval [n+2i-1, n+2i][ , ]

Then the table at node n contains a pointer to node n+2i-1– the smallest node f from the finterval

Claim: f is closer to t than nf So, in one step we halved the distance to t We can do this at most log N timesg Thus, we find t in O(log N) steps

Hypercube:Hypercube: N = 2d

Edge u‐v if gbit‐string differs in 1 position

Properties: Properties: Low max‐degree: log N Low diameter: log Ng Good expansion: 1

(cut out 2d‐1 hypercube) Simplest graph with Simplest graph with above three properties

Homework 1 is out - Stanford...

Documents

Transcript of Homework 1 is out - Stanford...

Stanford Universitysnap.stanford.edu/class/cs322-2009/08-macro-evol-annot.pdf · Stanford University ... diam the same degree ... CS Math Drama Music

Decision Theory: Markov Decision Processeskevinlb/teaching/cs322 - 2009-10/Lectures/D… · RecapFinding Optimal PoliciesValue of Information, ControlMarkov Decision ProcessesRewards

Representation & Reasoning, Representational Dimensions ...mack/CS322/lectures/1-Intro2.pdf · Representation & Reasoning, Representational Dimensions, Course Overview Alan Mackworth

Stanford Universitysnap.stanford.edu/class/cs322-2009/17-ncp-annot.pdf · CS 322: (Social and Information) Network Analysis Jure Leskovec Stanford University. Define modularityto

CS322 Fall 2003: Programming Language Design Lecture Notes

CS322 Languages and Compiler Design IIapt/cs322/lecture10.pdfPSU CS322 SPR’12 LECTURE 10 c 1992–2012 ANDREW TOLMACH 13 MINIMIZING REGISTERS NEEDED TO EVALUATE EXPRESSION TREES

Logic: Intro & Propositional Definite Clause Logicmack/CS322/lectures/5-Logic1.pdf · Technique Uncertainty Decision Theory Course Module Variable Elimination Value Iteration ...

Logic: semantics, proof procedures, soundness and …mack/CS322/lectures/5-Logic2.pdfLecture Overview • Recap: Propositional Definite Clause Logic (PDCL) - Syntax - Semantics •

CS322: Database Systems Transactions

STANFORD GEOTHERMAL PROGRAM STANFORD UNIVERSITY … · STANFORD GEOTHERMAL PROGRAM STANFORD UNIVERSITY Stanford Geothermal Program Interdisciplinary Res ear ch in Engineering and

Home | Stanford HAIHome | Stanford HAI

CS322: Database Systems - Manas Khatuamanaskhatua.github.io/courses/CS322/DBMS_Lec7_Indexing...CS322: Database Systems Dr. Manas Khatua Assistant Professor Dept. of CSE IIT Jodhpur

Stanford Universitysnap.stanford.edu/na09/09-cascades-annot.pdf · Stanford University ... 10/20/2009 Jure Leskovec, Stanford CS322: Network Analysis

STANFORD UNIVERSITY STANFORD LINEAR ACCELERATOR

Stanford Universitysnap.stanford.edu/na09/12-influence-annot.pdf · Stanford University ... 10/29/2009 Jure Leskovec, Stanford CS322: Network Analysis 23

STANFORD GEOTHERMAL PROGRAM STANFORD ......STANFORD GEOTHERMAL PROGRAM STANFORD UNIVERSITY STANFORD, CALIFORNIA 94305 SGP-TR-35 SECOND ANNUAL REPORT TO U.S. DEPARTMENT …

STANFORD GEOTHERMAL PROGRAM STANFORD UNIVERSITY

CS322: Database Systemsmanaskhatua.github.io/courses/CS322/DBMS_Lec6... · • by Cost per unit of data • by Reliability ... to store the entire database • capacities of up to

STANFORD GEOTHERiMAL PROGRAiM STANFORD UNIVERSITY

CS322 Fall 2003: Programming Language Design Lecture Notesfsl.cs.illinois.edu/pubs/UIUCDCS-R-2003-2897.pdfGrigore Rosu - Fall 2003 - CS322 HW1 due, in class. HW1 Possible Solutions