Information Complexity Lower Bounds

Rotem Oshman, Princeton CCIBased on:

Bar-Yossef,Jayram,Kumar,Srinivasan’04 Braverman,Barak,Chen,Rao’10

Communication Complexity

𝑋 𝑌

Yao ‘79, “Some complexity questions related to distributive computing”

• Applications:– Circuit complexity– Streaming algorithms– Data structures– Distributed computing– Property testing– …

Communication Complexity

Deterministic Protocols

• A protocol specifies, at each point:– Which player speaks next– What should the player say– When to halt and what to output

• Formally,

what we’ve said so far

who speaks next: Alice, Bob, = halt

what to say/output

Randomized Protocols

• Can use randomness to decide what to say– Private randomness: each player has a separate

source of random bits– Public randomness: both players can use the same

random bits• Goal: for any compute correctly with

probability • Communication complexity: worst-case length

of transcript in any execution

Randomness Can Help a Lot

• Example: EQUALITY– Input: – Output: is ?

• Trivial protocol: Alice sends to Bob• For deterministic protocols, this is optimal!

EQUALITY Lower Bound

0𝑛 1𝑛…0𝑛

111111111

#rectangles

Randomized Protocol

• Protocol with public randomness:– Select random – Alice sends – Bob accepts iff

• If : always accept• If :

• Reject with probability

non-zero vector

Set Disjointness

• Input: • Output: ?• Theorem [Kalyanasundaran, Schnitger ‘92,

Razborov ‘92]: randomized CC = – Easy to see for deterministic protocols

• Today we’ll see a proof by Bar-Yossef, Jayram, Kumar, Srinivasan ‘04

Application: Streaming Lower Bounds

• Streaming algorithm:

• Example: how many distinct items in the data?• Reduction from Disjointness [Alon, Matias,

Szegedy ’99]

algorithmHow much spaceis required to approximate f(data)?

Reduction from Disjointness:

• Fix a streaming algorithm for Distinct Elements with space , universe size

• Construct a protocol for Disj. with elements:

𝑋={𝑥1 ,…,𝑥𝑘 } 𝑌={𝑦 1 ,…, 𝑦 ℓ }

algorithmState of the algorithm and (#bits = )

⇔ #distinct elements in is

Application 2: KW Games

• Circuit depth lower bounds:

• How deep does the circuit need to be?

∧∨ ∧

∨∨∧ ∨𝑥1 … 𝑥𝑛

𝑓 (𝑥¿¿1 ,…,𝑥𝑛)¿

• Karchmer-Wigderson’93,Karchmer-Raz-Wigderson’94:

𝑋 : 𝑓 ( 𝑋 )=0 𝑌 : 𝑓 (𝑌 )=1

find such that

• Claim: if has deterministic CC , then requires circuit depth .

• Circuit with depth protocol with length

∧∨ ∧

∨∨∧ ∨𝑥1 … 𝑥𝑛𝑋 : 𝑓 ( 𝑋 )=0 𝑌 : 𝑓 (𝑌 )=1

1 10 1

Information-Theoretic Lower Bound on Set Disjointness

Some Basic Concepts from Info Theory

• Entropy of a random variable:

• Important properties:

– is deterministic– = expected # bits needed to encode

• Conditional entropy: • Important properties:

– are independent • Example:–– If then , if 1 then

• Mutual information:

• Conditional mutual information:

• Important properties:

– are independent

• Chain rule for mutual information:

• More generally,

Information Cost of Protocols

• Fix an input distribution on • Given a protocol , let also denote the

distribution of ’s transcript• Information cost of :

• Information cost of a function :

Information Cost of Protocols

• Important property: • Proof: by induction. Let .• : what we know after r rounds

what we knew after r-1 rounds

what we learn in round r, given what we already know

Information vs. Communication

• Want: • Suppose is sent by Alice.• What does Alice learn?– is a function of and so

• What does Bob learn?

Information vs. Communication

• Important property: • Lower bound on information cost ⇒ lower

bound on communication complexity• In fact, IC lower bounds are the most powerful

technique we know

Information Complexity of Disj.

• Disjointness: is ?• Strategy: for some “hard distribution” ,

1. Direct sum: 2. Prove that .

Hard Distribution for Disjointness

• For each coordinate :

𝑋 𝑖=0 𝑋 𝑖=1

𝑌 𝑖=0

𝑌 𝑖=1

𝐼 𝐶𝜇𝑛 (Disj )≥𝑛⋅ 𝐼𝐶𝜇 (¿)• Let be a protocol for on • Construct for as follows:– Alice and Bob get inputs – Choose a random coordinate , set – Sample and run – For each ,

𝐼 𝐶𝜇𝑛 (Disj )≥𝑛⋅ 𝐼𝐶𝜇 (¿)• Let be a protocol for on • Construct for as follows:– Alice and Bob get inputs – Choose a random coordinate , set – Bad idea: publicly sample

𝑌Suppose in , Alice sends .

In , Bob learns one bit in he should learn bitBut if is public Bob learns 1 bit about !

𝐼 𝐶𝜇𝑛 (Disj )≥𝑛⋅ 𝐼𝐶𝜇 (¿)• Let be a protocol for on • Construct for as follows:– Alice and Bob get inputs – Choose a random coordinate , set – Another bad idea: publicly sample , Bob privately samples

given – But the players can’t sample , independently…

𝐼 𝐶𝜇𝑛 (Disj )≥𝑛⋅ 𝐼𝐶𝜇 (¿)• Let be a protocol for on • Construct for as follows:– Alice and Bob get inputs – Choose a random coordinate , set

Publicly sample

Privately sample

Direct Sum Theorem• Transcript of • Need to show:

Information Complexity of Disj.

• Disjointness: is ?• Strategy: for some “hard distribution” ,

1. Direct sum: 2. Prove that .

Hardness of AND

0 0 10

transcript on should be“very different”

Hellinger Distance

• Examples:

– If have disjoint support,

Hellinger Distance

• Hellinger distance is a metric– , with equality iff

– Triangle inequality:

𝑃𝑄

Hellinger Distance

• If for some we have then

0 0 10

h≥ 23√2

Hellinger Distance vs. Mutual Info

• Let be two distributions• Select by choosing , then drawing • Then

𝐼 (Π ;𝑌|𝑋=0 )≥h2 (Π 00 , Π 01 )

0 0 10

1/3𝐼 (Π ; 𝑋|𝑌=0 )≥h2 (Π 00 , Π 10 )

Hardness of AND

0 0 10

h≥ 23√2

Same for Alice untilBob acts differently

Same for Bob untilAlice acts differently

“Cut-n-Paste Lemma”

• Recall: • Enough to show: we can write

• We can write

• Proof:– induces a distribution on “partial transcripts” of

each length : probability that first bits are – By induction:

• Base case: – Set

• Step: • Suppose after it is Alice’s turn to speak• What Alice says depends on:– Her input– Her private randomness– The transcript so far,

• So • Set

Hardness of AND

0 0 10

h≥ 23√2

Multi-Player Communication Complexity

The Coordinator Model

𝑓 (𝑋 1,… ,𝑋𝑘 )=?

𝑋 1 𝑋 2 𝑋𝑘…

Multi-Party Set Disjointness

• Input: • Output: is ?• Braverman,Ellen,O.,Pitassi,Vaikuntanathan’13:

lower bound of bits

Reduction from DISJ tograph connectivity

• Given we want to– Choose vertices – Design inputs such that is connected iff

Reduction from DISJ tograph connectivity

𝑝𝑘

(Players)

𝑋 𝑖

[𝑛 ]∖⋃𝑋 𝑖

(Elements)

input graph connected

Other Stuff

• Distributed computing

Other Stuff

• Compressing down to information cost• Number-on-forehead lower bounds• Open questions in communication complexity

Information Complexity Lower Bounds

Documents

Transcript of Information Complexity Lower Bounds

Kolmogorov Complexity and Formula Size Lower Bounds

Sample Complexity Bounds for Iterative Stochastic …papers.nips.cc/paper/5693-sample-complexity-bounds-for-iterative... · Sample Complexity Bounds for Iterative Stochastic Policy

Two absolute bounds for distributed bit complexity

Information complexity and exact communication bounds

Lower Bounds In Kernelization

Element distinctness lower bounds

Lower Bounds for Number-in-Hand Multiparty Communication Complexity, Made …jeffp/papers/multiCC-soda12.pdf · 2011-10-06 · Lower Bounds for Number-in-Hand Multiparty Communication

Unique Properties, Lower bounds & Derandomization

Lower Bounds on Quantum Query and Learning Graph Complexitiesrosmanis.com/research/Rosmanis_Ansis(UWSpace).pdf · (n1 k2 2=(2k 1)) lower bound on the adaptive learning graph complexity

Lower Bounds in Communication Complexity: A Surveyresearch.cs.rutgers.edu/~troyjlee/survey_plain.pdf · Lower Bounds in Communication Complexity: A Survey ... Abstract We survey lower

LOWER BOUNDS ON THE RANDOMIZED COMMUNICATION COMPLEXITY OF READ-ONCE FUNCTIONSusers.uoa.gr/~nleon/files/readonce.pdf · 2010. 10. 5. · Communication complexity of read-once functions

Information-theoretic lower bounds on the oracle ...bartlett/papers/abrw-itlbocsco-12.pdf · Information-theoretic lower bounds on the oracle complexity of convex optimization ...

1 CSC 421: Algorithm Design & Analysis Spring 2013 Complexity & Computability lower bounds on problems brute force, decision trees, adversary arguments,

Lower Bounds Based on SETH

Notes on the Complexity of Switching Networkshungngo/papers/SN-complexity.pdf · concentrator as a tool to establish non-linear lower bounds for the complexity of circuits computing

Complexity Basics - West Virginia Universitycommunity.wvu.edu/~krsubramani/courses/sp15/complexity/lecnotes/basics.pdf6 Abstract Algebra. 7 Upper and lower bounds. 8 Problem paradigms.

Time-Space Tradeoffs in Proof Complexity: Superpolynomial Lower Bounds for Superlinear Space Chris Beck Princeton University Joint work with Paul Beame.

2 -1 Chapter 2 The Complexity of Algorithms and the Lower Bounds of Problems.

Complexity Lower Bounds using Linear Algebra · lower bounds on linear circuits of logarithmic depth. Linear circuits are algebraic circuits consisting only of gates that compute

Upper and lower bounds