The Computational Complexity of Games and Puzzles...time mathematical and algorithmic depth, they...

The Computational Complexity of Games and

Puzzles

Valia Mitsou

Abstract

The subject of my thesis is studying the algorithmic properties of oneand two-player games people enjoy playing, such as chess or Sudoku. Thisresearch falls into a wider area known as combinatorial game theory. Oneof the main questions asked about games in this context is whether they arealgorithmically tractable, that is, whether they can be solved with efficientalgorithms.

So far, more than fifty popular games have been categorized as intractable.This is probably because people enjoy games and puzzles for which designinga winning strategy or finding the solution requires some cleverness.

Contents

1 Introduction 2

2 Computational complexity background 4

2.1 Complexity classes for games and puzzles . . . . . . . . . . . . 4

2.2 The class PSPACE . . . . . . . . . . . . . . . . . . . . . . . . 6

2.2.1 Generalized Geography . . . . . . . . . . . . . . . . . . 7

2.3 Parameterized complexity primer . . . . . . . . . . . . . . . . 10

2.3.1 Structural Parameters . . . . . . . . . . . . . . . . . . 12

2.3.2 Parameterized Complexity for puzzles and 2-player games 15

3 The Game of UNO 16

3.1 Single player version of UNO . . . . . . . . . . . . . . . . . . . 17

3.2 Many-player uncooperative version . . . . . . . . . . . . . . . 20

4 The Game of SET 21

4.1 One round of SET . . . . . . . . . . . . . . . . . . . . . . . . 22

4.2 Multi-round versions . . . . . . . . . . . . . . . . . . . . . . . 25

4.3 A two-player game . . . . . . . . . . . . . . . . . . . . . . . . 27

1

1 Introduction

Combinatorial Game Theory is a very intriguing and widely developed field of

mathematics and computer science under the wide umbrella of Recreational

Mathematics. A combinatorial game is usually considered to be a 2-player

turn-based finite game with perfect information (no player hides any infor-

mation from her opponent) and no chance, like chess or go. Similarly, finite

1-player puzzles of no chance, like sudoku or mastermind also fall into this

category. Other than the practical applications that studying games and

puzzles has in the gaming market, combinatorial game theory is a field that

many people inside and outside the academic community find insteresting

and amusing.

Since many games and puzzles are entertaining while having at the same

time mathematical and algorithmic depth, they serve as a great teaching

tool. For example, Prisner in university of Maryland used a river crossing

puzzle in his Discrete Mathematics class. Many games and puzzles can be

directly reformulated as well-known important algorithmic problems. To give

some examples, solving the river crossing riddle we just mentioned is very

close to solving Vertex Cover, playing the game UNO as we present in

section 3 can be modeled as solving Hamiltonian Path, different versions

of the game SET as we present in section 4 can be reformulated as covering

or matching problems, the game “lights out” can be viewed as some version

of parity domination [22], etc. To take this to some extent, studying a game

or a puzzle with a direct algorithmic application might give results which are

important even outside of the field.

Furthermore, all people learn and play games and puzzles and this pro-

vides a connecting bond between the academic and the non-academic com-

munity: research results in combinatorial game theory are more likely to be

read, understood and appreciated by people ouside of academia. Recent the-

2

http://www.eprisner.de/WZK/WZK1.html

oretical results have appeared in blogs of popular science (see for example

Tetris is hard, even to approximate: addictive puzzle even stumps comput-

ers, Mathematicians Prove Tetris Is Tough, And Scrabble proved PSPACE-

Complete, How Complicated is Scrabble, Lemmings Forum, The Complexity

of Tip Over and other puzzles). In addition, studying games and puzzles

from an algorithmic point of view can enhance cross-disciplinary research, as

people outside of the field but interested in the said game might read research

results about it.

When we study a 2-player game, one of our main goals is to figure out

which player is going to win. It is worth mentioning here that, for the

class of games that we study (turn-based, finite, perfect information, and no

chance), this is always going to be the same player (or always a draw) if both

players play optimally. The strongest way of determining (and enforcing) the

outcome is by designing a winning strategy : an algorithm that, for any move

of the opponent, can provide us with a choice that if the player follows, she

will eventually win the game (or force a draw if the outcome is a draw). For

most interesting games though, designing an algorithm that works efficiently

is particularly difficult due to the combinatorial explosion of the number

of potential next moves. In some cases, we could prove that a player has

a winning strategy by giving a non-algorithmic proof. Even in that case it

can again be particularly complicated to describe the actual winning strategy.

For many well-known games, such as chess, it is still unknown if either player

has a winning strategy. The complexity to find a winning strategy to a game

is usually what makes the game interesting.

In the case of an 1-player puzzle, the objective is to find the solution to the

puzzle. Likewise, that can be determined by designing a solving alorithm or

giving a non-algorithmic mathematical argument. In some cases it is fruitful

to consider the question whether the puzzle is solvable or not, which can be

tackled in a similar way. Again, solving a puzzle is usually a slow procedure,

3

http://www.nature.com/news/1998/021021/full/news021021-9.html

http://www.nature.com/news/1998/021021/full/news021021-9.html

http://www.scientificamerican.com/article.cfm?id=mathematicians-prove-tetr

http://www.technologyreview.com/view/426724/and-scrabble-proved-pspace-complete/

http://www.technologyreview.com/view/426724/and-scrabble-proved-pspace-complete/

http://physicsbuzz.physicscentral.com/2012/01/how-complicated-is-scrabble.html

http://www.lemmingsforums.com/index.php?topic=432.0

https://www.sciencenews.org/article/complexity-tipover-and-other-puzzles

https://www.sciencenews.org/article/complexity-tipover-and-other-puzzles

because most of the times it requires an exaustive search in the set of potential

plays. For example, though Sudoku is easy for modern computers for 9 × 9

boards, it can quickly become impossible to decide if a larger Sudoku instance

has a solution, because we know of no (significantly) better algorithm than

trying out all solutions. Again, the complexity of a puzzle is what makes it

interesting to play, unless the amusement comes from figuring out a clever

trick to solve it (which renders the puzzle useless once you know the trick).

It thus becomes natural to ask this question: for which games or puzzles is

it always possible to decide on an optimal next move efficiently, even for large

instances? Such questions are usually answered with the tools of computa-

tional complexity theory, a theory which categorizes algorithmic problems as

tractable or intractable depending on the resources needed to solve them.

2 Computational complexity background

2.1 Complexity classes for games and puzzles

In computational complexity theory resources are measured as functions of

the size of the input. Thus any finite-sized puzzle or game is considered

trivially solvable. So, when we study a puzzle or a 2-player game from the

computational complexity point of view, we first need to create a natural un-

bounded generalization of the puzzle or game. For example, in a board game

we consider a variation where the board is of size n× n. Usually, algorithms

with a polynomial (in n) complexity are considered efficient and algorithms

with an expenential complexity are considered inefficient or impractical.

To characterize the computational complexity of a game we can do two

things. If the game is tractable we must demonstrate an efficient algorithm

that solves it. If it is not, we must show that such an algorithm cannot exist.

The second task is generally more complicated (since we must rule out all

4

conceivable algorithms). Usually, evidence of intractability takes the form of

a hardness proof that shows that if an efficient algorithm for the game exists

some widely believed complexity conjectures are proved false.

The most celebrated theory for proving hardness results is the theory of

NP-Completeness, invented by Cook in 1971 ([6]) and further developed by

Karp in 1972 ([26]). From the time that it was introduced, more than 1000

problems have been characterized as NP-complete (i.e. problems that it is

efficient to verify a suggested solution but probably inefficient to compute

it). Furthermore, one of the most important open problems in complexity

theory (and computer science in gereral) is the famous P=NP question (i.e.

whether NP-complete problems admit efficient algorithms).

Similar to the example of NP-completeness, theorists have introduced sev-

eral other hardness complexity classes to characterize possibly more difficult

problems. Some of them are for example the class of PSPACE-complete prob-

lems (problems not requiring a lot of additional memory space to be worked

out but probably requiring a lot of time) and the class of EXPTIME-complete

problems (problems known to be inefficient from a time perspective).

Intractability for games and puzzles takes the form of a hardness proof.

When we say that a game or puzzle is NP-complete, we mean that it is

hard to find its solution efficiently, but easy to verify the correctness of a

given solution. For a PSPACE-complete problem, verification is also hard

but at least polynomial termination of the game is guaranteed. Harder games

without such a termination guarantee usually turn out to be EXP-complete.

Many of the famous 1-player games and puzzles are usually shown to be

NP-Complete. Examples include (among others) sudoku [32], mastermind

[31] and tetris [11]. This is definitely not a coincidence: people enjoy spending

time on an intractable puzzle, such as Sudoku, but also appreciate the ability

to easily check the solution for correctness, once it’s done.

On the other hand, hard 2-player games are usually at least PSPACE-

5

hard. That is so because in order to be sure that some player has a winning

strategy, for each move we have to check all the possible moves that the oppo-

nent can make. Examples include (among others) chess [19], go [28] and oth-

ello [25]. For an extensive list of NP-Complete and PSPACE-Complete games

and puzzles see http://en.wikipedia.org/wiki/List_of_NP-complete_

problems#Games_and_puzzles and http://en.wikipedia.org/wiki/List_

of_PSPACE-complete_problems#Games_and_puzzles.

2.2 The class PSPACE

The class PSPACE is considered the natural home of 2 player games. The

reason is that Quantified Boolean Formula, the variation of satisfiabil-

ity which is complete for PSPACE, can be viewed as a two player game where

players set truth values of the variables used in a formula φ interchangeably.

If φ is satisfied then player A wins, else player B wins.

Definition 2.1. Problem: Quantified Boolean Formula.

Input: A first order formula ∃x1∀x2∃x3 . . . ∀xnφ(x1, x2, x3, . . . xn) with φ be-

ing in CNF.

Question: Is φ satisfiable?

Theorem 2.1. (from [29]) Quantified Boolean Formula is PSPACE-

Complete.

Quantified Boolean Formula can be seen as a two player game: If

player A can pick a truth value for x1 such that for any truth value player

B might pick for x2 then player A can pick a truth value for x3, and so on,

such that in the end the formula is satisfied, player A wins. On the other

hand, if for any x1 that player A picks there is a truth value for x2 that

player B picks, and so on, such that in the end the formula is false, then

player B wins. Quantified Boolean Formula is a very useful problem

6

http://en.wikipedia.org/wiki/List_of_NP-complete_problems#Games_and_puzzles

http://en.wikipedia.org/wiki/List_of_NP-complete_problems#Games_and_puzzles

http://en.wikipedia.org/wiki/List_of_PSPACE-complete_problems#Games_and_puzzles

http://en.wikipedia.org/wiki/List_of_PSPACE-complete_problems#Games_and_puzzles

for proving hardness of 2-player games. Below we present a famous 2-player

game called Geography.

2.2.1 Generalized Geography

The problem Generalized Geography is inspired by the school game of

Geography usually played by two players: Player A starts the game with

introducing the name of a city, for example “New York”. Then player B

should find a city the name of which starts with the last letter of the previous

city played, for example “Kyoto”. Player A continues in the same way,

playing for example “Osaka”, player B playing “Athens” and so on and so

forth. A city that was already played cannot be played again. The first

player unable to find a city to play loses.

Variations of this game can be found in many countries. For example in

the US the game is called “States” (the theme is US states), in Russia it’s

called “Goroda” (trans. “cities”), and in Japan it is called “Shiritori” (trans.

“taking the end”). In a similar game played in Greece, the theme is “songs”.

Definition 2.2. Problem: Generalized Geography.

Input: A directed graph G(V,E) and a starting vertex v ∈ V .

Description: Two players take turns moving a token (initially placed in v)

from a current vertex to a neighboring vertex following the direction of the

arcs. When a vertex is visited, it is removed from the graph. First player

unable to move loses.

Question: Is there a strategy so that player 1 wins?

Generalized Geography is one of the most common games used

in proving PSPACE-hardness for other games. Here we prove that it is

PSPACE-hard.

Theorem 2.2. (from [29]) Directed Generalized Geography is PSPACE-

hard

7

Figure 2.1: Geography game: the variable gadget.

Proof. We reduce Quantified Boolean Formula to Generalized Ge-

ography. The construction is as follows.

Suppose we are given a first order formula ∃x1∀x2 . . . ∃xnφ. We construct

a graph as follows: For each variable xi we construct a diamond-like variable

gadget as shown in figure 2.1. The player who gets to choose whether some

variable xi will be set to true or false is the one responsible to move the token

in the corresponding variable gadget to up or down. We put many variable

gadgets one after the other to represent the alternating choice in truth values

of variables by the players. In the end, player B should choose which clause

player A should satisfy. So we create one vertex cj for each clause and connect

the last gadget vertex with each clause vertex with arcs. We also connect

a clause vertex with the literals that it contains in the following manner: if

variable xi appears in the clause cj as positive, then we connect vertex cj with

the vertex that corresponds to xi = F , otherwise if it appears as negative,

we connect cj with vertex from the variable gadget xi that corresponds to

xi = T . We need to make sure that the choice of clause to be satisfied is

made by player B. So we might need to add an additional vertex after the

variable gadgets depending on the parity of the number of the alternating

quantifiers. For an example see figure 2.2.

There are many variations of Generalized Geography. One is Edge

8

Figure 2.2: A complete example transforming the first order formula∃x∀y∃z((¬x∨ y ∨ z)∧ (x∨ y ∨¬z)) into a geography game. Game starts onthe purple node with player A setting the truth value of variable x. PlayerB on the yellow node picks which clause is to be satisfied.

Generalized Geography, where instead of vertices we remove visited

edges from the graph. Generalized Geography and Edge General-

ized Geography are both proven to be PSPACE-complete even on bipar-

tite or on planar graphs with in/out degrees at most 2 and total degree at

most 3 (see [21] and [28]). Furthermore, both vertex and edge versions can

be played in an undirected graph instead of a directed. In [20], Undirected

Vertex Generalized Geography is proven to be in P, whereas Undi-

rected Edge Generalized Geography is proven PSPACE-complete

(though its restriction to bipartite graphs is shown to be in P).

Theorem 2.3. (from [20]) Undirected Generalized Geography is

solvable in polynomial time.

Proof. We reduce Undirected Generalized Geography to Maximum

Matching. We prove that player 1 has a winning strategy iff v is matched

in every maximum matching of G.

If v is matched in every maximum matching, then consider one such

9

matching M in G. Player 1 will play over edges of M . Observe that the

edges that player 2 plays over also constitute a matching M ′ in G. However,

M ′ can’t be as large as M because then it would be a maximum matching

that doesn’t cover v. So |M | ≥ |M ′| and player 1 wins.

On the other hand, if M is a maximum matching that doesn’t match v,

it should match all of its neighbors (otherwise if there was a u neighbor of

v that was also unmatched, we could include (u, v) in M). So now player 2

can play on the edges of the matching and guarantee a winning strategy.

Deciding whether all maximum matchings match v can be done by com-

puting the maximum matching on G and then on G \ v and comparing the

sizes. If they are equal, there exists a maximum matching in G that doesn’t

match v, otherwise v is always matched by a maximum matching in G.

2.3 Parameterized complexity primer

Classical complexity theory mostly focuses on the distinction between poly-

nomially solvable vs exponentially solvable problems in terms of the size of

the input (n). Parameterized Complexity refines this distinction by intro-

ducing yet another parameter of the problem (k) and classifying problems

as tractable or intractable when this parameter is small or moderately large.

For example, consider the following problems: Vertex Cover, Indepen-

dent Set, Graph Coloring. All 3 are well-known NP-Hard problems.

However, the first two become polynomially solvable if we know that the

size of the solution is a constant, whereas the third is NP-hard even for 3

colors. The class of problems that admit polynomial time algorithms when

the parameter is a constant is called XP.

Parameterized complexity theory comes with its own notions of tractabil-

ity and intractability. A problem is considered Fixed Parameter Tractable

(abbrev. FPT) when exponential growth in the complexity of the problem

10

is confined in a function of just the parameter, in other words if the problem

admits an algorithm with complexity O(f(k) · nc). FPT algorithms are con-

sidered more efficient than their XP counterpart with running time O(nf(k))

for moderately large values of k. Vertex Cover parameterized by the size

of the vertex cover admits an easy FPT algorithm: Pick an uncovered edge

(u, v) and consider covering it first by u and then by v. Remove (u, v) to-

gether with all edges having either u or v as an endpoint. Repeat for up to

k steps. If all edges are covered reply yes. If not, reply no.

On the other hand, Independent Set parameterized by the size of the

independent set is not known to admit an FPT algorithm. It is also unlikely1

to admit one, as this problem is proven to be W-hard. The W-hierarchy is

an analogue of the polynomial hierarchy of classical complexity theory to

parameterized complexity. Classes W[t] are defined as the classes for which

the Weighted t−Normalized Satisfiability problem is complete.

Definition 2.3. Problem: Weighted t−Normalized Satisfiability

Input: A t-normalized boolean expression F , a positive integer k

Parameter : k

Question: Does F have a satisfying truth assignment of weight k? That is,

a truth assignment where precisely k of the variables which appear in F are

true?

In the definition, t stands for the number of alternations between conjunc-

tions and disjunctions in the formula F . A CNF formula can be viewed as

the conjunction of bracketed disjunctions, so it has only one alternation level

and is 1-normalized. If we were to substitute some literals in a CNF formula

with bracketed conjunctions, the result would be a 2-normalized formula.

As with classical complexity theory, in parameterized complexity theory

we have developed a reduction program. If a problem A is FPT-reducible

1unless the exponential time hypothesis (abbrev. ETH) is proven false, that is thereexists a 2o(n) algorithm to solve 3−SAT.

11

to another problem B and we can solve B in FPT time, then we can also

solve A in FPT time. On the other hand, a problem B can be proven to be

W-hard (or AW-hard) when, starting by a known hard problem A we can

reduce A to B via an FPT time reduction, as long as the parameter in B is

a function of just the parameter of A (independent of the input size).

Classes W[1] and W[2] are the most commonly seen, and many param-

eterized problems have been categorized as complete for these classes. In-

dependent Set is an example of a W[1]-complete problem, together with

Clique. On the other hand, Dominating Set and Hitting Set are W[2]-

complete. Yet another complete problem for W[1] is the k-Multicolored

Clique problem, one of the most famous problems that is used to prove

parameterized hardness (see [16]).

A quantified variation of t−normalized SAT Parameterized QBFSATt,

is complete for the class AW[t] [1]. AW[t] for t = 1, 2, . . ., is also a hierarchy

of classes which has been collapsed (see [1]) and is now known as AW[*].

In this parameterized geography of classes, the class XP can be seen as

the parameterized equivalent of the class EXP of classical complexity. A

problem which is hard for XP cannot be in FPT [14].

2.3.1 Structural Parameters

In addition to natural parameters (i.e. the parameter being the size of the

solution), one can consider other parameters - quantities or measures which

when they are small constants the problem can be solved in polynomial

time. One of the most celebrated such examples is the notion of treewidth.

In simple terms, treewidth measures how much a graph resembles a tree. For

a graph, having small treewidth essentially means that it has small cuts. A

graph with small treewidth can be re-written in a tree-like structure called

a tree decomposition where every node is a small cut of the original graph.

Then, one can explore this tree-like structure to develop dynamic programs

12

to solve a number of different problems.

Treewidth is one of the most celebrated structural parameters. First of

all, the problem of finding whether a graph has treewidth at most k is FPT.

Second, by Courcelle’s theorem [8], every problem expressible in monadic

second order logic is FPT parameterized by treewidth and solvable in linear

time when treewidth is a constant.

Researcher have tried to reproduce treewidth’s success and have intro-

duced many different structural parameters. We will analyse clique-width,

since this will prove useful later.

In order to describe clique-width we need to define the following 4 oper-

ations:

1. Construct a new vertex with any color among {1, 2, . . . k};

2. Take the union of two colored graphs of up to k colors each;

3. Take the join between two colors i, j, in other words connect all vertices

of color i to vertices of color j;

4. Rename color i to color j.

A graph G(V,E) is said to have click-width k if it can be constructed

using at most k colors and the above operations.

Graphs of bounded treewidth also have bounded clique-width but not

vice versa, which makes the class of graphs of bounded clique-width more

general [7]. Clique-width was introduced in lack of a measure to describe

classes of dense graphs which might have a simple enough structure. For

example, cliques have unbounded treewidth, but have clique-width 2: First,

construct a vertex of color 1, then repeatedly:

• construct a vertex of color 2 ;

• join all vertices of color 1 to the vertex of color 2;

13

• recolor vertex of color 2 to color 1.

Unfortunately, it is shown that computing the clique-width of a graph is

NP-hard [17], though it is still unknown if it is W-hard. However, the class of

graphs of bounded clique-width is interesting, because many NP-Complete

problems become polynomially solvable for this class. For example, all graph

properties expressible in MSO1 logic, in other words if we allow quantification

over sets of vertices (but not sets of edges) are decidable in linear time in

graphs of constant clique-width [9]. Furthermore, many other problems have

been shown to be tractable for constant clique-width, see [15]. Here we

present an algorithm to solve Hamiltonian Path.

We design a dynamic program that works as follows: we store information

about all different path covers over all pairs of colors of the graph in a multiset

M , then we extend the multiset for each operation that we perform. A path

cover of the graph is a union of vertex disjoint paths that cover the graph. A

Hamilton path is a path cover that is a unique path. Thus, if after computing

M for graph G, if M contains some path cover which is a unique path we

reply yes, else we reply no.

Here we show how to update the multiset after performing one of the

four operations described above. Suppose that we have a multiset M that

contains information about all different path covers of the graph so far. The

operations 1,2, and 4 (add vertex, union, rename) are easy. For the first,

if the added vertex has color i, add the path < i, i > in all path covers of

M . For the second, combine all the path covers of the first with the path

covers of the second. For the last, if we are to rename all vertices of color

i to color j, we construct M ′ by substituting i with j everywhere in M .

The join operation is the most difficult to explain. If we are to connect all

vertices of color i to vertices of color j then potentially some new paths will

be constructed. Thus, for every path cover in M that contains < i, k > and

< l, j >, we add in the multiset this path cover and also a path cover where

14

< k, i >,< j, l > is replaced by < k, l >.

The size of the multiset in any given time is at most O(|V |k2). Further-

more, it is polynomial in the size of the multiset to compute M ′ for each

operation, and the number of operations for any reasonable expression of

clique-width is polynomial in the size of the input. Thus:

Theorem 2.4. Hamiltonian Path can be solved in polynomial time in

graphs of bounded clique-width.

2.3.2 Parameterized Complexity for puzzles and 2-player games

In puzzles and games as well as in regular problems, one can identify impor-

tant parameters that are known to be small for the particular game or puzzle

and examine its complexity from the parameterized point of view. It is quite

usual for games and puzzles to have natural parameters apart from the input

size. For example, in a card game like UNO, one can consider the number of

colors as a parameter (in the real game this number is 4: red, blue, green,

yellow).

In [1], Abrahamson, Downey, and Fellows introduced the concept of a

short or k-move game. This concept can be viewed as a generalization of the

mate-in-k-moves problem from chess, and creates a parameter which can be

applied to every combinatorial game which can be played in rounds. Short

games are also very much in line with the standard heuristic approach to

playing combinatorial games, which involves the evaluation of future moves

to some upper limit. In [1], AW[*] is conjectured to be the natural home of

short 2 player games.

15

3 The Game of UNO

In this section we will analyze known results regarding the game of UNO.

UNO is a popular game of cards. Each card consists of two attributes, number

and color. There are also some cards with special effects. In the beginning of

the game, an equal number of cards is dealt to each of the players. Players

take turns discarding one card at a time from their hand to a middle pile of

used cards. The rule is that the discarded card should match the top card

on the pile either in color or in number. If the player who is about to play

doesn’t have a matching card to play, then she is required to draw one card

from a stack of unused cards. If the new card does not match either, then

the player loses her turn. The first player to get rid of all her cards wins.

We present results from the work by E. Demaine, M. Demaine, N. Harvey,

R. Uehara, T. Uno and Y. Uno [13]. The authors study a simplified version

of the game with no special effect cards and no stack of unused cards (if

a player doesn’t have a card to play she loses the game). However, this

inconvenience is somewhat remedied by the fact that the simplified version

is a perfect information game, where the opponents’ cards are open. Below,

we present the game definitions more formally.

An UNO card is defined by a color-number pair (x, y) ∈ X × Y , where

X = {1, 2, . . . c} is a set of colors and Y = {1, 2, . . . b} is a set of numbers.

In an UNO game there are p players participating (where p ≥ 1). At the

beginning, each player i is given a set of Ci cards. We assume that the set

C =⋃

iCi is a multi-set (i.e. two or more cards can have the same number

and color).

An interesting observation is that the problem can be expressed in a

graph-theoretic form, where the dealt cards are the vertices and two vertices

are connected with an edge if the cards that they represent can be discarded

one after the other in a game play, i.e if they share either the same color

16

or the same number, and, in the many-player version, one belongs in the

hand of the i-th player and the other in the hand of the i + 1( mod p)-th

player. In that sense, the one-player version where a single player is trying

to discard all of her cards, is equivalent to finding a Hamiltonian path in

the card graph, whereas the p-player un-cooperative version is like playing

Generalized Geography on the card graph.

We present results regarding the one-player version of the game. This

problem is NP-Complete when c and b are unbounded and in XP when c

(or equivalently b) is a parameter. We also present some interesting open

problems.

3.1 Single player version of UNO

As it was mentioned before, each game instance can be expressed as an

(undirected) graph, where vertices represent the cards and two vertices are

connected if the cards that they represent share the same color or the same

number. In that sense, solving the solitaire UNO becomes equivalent to

finding a Hamiltonian Path in the graph. Let us now examine the properties

of an UNO graph.

The vertices of a single-player UNO graph represent cards that can be

determined by a pair of attributes (color,number). Thus, UNO graphs can

be redrawn in a grid-like form, where cards of the same number are on the

same line and cards of the same color are on the same column. It is easy to

see that the graph will have edges only within the same lines or the same

columns. In fact, the graph induced by a line or a column of this grid-like

UNO graph will be a clique. An UNO graph can be seen in figure 3.1 (boxes

represent cliques).

Observe that UNO graphs are claw-free graphs: there is no induced K1,3.

Furthermore, it is not difficult to see that an UNO graph is the line graph of

17

Figure 3.1: A set of UNO cardsrepresented as a grid-like graph.Vertices enclosed in boxes are fullyjoint with edges.

Figure 3.2: A Hamiltonian pathin the UNO graph, which can beviewed as a discarding sequence ofthe cards.

Figure 3.3: The bipartite graph whose line graph is shown in figure 3.1.

a bipartite graph, with one part having c vertices labeled {1,2,. . . c} and the

other b vertices labeled {1,2,. . . b}, and each card is represented as an edge

connecting a color to a number. The graph of figure 3.1 is the line graph of

the graph shown in figure 3.3.

This observation, together with the fact that Hamiltonian Path is NP-

Hard in line graphs of bipartite graphs leads to the following theorem.

Theorem 3.1 (From [2]). The single player version of UNO where the num-

ber of colors c and number of numbers b is unbounded is NP-Hard.

The parameterized version of the problem where the number of colors c

18

is the parameter, can be shown to be in XP, by showing that the underlying

card graph has small clique-width and solving Hamiltonian Path with

dynamic programming, see section 2.

Lemma 3.1. Single player UNO graphs have clique-width at most 2c.

Proof. An arbitrary UNO graph can be constructed line by line as follows:

• Construct the vertices of the first line and give them all different colors

among {1, 2, . . . ,c}.

• Assuming that l lines are already constructed using colors {1, 2, . . . , c}and that all vertices in a column share the same color, construct the

vertices in line l+ 1 and give them all different colors within {c+ 1, c+

2, . . . , 2c}.

• Connect vertex with color c+ i with vertex with color c+j for all i ≤ j.

• Connect vertex with color c+ i to vertices with color i.

• Rename color c+ i to i.

From Lemma 3.1 and using standard dynamic programming techniques,

we can solve the single player version of UNO in time polynomial in the size

of the input (assuming that the number of colors is a small constant).

Theorem 3.2. The single player parameterized version of UNO where c is

a parameter is in XP.

An interesting question which was posed in [13] is whether the above

parameterized problem is in FPT. Unfortunately, Hamilton cycle is known

to be W-hard for clique-width (see [18]), so this observation can’t be used

in order to provide an FPT algorithm. On the other hand, one can consider

19

the equivalent problem Edge Hamiltonian Path (find a Hamilton path

on the line graph) on a bipartite graph. If the number of colors (or numbers)

is small, then this graph has a small vertex cover. The reason this fact might

be helpful is the following observation: the large part can be divided into a

small (bounded by a function of c) number of groups of vertices which are

equivalent in terms of neighborhood. This fact could be helpful in order to

provide a kernel for the problem. We are not aware of Edge Hamiltonian

Path being solvable for graphs of small vertex cover, but this might be an

interesting problem to tackle. Furthermore, the UNO graph itself has a very

interesting structure that one could possibly explore in order to provide an

FPT algorithm.

3.2 Many-player uncooperative version

As it was suggested in the introduction of this section, the uncooperative

version of UNO is equivalent to playing Generalized Geography on the

card graph. The card graph is a directed p−partite graph. However, in the

2-player case the graph is undirected since two cards u ∈ C1 and v ∈ C2 can

be discarded either in the order u, v or in the order v, u.

As it was stated in section 2, Generalized Geography is in P for

undirected graphs. As a result, 2-player uncooperative UNO is also in P. An

interesting question is the case of p ≥ 3. We conjecture that this problem is

PSPACE-Complete. Namely, one can try to provide a reduction from Edge

Generalized Geography on directed bipartite graphs, which is a known

PSPACE-Complete problem (see [20]), following similar proof ideas as the

ones that appear in [13].

Last, yet another interesting variation to consider (given that the p-player

game is indeed proven to be PSPACE-hard for p ≥ 3) is the game parame-

terized by the number of rounds played. It can be shown that it this problem

20

is in XP by the trivial algorithm that creates the game DAG for all possible

paths of length up to k with k being the number of rounds (size O(nk) ) and

analyzing the outcome starting from the leaves and working all the way to

the root. It is interesting to explore whether there exists an FPT algorithm

for this problem.

4 The Game of SET

The game of SET is a card game in which players seek to form Sets of cards

from a special deck. Each card from this deck has a picture with 4 attributes

(shape, color, number, shading), and each attribute can take one of 3 values

(for example the shape can be oval, squiggle, or diamond, the color can be

blue, green, or purple, etc). To create a Set2, the player needs to identify 3

cards in which, for each attribute independently, either all cards agree on the

value, or they constitute a rainbow of all possible values. In a single round of

the normal play, 12 cards are dealt and the players seek (simultaneously) a

Set. The first player to find a Set wins the 3 cards constituting it. Then 3 new

cards are dealt in the old ones’ places and the game continues with the next

round. In the unlucky event that no Set exists among the 12 cards, players

deal 3 more from the stack. The game finishes when there are no more cards

to deal. The player acquiring the most cards wins. For more information

regarding the game and its rules as well as for other variations see the official

website of the game http://www.setgame.com/set/index.html.

The game of SET has gained remarkable attention and popularity (espe-

cially among mathematicians) as well as many awards. The game has been

the subject of both educational and technical research. A broad set of ed-

ucational activities has been suggested, a collection of which can be found

2The first letter of Set is capitalized to avoid a mix-up with the notion of mathematicalset

21

http://www.setgame.com/set/index.html

in [24]. Furthermore, the game has been studied extensively from a more

technical mathematical point of view, considering questions like “what is the

maximum number of cards with n attributes and 3 values that can be laid

such that no Sets are formed” [10], or “for fixed n, how many non-isomorphic

collections of n cards are there” [5]). In [33], many other similar questions

are posed. In addition to the game’s popularity, one motivation for this in-

tense study is that the problem has a very natural alternative mathematical

formulation: if one describes the cards as four-dimensional vectors over the

set {0, 1, 2}, then a Set is exactly a collection of three collinear points, that

is, three points whose vectors add up to 0(mod3). Nevertheless, the first

and - to the best of our knowledge - only attempt to consider the game’s

computational complexity was made by Chaudhuri et al [3] in 2003, who

showed that a generalization of the game is NP-complete.

In order to study a game from the viewpoint of computational complexity

theory, one needs to define a natural generalization of the game in question

(as the original constant size game always has constant time and space com-

plexity). In a round of SET, there are 3 parameters to consider: the number

of cards m, the number of attributes n and the number of values k (in the

original game m = 12, n = 4 and k = 3). A subset of k cards will be consid-

ered to be a Set if for all attributes, values either all agree or all differ. Of

course these three parameters are not totally independent as the number of

cards m is upper-bounded by kn. In any multi-round version of the game,

an extra parameter r being the number or rounds is added.

4.1 One round of SET

Chaudhuri et al. in [3] consider a single-round version of SET. We are dealt m

cards, each with n attributes that can take one of k values and we need to find

a set of size k. We call this problem k-Value 1-Set. Their main insight is

22

Figure 4.1: An example of an n−dimensional perfect matching and of a Setof cards with n attributes and k values each.

that this problem can be seen as a hypergraph problem. Specifically, one may

construct a hypergraph on n ·k vertices, each representing an attribute-value

pair. Now, cards can be represented as hyperedges, by including in each

hyperedge the k values that describe the corresponding card’s attributes.

See figure 4.1. It is not hard to see that a perfect matching in this n-partite

hypergraph corresponds to a Set in the original instance. On the other hand,

some Sets do not correspond to perfect matchings, because all cards may

share the same value for some attributes. Nevertheless, Chaudhuri et al.

have established that the two problems have the same complexity and finding

a Set is essentially algorithmically equivalent to find a perfect matching in

this hypergraph.

Theorem 4.1. Perfect Multi-Dimensional Matching is polynomially

reducible to k-Value 1-Set.

Proof. Given an instance of Perfect Multi-Dimensional Matching

namely an n−partite graph G(V1 ∪ V2 ∪ . . . ∪ Vn, E) with each part of equal

size k, we construct an n−partite graph G′(V ′1 ∪ V ′

2 ∪ . . .∪ V ′n, E

′) as follows:

we add a k + 1−th value (vertex) vik+1in each set Vi to create V ′

i . Also

E ′ = E ∪ e′, where e′ = {v1k+1, v2k+1

, . . . , vnk+1} is a multiedge that spans

through the newly added vertices vik+1.

Now, if G has a perfect multidimensional matching M then G′ also has

a multidimensional matching M ′ = M ∪ e. Since M ′ is a multidimensional

23

matching, it is also a Set in G′. On the other hand, if G′ has a Set M ′, then

this has to be a multidimensional matching since no other multiedge apart

from e′ goes through the vertices vik+1. So M ′ \ e′ should be a multidimen-

sional perfect matching in G.

In what follows we will exploit this connection between k-Value 1-Set

and Perfect Multi-Dimensional Matching to analyze the complexity

of finding a Set with respect to the three relevant parameters m,n, and k. If

k is unbounded, Theorem 4.1 implies that k-Value 1-Set is NP-hard even

for just 3 attributes. If the cards have only 2 attributes, the game is in P

as this problem is equivalent with finding a perfect matching or a star in a

bipartite graph. On the other hand, if n is unbounded but the number of

values k is considered as a parameter, the problem is in XP (by the trivial

algorithm that enumerates all size-k sets of cards and checking whether any

of them constitutes a Set). An interesting open question is whether the triv-

ial algorithm can be improved to an FPT algorithm. In [27], this question

was answered negatively, by proving that the problem is W[1]-hard. This

W-hardness proof applies to Perfect Multi-Dimensional Matching

as well, proving that Perfect Multi-Dimensional Matching parame-

terized by the size of the dimensions k (while the number of dimensions n is

unbounded) is W[1]-hard. This result may be of independent interest, as this

is a natural parameterization of a classic problem that has not been consid-

ered before. The only relevant parameterized result known is that Maximum

Multi-Dimensional Matching parameterized by the size of the matching

and the number of dimensions is FPT (first established in [14] and further

improved in [4].

24

4.2 Multi-round versions

In order to study the game from a more realistic point of view, it is necessary

to study multi-round variations. The game is already NP-Complete for one

round for n ≥ 3. On the other hand, if k = 2 and n is unbounded, the answer

is trivial as any two cards form a Set. So it makes sence to focus our attention

to the case where the number of values k is 3 and n is unbounded (k = 3

is also interesting, since this is the value of the parameter k in the actual

game). We remind the reader that there is a polynomial time algorithm to

find whether there exists at least one Set, in other words to play just one

round. The complexity stays the same even if we consider the question of

enumerating all Sets. This generalizes the daily puzzles found either on the

official website of SET or in the New York Times. In these puzzles we are

given m cards and need to find the maximum number of Sets assuming that

we don’t remove any cards from the table after finding a Set.

It becomes interesting to ask the same question for a multi-round game,

where cards are gradually removed. This corresponds to the CO-OP version

of the game, where players have to cooperate in order to find the maximum

number of available Sets given that cards of found Sets are removed from

the table (see rules for CO-OP SET for the rules of this variation). We

call this variation Max 3-Value r-Set. Another interesting variation is

the corresponding minimization version, where we are looking for the mini-

mum number of Sets that once picked destroy all existing Sets. We call this

variation Min 3-Value r-Set. Both problems can be seen as special cases

of more general packing and covering problems. In the maximization ver-

sion, one is looking for a maximum 3-set packing, while in the minimization

version one is looking for a minimum independent edge dominating set (or

equivalently a Minimum Maximal Matching) in a 3-uniform hypergraph.

It is interesting to examine whether these problems remain NP-Hard even

on instances that correspond to the SET game. This question is answered

25

http://www.setgame.com/set/index.html

http://www.nytimes.com/ref/crosswords/setpuzzle.html

http://www.setgame.com/content/co_op_set

positively in [27]. For the latter problem, it is easy to show a reduction from

Independent Edge Dominating Set on graphs.

Theorem 4.2. Min 3-Value r-Set is polynomially reducible to Indepen-

dent Edge Dominating Set.

Proof. Given an instance of Independent Edge Dominating Set (a

graph G(V,E) and a number r), we create an instance of Min 3-Value

r-Set of |V | + |E| cards with |V | dimensions each, such that if G has an

edge dominating set of size at most r then there exist at most r Sets which

once picked up destroy all other Sets. Cards will be represented by vectors

in F|V |3 .

The construction is as follows: For each vertex i ∈ V we create a card

where all coordinates are 0 except from the value of the ith coordinate which

is equal to 1. Furthermore, for each edge (i, j) ∈ E we create a card where

all coordinates are 0 except from the values of coordinates i and j which are

equal to 2.

Observe that the only Sets formed correspond directly to edges in G.

Picking a Set corresponding to edge (i, j) eliminates the cards corresponding

to vertices i, j (together with the card corresponding to edge (i, j)). This

move causes the elimination of any potential Set containing cards correspond-

ing to vertices i and j. Thus an edge dominating set of size at most r in G

corresponds to an equal number of Sets overlapping all other Sets. On the

other hand the smallest number of Sets that overlap all other Sets is equal

to the minimum edge dominating set.

From the parameterized point of view, if one considers as the parameter

the number of rounds played r, natural parameterizations of the two problems

are to ask whether there exist at least r mutually disjoint Sets, or whether

there exist at most r Sets to destroy all Sets. The former is shown to be

26

Fixed Parameter Tractable by Chen et al. [4]. The latter is an interesting

open question, and it is tackled in [27] through the close connection with

Independent Edge Dominating Set on graphs that was presented.

4.3 A two-player game

In this section, we analyse a potentially interesting two-player game which

we call 2P 3-Value Set. Suppose that an arbitrary set of cards is on the

table and two opposing players take turns playing. Each player may select

three cards that form a Set and remove them from play. No additional cards

are dealt. The game goes on until a player is unable to find a Set, in which

case she loses.

Unlike the solitaire games Max 3-Value r-Set and Min 3-Value r-

Set, here players must exercise some strategic thinking: each player is trying

not only to maximize the number of Sets she will collect but also to prevent

the opponent from forming a set.

This game can be seen as a restriction of the game Arc Kayles in 3-

uniform hypergraphs (where hyperedges should be valid Sets). Arc Kayles

is defined below.

Definition 4.1. Problem: Arc Kayles.

Input: A (hyper-)graph G(V,E).

Description: Players take turns picking edges from G. When an edge e is

picked, e together with all incident vertices to e are removed. First player

unable to pick an edge loses.

Question: Is there a winning strategy for player A?

The complexity of Arc Kayles is currently unknown even on graphs and

it has been a long-standing open question since the PSPACE-Completeness

of its sibling problem Node Kayles was established in [30]. The multi-

round 2-player version of SET is at least as hard as Arc Kayles, following

27

the reduction of Min 3-Value r-Set to Independent Edge Dominat-

ing Set. A slightly more general version of Arc Kayles is mentioned

to be PSPACE-complete in [30], while the natural generalization of Arc

Kayles to hypergraphs with unbounded hyperedge size is PSPACE-hard by

the complexity of poset games [23].

The 2-player SET problem on graphs is a natural restriction of Arc

Kayles, though this version of SET, unlike its hypergraph counterpart turns

out to be trivial: if the size of the Sets (i.e. the number of different values)

is 2 then any 2 cards form a Set; thus the 2-player problem is equivalent to

Arc Kayles on complete graphs and becomes a simple matter of parity of

the number of nodes.

A possible direction is to study 2P 3-Value Set from a parameterized

point of view. An interesting question is whether player 1 has a winning

strategy in r moves. In [1] it was established that the r-move parameterized

version of Node Kayles is AW[*]-hard. This question is addressed in [27],

where we prove that the problem is FPT. Settling that 2-player SET is FPT

parameterized by r implies the same result for Arc Kayles on graphs.

28

References

[1] Karl R. Abrahamson, Rodney G. Downey, and Michael R. Fellows.Fixed-parameter tractability and completeness iv: On completeness forw[p] and pspace analogues. Ann. Pure Appl. Logic, 73(3):235–276, 1995.

[2] Alan A Bertossi. The edge Hamiltonian path problem is NP-complete.Information Processing Letters, 13(4):157–159, 1981.

[3] K. Chaudhuri, B. Godfrey, D. Ratajczak, and H. Wee. On the complex-ity of the game of Set. Manuscript, 2003.

[4] Jianer Chen, Qilong Feng, Yang Liu, Songjian Lu, and Jianxin Wang.Improved deterministic algorithms for weighted matching and packingproblems. Theor. Comput. Sci., 412(23):2503–2512, 2011.

[5] Ben Coleman and Kevin Hartshorn. Game, set, math. MathematicsMagazine, 85(2):83–96, 2012.

[6] S.A. Cook. The complexity of theorem-proving procedures. In Proceed-ings of the third annual ACM symposium on Theory of computing, pages151–158. ACM, 1971.

[7] Derek G Corneil and Udi Rotics. On the relationship between clique-width and treewidth. SIAM Journal on Computing, 34(4):825–847, 2005.

[8] Bruno Courcelle. The monadic second-order logic of graphs. I. Recog-nizable sets of finite graphs. Information and computation, 85(1):12–75,1990.

[9] Bruno Courcelle, Johann A Makowsky, and Udi Rotics. Linear time solv-able optimization problems on graphs of bounded clique-width. Theoryof Computing Systems, 33(2):125–150, 2000.

[10] Benjamin Lent Davis, Davis, and Diane Maclagan. The card game set,2003.

[11] E. Demaine, S. Hohenberger, and D. Liben-Nowell. Tetris is hard, evento approximate. Computing and Combinatorics, pages 351–363, 2003.

29

[12] Erik D Demaine, Martin L Demaine, Nicholas J. A Harvey, RyuheiUehara, Takeaki Uno, and Yushi Uno. UNO is hard, even for a singleplayer. Manuscript, 2013.

[13] Erik D Demaine, Martin L Demaine, Ryuhei Uehara, Takeaki Uno, andYushi Uno. UNO is hard, even for a single player. In Fun with Algo-rithms, pages 133–144. Springer, 2010.

[14] Rodney G. Downey and Michael R. Fellows. Parameterized Complexity.Springer-Verlag, 1999. 530 pp.

[15] Wolfgang Espelage, Frank Gurski, and Egon Wanke. How to solve NP-hard graph problems on clique-width bounded graphs in polynomialtime. In Graph-theoretic concepts in computer science, pages 117–128.Springer, 2001.

[16] Michael R. Fellows, Danny Hermelin, Frances A. Rosamond, andStephane Vialette. On the parameterized complexity of multiple-intervalgraph problems. Theor. Comput. Sci., 410(1):53–61, 2009.

[17] Michael R Fellows, Frances A Rosamond, Udi Rotics, and Stefan Szeider.Clique-width minimization is NP-hard. In Proceedings of the thirty-eighth annual ACM symposium on Theory of computing, pages 354–362.ACM, 2006.

[18] Fedor V Fomin, Petr A Golovach, Daniel Lokshtanov, and SaketSaurabh. Clique-width: on the price of generality. In Proceedings ofthe twentieth Annual ACM-SIAM Symposium on Discrete Algorithms,pages 825–834. Society for Industrial and Applied Mathematics, 2009.

[19] A.S. Fraenkel and D. Lichtenstein. Computing a perfect strategy forn × n chess requires time exponential in n. Journal of CombinatorialTheory, Series A, 31(2):199–214, 1981.

[20] Aviezri S Fraenkel, Edward R Scheinerman, and Daniel Ullman. Undi-rected edge geography. Theoretical Computer Science, 112(2):371–381,1993.

[21] Michael R Garey and David S Johnson. Computers and Intractability,volume 174. freeman New York, 1979.

30

[22] John Goldwasser and William Klostermeyer. Maximization versions of”lights out” games in grids and graphs. Congressus Numerantium, pages99–112, 1997.

[23] Daniel Grier. Deciding the winner of an arbitrary finite poset gameis pspace-complete. In Fedor V. Fomin, Rusins Freivalds, Marta Z.Kwiatkowska, and David Peleg, editors, ICALP (1), volume 7965 ofLecture Notes in Computer Science, pages 497–503. Springer, 2013.

[24] Set Enterprises Inc., editor. Mathematics Workbook: How to Use theSET Game in the Classroom. Set Enterprises, Inc., 2011.

[25] S. Iwata and T. Kasai. The Othello game on an n × n board is PSPACE-complete. Theoretical Computer Science, 123(2):329–340, 1994.

[26] R.M. Karp. Reducibility among combinatorial problems. In Proceedingsof Symposium on the Complexity of Computer Computations, pages 85–103. The IBM Research Symposia Series, Plenum Press, 1972.

[27] Michael Lampis and Valia Mitsou. The computational complexityof the game of set and its theoretical applications. arXiv preprintarXiv:1309.6504, 2013.

[28] D. Lichtenstein and M. Sipser. Go is polynomial-space hard. Journal ofthe ACM (JACM), 27(2):393–401, 1980.

[29] Christos M. Papadimitriou. Computational complexity. Addison-Wesley,Reading, Massachusetts, 1994.

[30] Thomas J. Schaefer. On the complexity of some two-person perfect-information games. J. Comput. Syst. Sci., 16(2):185–225, 1978.

[31] J. Stuckman and G.Q. Zhang. Mastermind is NP-complete. Arxivpreprint cs/0512049, 2005.

[32] Y. Takayuki. Complexity and completeness of finding another solutionand its application to puzzles. Master’s thesis, Citeseer, 2003.

[33] Mike Zabrocki. The joy of set, 2001.

31

The Computational Complexity of Games and Puzzles...time mathematical and algorithmic depth, they...

Documents

Transcript of The Computational Complexity of Games and Puzzles...time mathematical and algorithmic depth, they...