Proofs

Exercise 7.1. You’re about to leave on holiday, but you forgot to pack socks! You’ve ran back to your room, but the light’s burnt out, so you can’t see the colours of your socks.

You know that in your sock drawer that there are ten pairs of green socks, ten pairs of black socks, and eleven pairs of blue socks (all mixed up.)

How many of your socks do you need to take before you can be sure you’ve grabbed at least one matching pair?

Exercise 7.2. You’re a mad scientist! You’ve conducted an experiment on yourself to get superpowers. It worked, but to keep the powers you need to take two different tablets each day; if you forget one, or take more than one of either type, you’ll, um, explode.

Unfortunately, they look completely identical, and you’ve just dropped your last two days of supply (four tablets) on the floor.

What can you do?

Proofs: Motivation and Fundamentals

Throughout this coursebook, we’ve stressed the importance of clear, logical explanations for why things are true. In our previous chapters, we motivated the need for these arguments in a number of ways:

In our answer to Exercise 1.1, we said that making a good, logical argument is often a useful skill in the workplace! In real life, you will often have to work for people who aren’t particularly “tech-y” and will expect you to be able to do literally impossible tasks. Being able to give a clear and patient explanation for why something can never happen is a good skill to have!
In our argument for Claim 1.1, we explained why an argument that just checks a few hundred or even a few thousand cases is often not enough in mathematics and computer science. As we saw then, there are tons of examples of claims that are true for all of the small values you’d like to check, but that suddenly become completely false when you get to large values (i.e. the ones that you could encounter when running code!)
Finally, in chapter 4 we studied algorithms, step-by-step processes that we could easily turn into computer programs. When doing so, we often added in discussions about why these algorithms were “guaranteed” to work or why these algorithms would never “crash;” these arguments made it so that we could trust these processes to always work.

This is something that you do in real life as a computer scientist all the time! Good code involves carefully thinking about all of the possible inputs you could be given, and for each case ensuring that your code is well-behaved. Writing such code and documenting it in a way that others can understand is a key part of being a professional programmer!

These motivations aren’t just hypotheticals! Here’s a pair of stories that illustrate why knowing how to make logical arguments is a useful skill:

Story 7.1. Janelle Shane is a researcher in optics who works with neural networks and machine learning. Roughly speaking, the way that a neural network works is the following:

Take a bunch of examples of the thing you want the neural network to recognize, as well as a bunch of nonexamples.
“Show” the neural network these examples and nonexamples.
The neural network will then come up with a set of rules that it believes describes what it means for.

“Show” is hard to define in words, but there are some great YouTube videos: see SethBling’s Mar I/O videos for a fun and accessible introduction!

People are often tempted to just use the results of a neural network directly, without checking whether its discovered rules make sense. Doing so, as Janelle notes, leads to some fascinatingly weird behaviour:

“There was an algorithm that was supposed to sort a list of numbers. Instead, it learned to delete the list, so that it was no longer technically unsorted.”
” In 1997, some programmers built algorithms that could play tic-tac-toe remotely against each other on an infinitely large board. One programmer, rather than designing their algorithm’s strategy, let it evolve its own approach. Surprisingly, the algorithm suddenly began winning all its games. It turned out that the algorithm’s strategy was to place its move very, very far away, so that when its opponent’s computer tried to simulate the new greatly-expanded board, the huge gameboard would cause it to run out of memory and crash, forfeiting the game.”
“An algorithm that was supposed to figure out how to apply a minimum force to a plane landing on an aircraft carrier. Instead, it discovered that if it applied a huge force, it would overflow the program’s memory and would register instead as a very small force. The pilot would die but, hey, perfect score.”

In short: just because something works for a bunch of examples doesn’t mean it’s good!

Story 7.2. A somewhat darker story on the importance of being able to read and understand proofs comes from the NSA, and something called a Dual Elliptic Curve Deterministic Random Bit Generator. This was an algorithm, designed by the NSA (a USA security agency,) that they claimed was a cryptographically secure way to generate random numbers.

However, this algorithm was one that the NSA had built a “backdoor” into. That is, they designed the algorithm around certain secret values so that anyone with knowledge of those values (i.e. the NSA) could predict the randomly-generated numbers with a higher-than-normal degree of accuracy and thereby defeat cryptographic systems using this algorithm.

The NSA managed to get their algorithm used as a “standard” for over seven years. However, many mathematicians and computer scientists were suspicious of the NSA’s algorithm from the very start, in large part because it was not something that was proven to work! Their research led to the eventual revocation of the NSA’s algorithm as a standard.

See the New York Times for an article summarizing the scandal.

Also, check out Kleptography: Using Cryptography Against Cryptography and Cryptanalysis of the Dual Elliptic Curve Pseudorandom Generator, if you’d like to read through some research papers describing the NSA’s algorithm/its weaknesses.

So: we have some motivation for why we would want to write clear, logical arguments. The next question for us, then, is what counts as a valid argument?

Every major field of study in academia, roughly speaking, has a way of “showing” that something is true. In English, if you wanted to argue that the whale in Melville’s Moby Dick was intrinsically tied up with mortality, you would write an essay that quoted Melville’s story alongside some of of his other writings and perhaps some contemporary literature, and logically argue (using these quotations as “evidence”) that your claim holds. Similarly, if you were a physicist and you wanted to show that the speed of light is roughly $3.0 \cdot 10^8$ meters per second, you’d set up a series of experiments, collect data, and see if it supports your claim.

In mathematics, a proof is an argument that mathematicians use to show that something is true. However, the concepts of “argument” and “truth” aren’t quite as precise as you might like; certainly, you’ve had lots of “arguments” with siblings or classmates that haven’t proven something is true!

In mathematics, the same sort of thing happens: there are many arguments that (to an outsider) look like a convincing reason for why something is true, but fail to live up to the standards of a mathematician. In Chapter 1, we already studied a pair of “failed” proofs: namely, our first attempts at proving Claim 1.1 and Exercise 1.1. We said that these arguments failed because they did not work in general: that is, they only considered a few cases, and did not consider all of the possible ways to put dominoes on a chessboard, or to pick a pair of integers.

This, however, is not the only way in which a proof might fail us! Here’s another dodgy proof:

Claim 7.1. Given any two nonnegative real numbers $x, y$ , we have $\frac{x+y}{2} \geq \sqrt{xy}$ .

“Bad” proof:

\begin{array}{rl} \sqrt{xy} &\leq \frac{x+y}{2} \\ xy &\leq \frac{(x+y)^2}{4} \\ 4xy &\leq (x+y)^2 \\ 4xy &\leq x^2 + 2xy + y^2\\ 0 &\leq x^2 - 2xy + y^2 \\ 0 &\leq (x-y)^2.\\ \end{array}

\square

A defense of the “bad” proof: We’re not using examples; we’re working in general! Also, we totally showed that this claim is true: after all, we started with our claim and turned it into a true thing!

Why this proof is not acceptable in mathematics:

We have no idea what $x$ and $y$ are! In particular, by plugging in some sample values of $x$ and $y$ , we can see that this is sometimes true and sometimes false: for $x=1, y=4$ we do indeed have $\sqrt{xy} = \sqrt{4} = 2 \leq \frac{1+4}{2} = 2.5$ , but for $x=-1, y=-1$ the claim $\sqrt{(-1)\cdot(-1)} \leq \frac{-1-1}{2}$ is very false, as $-1 \not\leq 1$ ! So, to do anything here, we first need to know what $x$ and $y$ are. That is: we need to define what set $x,y$ come from!
This proof is “backwards:” that is, it starts by assuming our claim is true, and from there gets to a true statement. This is not a logically sound way to make an argument! For example, if we assume that 1=2, we can easily deduce a true statement by multiplying both sides by 0:

\begin{array}{ll} & 1 = 2 \\ \Rightarrow & 0 \cdot 1 = 0 \cdot 2 \\ \Rightarrow & 0 = 0. \\ \end{array}

This doesn’t prove that 1=2, though! As we said above, proofs need to start with true things, and then through argument get to what you’re trying to show.

Finally, this proof has no words! This flaw in some sense is why the other two flaws could exist: if you had to write out in words what $x$ and $y$ were, and how you went from one line to the next, it would probably become clear that this proof was written backwards and also that we have to be careful with what $x, y$ are allowed to be.

This sort of thing is often easy to fix, though! If your proof is “backwards,” simply try starting from the end and reasoning your way backwards to the start. If your logic was flawed, somewhere along the way you’ll encounter a nonreversible step.

For example, if we tried to reverse our proof that $1=2$ , we could go from $0=0$ to $0 \cdot 1 = 0 \cdot 2$ , but would see that we can’t “divide by 0” to get to the desired conclusion (and thus that this doesn’t work.)

With this in mind, let’s try a “fixed” version of this proof:

Theorem 7.1. (The arithmetic mean-geometric mean inequality.) For any two nonnegative real numbers $x,y$ , we have that the geometric mean of $x$ and $y$ is less than or equal to the arithmetic mean of $x$ and $y$ : in other words, we have that

\sqrt{xy} \leq \frac{x+y}{2}

Proof. Take any pair of nonnegative real numbers $x,y$ . We know that any squared real number is nonnegative: so, in specific, we have that the square of $x-y$ , $(x-y)^2$ is nonnegative. If we take the equation $0 \leq (x-y)^2$ and perform some algebraic manipulations, we can deduce that

\begin{array}{rl} 0 &\leq (x-y)^2\\ \Rightarrow 0 &\leq x^2 - 2xy + y^2 \\ \Rightarrow 4xy &\leq x^2 + 2xy + y^2\\ \Rightarrow4xy &\leq (x+y)^2 \\ \Rightarrow xy &\leq \frac{(x+y)^2}{4}. \\ \end{array}

Because $x$ and $y$ are both nonnegative, we can take square roots of both sides to get

\sqrt{xy} \leq \frac{|x+y|}{2}.

Again, because both $x$ and $y$ are nonnegative, we can also remove the absolute-value signs on the sum $x+y$ , which gives us

\sqrt{xy} \leq \frac{x+y}{2},

which is what we wanted to prove.

\square

Much better! This proof doesn’t have logical flaws, it’s easier to read, and we’ve justified all of our steps so that even a skeptical reader would believe us.

Direct Proofs

In our past chapters, we’ve seen a number of useful techniques for writing such arguments while studying new mathematical concepts. In this chapter, we’re reversing this structure a bit: instead of focusing on new mathematics, we’re going to focus our sections on new argument techniques. By doing this, we’re hoping that the arguments you’ve been reading in the past few chapters will become ones that you’re comfortable with writing and reading on your own!

The proof we just wrote serves as a nice example of the first proof technique we’ll study in this class: the idea of a direct proof. To prove that a given claim is true, the most straightforward path we’ve used in this class has been the following:

Write down things that you know are true that relate to your claim. This typically includes the definitions of any terms referred to in the definition, any results from class or the tutorials/assignments that look related, and maybe some fundamental facts you know entering this class about numbers.
Combine those things by using logic or algebra to create more things you know are true.
Keep doing this until you get to the claim!

A particularly common form of direct proof comes up when people want to prove a statement of the form “if A holds, then B must follow” for two propositions A and B (or equivalently, “A implies B,” which we write in symbols as $A \Rightarrow B$ .)

To write a direct proof of such a statement, we proceed as before, but also throw in the assumption that A holds! That is, to prove “A implies B,” we assume thaat A is true, and try to combine this assumption with other known true things to deduce that B is true. (Logically speaking, this is because $A \Rightarrow B$ holds as long as we’re never in the situation where $A$ is true and $B$ is false. Therefore, if we can show that $A$ being true forces $B$ to also be true, then we know that our claim must hold!)

We illustrate this with a pair of examples here:

Claim 7.2. If $n$ is an odd integer, then $n^2$ can be written as a multiple of 4 plus one.

Proof. We start by “assuming” the part by the “if:” that is, we assume that $n$ is an odd integer. By definition, this means that we can write $n = 2k+1$ for some other integer $k$ .

We seek to study $n^2$ . By our observation above, this is just $(2k+1)^2 = 4k^2 + 4k+1 = 4(k^2+k)+1$ . This is a multiple of 4 plus 1, as claimed! Therefore we have completed our proof.

\square

Claim 7.3. If $G$ is a graph, then $G$ must have an even number of vertices with odd degrees; that is, it is impossible to have a graph $G$ with an odd number of vertices with odd degrees.

Proof. We start this proof by thinking about all of the facts that we know about graphs and degrees. There’s one result that should immediately jump to mind, namely the degree-sum formula: for any graph $G$ ,

\boxed{\begin{array}{c} \textrm{The sum of the degrees} \\ \textrm{of the vertices in G} \\ \end{array}}

\boxed{\begin{array}{c} \textrm{Twice the number} \\ \textrm{of the edges in G} \\ \end{array}}

Let’s use this result! Specifically: in this problem, we’re studying vertices with odd degree. How can we turn this result into something that talks about odd-degree vertices? Well: from our work in our first chapter, we know that every integer is either even or odd. If we apply this idea to our degree-sum formula, we get the following:

\boxed{\begin{array}{c} \textrm{The sum of the} \\ \textbf{odd}\textrm{ degrees of} \\ \textrm{vertices in G} \\ \end{array}}

\boxed{\begin{array}{c} \textrm{The sum of the} \\ \textbf{even}\textrm{ degrees of} \\ \textrm{vertices in G} \\ \end{array}}

\boxed{\begin{array}{c} \textrm{Two times the} \\ \textrm{number of the} \\ \textrm{edges in G} \\ \end{array}}

We wanted to study the odd-degree vertices, so let’s get them isolated on one side of our equation:

\boxed{\begin{array}{c} \textrm{The sum of the} \\ \textbf{odd}\textrm{ degrees of} \\ \textrm{vertices in G} \\ \end{array}}

\boxed{\begin{array}{c} \textrm{Two times the} \\ \textrm{number of the} \\ \textrm{edges in G} \\ \end{array}}

\boxed{\begin{array}{c} \textrm{The sum of the} \\ \textbf{even}\textrm{ degrees of} \\ \textrm{vertices in G} \\ \end{array}}

On the right-hand side, notice that we have an even number (twice the number of edges) minus a bunch of even numbers (the degrees of all even-degree vertices in $G$ ); therefore, the right-hand-side is even!

As a result, the left-hand-side is also even. But this means that the sum of all odd-degree vertices is an even number.

We know that summing an odd number of odd numbers is always odd, and that summing an even number of odd numbers is always even. Because the left-hand side is even, we know we must be in the second case; that is, that we have an even number of vertices of odd degree, as claimed!

\square

Proof by Cases

Our second proof technique is best illustrated by an example:

Theorem 7.2. For every natural number $n$ , if $n$ is a square number, then $n \not\equiv 2 \mod 3$ .

An integer $n$ is said to be a square number if we can write $n = k^2$ for some other integer $k$ . For example, $0,1,4,9,16,25\ldots$ are all square numbers!

Proof. As always, we start by expanding our definitions. If $n$ is a square number, then by definition we know that $n = k^2$ for some integer $k$ .

From here, we use the particularly clever trick that this section is devoted to: we consider cases. That is: we want to look at what $n$ is congruent to modulo 3.

We don’t have any information about what $n$ or $k$ theirselves are modulo 3, so it would seem hard to introduce this information into our proof! However, by the definition of the modulus operator $\%$ , we know that every number is congruent to one of 0, 1 or 2 modulo 3. By definition, then, this means that we most always be in one of the following three cases: $k \equiv 0 \mod 3$ , $k \equiv 1 \mod 3$ or $k \equiv 2 \mod 3$ .

In each of these cases, we can now expand our definitions and use our knowledge of modular arithmetic to proceed further:

Assume that we’re in the $k \equiv 0 \mod 3$ case. In this situation, we have that $k = 3m$ for some $m$ , which means that $k^2 = 9m^2 = 3(3m^2)$ is also a multiple of 3. Thus, $k^2 \equiv 0 \mod 3$ .
Now, assume instead that we’re in the $k \equiv 1 \mod 3$ case. In this situation, we have that $k \equiv 3m + 1$ for some $m$ , which means that $k^2 = 9m^2 + 6m + 1= 3(3m^2 + 2m) + 1$ . Thus, $k^2 \equiv 1 \mod 3$ .
Finally, consider the last remaining case, where $k \equiv 2 \mod 3$ . In this situation, we have that $k \equiv 3m + 2$ for some $m$ , which means that $k^2 = 9m^2 + 12m + 4= 3(3m^2 + 4m + 1) + 1$ . Thus, $k^2 \equiv 1 \mod 3$ .

In all three of these cases, we’ve seen that $n = k^2$ is not congruent to 2 modulo 3. These cases cover all of the possibilities! Therefore, we know that $n$ is simply never congruent to 2 modulo 3 in any situation, and have therefore proven our claim.

\square

The trick to the proof above was that we were able to introduce additional information about $k$ (namely, its remainder on division by 3) by simply considering all possible remainders as separate cases! This technique, called proof by cases, is a powerful technique in several situations:

Whenever you’re dealing with integers and answering a question about whether some expression $f(n)$ is even or odd, try considering the two cases ” $n$ is even” and ” $n$ is odd.”
If you’re dealing with a claim about modular arithmetic, or with claims like “is a multiple of”, considering the different possible remainders that a number could have (i.e. the three cases where $k \% 3 = 0, 1$ or $2$ we considered above) is often a great approach.
If you’re dealing with claims about rational and irrational numbers, separating the cases ” $x$ is rational” and ” $x$ is irrational” can be handy.

We practice this in the examples below:

Claim 7.4. For any real number $x$ , we claim that $|x+7| - x \geq 7$ .

Proof by cases is an excellent technique whenever you see an absolute value, as it lets you get rid of the absolute value in each case.

Proof. We proceed by considering cases:

We first consider the case where $x \geq -7$ . In this case, $x+7 \geq 0$ , and so $|x+7| - x$ is just $x+7 - x = 7$ . In this case, our inequality holds!
Now, we consider the case where $x < -7$ . In this case, $x+7 < 0$ , and so we have $|x+7| = -(x+7)$ (as the negative of a negative is a positive!)

Therefore, we have $|x+7| - x = -(x+7) - x = -7-2x$ . If $x < -7$ < then $-2x > 14$ , and so $-2x - 7 > 14-7 = 7$ . Therefore our claim holds in this case as well!

Because every number is either greater than or equal to 7 or less than 7, we’ve considered all possible cases. As our claim was true in each possible case, this completes our proof!

\square

Claim 7.5. For every two numbers $x,y$ , we always have that $\max(x,y) + \min(x,y) = x+y$ .

_Proof._We consider two possible cases:

$x > y$ . In this case, we have $\max(x,y) = x$ and $\min(x,y) = y$ ; therefore, $\max(x,y) + \min(x,y) = x+y$ as claimed.
$x \leq y$ . In this case, we have $\max(x,y) = y$ and $\min(x,y) = x$ ; therefore, $\max(x,y) + \min(x,y) = y+x = x+y$ , also as claimed.

This covers all possible cases, as for any two numbers $x,y$ either $x > y$ or $x \leq y$ ! Therefore, we’ve proven our claim.

\square

In the next example, we return to the tricks we used to calculate the last digit of a number in Claim 1.10:

Claim 7.6. For any integer $k$ , we have that $(k^4) \% 10$ is always either 0, 1, 5, or 6.

Proof by cases is also usually a good idea if you see the modulus operator!

Proof. We saw before in our chapter on integers that if $d_0$ is the last digit of an integer $n$ , then $n^m \% 10$ is equal to $d_0^m \% 10$ for any positive integer power $m$ .

Therefore, in our claim, we don’t have to actually consider every possible integer $k$ ; we can just consider the ten different possible last digits $k$ could have, and calculate the cubes of each of those! We do so here:

$0^4 \% 10 = 0 \% 10 = 0$ .
$1^4 \% 10 = 1 \% 10 = 1$ .
$2^4 \% 10 = 16 \% 10 = 6$ .
$3^4 \% 10 = 81 \% 10 = 1$ .
$4^4 \% 10 = 256 \% 10 = 6$ .
$5^4 \% 10 = 625 \% 10 = 5$ .
$9^4 \% 10 = (81)^2 \% 10 = 1^2 \% 10 = 1$ .
$6^4 \% 10 = (36)^2 \% 10 = 6^2 \% 10 = 6$ .
$7^4 \% 10 = (49)^2 \% 10 = 9^2 \% 10 = 6$ .
$8^4 \% 10 = (64)^2 \% 10 = 4^2 \% 10 = 6$ .

In all ten cases, our remainders are always 0, 1, 5, or 6, as claimed! Therefore, we’ve proven our claim.

\square

Finally, we can use a proof by cases to prove one of our exercises:

Answer to Exercise 7.1. Even though there are lots of socks in the drawer, there are only 3 colours. Therefore, we can just take 4 socks to make sure that at least 2 of them are the same colour. To understand why, let’s look at the colours of three first socks.

There are two possible cases here:

If we were lucky enough to pick two matched socks from those first three, then we’ve succeeded!
However, in the worst-case scenario the first three socks we took were all different colours, and we do not yet have a pair. In this situation, we have one sock of each colour.

In this case, however, our fourth sock is guaranteed to match at one of our three previously chosen socks!

In any case, we’ve grabbed a pair of socks, as desired.

Proof by Contradiction

Contradiction - i.e. the “if we’re stuck on a problem, suppose we’re wrong and see what happens” proof technique --- is a method we’ve already used to considerable success throughout this coursebook! In this section, we study several more examples of proof by contradiction, and talk a bit about the trickier aspects of this proof method.

To start, let’s examine one of the most famous proofs by contradiction! In this proof, we’re going to really pick apart the structure of a proof by contradiction, so that we can see why this method works:

Claim 7.7. The number $\sqrt{2}$ is not rational.

Proof. As always, let’s start by unpacking our definitions:

$\sqrt{2}$ is the unique positive real number such that when we square it, we get 2.
A number $x$ is rational if we can write $x = \frac{m}{n}$ , where $m$ and $n$ are integers and $n$ is nonzero.

With this done, our claim can be unpacked to the following:

“For a real number $x$ , if $x = \sqrt{2}$ , then there are no values of $m, n \in \mathbb{Z}$ with $n \neq 0$ such that $x = \frac{m}{n}$ .”

So: how do we do this? Because the problem wants us to show that we cannot write $\sqrt{2} = \frac{m}{n}$ for any integers $m, n$ with $n \neq 0$ , we can’t just check a few examples: we’d have to look at all of them, and this could be quite difficult! We’d have to find some useful property that makes all examples of this form fail, and this could be quite hard to find.

Instead, consider the following way to “side-step” these difficulties. Instead of looking at all pairs $m, n$ and trying to show that each one fails, let’s assume that we have one such pair $m, n$ such that $\sqrt{2} = \frac{m}{n}$ !

With this assumption in hand, let’s now show that this assumption “breaks mathematics” in some way: that starting from this assumption, we can get to something we know is impossible, like $1+1=0$ . If we can do this, then we know that our original assumption that there was such a fraction $\frac{m}{n}$ must have been nonsense (i.e. false), and therefore that our claim that no such fraction exists is true!

We do this here. Suppose that we can find two integers $m, n$ with $n \neq 0$ such that $\sqrt{2} = \frac{m}{n}$ . If $m$ and $n$ have common factors, divide through by those factors to write $\frac{m}{n}$ in its simplest possible form: that is, don’t write something like $\frac{3}{6}$ or $\frac{12}{24}$ , write $\frac{1}{2}$

Then if we square both sides, we get $2 = \frac{m^2}{n^2}$ . Multiplying both sides by $n^2$ gives us $2n^2 = m^2$ , which means that $m^2$ is even (because it is a multiple of 2)!

This means that $m$ is even (see the tutorials from earlier in this course!), and therefore that we can write $m = 2k$ for some integer $k$ . If we plug this into our equation $2n^2 = m^2$ , we get $2n^2 = (2k)^2 = 4k^2$ , and by dividing by $2$ we have $n^2 = 2k^2$ .

This means that $n^2$ is even, and therefore that $n$ is even as well (same logic as before.)

But this means that both $n$ and $m$ are multiples of 2; that is, that they have a common factor! We said earlier that we’d divided through by any common factors to get rid of them, so this is a contradiction: from our initial assumption we got to something that is both true and false. As a result, our original assumption (that we could write $\sqrt{2} = \frac{m}{n}$ ) must be false; that is, we have shown that $\sqrt{2} \neq \frac{m}{n}$ for any integers $m, n$ with $n \neq 0$ , as desired. Yay!

\square

We can generalize the form of the argument we just made above as follows:

We have a claim we’re trying to prove; let’s denote it $P$ , for shorthand.
Instead of proving $P$ is true directly, we want to prove that “not $P$ ” is impossible.
To do this, we can simply do the following:
1. Assume, for the moment, that “not- $P$ ” is actually true!
2. Working from this assumption, find a pair of contradictory statements that are implied by “not $P$ .” That is, find a pair of statements $Q$ and “not- $Q$ ” such that if $P$ was false, both $Q$ and “not- $Q$ ” would both hold. Common examples are “1=1” and “1=0”, or ” $n$ is even” and ” $n$ is false”, or ” $x$ is positive” and ” $x$ is negative”: stuff like that.
3. This proof demonstrates that “not- $P$ ” must be impossible, because it implies two contradictory things (like the two simultaneous claims ” $n$ is even”and ” $n$ is odd.“) Mathematics is free from false statements and contradictions; therefore, we know that this must be impossible. In other words, “not- $P$ ” must be false and $P$ must be true!

This is how a proof by contradiction works. You take your claim $P$ , assume it’s false, and use “not $P$ ” to deduce contradictory statements, which you know mathematics cannot contain.

A beautiful quote about proofs by contradiction, by the mathematician G. H. Hardy: “Proof by contradiction”, which Euclid loved so much, is one of a mathematician’s finest weapons. It is a far finer gambit than any chess gambit: a chess player may offer the sacrifice of a pawn or even a piece, but a mathematician offers the game.

We consider another example here:

Claim 7.8. There are two irrational numbers $a$ and $b$ such that $a^b$ is rational.

Proof. In the example we’re studying here, we want to show that it’s impossible for $a^b$ to be irrational for every pair of irrational numbers $a,b$ . To do this via a proof by contradiction, we do the following: first, assume that $a^b$ is irrational for every pair of irrational numbers $a,b$ ! If we apply this knowledge to one of the few numbers ( $\sqrt{2}$ ) we know is irrational, our assumption tells us that in specific

$\sqrt{2}^{\sqrt{2}}$ is irrational.

What do we do from here? Well: pretty much the only thing we have is our assumption, our knowledge that $\sqrt{2}$ is irrational, and our new belief that $\sqrt{2}^{\sqrt{2}}$ is also irrational. The only thing really left to do, then, is to let $a = \sqrt{2}^{\sqrt{2}}$ , $b = \sqrt{2}$ , and apply our hypothesis again. But this is excellent! On one hand, our we have that $a^b$ is irrational by our hypothesis. On the other hand, we have that $a^b$ is

(\sqrt{2}^{\sqrt{2}})^{\sqrt{2}} = \sqrt{2}^{\sqrt{2} \cdot \sqrt{2}} = \sqrt{2}^2 = 2,

which is clearly rational. This is a contradiction! Therefore, we know that our hypothesis must be false: there must be a pair of irrational numbers $a,b$ such that $a^b$ is rational.

\square

An interesting quirk of the above proof is that it didn’t actually give us a pair of irrational numbers $a,b$ such that $a^b$ is rational! It simply told us that either

$\sqrt{2}^{\sqrt{2}}$ is rational, in which case $a=b=\sqrt{2}$ is an example, or
$\sqrt{2}^{\sqrt{2}}$ irrational, in which case $a = \sqrt{2}^{\sqrt{2}}, b = \sqrt{2}$ is an example,

but it never actually tells us which pair satisfies our claim! This is a weird property of proofs by contradiction: they are often nonconstructive proofs, in that they will tell you that a statement is true or false without necessarily giving you an example that demonstrates the truth of that statement.

To stick with the classical route, let’s study another one of the first proofs by induction, that we considered all the way back in our first chapter:

Claim 7.9. (Euclid) There are infinitely many prime numbers.

Proof. As we did with our argument that no number can be both even and odd at the same time, let’s approach this with a bit of a thought experiment: what would happen if there were not infinitely many prime numbers?

Well: if this were to happen, then there would be some fixed number of primes in existence. Let’s give that number a name, and say that there were $n$ primes in existence. Then, if we had a piece of paper with $n$ lines on it, we could in theory write down all of the prime numbers that existed!

If we labeled those lines $1,2,\ldots n$ , we could then refer to those prime numbers by their labels: that is, we could refer to our prime numbers by calling them $p_1, p_2, p_3, \ldots p_n$ . (Giving things names: a very useful technique!)

In this world where we have all of these prime numbers, what can we do with them? Well: as we saw before, a particularly useful property about prime numbers is that they form the building blocks out of which we can make all integers. Therefore, we’re motivated to take our primes and stick them together, and see what happens!

After a lot of effort, you might eventually hit on the clever combination of our prime numbers that Euclid discovered: think about what happens if we multiply all of our prime numbers together, and then add 1 to that entire sum. That is: look at the number

M = 1 + (p_1 \cdot p_2 \cdot p_3 \cdot \ldots \cdot p_n)

On one hand: take any of the prime numbers on our list. To indicate that we’re taking a general prime number from our list, let’s refer to that prime number as $p_i$ , where $i$ could be any index. Look at $\frac{M}{p_i}$ . By definition, this is equal to

\frac{1}{p_i} + \left(\overbrace{p_1 \cdot p_2 \cdot \ldots \cdot p_n}^{\textrm{all of the primes except for }p_i}\right).

In particular, notice that this is not an integer! $\frac{1}{p_i}$ is some fraction strictly between 0 and 1, because $p_i$ is a prime and therefore at least 2, while the right-hand-bit is a product of integers and therefore is an integer itself.

Therefore, we’ve shown that if we multiply $p_i$ by a number to get $M$ , that number cannot be an integer; in other words, we have shown that $p_i$ is not a factor of $M$ . This holds for any of our primes, because $p_i$ was an arbitrary prime; therefore $M$ is not a multiple of any of our prime numbers!

On the other hand, though, we know that $M$ is an integer. Therefore, we know that we can factor $M$ into prime numbers! Do so, and write $M$ as a product of prime numbers.

Our argument above tells us that that none of those prime numbers can be from our list $p_1, \ldots p_n$ . But this list was supposed to contain all of the prime numbers! Therefore, we know that our original assumption that we could write down all of the prime numbers must have been false: that is, there must have been infinitely many prime numbers.

\square

Common Contradiction Mistakes: Not Understanding Negation

In a proof by contradiction, we’re trying to prove that a claim $P$ is true by showing that “not- $P$ ” cannot be false. Perhaps surprisingly, the most common mistake people make when using a proof by contradiction is in their very first step: specifically, in writing just what “not- $P$ ” is for a given claim!

For example, consider the following claim:

Claim 7.10. For every pair of integers $x,y$ such that $x$ and $y$ are both odd, we have that $x\cdot y$ is also odd.

Here are a number of incorrect ways that people will try to negate this claim:

“For every pair of integers $x,y$ such that $x,y$ are both even, we have that $x \cdot y$ is also even.”

The first mistake made here is in the first two words, where we wrote “for every!” That is: Claim 7.10 is a claim about all pairs of integers. As such, if someone were to say that $P$ was false, they’d just have to have one counterexample to prove us wrong!

That is: if someone made a claim that every UoA student was enrolled in Compsci 120, you wouldn’t prove them wrong by trying to show that every UoA student is not enrolled in Compsci 120; you’d just have to find at least one student not in Compsci 120.

This tells us the first part of how we should write the negation of this claim: it should go “There is a pair of integers $x,y$ …”
This, however, is not enough! That is: “There is a pair of integers $x,y$ such that $x,y$ are both even and $x \cdot y$ is also even” is also not the negation of our claim.

To see why this fails, note that Claim 7.10 is a claim about all pairs of odd integers. As such, if we’re trying to say that this claim fails, we still need to work in the same universe as our normal claim, and we need to find a counterexample that consists of a pair of odd integers.

That is: if someone told you “every Compsci 120 student who’s never been to Antartica currently has an A,” you don’t disprove them by trying to find a student who’s been to Antarctica! Their claim was about people who haven’t been to Antarctica; to disprove it, you need to work within the same bounds!

As such, the correct negation of Claim 7.10, is the following:

“There is a pair of integers $x,y$ such that both $x,y$ are odd, and yet $x \cdot y$ is even.”

Much more reasonable!

Using similar logic, the clam

Claim 7.11. “There is an even prime number.”

should negate to the following:

“Every prime number is odd.”

This is because the opposite of a claim about something existing is that there are no counterexamples (i.e. a claim about everything). As well, the universe of numbers we’re studying (primes) should remain the same, leaving only the conclusion (even) to flip to “odd.”

Let’s consider another claim:

Claim 7.12. If $G$ is a graph containing $\geq 2$ vertices, then $G$ contains two vertices whose degrees are equal.

There are many tempting and incorrect ways to negate this claim:

If $G$ is a claim containing $<2$ vertices, then $G$ does not contain 2 vertices whose degrees are different.” This is the same “negating the universe” error from before!

That is: our claim is about graphs on 2 or more vertices. Its negation should still talk about graphs on 2 or more vertices! As well, our claim was about vertices whose degrees agreed. Its negation should still talk about vertices with equal degree!
This suggests the following as a fix: “If $G$ is a graph containing $\geq 2$ vertices, then $G$ does not contains two vertices whose degrees are equal.”

This has the right universe, but it fails for a second, more subtle reason: the opposite of “if $A$ then $B$ ” is not “if $A$ then not- $B$ .”

That is: suppose that someone claimed to you “if you attend tutorials in Compsci 120, you’ll pass the class.” The above strategy would say that you could disprove their claim by saying “if you attend tutorials in Compsci 120, then you won’t pass the class.”

This doesn’t really make sense, though! Their claim here is a really strong guarantee: it says that everyone who attends tutorials in Compsci 120 will pass. To disprove this, you don’t need to show that everyone who attends tutorials won’t pass; that’s way too hard! Instead, you’d just need to find at least one person who (1) attended all of the tutorials but (2) didn’t pass the class.

That is: the opposite of “if $A$ then $B$ ” is “there is a situation where $A$ holds and $B$ fails.”

By using this, the correct negation of Claim 7.12,. is the following:

“There is a graph $G$ on $n\geq 2$ vertices, such that $G$ does not contain two vertices with the same degree.”

We can summarise the observations we made above as follows:

Observation 7.16. The phrases “For every” and “There exist” get switched around when writing a proof by negation. This is because we disprove a claim about everything by finding a single counterexample, and we prove that no example of a thing can exist by showing that everything is not a counterexample!

Observation 7.17. The “universe” of a claim remains the same: i.e. we don’t disprove a claim about all even numbers by studying odd numbers.

Observation 7.18. The opposite of an “if A then B” statement is “there is a situation where $A$ holds and $B$ fails.” That is: if someone tells you that when it rains outside the sidewalk gets wet, you just need to find a situation where (1) it’s raining and (2) some bit of sidewalk is still dry to disprove their claim!

To finish this section and put this to use, let’s prove Claim 7.12!

Proof of Claim 7.12,. As noted above, the contradictive assumption here would be that $G$ is a graph on $n\geq 2$ vertices in which all of the vertices have different degrees.

We know that the maximum degree of any vertex in $G$ is $n-1$ , because any vertex is at most adjacent to every other vertex. As well, the minimum degree of any vertex is trivially 0. Therefore, there are in theory $n$ different possible degrees for the $n$ vertices in $G$ , namely the values $0,1,\ldots n-1$ .

If no degrees are repeated in $G$ , then (because there are $n$ vertices and $n$ different possible degrees) there is exactly one vertex with degree $i$ , for every $i \in \{0,1,\ldots n-1\}$ . If $n \geq 2$ , note that in particular $n-1 \geq 1$ , and so the degree-0 and degree- $(n-1)$ vertices are different.

Now, notice that if there is a vertex with degree $n-1$ , it is connected to every other vertex in our graph. In particular it must be connected to the vertex that has degree 0, which contradicts the property that this vertex is supposed to have degree 0.

Therefore we have a contradiction, and can conclude that our original claim must hold.

\square

Proof by Construction

In many of the proofs above, we’ve been focused on proving claims about “all” numbers $x,y$ , or “all” odd integers $n$ , or other sorts of “universal” claims about things. When we’re proving claims of these forms, we need to use techniques and arguments like the ones above where we work in general / don’t get to use examples to prove our claim!

Sometimes, however, we’ll find ourselves with claims of the form “There exists a number $n$ such that…” or “There is a value $x$ with the property…” In this sort of situation, we’re not being asked to show that something is true for all values: instead, we’re just asked to find a single example!

In situations like this, a common technique is proof by construction, where we simply create an object with the desired properties. We illustrate this with an example:

Claim 7.13. _ There is an odd integer that is a power of two._

Proof. Notice that $2^0 = 1$ . Therefore, $1$ is a power of 2. 1 is also odd, as we can write it in the form $1+2k$ for some integer $k$ (specifically, $k=0$ .) Therefore we’ve constructed the claimed integer, as desired.

\square

Notice the following two aspects of this proof:

We didn’t have to work with a general integer $n$ ; instead, we got to give a specific example! This is because our claim was of the form “There is …”, which means that we’re just asked for a single example. If our proof had started “For all …”, this would be different, and this proof would be invalid (just like how examples weren’t enough for a proof in our earlier “the sum of any two odd numbers is even” claim.)
Also notice that we didn’t just say “1 is the answer” and ended our proof; we actually took the time to explain why 1 has the desired properties. You should expect to always do this!

We give a second example, to illustrate how these sorts of things come up in combinatorics and/or “puzzle” mathematics:

Claim 7.14. Take the aces and face cards from a standard 52-card deck. Can you arrange them in a $4 \times 4$ grid so that no suits or symbols are repeated in any row or column?

Proof. Behold!

\square

In this proof, we don’t have much to really explain: the solution presented self-evidently has the desired property (just check every row and column.) If it was unclear, though, we’d have to have some explanation along with our answer!

We close by giving a pair of slightly trickier examples for how construction can work, by using processes and algorithms:

Definition 7.1. Given a graph $G$ , a vertex coloring of G with $k$ colors is any way to assign each vertex of $G$ one of $k$ different colors, so that no two adjacent vertices get the same color.

Claim 7.15. We can vertex-color any tree $T$ with at most 2 colors.

Proof. Consider the following algorithm to paint any connected graph $G$ ’s vertices with the colors red and blue:

Algorithm 7.13. Init: Choose any vertex $v$ in $G$ , and paint it red.

Take all currently uncolored vertices that are connected to any red vertices by an edge, and color them blue.
Take all currently uncolored vertices that are connected to any blue vertices by an edge, and color them red.
If there are any uncolored vertices left, go back to (i) and repeat.

We claim that this algorithm will always succeed at coming up with a valid vertex coloring of any tree $G$ ; indeed, more generally, we claim that this algorithm will always succeed at making a valid 2-coloring of any graph $G$ that doesn’t contain an odd-length circuit as a subgraph! Because any tree does not have an odd-length circuit as a subgraph (indeed, it doesn’t contain a cycle subgraph of any length), this would prove our claim.

To see why, we use a second proof technique: contradiction! Think about what would happen if this algorithm would fail, given a connected graph $G$ with no odd-length circuit subgraphs.

Because $G$ is connected, the above process will eventually color every vertex of $G$ ; we first color $v$ , then its neighbors, then its neighbor’s neighbors, and so on/so forth, coloring every vertex within a walk of $k$ edges by the $k$ -th pass. So if the algorithm fails, it does so because in its coloring there must be an edge $\{x,y\}$ in which $x, y$ both get the same color.

Notice that if a vertex $w$ is colored blue, it is because we can walk to to $w$ from our starting $v$ in either one step, or three steps, or five steps … or in general an odd number of steps. This is because we alternated between red and blue in our algorithm. Similarly, if a vertex $w$ is colored red, it is because we can walk to $w$ in either 0 or 2 or 4 or 6 or … or an even number of edges.

Take any walk $P_x$ from $v$ to $x$ , and any other walk $P_y$ from $v$ to $y$ . We have proven that either $P_x, P_y$ both have an even number of edges, or that they both have an odd number of edges. Therefore, the circuit formed by starting at $v$ , walking along $P_x$ to $x$ , using the $\{x,y\}$ edge to go to $y$ , and then reversing $P_y$ to return to $v$ has either $(\textrm{even} + 1 + \textrm{even})$ or $(\textrm{odd} + 1 + \textrm{odd})$ length. Both of the quantities, in particular, are odd! This contradicts our assumption that $G$ had no odd-length circuits.

As a result, our original claim (that $G$ is bipartite) must be true!

\square

Claim 7.16. A Hamiltonian circuit in a graph $G$ is a walk that starts and ends at the same vertex, and along the way visits every other vertex exactly once. For example, the cube graph $Q_3$ below has a Hamiltonian circuit (highlighted.) The $n \times n$ grid graphs are defined by drawing a $n \times n$ grid of vertices and connecting adjacent vertices, as drawn below: Prove for all $n \in \mathbb{N}$ that $G_{2n, 2n}$ has a Hamiltonian circuit.

Proof. Consider the following constructive process for generating such a circuit:

Label the $(2n)^2$ vertices in the grid graph $G_{2n, 2n}$ with coordinates $(i,j)$ , where vertex $(1,1)$ is the vertex in the bottom-left-hand corner and $(2n,2n)$ is the vertex in the upper-right-hand corner.
Start at $(1,1)$ ,
From this vertex, walk to the right until you’re at the bottom-right corner $(1,2n)$ .
Go up one step to $(2,2n)$ , and then walk back to the left until you’re at $(2,2)$ .
Go up one step to $(3,2)$ , then walk back to the right until you’re at $(3,2n)$ .
Go up one step to $(4,2n)$ , and then walk back to the left until you’re at $(4,2)$ .
Go up one step to $(5,2)$ , then walk back to the right until you’re at $(5,2n)$ .
… Keep doing this! Eventually, you will find yourself at $(2n,2)$ , having walked on all of the vertices whose second coordinates are not equal to 1, and not having visited any vertices whose second coordinate is 1 other than (1,1). (This is where the ” $2n$ ” part comes in: because we go right on odd rows and left on even rows, if our grid has even height then we’ll be going left on our top row and thus wind up at $(2n,2)$ as claimed.)
Walk from $(2n,2)$ to $(2n,1)$ , and then go down to $(1,1)$ .

By construction we have visited all vertices in our graph exactly once, and thus created a Hamiltonian circuit, as desired.

\square

Finally, we can answer one our earlier exercises by using construction:

Answer to Exercise 7.2. This has a nice constructive answer: take half of each of the four tablets today, and the other half tomorrow.

Because this is constructive, we don’t have to explain how we came up with this clever idea: we can just present it as an answer! (Though if you did something clever to come up with an idea, it is good to mention how you did this.) We just have to explain why this works, which is pretty simple: half of each tablet gives us half of the tablets of each type, and thus exactly one dose for each kind.

Proof by Induction: First Examples

Sometimes, in mathematics, we will want to study a statement $P(n)$ that depends on some variable $n$ . For example:

$P(n) =$ “The sum of the first $n$ natural numbers is $\frac{n(n+1)}{2}$ .”
$P(n) =$ “If $q \geq 2$ , we have $n \leq q^n$ .
$P(n) =$ “Every polynomial of degree $n$ has at most $n$ roots.”
$P(n) =$ Take a $2^n \times 2^n$ grid of unit squares, and remove one square from the top-right-hand corner of your grid. The resulting shape can be tiled by - shapes.

For any fixed $n$ , we can usually use our earlier proof methods to prove that the claim holds! For instance, let $P(n)$ be the fourth example above, and consider $P(3)$ , which is the claim that if we take a $8 \times 8$ grid of squares and delete the top-right-hand corner square, we can tile the rest of the shape with tiles. We can prove this by construction by just giving an explicit way to do it: see the drawing below!

However, sometimes we will want to prove that one of these statements holds for every value $n \in \mathbb{N}$ . How can we do this?

The answer here is mathematical induction! Mathematical induction is just a formal way of writing up our “building-block plus preserved property” process, in a way that will hopefully let us avoid everyone having the same shoe size. We describe it here:

To start, take a claim $P(n)$ that we want to prove holds for every $n \in \mathbb{N}$ .
The first step in an inductive proof is the base step: in this step, we explicitly prove that the statement $P$ holds for a few small cases using normal proof methods (typically construction or just calculation.)

Usually you just prove that your claim holds when $n=0$ , but sometimes you start with $n=1$ if your claim is one where 0 is a “dumb” case, or prove a handful of cases like $n=0,1,2,3$ to get the hang of things before moving on. You can think of this as the “building block” step from before!
With this done, we move to the induction step! Here, we prove the following statement:

If our claim $P$ is true for all values up to some $n$ , \then it will continue to be true at the next value $n+1$ .

Because this is an implication, i.e. an if-then proof, we usually prove it directly by assuming that our claim holds for all values up to some $n$ , and then use this assumption to prove that our claim holds when we have $n+1$ in our claim.

Just doing these two steps shows that your claim $P$ is true for every natural number $n$ ! To see why, just examine what these two steps tell you:

By our “base case” reasoning, we know that our claim is true at n=0.
By our “inductive step” reasoning, we know that if our claim is true at 0, it is true at the “next” value $n+1 = 1$ .
By our “inductive step” reasoning, we know that if our claim is true up to 1, it is true at the “next” value $2$ .
By our “inductive step” reasoning, we know that if our claim is true up to 2, it is true at the “next” value $3$ .
…
By continuing this process, we eventually get to any $n$ ! Therefore our claim is true for every $n \in \mathbb{N}$ , as desired.

The way we usually think of inductive proofs is to think of topping dominoes. Specifically, think of each of your $P(n)$ propositions as individual dominoes - one labeled $P(0)$ , one labeled $P(1)$ , one labeled $P(2)$ , and so on/so forth. With our inductive step, we are insuring that all of our dominoes are lined up - in other words, that if we’ve knocked over some of them, the “next one” will also be knocked over. Then, we can think of the base step as “knocking over” the first domino. Once we do that, the inductive step makes it so that all of the later dominoes also have to fall, and therefore that our proposition must be true for all $n$ (because all the dominoes fell!)

To illustrate how these kinds of proofs go, let’s go back to our tiling problem, and prove that we can tile this grids for every $n \in \mathbb{N}$ ! (As an added bonus, let’s prove it for grids where we remove one square from anywhere, not just the top-right-hand corner!)

Claim 7.17. For any $n \in \mathbb{N}$ , take a $2^n \times 2^n$ grid of unit squares, and remove one square from somewhere in your grid. The resulting grid can be tiled by - shapes.

Proof. As suggested by the section title, we proceed by induction, where our proposition $P(n)$ is “we can tile a $2^n \times 2^n$ grid of $1 \times 1$ squares with one square deleted by using - shapes.”

Base case: we want to prove $P(0)$ . So: what is $P(0)$ ?

Well: for $n=0$ , we have a $2^0 \times 2^0 = 1 \times 1$ grid, which we’ve removed a $1 \times 1$ square from. In other words, we have nothing. If you want, you can think of “nothing” as being something we can trivially cover by placing no three-square shapes!

Alternately, you can decide that 0 is a stupid case and look at $n=1$ instead. For $n = 1$ , we simply have a $2 \times 2$ grid with one square punched out. As this is one of our three-square shapes, we are done here; just place a tile on top of our grid!

Either starting place is fine. In general, we recommend doing as many base cases as you need to do in order to feel comfortable with the pattern and believe that you’ve done something concrete! Most of the time, though, the base case will feel kinda silly; don’t worry about this! The inductive step will do all of the heavy lifting for us.

Inductive step: We want to prove that if we know that our claim holds up to $n$ , then it holds for $n+1$ as well; formally, this means that we want to show that if $P(0)$ and $P(1)$ and … and $P(n)$ all hold, then $P(n+1)$ must follow.

In this problem in particular, this means that we’re assuming that we can tile a $2^{k} \times 2^{k}$ -grid with a square deleted for any $k \leq n$ , and want to use this assumption to tile a $2^{n+1} \times 2^{n+1}$ grid with a square deleted.

To do this, take any $2^{n+1} \times 2^{n+1}$ grid with a square deleted. Divide it into four $2^n \times 2^n$ squares by cutting it in half horizontally and vertically. Finally, by rotating our grid if needed, make it so that the one missing square is in the upper-right hand corner.

Take this grid, and carefully cut out one three-square shape in the center as drawn below.

Now, look at each of the four $2^n \times 2^n$ squares in this picture. They all are missing exactly one square: the upper-right hand one because of our original setup, and the other three because of our placed three-square-shape.

By our inductive hypothesis $P(n)$ we know that all of these smaller squares can be tiled! Doing so then gives us a tiling of the whole shape; in other words, we’ve shown how to use our $P(n)$ results to get a tiling of the $2^{n+1} \times 2^{n+1}$ grid.

As this completes our inductive step, we are thus done with our proof by induction.

\square

The claim we proved above - one where we were some sense “growing” or “extending” a result on small values of $n$ to get to larger values of $n$ - is precisely the kind of question that induction is set up to solve! The Fibonacci numbers, which we introduce in the next question, is another object where this sort of “extension” approach is useful to consider.

Definition 7.2. The Fibonacci numbers $f_n$ are defined by a recurrence relation as follows:- $f_0 = 0$ , $f_1 = 1$ .- For any $n \geq 2$ , $f_n = f_{n-2} + f_{n-1}$ .

To illustrate how it works, let’s use it to calculate the first few values of the Fibonacci sequence! We know that $f_0 = 0, f_1 = 1$ by definition.

To find $f_2$ , we can use the fact that for any $n \geq 2$ , $f_n = f_{n-2} + f_{n-1}$ to calculate that.

f_2= f_0 + f_1 = 0+1 = 1.

We can calculate further values of $f_n$ similarly:

$f_3 = f_1 + f_2 = 1+1 = 2,$
$f_4 = f_2 + f_3 = 1+2 = 3,$
$f_5 = f_3 + f_4 = 2+3 = 5,$
$f_6 = f_4 + f_5 = 3+5 = 8,$
$f_7 = f_5 + f_6 = 5+8 = 13,$
$f_8 = f_6 + f_7 = 8+13 = 21,$
$f_9 = f_7 + f_8 = 13+21 = 34,$
$f_{10} = f_8 + f_9 = 21+34 = 55,$
$f_{11} = f_9 + f_{10} = 34+55 = 89,$
$f_{12} = f_{10} + f_{11} = 55+89 = 144,$

When doing this, you’ll likely notice a number of interesting properties about the Fibonacci sequence: see for a ton of weird/beautiful properties these numbers have! We prove one of these properties here:

Claim . For any $n \in \mathbb{N}$ , the $n$ -th Fibonacci number is even if and only if $n$ is a multiple of 3.

Proof. Let $P(n)$ denote the claim “(the $n$ -th Fibonacci number is even) $\Leftrightarrow$ ( $n$ is a multiple of 3).” We want to prove that $P(n)$ holds for all $n \in \mathbb{N}$ , and proceed to prove this claim by induction.

Our base cases are pretty easy to check! We calculated the Fibonacci numbers from $f_0$ to $f_{12}$ above, and we can see that the only ones that are even are $f_0, f_3, f_6, f_9$ and $f_{12}$ ; so we know that $P(0), P(3), P(6), P(9),$ and $P(12)$ all hold.

We now move to the inductive step: here, we want to prove $P(0)$ and $P(1)$ and $P(2)$ and … and $P(n)$ , when all combined together, imply $P(n+1)$ . We start with what we’re assuming, namely that all of $P(0), P(1),\ldots P(n)$ are all true: that is, we’re assuming that the $k$ -th Fibonacci number is even if and only if it is a multiple of 3, for every $k \in \{0,1,\ldots n\}$ .

We want to prove $P(n+1)$ , i.e. that the $n+1$ -th Fibonacci number is even if and only if it is a multiple of 3.

So: let’s consider cases! There are two possible cases for the value $n+1$ : either it is a multiple of 3, or it’s not.

If $n+1$ is a multiple of 3, we can write $n+1=3k$ for some $k \in \mathbb{Z}$ . Notice that this means that $n = 3k-1$ and $n-1 = 3k-2$ , and in particular that both of the values $n, n-1$ are not multiples of 3! As a result, our inductive assumption tells us that $f_n, f_{n-1}$ are both not even, because they’re not multiples of 3! But being not-even just means that these numbers are both odd. As a result, because $f_{n+1} = f_n + f_{n-1} =$ odd+ odd= even, we have shown that $f_{n+1}$ is even in this case.
If $n+1$ is not a multiple of 3, then $n+1$ either has remainder 1 or 2 when we divide it by 3; this is because any number has remainder 0, 1 or 2 when divided by 3. This means we can write $n+1 = 3k + 1$ or $3k+2$ , for some $k \in \mathbb{Z}$ . As a result, we can see that of the two numbers $n, n-1$ , exactly one of them is a multiple of 3; if $n+1 = 3k+1$ then $n, n-1 = 3k, 3k-1$ , and if $n+1 = 3k+2$ then $n,n-1 = 3k+1, 3k$ . As a result, our inductive hypothesis tells us that exactly one of $f_n, f_{n-1}$ are odd, and the other is even. Therefore, because $f_{n+1} = f_n + f_{n-1} =$ (one odd number plus one even number)= odd, we have shown that $f_{n+1}$ is odd in this case.

So, by using strong induction, we have proven that $f_n$ is even if and only if it is a multiple of 3!

\square

Induction: Two Recurrence Relations

As we just saw in the section above, induction is a useful tool to study recurrence relations! In this section, we continue with this theme, and use induction to prove Claim 4.2 and Claim 4.6 from our algorithms chapter.

Claim 7.18. (Claim 4.2) For every positive integer $n$ , we have $\texttt{SelectionSortSteps}(n) = \dfrac{3n^2+9n-10}{2}$ .

Proof. We proceed by induction. First, we can notice that the table of values we calculated earlier validates our claim for the first few values of $n$ :

\begin{array}{c||c|c|c|c|c|c|c} n & 1 & 2 & 3 & 4 & 5 & 6 & 7\\\hline \texttt{SelectionSortSteps}(n) & 1 & 10 & 22 & 37 & 55 & 76 & 100\\\ \dfrac{3n^2+9n^{2^2}-10}{2} & 1 & 10 & 22 & 37 & 55 & 76 & 100\\ \end{array}

This gives us our base case.

For the inductive step, we proceed as always: we assume that our claim holds up to some value $n$ , and seek to prove it for $n+1$ .

In particular, if our claim holds up to some value $n$ , we have $\texttt{SelectionSortSteps}(n) = \dfrac{3n^2+9n-10}{2}.$

As well, by Claim 4.1, we know that

\texttt{SelectionSortSteps}(n+1) = 3(n+2) + \texttt{SelectionSortSteps}(n).

By combining these together, we get

\texttt{SelectionSortSteps}(n+1) = 3(n+2) + \dfrac{3n^2+9n-10}{2} = \dfrac{6(n+2)}{2} + \dfrac{3n^2+9n-10}{2} = \dfrac{3n^2 + 15n +2}{2}.

But we know that

\dfrac{3(n+1)^2+9(n+1)-10}{2} = \dfrac{3n^2 + 6n + 3 + 9n + 9-10}{2} = \dfrac{3n^2 + 15n +2}{2}

In other words, we’ve shown that our claim holds at $n+1$ , and have thus proven our claim by induction!

\square

We can study $\texttt{MergeSort}$ in the same way:

Claim 7.19. (Claim 4.6,) $\texttt{MergeSortSteps}(2^k) = k \cdot 2^{k+2} + 2^{k+1} - 1$ , for every natural number $k$ .

Proof. We again proceed by induction. Again, as before, our previously-calculated table of values suffices for a base case:

\begin{array}{c||c|c|c|c|c|c|c} k & 1 & 2 & 3 & 4 & 5 & 6 & 7 \\\hline \texttt{MergeSortSteps}(2^k) & 11 & 39 & 111 & 287 & 703 & 1663 & 3839\\ k \cdot 2^{k+2} + 2^{k+1} - 1 & 11 & 39 & 111 & 287 & 703 & 1663 & 3839\\ \end{array}

With this established, we turn to the inductive step. Here, we again assume that our claim holds up to some value $k$ , and seek to prove it for $k+1$ .

In particular, if our claim holds up to some value $k$ , we have $\texttt{MergeSortSteps}(2^k) =k \cdot 2^{k+2} + 2^{k+1} - 1$

As well, by Claim 4.5, we know that

\texttt{MergeSortSteps}(2^{k+1}) = 1 + 4\cdot 2^{k+1} + 2\cdot\texttt{MergeSortSteps}( 2^k).

Again, by combining these together, we get

\texttt{MergeSortSteps}(2^{k+1})

=1 + 4\cdot 2^{k+1} + 2(k \cdot 2^{k+2} + 2^{k+1} - 1)

= 1 + 2^{k+3} + 2k2^{k+2} + 2 \cdot 2^{k+1} - 2

= k\cdot 2^{k+3} + 2^{k+3} + 2^{k+2} -1,

But we know that

(k+1) \cdot 2^{(k+1)+2} + 2^{(k+1)+1} - 1 = k\cdot 2^{k+3} + 2^{k+3} + 2^{k+2} - 1

In other words, we’ve shown that our claim holds at $k+1$ , and have thus proven our claim by induction!

\square

Induction, Graphs, and Trees

Induction is a particularly useful technique to use when studying graphs and trees! We prove three claims here, two of which you may recall from our section on trees:

Claim 7.20. If $G$ is a connected multigraph with loops (i.e. we allow multiple edges, and also allow an edge to have both of its endpoints be equal) on $n$ vertices, then $G$ contains at least $n-1$ edges.

Proof. We proceed by induction on $n$ . For $n=0,1$ , this claim is trivially true, as we always have that $E$ is a nonnegative number.

This establishes our base cases, so we now turn to the inductive step: here, we assume that our claim holds for all connected graphs on at most $n$ vertices, and seek to use that assumption to prove that our claim holds for connected graphs on $n+1$ vertices.

To do this, consider the following operation, called edge contraction. We define this as follows: take any graph $G$ and any edge $e$ in $G$ with two distinct endpoints. We define $G_e$ , the graph that this edge, as follows: take $G$ , delete $e$ , and then combine $e$ ’s two endpoints together into a single vertex, preserving all of the other edges that the graph has along the way.

We draw examples of this process below: here, we have started with a graph on six vertices, and then contracted one by one the edges highlighted in red at each step.

Notice that contracting an edge decreases the number of vertices by 1 at each step, as it “squishes together” two adjacent vertices into one vertex. It also decreases the number of edges by 1 at each step, as we are contracting an edge to a point!

Finally, notice that contracting an edge preserves the property that our graph is connected. To see why, take any walk

\{v_0, v_1\}, \{v_1, v_2\}, \ldots \{v_{i-1}, v_i\}, \{v_i, v_{i+1}\}, \{v_{i+1}, v_{i+2}\}, \ldots \{v_{n-1}, v_n\}

in our graph. Notice that if we contracted an edge $\{v_i, v_{i+1}\}$ in this walk, this would collapse the vertices $v_{i}, v_{i+1}$ into some new vertex $v_{i\oplus i+1}$ and preserve all of the edges other than $\{v_i, v_{i+1}\}$ . As a result, our walk would just become

\{v_0, v_1\}, \{v_1, v_2\}, \ldots \{v_{i-1}, v_{i\oplus i+1}\}, \{v_{i\oplus i+1}, v_{i+2}\}, \ldots \{v_{n-1}, v_n\},

and thus still connects the vertices $v_0, v_n$ . Therefore, edge contraction cannot “break” any pre-existing walks, and so preserves the property that our graph is connected.

We can use this process to prove our claim via induction:

Take any connected multigraph graph $G$ on $n+1$ vertices.
Take any edge $e$ in $G$ with two distinct endpoints (such an edge exists, because $G$ contains at least two different vertices and $G$ is connected) and contract that edge. This gives us a new graph $G_e$ , which is connected and contains $n$ vertices.
Therefore, by induction, we know that in $G_e$ , the number of edges is at least $n-1$ .
We also know that $G$ has exactly one more edge than $G_e$ .
Therefore, in $G$ , we know that we have at least $n-1+1 = (n+1)-1$ edges. In other words, we’ve proven that our claim holds for graphs on $n+1$ vertices, as desired!

\square

Notice that this result applies to simple graphs as well, as any simple graph is certainly a multigraph!

We can also use induction to prove Theorem 6.2! We split this result into two parts, as it’s a longish equivalence proof:

Theorem 7.3(Half of Theorem 6.2) If $T$ is a tree on $n$ vertices, then $T$ contains exactly $n-1$ edges.

Proof. We proceed by induction. Our base case is straightforward: any tree on 1 vertex clearly has no edges (as it’s a simple graph.) If you want, you can also consider 2-vertex graphs as well; the only connected two-vertex graph is , which has one edge as desired.

For the inductive step, let’s assume that our property holds for all trees on up to $n$ vertices. Let $T$ be any tree on $n+1$ vertices; we want to use our assumption to prove that $T$ contains exactly $(n+1) - 1 = n$ edges.

To do this, let $l$ be a leaf vertex in $T$ (we know that $l$ exists by our earlier theorem.) Delete $l$ and the edge connecting $l$ to the rest of $T$ from $T$ ; call the resulting graph $T-l$ .

$T-l$ contains $n$ vertices, because we started with $n+1$ vertices and deleted one vertex. It is also still connected (because $l$ was degree 1, the only walk that would need to use the edge to $l$ is a walk going directly to $l$ , and we deleted $l$ .) Finally $T-l$ contains no cycle subgraphs, because $T$ contained no cycle subgraphs and deleting things from $T$ cannot have somehow caused a cycle to exist.

Therefore $T-l$ is a tree! By induction, $T-l$ contains $n-1$ edges.

Therefore $T$ itself contains $(n-1)+1 = n$ edges, because $T$ is just $T-l$ plus the vertex $l$ and the single edge connecting $l$ to the rest of $T$ . In other words, we’ve proven our inductive claim!

\square

Theorem 7.4(The other half of Theorem 6.2).If $G$ is a connected graph on $n$ vertices containing exactly $n-1$ edges, then $G$ is a tree.

Proof. We proceed by contradiction; suppose that $G$ is a connected graph on $n$ vertices containing $n-1$ edges that is somehow not a tree.

Because $G$ is connected, the only way that $G$ can fail to be a tree is if it contains a cycle subgraph. Let $\{v_1, v_2\}, \{v_2, v_3\}, \ldots \{v_k, v_1\}$ be such a cycle subgraph.

Take $G$ and delete the edge $\{v_1, v_2\}$ from $G$ . We claim that $G$ is still connected.

To see why, take any walk in $G$ that uses the edge $\{v_1, v_2\}$ , and replace each use of $\{v_1, v_2\}$ with the sequence of edges $\{v_1, v_k\}, \{v_k, v_{k-1}\}, \ldots \{v_3, v_2\}$ . In other words, every time you’d go directly from $v_1$ to $v_2$ along that edge, instead use the cycle to go the “other” way around!

As a result, if two vertices $x,y$ used to be connected by a walk in $G$ , they are still connected after deleting $\{v_1, v_2\}$ ; in other words, $G - \{v_1, v_2\}$ is still connected.

But $G - \{v_1, v_2\}$ is a graph on $n$ vertices containing $n-2$ edges, as we had $n-1$ edges and deleted one. But in Claim 7.20, we proved that a connected graph on $n$ vertices must contain at least $n-1$ edges! In other words, we have a contradiction, and so our claim that $G$ was a tree must have been correct.

\square

Proof Methods: How to Choose

With all of these proof methods at our fingertips, a natural question is this: how do you choose a method? One answer is the following:

Just try methods one-by-one until something works!

Paper is cheap, and it’s usually just a lot faster to try stuff and see which things break than to predict ahead of time which method is “best.” Also, most problems in maths can be solved by a number of different methods: there’s rarely a single “correct” approach to a problem! Instead, many problems can be solved with many different techniques, and each different proof can help illustrate a new way of thinking about the task at hand.

With that said, though, there are clues or signs in a problem statement that can indicate that certain techniques might be useful. There are no hard-and-fast rules here, but the following observations often come in handy:

Are you proving a claim of the form “if (some claim $A$ is true), then (some other claim $B$ is true)?” If so, a direct proof is maybe a good idea! Write down what it would mean for $A$ to be true, and try to use that assumption to prove that $B$ is also true.
Are you dealing with modular arithmetic, even versus odd numbers, claims about “is a multiple of,” or absolute values? Cases are often useful here. (More generally: if you have any problem where the inputs or outputs can be split into cases, do so! Proofs by cases often combine with other proof methods.)
Are you being asked a claim of the form “Show that $blah$ exists?” Construction’s a good way to go here! (This is opposed to claims of the form “Show that every $x$ has property $foo$ ,” which you usually do not do by construction, as it’s hard to construct every $x$ !)
Are you proving a claim where it seems like your previous results stick together to give you a later result? (Tiling problems that involve a general integer $n$ , anything defined recursively like the Fibonacci sequence, processes that have recursion in them, …) When you’re writing your proof, do you find yourself wanting to use ”…” to show how a pattern you’ve found continues? Then this is probably a good candidate for induction!

Induction is often especially useful for studying the runtime of an algorithm, or for proving that a given algorithm is “guaranteed” to give a specific output.
Are you totally stuck? Try contradiction! Contradiction often gives you something to start from: i.e. it turns claims like “show that every object $X$ has property $blah$ ” into “what would happen if an object $X$ failed to have property $blah$ ?” This is often an easier place to start from! It can be a lot easier to think about how to “break” things and find contradictions of any kind, than to try to proceed directly and argue why some very specific property must hold.

Contradiction is a particularly nice technique if you’re trying to show that some task is impossible: the opening line of “suppose that this is possible” often makes proofs a lot easier to start.
Are you still totally stuck? Maybe it’s false: try a disproof! Best-case scenario: you disprove it and can move on. Worst-case scenario: even if you fail at disproving it, if you think about why you weren’t able to disprove your claim, you might be able to turn that back into a good proof.

To get some practice with this, we solve a few problems below, and in each proof explain why we picked the methods that we did!

Claim 7.21. For every positive integer $n$ , $16^n - 1$ is a multiple of 15.

Proof. Let’s think about which of our proof methods we want to try:

Direct proof: we could try this. This would involve expanding out what it means to be a multiple of 15, and trying to use logic/known results to get to the conclusion.
Cases: even though cases is often a good technique when working with mods / multiple problems, this is likely not a great idea here. This is because there isn’t really a clear set of cases you’d want to divide $n$ into: even versus odd doesn’t seem relevant, and considering all fifteen possible remainders of $n \% 15$ seems painful enough to not do unless absolutely necessary.
Contradiction: could do, if we’re stuck!
Construction: not relevant. We’re proving something for every integer, not building examples for some values.
Induction: This doesn’t obviously look like induction, in that it’s not clear how you’d relate $16^n - 1$ to the “next” value $16^{n+1} -1$ .

With some algebraic trickery, though, this is possible! Notice that $16(16^n - 1) = 16^{n+1} - 16 = (16^{n+1} - 1) - 15$ , and thus that we’ve related one step to the next (in a way that involves a 15, which seems promising.) So if you saw this trick, then this is promising!
Disproof: If you were suspicious of this claim, you could start by calculating a handful of values of $16^n - 1$ , and see if any failed to be a multiple of 15.

No obvious counterexamples immediately showed up in our table below, so let’s not try to disprove this just yet.
$\begin{array}{c|c} n & 16^n - 1 & \\\hline 0 & 1-1=0 & \\ 1 & 16-1=15 & \\ 2 & 16^2 - 1 = 255 = 15 \cdot 17 & \\ 3 & 16^3 - 1 = 4095 = 15 \cdot 273 & \\ 4 & 16^4 - 1 = 65535 = 15 \cdot 4369 & \\ \end{array}$

So, amongst our proof methods, a direct proof and induction look promising. Let’s try induction first!

Base case: we saw in our table in the margins that our case holds for $n=0,1,2,3$ and $4$ . So we’ve established our claim for a number of base cases.

Inductive step: For the inductive step, we assume that we’ve proven our claim for $n$ : i.e. that $16^n - 1$ is a multiple of 15. We seek to use this claim to prove that our claim holds for the “next” value $n+1$ : i.e. that $16^{n+1} - 1$ is also a multiple of 15.

This is not too hard to do! Notice that as we observed above,

16^{n+1} - 1 = 16^{n+1} - 16 + 15 = 16(16^n - 1) + 15.

If $16^n - 1$ is a multiple of 15, then by definition we can write $16^n - 1 = 15k$ for some integer $k$ . Doing so tells us that the right-hand-side is

16(15k) + 15 = 15(16k) + 15 = 15(16k+1) = \textrm{a multiple of 15}.

Therefore, $16^{n+1}-1$ is also a multiple of 15! As a result, we’ve proven our claim by induction: we showed that it holds for the first few values of $n$ , and then showed that it will stay true, as if it is true for some value of $n$ it must stay true for the “next” value $n+1$ .

\square

This is not the only way you could prove this result! We could also use a direct proof:

Proof. We want to show that $16^n - 1$ is always a multiple of 15, for any positive integer $n$ .

By definition, this holds true if and only if $16^n \equiv 1 \mod 15$ .

So: we know that $16 \equiv 1 \mod 15$ , because $16-1$ is itself a multiple of 15. We also know from Claim that for any positive integers $a,b,c,n$ that if $a \equiv b \mod c$ , then $a^n \equiv b^n \mod c$ as well.

Combining these facts tells us that for any positive integer $n$ , we have $16^n \equiv 1^n \mod 15$ . Because $1^n = 1$ for all $n$ , this gives us $16^n \equiv 1 \mod 15$ . By definition, this means that for every positive integer $n$ we’ve shown that $16^n - 1$ is a multiple of 15, as desired!

\square

Claim 7.22. Consider the following program $\texttt{puzzle(n)}$ , which is a slightly modified version of the algorithm you studied in Practice Problem 5 on page 74. It takes in as input a nonnegative integer $n$ , and does the following:

If $n$ is either 0, 1, or 6, output $n$ and stop. Otherwise, go to (ii).
If $n$ has two or more digits, replace $n$ with its last digit and go to (i). Otherwise, go to (iii).
Replace $n$ with $n^2$ and go to (i).

Prove that for every nonnegative odd number $n$ , if $\texttt{puzzle(n)}$ stops, it outputs 1.

Proof. We consider proof methods:

Direct proof: we could try this, in that really any proof can be written in a direct method. (In this sense, “direct” often just means “not worrying about a specific technique.“)
Cases: This seems promising! Our program does different things based on different inputs. As such, cases is a natural technique to use!
Contradiction: could do, if we’re stuck!
Construction: not relevant. We’re proving something for every integer, not building examples for some values.
Induction: Not a great technique here. It doesn’t look like knowing what our program does on input $n$ would tell us much about what it does on input $n+1$ .
Disproof: this seems true (run the program on a bunch of odd values if you’re skeptical!), so disproving it doesn’t seem like a good idea.

Cases looked like the strongest approach: so let’s try that!

Take any nonnegative odd number $n$ . Because $n$ is a nonnegative integer, this means that $n = 1,3,5,7,9$ or $n$ is a two-digit number (whose last digit is 1,3,5,7 or 9.)

After one iteration of our program, if $n$ was a two-digit number it will be replaced with one of 1,3,5,7,9; so it suffices to just understand those five cases:

If $n=1$ , then the program immediately stops and outputs 1, as desired.
If $n=3$ , then on our first iteration we square $n$ to get 9; on our second iteration we square again to get 81; on our third iteration we replace this two-digit number with 1; and on our fourth iteration we stop and output 1.
If $n=5$ , we saw earlier that this case enters an infinite loop.
If $n=7$ , then on our first iteration we square $n$ and get 49; on our second iteration we replace this two-digit number with 9; on our third iteration we square to get 81; on our fourth iteration we replace this two-digit number with 1, which we then output and halt on our fifth iteration.
If $n=9$ , we go to 81 and then 1 and then halt (as described above.)

In all of these cases, we either run forever or output 1, as claimed!

\square

Again, we can use a more direct approach if we see it:

Proof. Take any nonnegative odd number $n$ . Notice that if $n$ is odd, then no matter what our program does $n$ will stay odd! This is because the square of an odd number is odd, and the last digit of an odd number is odd.

Therefore, $n$ will never be reduced to either of 0 or 6. As a result, the only possibilities that remain are “the program halts when $n=1$ ” or “the program runs forever,” as there are no other conditions that cause our program to halt.

\square

We close this section with a third problem about graphs:

Claim 7.23. Given a graph G, an edge coloring of G with k colors is any way to assign each edge of G one of k different colors, so that no two edges of the same color share an endpoint in common. An example of an edge coloring is given below. Show that there is a graph G in which all vertices have degree 3, and yet at least four colors are needed to create an edge-coloring of G.

Proof. While we could go through all of the proof methods again, we’ll shortcut the process and explain why we know this is a constructive proof: it’s asking us to show that there is a graph with some property! This isn’t a “show all graphs have property $foo$ problem or a “take any graph $G$ , show that it cannot be $blah$ ” task: this is just asking us to find some single graph with a given property.

So, uh: behold the graph below!

This is the Petersen graph $P$ , a particularly useful counterexample to many claims in graph theory. We claim that $P$ is a graph that needs four colors to properly color its edges; i.e. you cannot edge-color $P$ with three colors. Note that to complete our proof, we need to explain why using just 3 colors to edge-color this grap is impossible: that is, it’s not enough to just give our object, we also need to show that it has the desired property!

To do this, we now need a new proof technique. We claim that if you went through your list of proof techniques, none would stand out and you’d get stuck for a while. In this situation, contradiction is what we’d go to!

Here, a proof by contradiction would start as follows: suppose that we could use only three colors to color the edges of $P$ . Call them red, green and blue.

Make the following observations:

Notice that because every vertex of $P$ is degree 3, every vertex of $P$ has one red, one green, and one blue edge leaving it.
Notice that on the outer pentagon of $P$ , we need to use all three colors: if we tried to use just two colors to edge-color the pentagon, then we would have two edges with the same color touching each other.
Take a red edge on the outer pentagon of $P$ . Call its two endpoints $x,y$ , and let the inner vertices adjacent to $x,y$ be called $a,b$ .

Because $\{x,y\}$ is red, the edge $\{a,x\}$ is not red. Therefore, the red edge that (1) told us must be connected to $a$ is on this inner star.

Similarly, because $\{x,y\}$ is red, $\{b,y\}$ is not red. Therefore, the red edge that (1) told us must be connected to $b$ is also on this inner star.

Finally, because $a$ and $b$ are not adjacent, these red edges are different: that is, there are two red edges in this inner star.

Take a blue edge on the outer pentagon of $P$ . The same logic as in (3) guarantees two blue edges in the inner star.
Take a green edge on the outer pentagon of $P$ . The same logic as in (3) guarantees two green edges in the inner star.
Conclusion: the inner start has two blue edges, two red edges, and two green edges.

… but it only has five edges! Therefore this is impossible; i.e. we’ve reached a contradiction, and our original claim (that at least four colors are required) has been proven.

\square

Practice Problems

You’re a programmer! You’ve found yourself dealing with a program $\texttt{mystery(n)}$ that has no comments in its code, and you want to know what it does. After some experimentation, you’ve found that $\texttt{mystery(n)}$ takes in as input a natural number $n$ , and does the following:

If $n$ is either 0, 1, 2, or 3, output $n$ and stop. Otherwise, go to (ii).
If $n$ is even, replace $n$ with $n/2$ and go back to (i). Otherwise, go to (iii).
Replace $n$ with $n+5$ and go to (i).

Come up with the following proofs about $\texttt{mystery(n)}$ :

(a) Use contradiction to prove the claim “if this program outputs 3 on input $n$ , then $n$ is not a power of 2.”

(b) Disprove the claim “Given any natural number $n$ as input, this program will eventually stop” by finding a counterexample.

(d) Write a direct proof that if “the output of this program is 1” then “the input to this program was 1.”

(-) Let $a,b,c$ be three integers, such that $a$ divides $b$ and $a$ divides $c$ . Write a direct proof that $a$ also divides $b-c$ .
(-) Write a proof by cases that for any integer $n$ , the number $3n^2+n-16$ is even.
(-) Prove by contradiction that the number $\sqrt{19}$ is irrational.
Suppose that $a$ , $b$ are a pair of real numbers with the following property: if $x$ is any number greater than $b$ , then $x$ must also be greater than $a$ . Prove by contradiction that $a \leq b$ .
(+) The game of generalized $n$ -tic-tac-toe is played as follows: on a $n \times n$ grid, two players $X$ and $O$ take turns placing their respective symbols $x, o$ into cells of the grid. No cell can be repeated. The game ends whenever any player gets $n$ consecutive copies of their symbol on the same row /column / diagonal, or when the grid is completely filled in without any player having any such $n$ consecutive symbols. (Normal tic-tac-toe is where $n = 3$ .)

Prove that there is no strategy in generalized tic-tac-toe where the second player to move is guaranteed to win.
A queen in the game of chess is a piece, shaped like . In the game of chess, when moved, a queen (when placed in a given cell in a chessboard) can go to any cell within the same row, any cell within the same column, or any cell along the two diagonals through the cell that it starts from. We illustrate this below.

The $n$ -queens problem is the following task: Take a $n \times n$ chessboard. Can you place $n$ distinct queens on this chessboard, so that no queen can capture any other (i.e.\ so that there is no way to move any one queen into a cell currently occupied by another queen?): (a) Prove by cases that there are no solutions to the $3$ -queens problem. (b) Prove by construction that there is a solution to the $4$ -queens problem. (c) Prove by construction that there is a solution to the $8$ -queens problem.
Prove or disprove the following claim: if $G$ is a graph in which the degree of every vertex is 3, then $G$ cannot be bipartite.
Prove or disprove the following claim: if $G$ is a graph in which the degree of every vertex is at least 2, then $G$ is connected.
Consider the following two-player game: starting with the single number 123, two players alternately subtract numbers from the set $\{1,2,3\}$ from this value. The player who first gets this sum to 0 wins.

If you want to win this game, should you go first or second? Prove that your chosen player has a winning strategy. (Hint: try induction!)

Take an equilateral triangle with side length $2^n$ . Divide it up into side-length 1 equilateral triangles, and delete the top triangle. Call this shape $T_n$ :

Take three side-length 1 equilateral triangles. Join them together to form the following tile: Prove that you can tile $T_n$ with tiles, for every $n \in \mathbb{N}$ .

Consider the following inductive “proof:”

Claim: If $G$ is a graph containing at least 3 vertices, and every vertex in $G$ has degree at least 2, then $G$ contains a $C_3$ subgraph

Proof. We proceed by induction on the number of vertices in $G$ . To start: we assume that our claim holds for all graphs on up to $n$ vertices, and seek to prove that it holds for all graphs on $n+1$ vertices as well. To do this: for any $n > 3$ , take any graph $G$ on $n$ vertices in which every vertex has degree at least 2. Add a new vertex $v$ to this graph, and connect $v$ to at least two other vertices in $G$ ; this gives us a new graph $G'$ on $n+1$ vertices. We know by our inductive assumption that $G$ itself contains a $C_3$ subgraph. Therefore this new graph $G'$ on $n+1$ vertices also contains a $C_3$ subgraph! This is what we wanted to prove, and thus finishes our inductive proof.

\square

Find every logical flaw in this proof. Explain why the flaws you have found are indeed mistakes. (Hint: there are at least two flaws here!)

Consider the following solitaire game:

The picture above contains three circles drawn in the plane. In each of the bounded regions formed by the intersections of these circles, we’ve placed a coin, which is white on one side and black on the other. All of the coins start with their black side up.

The moves you’re allowed to perform in this game are the following:

You can at any time flip all of the coins within any circle.
Alternately, you can at any time take any circle and flip all of its white coins over to black. Can you ever reach the following configuration? Prove your claim.