During the Enlightenment, the world was viewed as an orderly place in which everything operated by precise mathematical principles. Beginning with the Romantic backlash, though, and unfolding into the early 20th century, rationalism in mathematics, as well as the concept of absolute truth, was called into question.

This change came about through startling discoveries in the field of mathematics, revealing that what we thought were fundamental principles in both geometry and arithmetic were not always true in every situation.

This is a transcript from the video seriesRedefining Reality: The Intellectual Implications of Modern Science. Watch it now, on The Great Courses Plus.

Interestingly, this collapse of certainty—and the resulting wide-scale wreckage to the foundations of our reason—was reflected in a pair of notable and related works of fiction coming out of Britain in the late 1800s: Lewis Carroll’s *Alice’s Adventures in Wonderland*, and Edwin Abbott Abbott’s *Flatland: A Romance of Many Dimensions*.

*Alice in Wonderland*—as the tale has come to be known—and its sequel, *Through the Looking-Glass*, were written by Lewis Carroll, the pen name of Charles Lutwidge Dodgson, a mathematical logician at Oxford.

Because of the ascent of non-Euclidean geometry, and the attempts to find a firm foundation for arithmetic in set theory, the world of mathematics had turned its attention largely to logic in hopes that an analysis of the nature of mathematical reasoning would yield the needed justification to keep mathematics as the hardcore basis of all that was certain.

But mathematicians are a strange lot. While they understood the gravity of the circumstances, they also found themselves drawn to the paradoxes that could be created when these foundations were examined creatively.

The heart of traditional logic is the law of the excluded middle—the claim that either a sentence or its negation, but not both, must be true. Either I have a brother or I don’t; I can’t both have a brother and not have a brother.

If we know that one claim is true, we know the other is false. And if we know one is false, then we know the other is true. A paradox is a sentence or set of sentences that contradicts itself. That is, it must be true, but then its truth implies its falsity.

Since the law of the excluded middle holds that a sentence can’t be both true and false, we have an affront to the basis of logic itself. Logicians like Dodgson were examining purported paradoxes generated by the logical system itself. If authentic, such paradoxes would undermine the underpinnings of our most rigorous form of thought.

Learn more about math concepts explored by literary figures like Kurt Vonnegut

Testing paradoxes were not limited to Dodgson’s professional published work; it’s also what he was playing with in his famous work for children. Think of the opening scene in which Alice spies the White Rabbit: he takes a pocket watch out of his waistcoat, declares that he’ll be late, and then he dashes down a hole into which Alice follows.

Carroll famously wrote that “Either the well was very deep, or she fell very slowly, for she had plenty of time as she went down to look about her and wonder what was going to happen next.”

Now, remember that Dodgson was British and the most celebrated influential figure in Britain was Isaac Newton. Newton’s laws of motion explained Galileo’s finding that all objects close to the surface of the Earth fall at exactly the same rate.

If we’re taking seriously the possibility that Alice is falling slowly, then we’re taking seriously the possibility that the immutable laws we take as part and parcel of the working of the universe no longer apply.

We have entered a realm where the Enlightenment presupposition of a well-behaved universe whose rules are accessible to our rational faculties can be reasonably denied. Reason implies nonsense.

Reason is not ultimately self-justifying, but ultimately self-defeating. *Wonderland* represents the death of the rationalist project.

Think of Alice’s encounter with Humpty Dumpty in *Through the Looking-Glass*. The two are discussing birthdays and birthday presents when Humpty Dumpty exclaims, “There’s glory for you.” Alice replies, “I don’t know what you mean by glory.” To quote the passage further:

Humpty Dumpty smiled contemptuously. “Of course you don’t—till I tell you. I meant ‘there’s a nice knock-down argument for you!’”

“But ‘glory’ doesn’t mean ‘a nice knock-down argument’,” Alice objected.

“When I use a word,” Humpty Dumpty said, in rather a scornful tone, “it means just what I choose it to mean—neither more nor less.”

“The question is,” said Alice, “whether you can make words mean so many different things.”

“The question is,” said Humpty Dumpty, “which is to be master—that’s all.”

In Greek, a language that Dodgson spoke, the word for “word” is *logos*, which also means, in some contexts, logic. When he has Humpty Dumpty redefine the word “glory,” it’s not a random sense but rather a nice knock-down argument, that is, the goal of logic.

When Alice protests, Humpty Dumpty replies that the real question is who is to be master: humans or words? That is, which ought to be the measure of reality: our experience, our freedom, our lives, or logic? Should we be subservient to our reasoning as the Enlightenment-influenced rationalists would have?

Or should we take command over logic, words, and *logos*? If we follow logic, do we disappear down a rabbit-hole—something that seemed possible given the paradoxes that mathematical logic was generating?

Learn more about the universe as changing and unstable

A similar challenge is found in Abbott’s *Flatland*, which takes place on a two-dimensional plane. It’s a flat world populated by shapes, the narrator being a lowly square. In Flatland, the more sides you have, the higher your social position.

The square is visited by a sphere, a three-dimensional figure that appears to the square at first as a point, then a circle of increasing diameter, then a circle of decreasing diameter, and then a point—finally, a disembodied voice. The sphere tries without any initial success to convince the square of the existence of the third dimension until he finally flings the square out of his planar world into the space above it.

Upon returning to Flatland, the square becomes evangelical about convincing his fellow flatlanders about the existence of this third dimension that is upward not northward—a dimension they haven’t seen.

He’s arrested and charged with heresy by the high priest, and at his trial, he is asked to provide any evidence for the existence of this third dimension of which he so passionately speaks.

The square’s argument is mathematical. If we can take a point and move it, we get a line. If we take a line and move it parallel to itself, we get a plane. If we take that plane and move it parallel to itself, we get space.

The priest asks for physical, rather than mathematical, reasoning. The square can provide none, so the priest offers his argument: He asserts that there is no reason to think this mathematical talk is anything but trickery, with no relation to anything real.

The argument is compelling; all the while the reader knows it’s wrong. But it’s reflective of the strange results coming out of mathematics at the time. They threatened to undermine our comfortable certain basis for rationality.

But should they be accepted? Has reason led to nonsense, or is there a foundation for rational thought to be found in rational thought?

Learn more about the paradoxical subject of quantum mechanics

For more than a thousand years, we accepted Euclid’s axioms and postulates as true because they were self-evident, seemingly self-justifying. But at the dawn of the 20th century, mathematics—our most secure and definitive science—was in turmoil.

We were forced to re-examine the basis of what we thought reality would be like. Lewis Carroll’s Wonderland was about to be discovered here in our world, when the science of the 20th century forced us to reconsider reality itself.

Lewis Carroll, whose real name was Charles Dodgson, was indeed a mathematician. At the time, groundbreaking new mathematical concepts were coming out such as imaginary numbers. Dodgson, whose views on math were rather traditional, considered these new concepts to be absurd, and the world of *Alice’s Adventures in Wonderland* mirrors this absurdity.

The scientific name for Alice in Wonderland syndrome is dysmetropsia, which is a neuropsychological condition in which one’s perception is distorted. In Lewis Carroll’s novel, Alice experiences a distorted sense of her body and physical surroundings, which reflects the anxiety many people were experiencing at that time as new mathematical concepts were introduced which changed how they viewed the world.

In the book *Flatland*, Flatland is a world occupied by flat, two-dimensional objects (circles, squares, etc.). Space, on the other hand, contains three-dimensional shapes such as spheres and cubes. The objects in Flatland can’t comprehend that this three-dimensional world exists because all they know is their own world.

Flatland and Lineland are similar in that each land consists of lines and points. Unlike the world of Spaceland, Lineland is familiar to Flatland because they share common elements.

Logical Fallacies: Understanding the Straw Man Argument

Making Math Fun: The Power of Mathematical Visualization—The Torch Podcast

Big Mysteries: Extra Dimensions

The early 20th century sat at an interesting intellectual crossroads. The Enlightenment of the 17th and 18th centuries upheld reason as the defining characteristic of humanity and saw the advance of knowledge as the hallmark of human progress.

The romantic movement of the 19th century was a backlash against what it saw as the arrogance and naiveté of the reduction of the human to its brain. As such, there was a divide in the intellectual world between the sciences and the arts.

This is a transcript from the video seriesRedefining Reality: The Intellectual Implications of Modern Science. Watch it now, on The Great Courses Plus.

The sciences largely bought into the Enlightenment presuppositions of a well-behaved world, regulated by absolute laws that were accessible to human reason through rigorous processes of observation and logic. Acquiring an understanding of these laws was paramount in striving to move forward as a species.

The practitioners of the arts and letters, on the other hand, saw themselves as the loyal opposition, obligated to correct what they saw as the overreach of the sciences, which seemed to miss the beauty, the joy, and the experience of being human. The heart was as important as the brain and the fetish that scientists held for knowledge limited their true understanding.

But this divide was not impermeable; there were important influences in both directions.

The advances achieved and the difficulties experienced by the sciences changed the way people saw the universe, the world, and human nature. This change in perspective affected what was painted, built, written, and composed. Art reflects the world, but the world is never given to us directly.

Learn more about how the scientific picture of reality changes as theories are refined or overthrown

As the late-18th-century philosopher Immanuel Kant pointed out, “Our understanding of reality is always mediated through concepts we use to create the ideas in our mind that only seem to come fully formed from our senses.”

But contrary to Kant, who held that these basic categories were necessary and unrevisable, these intellectual building blocks do change over time. With advances in the sciences, it forces us to radically revise how we make sense of ourselves and our environment.

This revision provides fertile ground for the creative arts. The freedom creatives enjoy to reflect the world in novel and sometimes strange ways can inform and influence the scientists, who are often in need of new and exciting ways to organize the seemingly strange results they receive from the universe.

Sometimes art and science influence each other, sometimes not. A tension existed between the Enlightenment-influenced rationalists and the romantically-inclined thinkers.

Reason-based rationalists supported their foundational views by citing the progress science and technology had made as evidence. The use of reason gave us justified beliefs that could come from nowhere else.

Learn more about revolutionary ideas about reality that eventually inspired the Age of Enlightenment

Foremost amongst these were the propositions of mathematics, which provided humanity with absolute truths.

Rene Descartes—a 17th-century founder of this rationalistic movement, and a major contributor to physics, mathematics, and philosophy—thought that the methods of the mathematician were so impressive that they ought to form the backbone of all further investigations. In all other areas of conversation, the intellectuals disagreed about everything.

But mathematics demanded universal assent by way of facts that could not be challenged by anyone who understood them, and complex results were derived with absolute rigor.

Mathematics was a thing of beauty, an absolute bedrock on which man could construct a completely firm structure of understanding. A generation after Descartes, when Isaac Newton mathematized physics with his invention of calculus, it seemed like the rational worldview based on mathematics was well on its way to giving us an unassailable sense of reality itself.

Mathematical propositions were self-evident and true beyond question. Those who doubted these propositions revealed themselves either to be lacking in understanding, mentally deficient, or just trying to be irascible.

It was worrisome when, in the 19th century, the very foundations of mathematics came into serious doubt.

Learn more about how mathematics hit a patch of uncertainty in the 19th and 20th centuries

Traditionally, the mathematical realm has been thought of as having two parts: Geometry, which deals with shapes in space, and arithmetic, which deals with matters of number.

Both had been rigorously grounded. While interesting works were showing some interconnections, it was thought there were two different, but equally justified areas of knowledge. Then things fell apart in both.

Since the 3rd century B.C., geometry was synonymous with the name Euclid. Centuries of work had been achieved in geometry before Euclid, but his contribution created order from the results.

He created a structure based on a few simple and obvious propositions using a strict means of reasoning to derive hundreds of complex and intricate theorems. These theorems, because of the rigor of his logic, must share in the certainty attributed to the first, most basic truths.

These basic truths come in three groups. First, are the definitions that simply describe what is meant by basic geometric terms.

A circle, for example, is the set of points in a plane some distance from a center point.

Definitions are true. They’re true because they simply tell us what we mean by words. We’re free to define any word in any way we want.

Learn more about how Einstein’s special theory of relativity heralded an entirely new conception of reality

But there are two other categories of basic truths Euclid used. One category includes his collection of axioms. The axioms were basic obvious truths that were not explicitly geometric.

For example, equals added to equals yields equals. If John and Suzy have the same number of apples and each is given some additional number of apples—giving them the same—then each still has the same number of apples as the other. It’s difficult to argue against that truth.

The postulates are similar except they are about purely geometric matters. Give us any two points and we can draw a line between them. Give us any line segment and we can continue that line as far as anyone wants in either direction.

Give a point and you can draw a circle around it any size you want. All right angles are equal to each other. No one could doubt these.

These are the first four of the postulates. Now, if the first four are fingers on the Euclidean hand, the fifth is the sore thumb—it sticks out.

This is the theorem: if two lines are approaching each other, they’ll eventually intersect. We usually think of it in terms of an equivalent formulation—take a line and a point not on that line.

How many lines can be drawn through the point that will be parallel with the line? One and only one.

This postulate seemed less like the others and more similar to Euclid’s theorems, the statements he proved from the other postulates.

Maybe it would be possible to derive it from the other four. This would be a big deal because mathematicians prize elegance. A system is elegant if it makes the fewest possible assumptions.

Learn more about the paradoxical subject of quantum mechanics

To show how we could derive the fifth postulate from the other four, making it unnecessary as an assumption, it would shrink the set of presuppositions. This would improve Euclid’s system, the seemingly greatest, most elegant, and powerful system of all time.

The significance of improving upon Euclid’s ideas would be as great as improving upon the works of Shakespeare; it would assure one’s place in the annals of mathematical history. Much time was spent by brilliant people for centuries seeking the elusive proof of the fifth postulate—the so-called parallel postulate.

Such a proof was never found, presumably because it doesn’t exist. Euclid cannot be improved on, as mathematicians had hoped.

The fifth postulate is entirely independent of the other four, but mathematicians discovered this the hard way. After mathematicians had failed in all their attempts to create a direct proof from the first four to the parallel postulate, the idea occurred to several different mathematicians to try an indirect proof.

Learn more about how quantum field theory led to a stunning synthesis called the standard model of particle physics

We can show something is true by demonstrating that it can’t be false. If you know that I have a sibling and you want to prove that I have a brother, it suffices to prove that I can’t have a sister. If I have a sibling and it’s false that I have a sister, then it must be true that I have a brother.

What the mathematicians wanted to prove is that Euclid’s fifth postulate can be derived from the other four; that means that the truth of the other four postulates guarantees the truth of the fifth. We start by assuming the opposite: The other four postulates are true and the fifth is false.

Then we derive a contradiction by forming any sentence of the form “A and not A.” Since either A or not A has to be true, but both can’t be, the contradiction A and not A has to be false.

The existence of this contradiction shows that if the postulates one, two, three, and four are held to be true, then the denial of the fifth can’t be true. But if the denial of the fifth is false, then the fifth has to be true. This would show that Euclid could be simplified.

Learn more about the underlying reality that governs the universe

However, when mathematicians assumed one, two, three, four, and the negation of five, they worked and worked but never found a contradiction. They found strange results, such as the discovery that triangles cannot have the same angles but different sizes; the internal angles of triangles add to less than 180 degrees.

Bizarre stuff—statements that seemed like they were false, but never a contradiction that had to be false.

The world was entering a new mathematical realm.

Rationalism is the theory that there are absolute truths which, using reason, the intellect can discover.

Within the tenets of rationalism, there is no proof or evidence of a supernatural creature that created and rules the universe.

It is generally accepted that René Descartes, the French philosopher, is the creator of rationalism.

Empiricism and rationalism are not the same. They are very nearly opposite. Empiricism denies innate truths and is the belief in the strict use of our five senses and induction to discover any truth, whereas rationalism believes there are innate truths that can be deduced with reason and intellect.

Big Questions: What is Reality?

Defining Mathematical Properties of Three-Dimensional Shapes

Literary Britain: The Romantics

For centuries, mathematicians had tried to simplify Euclid’s system—one of the most elegant systems in mathematics, and the basis for the geometry taught in schools today. To accomplish this feat, they would have to show how they could derive the fifth postulate from the other four.

Not only was a solution never found, but they made some bizarre discoveries along the way.

This is a transcript from the video seriesRedefining Reality: The Intellectual Implications of Modern Science. Watch it now, on The Great Courses Plus.

In the first half of the 19th century, mathematicians like Nikolay Lobachevsky of Russia realized that they had found something incredibly deep and troubling. They had in their hands a new geometry, a different geometry—a non-Euclidean geometry.

This strange world was a new mathematical realm—a parallel mathematical universe. If we have two geometries, which one is true? When we had only Euclid’s, we assumed that it gave us the absolute truth about the nature of shape and space.

If there’s a possible alternative, we can’t hold its truth to be absolute. We need a new sort of evidence to justify our belief in what seemed indubitable.

But, what kind of evidence could this be? We can’t simply say that the alternatives are too weird—being weird doesn’t make it false.

To make matters worse, more systems were created by denying other postulates and combinations of postulates. Possible geometric systems were popping up right and left.

Which one was true? Which one was the real geometry? How do we know? What had been the most secure place on the entire intellectual landscape for more than a thousand years was now suddenly without a foundation.

Learn more about the shocking discoveries of non-Euclidean geometries

Mathematicians were not happy, but at least we had the other side of the mathematical house. Arithmetic was still safe and secure; one plus one is two. There can’t be any reason to doubt that.

We had thought the numbers were well behaved, that they obeyed certain undeniable first truths. Think back to Euclid’s first axiom: Equals added to equals yields equals.

But let’s now think of the fifth axiom—the whole is greater than the parts. If I have an amount of money and in my will, I leave some of it to a relative and the rest to a charity, neither heir gets as much as I previously had in sum; by getting part, both get less than I had in aggregate.

This seems trivial and obvious. Of course, it’s always true.

In the second half of the 19th century, the German mathematician Georg Cantor showed this is not the case. Suppose we have a mutual friend named George and next week is George’s birthday.

We want to get him something we know he’ll love as a gift, but what to get him? You remind me that he’s an avid collector of numbers. We talk to his wife and she tells us that his collection now includes all of the positive integers.

He has 1, 2, 3, 816, 9,674,217—he’s got them all. If he has all the numbers you could count from 1 forward, we’ll also get him the one before 1—we’ll give him 0.

The big day comes and after blowing out his candle, he opens his gift, and sees his new 0, a number he didn’t have before. Overjoyed he looks at us and says, “Thanks for nothing.”

George’s number collection now has one more than it had before his birthday. The pre-birthday collection is only a part of the whole, so the whole is larger than the part, right?

Wrong; he still has the same number of numbers.

Learn more about the underlying reality that governs the universe

Suppose we go to a movie theater and we want to see if the show is sold out, undersold, or oversold. Since all people have but one backside and we use that backside to sit in but one seat, we could count the number of seats in the theater, count the number of backsides, and see which number is bigger or if they’re equal.

But Cantor realized we could do it in a way without counting at all. Ask everyone to sit down—that is, match every available seat to an available person, one to one. Then see if there is a remaining seat, a remaining individual, or neither.

This is a way to compare the size of sets without counting. Two sets are of equal size if there exists a way to map the members of one set onto the members of the other so that each element in the first set corresponds to one, and only one, member of the second with none left over.

Let’s do this with George’s numbers: If we take each of the numbers in his post-birthday collection, can we map it to one, and onto only one, from his pre-birthday collection? Take each number in the collection after his birthday and map it onto that number plus 1 in his old collection.

That is, 0 goes onto 1, 1 goes onto 2, 2 goes onto 3, and so on. In this way, in the end, there will be no number in one that does not have a correlated number in the other.

This maps one set perfectly onto the other set. The two collections are the same size.

George’s extra number has made his collection no larger even though it now includes a new element above and beyond what it had. For an infinite number—the number of counting numbers—that infinite number plus one yields the same number.

All right, that’s weird but we might think, “It’s all because it’s an infinite number of numbers. You can’t make infinity bigger. It is already infinite. How can you make it bigger?”

But all we showed is that infinite amounts are as big as one can get. You can’t have a smaller or larger infinity. This is where Cantor’s work gets fun.

Learn more about contradictions to the idea that the world is regular, simple, periodic, and predictable

Consider the numbers between zero and one. Some of these are what we call rational numbers, that is, they can be written as ratios: one half is ½, three-quarters is ¾. These can be written equivalently as decimals: one half is 0.5, three-quarters is 0.75.

Interestingly, all of these numbers will have one of two properties. Either they will terminate, that is like ½ will end, 0.5, done. Or, they will repeat—one-third is 0.3333333 and as far you go, there will always be more threes.

One-seventh, when written as a decimal, is 0.142857142857142857, repeats infinitely. There will always be another 142857. Such is the case with all ratios: they terminate or they repeat.

Then there are the numbers that do not repeat or terminate when we write them out as decimals. These are what we call the irrational numbers—not because they’re crazy, but because they can’t be written as a ratio of two counting numbers.

The most famous irrational number, of course, is pi—3.14159 and off it goes forever, always another digit, never repeating endlessly like the 3’s of 1/3 or the 142857’s of 1/7. If you take the rational numbers and combine them with the irrational numbers, you get what we call the real numbers.

Suppose George has an older brother, Frank, who has been collecting numbers even longer. He’s amassed all the rational numbers between 0 and 1: ½, ¼, 9/16, all of them.

Frank then orders from an online retailer the set of irrational numbers between 0 and 1. When it arrives, he now has both the infinite set of rational numbers between 0 and 1 and the infinite set of irrational numbers between 0 and 1. That is, he has all of the real numbers between 0 and 1.

We might think that, like George on his birthday, Frank’s new set with more numbers is the same size as it was before. Infinity is infinity; you can’t have more than infinity.

But you would be wrong. Cantor proved with absolute certainty that Frank’s new set is a bigger infinity. There are sizes of infinity.

If we map Frank’s old set onto his new set, Cantor showed that there’s a simple way to demonstrate that the new set has at least one number that can’t be in the old set, which means the new set is bigger.

Some infinite subsets are the same size as the sets that include them, and other infinite sets are bigger than others. There would be an infinite set of infinite numbers and these obey different rules than the finite numbers.

Learn more about the paradoxical subject of quantum mechanics

Which number rules are true? The easy answer would be that there’s one set of rules for finite numbers and another set for infinite ones.

That conclusion was the line mathematicians pursued until 1931 when the Austrian mathematician Kurt Gödel proved that we could not set out a complete set of rules for arithmetic.

Gödel showed that we could use any set of possible rules to create sentences similar to the sentence, “This sentence is false.” If it is true, then it is false, but if it is false, then it is true.

Any attempt to create rules would either allow sentences like, “This sentence is unprovable,” to be proven and so we would have sentences that can be proved but are false. We would have just proven the sentence that says it cannot be proven.

Alternatively, we could strengthen our rules to exclude these sentences, but then because we can no longer prove the sentence, the sentence “This sentence is unprovable,” would be true.

Learn more about how quantum field theory led to a stunning synthesis called the standard model of particle physics

We would have true sentences that we can’t prove in our system, making our system incomplete. Any set of rules would be either unsound—that is, include false sentences—or incomplete—not allow all true sentences to be proved.

The days of mathematics as the epitome of human rational understanding seemed to close at the end of the 19th and beginning of the 20th century. It was the canary in the intellectual coal mine.

An absolute truth is a concept or idea that is true no matter what, such as the rule that a circle can never be square.

There are absolute truths in mathematics such that the axioms they are based on remain true. Euclidean mathematics falls apart in non-Euclidean space and different dimensions result in changes. One could say that within certain jurisdictions of mathematics there are absolute truths.

Mathematics was not invented. The Kemetic priests of Egypt taught a wholistic concept of number and sound which became a cult led and taught by Pythagoras to the Greeks. Around 300 B.C.E., the axiomatic system we still use to discover mathematical insights was developed by Euclid.

Mathematics appears to exist as a part of this universe. There is a mathematical universe hypothesis by Max Tegmark that posits that the universe itself is a mathematical structure. It is possible in other universes that we could not understand them and they would not be mathematical; however, our perception within this universe is mathematical and so even considering these other possibilities is difficult.

The Power of Zero

Google Employee Calculates

There is an inspiring article that highlights thinking outside of the box when it comes to solving problems. The article compares a gifted class of middle school students with a remedial sort of vocational class, the article was measuring the creativity of the students. The experiment asked the students in each class one question: *How do you weigh a giraffe?*

The students in the gifted class were not so much gifted as successful, and they were used to succeeding and used to pleasing their teacher. They panicked because they didn’t know how to answer this question. This was way before the Internet, and the students couldn’t go online and look it up.

Meanwhile, in the vocational class, almost immediately some kid just blurted out and said, “Hey, I know what to do. Just take a chainsaw, and chainsaw that giraffe into chunks. Then weigh the chunks.”

Chainsawing the giraffe is an attitude that a good problem solver should have because you want to be fun, and you also want to be a little bit bad. Breaking rules is a good thing when we’re not talking about actual cruelty to animals here. We’re just talking about thinking outside the box or breaking mathematical rules.

To break the mathematical rules, we need a healthy dose of the 3 C’s.

This is a transcript from the video seriesArt and Craft of Mathematical Problem Solving. Watch it now, on The Great Courses Plus.

Concentration, creativity, and confidence are psychological attributes that are important for just about everything, but they’re vital for solving problems. How do we enhance them? All three of them are linked, but confidence is the least important of the three because it’s truly derived from the other two.

As your concentration ability increases, and if your creativity gets stronger, then you’ll naturally become more confident.

Learn more about thinking like a problem solver

To master concentration, you must set aside a quiet time and place for your work. You need to relax. You need to develop good work habits, and you need to find problems to concentrate on that are interesting to you, approachable by you, and addictive. Pretty much, that will do the trick. In order to build up your concentration, you want to build up from level 1, which is a minute or so of concentration, at least to level 3, getting up to an hour.

Collect a stock of back-burner problems. Start cultivating problems that you cannot solve. Make sure they’re interesting and then you’ll think about them. If you can find a problem that’s exciting to you, that annoys you, that sort of gnaws at you, then you’ll think about it. Interesting problems will force you to become a better concentrator.

*To master concentration, you must set aside a quiet time and place for your work.*

Click To Tweet

Concentration leads to confidence, which frees you to explore, which facilitates investigation and creativity.

To build both your confidence and creativity, you need to be disciplined about using those interesting problems. You need to set up a problem-solving routine, some workplace, a lucky pen, and then you should keep to your routine to get your mind in a relaxed state.

Then, occasionally, deliberately break your routine. If you like to work in the morning, work late at night. If you like quiet, go to a noisy café. If you like to work in a restaurant, go sit in a library, etc.

You should also, as a strategy, specifically think about peripheral vision. The peripheral vision strategy is to realize that many problems cannot be solved with direct focus. It’s just like your eyes. Your fovea has very good focus, but it has less sensitivity than the perimeter of your eyes, the periphery of your vision.

*Peripheral vision strategy—many problems cannot be solved with direct focus. *

Click To Tweet

Many problems need to percolate in your unconscious in this way. You need to cultivate a good supply of back burners, and just get in the habit of not solving problems. The more you do this, the more you’ll get into a state of investigative, purposeful contemplation, and the more powerful your mind will get.

Learn more about the psychological aspects of problem solving

Let’s look at a tool made famous by Carl Gauss. He was a prodigy, and as a teenager, he solved a problem that had been unsolved since Hellenistic times. He found a way to construct a regular heptadecagon, 17-gon, using compass and straightedge. The rest of his career was not much different. What Gauss could do in an afternoon was equivalent to what an ordinary mathematician could do in a lifetime.

When he was 10, he was faced with the problem: How do you find the sum of the numbers 1 + 2 + 3 up to 100? How do you compute this in 1787 when there are no calculators? Well, what little Gauss did was to pair the terms, the beginning term and the end term, (1 + 100); and then the second term and the next to last term, (2 + 99); and then (3 + 98); (4 + 97); and so on down to (50 + 51). Each of those pairs adds up to 101, and there are 50 such pairs. Thus, the sum is 5050, and that’s pretty clever. This is called Gaussian pairing and is an example of a powerful and useful tool.

Learn more about the power of specific tools, or “tricks”, to make a mathematical expression simpler

Wishful thinking is one of your first strategies for this because pretending to solve a problem, even an easier one, keeps you happy. It allows you to keep thinking about solving problems. Even delusion helps – deluding yourself into thinking that you’ve solved a problem actually allows you to solve it later because you can relax and be happy. Making yourself happy and confident, even if it’s through such a transparent thing as delusion, is fine.

A corollary of wishful thinking is a very sensible idea, which I just call the “make it easier” strategy. The idea is completely common sense. If your problem is too hard, just make it easier by removing the hard part. Either make the size smaller or remove an element that makes it hard. For example, if it involves square roots, remove them temporarily.

What you should keep in mind is strategy and tactic is what makes someone a good problem solver, not the tools. Now, if you’ve never seen the Gaussian pairing tool, which Gauss used to sum the numbers from 1 to a 100, you are undoubtedly impressed. Gaussian pairing are quite clever, but they are tools and tools are just tricks. These are things that can be acquired. What you should keep in mind is that strategy and tactic are what makes someone a good problem solver, not the tools.

And you should use these new ideas. Any time you see a new, interesting idea, learn it, use it, and make it yours. Ideas are collective human property. They are not private property. Don’t forget that what you’re doing is chainsawing the giraffe. It’s okay to mess around and break some rules.

Learn more about three strategies for achieving a problem-solving breakthrough

Let’s look at a quickie: a problem that requires “think outside the box” thinking.

If you consider the problem of nine dots in a grid and ask how do you join them all by drawing no more than four absolutely straight lines? If you think outside the box , as demonstrated in the diagram, it’s pretty obvious what to do.

As long as you go outside the box, you’re able to get all 9 dots. It’s a fun and challenging problem if you’ve never seen it before.

*Thinking outside the box helps you become a good problem solver. *

Click To Tweet

People may be endowed unequally with confidence, creativity, and power of concentration, but all of these are trainable skills. It’s possible to practice them and improve them, but in order to do so, you will need to see lots of creativity in action and you need lots of open-ended opportunity to experiment.

Learn more about the hidden world of problem solvers

The seven steps of problem solving are: identify a problem, define goals, brainstorm, consider alternatives, agree on the solution, execute the solution, and evaluate the outcome.

Good problem solvers generally have less drama in their lives and react less emotionally to difficulties and thus are able to follow the steps of problem-solving more coherently.

Become a better problem solver by stimulating the brain with mathematical problems and games, removing drama from your life, and following the steps of problem solving while allowing for random clues to appear.

Solving problems gets better with experience and team leaders must lead by offering solutions and roads to solutions which arise from experience.

Common Core Math Divides Parents, Teachers, Students

The Great Courses Learning Paths—Math

A Reflection on Human Intelligence a la Beef Stew

In this full lecture, discover methods that teach you to visualize numbers in a whole new light.

Taught by Professor James Tanton, Ph.D.

Let’s say you buy a lottery ticket; what are the chances that you’re going to be rich for the rest of your life?

You walk across a golf course in a stormy day; what are the chances you’ll be hit by lightning?

What are the chances that your investments will allow you to live happily for the rest of your days?

You have a fever; you have a cough. What are the chances that it’s a serious disease rather than something trivial?

This is a transcript from the video seriesWhat Are The Chances? Probability Made Clear.Watch it now, on The Great Courses Plus.

All these are real-life examples of situations where we’re confronted with possibilities whose outcomes we do not know. In fact, I would argue that many or most parts of our lives—and the world and trying to understand the world—involve situations where we don’t know what’s going to happen. They involve the uncertain and the unknown.

Learn more about the concept of randomness and its quantification through probability

It would be nice to say, “Well, our challenge in life is to get rid of uncertainty and be in complete control of everything.” That is not going to happen. One of life’s real challenges is to deal with the uncertain and the unknown in some sort of an effective way; and that is the realm of probability.

Probability accomplishes the really amazing feat of giving a meaningful numerical description of things that we admit we do not know, of the uncertain and the unknown. It gives us information that we actually can act on.

For example, when we hear there’s an 80% chance of rain, what do we do? We take an umbrella. Of course, if it doesn’t rain, we say, “Well, there was a 20% chance it wouldn’t rain. That’s okay.” If it rains, we say, “Oh, yes, the prediction was right. There was an 80% chance of rain.”

Probability is a rather subtle kind of a concept because it can come out one way or the other, and still a probabilistic prediction can be viewed as correct—but decisions made on probability have all sorts of ramifications.

When we make medical decisions, for example, we are making decisions that are based on probabilities, and yet they have extremely serious consequences, including life and death consequences.

In the case of the rain, all we risk is getting wet. But in many areas of making decisions on the basis of probability, there are very serious consequences. When we make medical decisions, for example, we are making decisions that are based on probabilities, and yet they have extremely serious consequences, including life and death consequences.

Learn more about a numerical way to make decisions

Back before probability was viewed as commonplace as it is today— between 1750 and 1770 in Paris, there was a smallpox epidemic for which a vaccine was developed. Unfortunately, the inoculations were rather risky. They reckoned that there was a 1 in 200 chance of death from taking the inoculation, but on the other hand, there was a 1 in 7 chance of dying eventually from the disease. So making that kind of decision is a very dramatic question where we’re weighing probabilities.

If you took that inoculation and you died immediately from smallpox, did you make the right decision or not? Well, of course, you don’t want to be among the 1 in 200 that died from the inoculation. On the other hand, on the basis of probability, it was the right decision. There are many controversies about this kind of thing and in today’s world with lawsuits and all this would be a very serious kind of an issue to undertake.

Well, in many arenas of life, our understanding of the world comes down to understanding processes and outcomes that are probabilistic in nature, that really come about from random chance, that things are happening by randomness alone. Over the last couple of centuries, the scientific descriptions of our world increasingly have included probabilistic components in them.

Learn more about probability can be used to model the distribution of genetic traits

Many aspects of physics all involve questions of probability. Things we imagine—molecules causing things to happen by the aggregate force of probabilistic occurrences—quantum mechanics — thermodynamics. At the very foundations of our knowledge of these studies is the theory of probability.

Biology, genetics, and evolution are all based very centrally on random behavior, as well. In fact, in all of these areas, the goal is to make definite, predictable, measurable statements about what’s going to happen that are the result of describing random behavior.

The description of random behavior is how we, as scientists and mathematicians, define the world. This is a major paradigm shift in the way science has worked for the last 150 years. As time goes on, there continues to be an increase in the role of probability and randomness at the center of scientific descriptions.

Probability gives us a specific statement about what to expect when things happen at random. But how can it be effective when, by definition, random outcomes of one trial or one experiment are completely unknown? Well, if you repeat those trials many, many times and look at them in the aggregate, that’s when you begin to see glimpses of regularity. It’s the job of probability to put a meaningful numerical value on the things that we admit we don’t know.

Probability is a mathematical possibility of what might occur when one takes part in a mathematical potential such as rolling dice or choosing an item off a menu.

There are many types of probability: experimental, theoretical and subjective probabilities are the most commonly understood.

A coin toss is the most common example of probability. With a 50 percent chance of either heads or tails, we see both sides of the issue.

Simple probability is simply the probability of an outcome such as a roll of the dice or choosing an item from a menu.

Strategy and Luck Play Roles in 50th Annual World Series of Poker

Games That Businesses Play — 3 Famous Examples of Failed Business Strategy

The Nature of Randomness

When used in statistics, the word population refers to the entirety of the collection of people or things that are of interest. A sample is a subset of the total population.

In general, the goal is to infer information about the whole population from information about the sample. In other words, it’s not in our interest to know only about the people who are asked in the sample. What we’re really interested in is those aspects of the entire population.

This is a transcript from the video seriesMeaning from Data: Statistics Made Clear.Watch it now, on The Great Courses Plus.

If you choose the sample randomly, the advantage is that using probability you can make inferences about how well the opinions of the sample do, in fact, represent the opinions of the whole population.

On the other hand, if you intentionally choose certain groups to reflect what you believe to be reflective of reality, you may bring your own biases to the selection process, and those biases are then going to be reflected in the people whom you ask. Representative of the whole population means that the sample should have the same characteristics that the whole population does.

The whole concept of choosing the sample randomly is that you have a better chance that the proportion of people in the sample with a certain opinion will be, in fact, the same as the entire population.

Learn more about induction within polling and scientific reasoning

The most familiar occasion where this comes up is before an election, when pollsters try to find out what proportion of the voters will vote for the Democratic candidate and what proportion will vote for the Republican candidate.

There are several major pitfalls in the way sampling can be done. In the 1936 U.S. presidential election, the two primary contenders for the presidency were the incumbent, Franklin Delano Roosevelt, and the Republican opponent, Alfred Landon. At the time, the magazine *The* *Literary Digest* had for several elections conducted polls to predict who would win the coming election. They had successfully predicted the outcomes in several elections, so this was a major poll.

In the 1936 election, *The Literary Digest* sent out 10 million voting surveys, and they received 2.4 million replies. Based on those surveys, *The* *Literary Digest* predicted that Landon would win in a landslide, with 370 electoral votes to Roosevelt’s 161.

Well, you may not recall reading about President Landon in your American history books. Obviously he did not win the presidency.

In fact, the only correct aspect of *The* *Literary Digest*’s prediction was that the election was a landslide, but unfortunately for them, the landslide was the other way. Roosevelt won the election with 62 percent of the popular vote and by an incredible 523 electoral votes to 8 for Landon.

Learn more about what makes aggregation more effective than any single poll

Obviously, *The* *Literary Digest*’s sampling method was not representative of the whole population.

What went wrong? Well, one thing was that *The* *Literary Digest* got their samples from several different kinds of lists. One list was the subscribers to their own magazine. They also looked at car registration records, and that was an available list of a lot of names, and they sent their surveys to those people. They also used telephones.

The people to whom

TheLiterary Digesthad sent their survey were likely wealthy people and obviously their opinions were not representative of the population at large.

The people to whom *The* *Literary Digest* had sent their survey were likely wealthy people and obviously their opinions were not representative of the population at large.

The year 1936 was in the middle of the Great Depression, and many people were having financial problems and were cutting back on their budgets. Probably one of the first things to go in tight times would be one’s subscription to *The* *Literary Digest*. In addition, not many people owned cars or telephones. These were luxury items for a lot of people in 1936. Because of this, the people to whom *The* *Literary Digest* had sent their survey were likely wealthy people and obviously their opinions were not representative of the population at large.

Learn more about gathering data from which deductions can be drawn confidently

*The* *Literary Digest* poll’s second pitfall was that it was a voluntary response survey.

The magazine sent out all these surveys, and only some people replied. The problem with this is that sometimes people who send back replies have a particular bias. Instead of sending back replies in the same proportion, maybe some people with a certain opinion are more apt to reply. The bias that can come from voluntary responses may not just give an answer that’s a little off, but it can give a completely erroneous view of reality.

Because of this story, *The* *Literary Digest*, which otherwise would simply be lost in the dustbin of history, will now live on forever in statistics textbooks as a great example of bias in sampling.

A success that came from this *Literary Digest* fiasco is the story of George Gallup.

At the time, Gallup was a young statistician just starting out, and he did his own poll for the 1936 election. He took a survey of 50,000 people and made two predictions of his own for the election.

- He correctly predicted that Roosevelt would win the election.
- He also predicted that
*The**Literary Digest*poll would be wrong and estimated how wrong they would be before their poll came out.

He was one of the people who introduced the concept of randomness in political polling as a key feature of sampling techniques. That is absolutely one of the fundamental criteria to look for when you’re evaluating whether a sample survey is, in fact, a good one.

Learn more about sampling; a technique for inferring features of a whole population from information about some of its members

Randomness is a basic ingredient of essentially all of the standard statistical techniques, and the reason it’s an ingredient is because the analysis of randomness and probability that allow us to apply mathematics to the understanding of the results that we get.

The most basic way to get an accurate sample is to take a sample that’s called a simple random sample, which is, as the name implies, simply to take the entire population you’re interested in, and say how many people you want to survey and randomly select them from that group, and then get the answer from each member of that selected sample.

Of course, there are lots of problems in getting the answer from that selected sample. But the simple random sample is the gold standard for finding a representative sample.

Political polling is used both to predict a campaign’s results and to give a candidate or supporters of that candidate a metric by which to measure the candidate’s results. The first poll, called a benchmark, helps a candidate to design a campaign strategy by identifying such factors as the candidate’s overall popularity, the demographics of the people most likely to vote for that candidate, and the issues that matter most to the candidate’s main audience.

An exit poll is conducted after an election. As voters leave the polling station, reporters ask them who they voted for. This is used to predict election results, since the votes can sometimes take a few days to count.

Polling data is used by a political candidate to gather information about how well he/she is resonating with potential voters at the start of and during a campaign. It provides the candidate with information such as the demographical features of the individuals who would most likely vote for the candidate and allow the candidate to test the popularity of various messages. His/her overall approval rating is also a good indicator of whether or not it is worth staying in the campaign because running a political campaign is very expensive.

Push polls are intended to “push” an issue to the forefront of the voter’s mind. For example, a push poll might ask potential voters to evaluate candidates based on their support of healthcare.

Strategy and Luck Play Roles in 50th Annual World Series of Poker

As Sports Betting Moves Online, Re-Examining Games of Chance

Microsoft to Release Software Kit for Voters to Track Ballots

Which would you say is bigger: the complete works of Shakespeare or an ordinary DVD? The complete works of Shakespeare fit in a big book, of roughly 10 million bytes. But any DVD, or any digital camera, for that matter, will hold upwards of four gigabytes, which is 4 billion bytes. A DVD is 400 times bigger. All the printed words in the Library of Congress would be 10 trillion bytes, 10 terabytes. That’s one very large wall full of DVDs, but it’s also about the size of a single high-end personal hard drive. That is, you might carry all the books in the Library of Congress on a single device the size of just one book.

And data is not merely being stored: We access a lot of data over and over. Google alone returns to the web each day, to process another 20 petabytes. What’s that? It’s 20,000 terabytes, 20 million gigabytes, 20 quadrillion bytes. How big do you want to go? Google’s daily processing gets us to one exabyte every 50 days. And 250 days of Google processing may be equivalent to all the words ever spoken by humankind to date, which have been estimated at five exabytes. And nearly one thousand times bigger is the entire content of the World Wide Web, estimated at upwards of one zettabyte, which is 1 trillion gigabytes. That’s 100 million times larger than the Library of Congress. Of course, there is a great deal more that is not on the web.

This is a transcript from the video seriesBig Data: How Data Analytics Is Transforming the World. Watch it now, on The Great Courses Plus.

But let’s turn to the velocity of data. Let’s start a clock, to see what this feels like. Not only is there a lot of data, it’s coming at very high rates. High-speed Internet connections offer speeds 1,000 times faster than dial-up modems connected by ordinary phone lines. Here are some things that are happening every minute of the day. YouTube users upload 72 hours of new video content. In the United States alone, there are 100,000 credit card transactions. Google receives over 2 million search queries. And 200 million email messages are sent. It can be hard to wrap one’s mind around such numbers. How much data is being generated? Let’s turn to Facebook. In only 15 minutes, the amount of photos uploaded to Facebook is greater than the number of photographs stored in the New York public photo archives. That’s every 15 minutes! Now think about the data over a day, a week, or a month.

Learn more about the tremendous scope and power of data analytics

The cost of a gigabyte in the 1980s was about a million dollars. So, a smartphone with 16 gigabytes of memory would be a $16 million device

Finally, there is variety. One reason for this can stem from the need to look at historical data. But data today may be more complete than data of yesterday. The cost of a gigabyte in the 1980s was about a million dollars. So, a smartphone with 16 gigabytes of memory would be a $16 million device. Today, someone might comment that 16 gigabytes really isn’t that much memory. This is why yesterday’s data may not have been stored or have been stored in a suitable format compared to what can be stored today. Now, consider satellite imagery. The images come in large variety of aspect ratios. While I know that a satellite image will contain pixels, I don’t necessarily know what is in the picture, or not in the picture. I don’t necessarily know where to look. I may not even know what to look for.

Learn more about how to put data to work in your own life

So, we stand in a data deluge that is showering large **volumes** of data at high **velocities** with a lot of **variety**. With all this data comes information and with that information comes the potential for innovation. Steve Jobs, charismatic co-founder of Apple, was diagnosed with a pancreatic cancer in 2003. He became one of the first people in the world to have his entire DNA sequenced, as well as that of his tumor. It cost him a six-figure sum but now he had his entire DNA. Why? When doctors pick medication, they hope the patient’s DNA is sufficiently similar to the patient in the drug trial. Steve Jobs’s doctors knew his genetic makeup and could carefully pick treatments. When one treatment became ineffective, they could move to another. While Jobs eventual died from his illness, having all the data and all that information added years to his life.

Human beings tend to distribute information through what is called a transactive memory system, and we used to do this by asking each other

We all have immense amounts of data available to us every day. Search engines almost instantly return information on what can seem like a boundless array of topics. For millennia, humans have relied on each other to recall information. The Internet is changing that and how we perceive and recall details in the world. Human beings tend to distribute information through what is called a transactive memory system, and we used to do this by asking each other. Now, we also have lots of transactions with smartphones and other computers. They can even talk to us. In a study covered in *Scientific American*, Daniel Wegner and Adrian Ward discuss how the Internet can deliver information quicker than our own memories can. Have you tried to remember something and meanwhile a friend types it into a smartphone, gets the answer, and if it is a place already has directions? In a sense, the Internet is an external hard drive for our memories.

Learn more about strategies that help manage the data deluge

So, we have a lot of data, with more coming. We aren’t just interested in the data; we are looking at data analysis, and we want to learn something valuable we didn’t already know. For example, UPS must decide on a delivery route for packages to save time and gas. Consider 20 drop-off points; which route is the best? Seems simple enough, but checking all possible routes isn’t that easy. You have 20 choices for the first stop, 19 for the second, and so forth. In all, there are about 2 times 10 to the 18^{th} power. How big is that number? That’s five times the estimated age of the universe. Clearly, we aren’t checking that number of combinations on a computer each time a driver needs a route. Keep in mind, that’s only 20 stops.

UPS has about 55,000 drivers every day. Until recently, UPS drivers had a general route to follow. It allowed for decisions on the part of the driver. UPS now has a program called ORION, or On-Road Integrated Optimization and Navigation to help. It uses math to decide on routes. They can be counterintuitive but save time in the end. It doesn’t find* the* best route, but a lot of research has been done to find good solutions to this problem. Keep in mind, UPS has a harder problem than simply finding a route to save time. They also must consider other variables like promised delivery times. How much can this save? Consider these two numbers. Thirty million dollars: that’s the cost to UPS per year if each driver drives just one more mile each day than necessary. Eighty-five million: the number of miles the analytics tools of UPS are saving per year. Data analysis doesn’t always involve exploring a data set that is given. Sometimes, questions arise and data hasn’t even been gathered. Then, the key is knowing what question to ask, and what data to collect.

As an example, let’s join Oren Etzioni on a flight from Seattle to Los Angeles for his younger brother’s wedding. Wanting to save money, Oren bought his ticket months before the “I dos” were said. During the flight, Oren asked neighboring passengers about their ticket price. Most had paid less, even though many had bought their tickets later. For some of us, this might simply tell us not to worry so much about choosing close to the date of a flight. But Oren was Harvard’s first undergraduate to major in computer science. He graduated in 1986. To him, this was a problem for a computer to solve. He’d seen the world this way before. He helped build MetaCrawler, which was one of the first search engines. InfoSpace bought it. He made a comparison-shopping website, also snatched up. Another startup was bought by Reuters.

So, Oren gave 12,000 price observations grabbed by his computer programs from a travel website over 41 days. He ended up with something that could save customers money, and not just by comparing current prices. It didn’t know why airlines were pricing the way they did, but it could help predict whether fares were more likely to go up or down in the near future. When it became a venture capital-backed startup called Farecast, it began crunching 200 billion flight-price records. Then? Microsoft bought it in 2008, for $110 million, and integrated it into the Bing search engine. What made it possible to predict future fares? Data—lots of it. How big and what’s big enough depends, in part, on what you are asking and how much data you can handle. Then, you must consider how you can approach the question. UPS can’t look for the optimal answer. But they can save millions of dollars finding much better answers. Again, they can do this by asking questions only answerable with the data that is streaming in and available in today’s data explosion.

Learn more about when there is a cause-and-effect relation or mere coincidence is involved

An example of **Big Data** is the aggregation of **petabytes** or millions of personal records of people containing multiple pieces of information pertaining to their identity.

**Big Data** is largely used to get to know a person from the inside out to **understand behavior** in an effort to better sell them things.

**Big Data** is thought to have **four V’s **that pertain to its usefulness. The four V’s are velocity, veracity, variety, and volume.

**Big Data** can be characterized by many types; however, at the most basic level data it’s either **structured, unstructured or semi-structured**. This categorization will determine how much work must go into understanding and using it.

Is Little Data The Next Big Data?

Medical Data Transparency and the Tamiflu Controversy

How Gathering Data can Reduce Uncertainty

Phot of Steve Jobs Matthew Yohe [CC BY-SA 3.0 (http://creativecommons.org/licenses/by-sa/3.0) or GFDL (http://www.gnu.org/copyleft/fdl.html)], via Wikimedia Commons

Imagine an experiment in randomness. Take a coin and flip it 200 times, and each time record whether it’s a heads or a tails, putting down Hs for the heads and Ts for the tails. Now, suppose you ask a person to just write down a random list of 200 Hs and Ts, and you put up both lists on a blackboard, one made by actually flipping a coin, and the other made by a human. Even though they may both look like an ocean of Hs and Ts, there is a way to tell which one is truly random, and which is human generated.

Learn more: Our Random World—Probability Defined

The thing to do is look for strings of long sequences where there are all Hs in a row or all Ts in a row. In the 200 Hs and Ts generated by randomly flipping a coin, you might see at least four or five long sequences of Hs or Ts: six Hs in a row here, five Ts there—a lot of streaks of many things in a row.

How often will a human being write more than four strings of the same letter in a row when they’re trying to be random?

Now consider the list generated by the human being. How often will a human being write more than four strings of the same letter in a row when they’re trying to be random? Well, we sort of resist this, because we don’t think that’s very random. They think you’ve got to sort of alternate—H-T-H-T—and so here in a human generated one you would see very few strings of Hs and Ts in a row.

This is a transcript from the video seriesWhat Are the Chances? Probability Made Clear. Watch it now, on The Great Courses Plus.

As a matter of fact, when you flip a coin 200 times, the probability of having at least one string of six or longer of Hs or Ts is roughly 96 percent—very likely. The probability of having at least one string of five is 99.9 percent—it’s essentially certain. You’d be very unlikely to flip a coin that many times without getting these long strings, and if you actually simulate this on the computer, you’ll see that this plays out, that you just almost always get long strings.

One of the common misconceptions that a lot of people have about randomness is illustrated by the coin flipping experiment. Let’s say that you flip a coin many times, and just randomly it happened that 10 times in a row you got heads. Well, doesn’t it seem like the next time it’s more apt to be a tails? It does to most people. And the answer is, of course, that the coin doesn’t know what it’s just done. To the coin, every flip is a new flip, and it’s just as likely to be a heads as a tails after it’s done 10 heads in a row, as it was to get a heads than a tails if it had done none of them.

Take a coin, and more than a million times, you flip the coin 11 times. Obviously you do this with a computer.

To demonstrate this, you can simulate the following experiment. Take a coin, and more than a million times, you flip the coin 11 times. Obviously you do this with a computer. Computers are great, by the way; they don’t care—a million times, they’ll just go ahead and do it. So you just do it a million times, and what do you get? To make it easy, you actually flip the coin 11 times for 1,024,000 times, because every 1,024 times is the probability of getting 10 heads in a row. In other words, if you do the experiment of flipping the coin 1,024,000 times, and each time you flip it 11 times, you expect that the first 10 will all be heads about 1,000 times.

Learn more: Probability Is in Our Genes

So you run the computer simulation a first time, and the number of times you get 10 heads in the first simulation is 1,008: extremely close to 1,000. What happened to the 11^{th} coin? Well, 521 times it turned out to be a head also, and 487 times it turned out to be a tail. There’s no memory. Approximately half the time heads, half the time tails.

If you do it again, the first 10 might be heads 983 times, and then the 11^{th} flip heads 473 times and tails 510 times. During a third experiment, 1,031 times it came out heads 10 times in a row, and of those, 502 had the next coin be a heads, and 529 a tails. The coin has no memory. After it’s gotten 10 heads in a row, it’s just as likely to be heads the next time as it was the first time you flipped that coin.

There is another counterintuitive aspect of probability, and it’s really interesting to think about what is rare, and how we view rarity in probability. Suppose you got dealt the following hand: the two of spades, the nine of spades, the jack of clubs, the eight of spades, and the five of hearts. Well, it probably doesn’t strike you as an impressive hand, one you write home about, but it is. One out of 2,598,960—that’s the probability of getting that hand.

Now if you were dealt the ace, king, queen, jack, ten of spades—a royal flush in spades—what’s the probability of getting this royal flush in spades? Exactly the same—1 out of 2,598,960—and yet you would write home to your mother about this hand for sure. Your previous hand was just an average hand, and yet in your whole life of playing cards, you know what? You will probably never get that hand again, because its probability is almost zero—1 out of 2,598,960. So this is one of the counterintuitive concepts of probability: that rare events happen all the time, but you may not recognize them as significant.

Learn more: Probability Everywhere

Rare events absolutely happen by chance alone. The most-common rare event that you see mentioned in the newspapers every day is the lottery. The probability of winning the Powerball lottery is approximately 1 out of 146,000,000. This is the big multistate lottery in some states. One out of 146,000,000. That chance is so remote you’d think it would never happen; but it happens regularly. Why? Because a lot of people try. A lot of people buy random numbers and some of them then occasionally win. If you try something that’s rare often enough, then it will actually come to pass.

This concept—that rare things will actually happen if you repeat them enough and you look for them enough—was encapsulated in an observation that was first made by the astronomer Sir Arthur Eddington in 1929, and he was describing some features of the second law of thermodynamics. He wrote the following:

If I let my fingers wander idly over the keys of a typewriter it might happen that my screed made an intelligible sentence. If an army of monkeys were strumming on typewriters they might write all the books in the British Museum. The chance of their doing so is decidedly more favourable than the chance of the molecules returning to one half of the vessel.

However, you can find patterns in random writing, and in fact an enterprising author made a lot of money a few years ago when he wrote *The Bible Code*. What the author of *The Bible Code* did was take the Bible, written in Hebrew, and find patterns of words by skipping a certain number of letters, and in that pattern of skips they would find words written out. One example was “Atomic holocaust Japan 1945.” He said that this was an example of how the Bible showed the future.

The truth is that this is just a matter of probability. If you take all possible sequences of different lengths, you can by randomness alone find surprising things, and just to demonstrate it, people debunking this analysis found patterns in *War and Peace* and so on. This is another challenging part of probability, namely that if you look for rare things but you have a lot of places to look, you’ll tend to find them.

These are some of the challenges of looking at and asking what is random in the world.

In probability, randomness refers to events that occur in no apparent order and are not causally related.

True randomness means that something unfolds purely by chance rather than intentionality, free from human interference.

Cryptography, gambling, statistical sampling, and computer simulation are all purposes for using a random number generator.

Many people claim that they can “outsmart” the lottery or predict winning combinations. People even sell tools to this aim, but these tools are most likely a waste of money. To the best of anyone’s knowledge, the process of choosing the winning lottery numbers operates on the principle of randomness.

Our Random World—Probability Defined

Can You Trust Polling Results?

Mind Expanding Ideas of Metaphysics

With only 10 symbols, we have the machinery to describe new numbers that grow beyond our imagination. Here, we’ll explore the origins of zero and the development of our modern decimal system. With a powerful positional number system in place, humankind was finally equipped with the tools necessary to begin the development of modern mathematics.

Let’s begin with the downside of the ancient additive systems. Most of the systems required the repetition of symbols. For example, the Roman numerals XXIII equal 23, and they’d add up the two Xs (10 each) and then the three Is, and get 23. The Babylonians used dovetails and nails, which they would add up. Although computation with the additive systems was fast using tools such as the abacus, those systems required a very long list of symbols to denote larger and larger numbers, and this was a problem in practice.

This is a transcript from the video seriesZero to Infinity: A History of Numbers. Watch it now, on The Great Courses.

Additive systems made it difficult to look at more arithmetically complicated questions and thus slowed the progress of the study of numbers. In order to move to what we call a positional system, they needed a new number. This inspired a philosophical question: How many items do you see in an empty box? Is your answer a number? This is the question about zero. In the Rhind Papyrus from 1650 B.C.E., the scribe Ahmes referred to numbers as “heaps.” This tradition actually continued through the Pythagoreans, who in the 6th century B.C.E. viewed numbers as “a combination or heaping of units.”

This notion of having zero be a quantity didn’t make any sense at all because they were thinking in terms of heaps. This lack of zero caused many challenges.

Even Aristotle defined number as an accumulation or heap. Also, the word “three” derived from the Anglo-Saxon word *throp*, again meaning “pile” or “heap.” Well, because we can’t have a heap of zero objects—with zero objects, there would be no heaping at all—zero was not viewed as a number. So this notion of having zero be a quantity didn’t make any sense at all because they were thinking in terms of heaps. This lack of zero caused many challenges. A careless Sumerian scribe could cause ambiguities because, in cuneiform, different spacing between symbols can actually represent different numbers. The Egyptian system, on the other hand, did not require a placeholder like zero, but their additive notation was cumbersome. Again, they had all the symbols together, and they had to add them all up. As a result, in the 2,000 years of the Egyptian numeral system, they made very little progress in arithmetic or, more generally, in mathematics. It’s interesting to see how the notation really drives our understanding, our intuition, and our further quest to consider number.

Learn more about why all numbers are interesting

The Mayans also had an eye-shaped symbol for zero that they also used only as a placeholder.

Zero first appeared as an empty placeholder rather than a number. The Babylonians had a symbol for zero by 300 B.C.E. It was a placeholder rather than a number because, again, they were thinking heapings, but they needed to distinguish between numbers. The Mayans also had an eye-shaped symbol for zero that they also used only as a placeholder. The evolution of the symbol for zero is actually very difficult to chart. The modern symbol “0” may have arisen from the use of sand tables that were used to calculate things, whereby pebbles would be placed in and moved back and forth for addition or subtraction. When a pebble would be removed, there would be an indentation or a dimple in the sand, which reflects the “0” that we see today. In fact, calculations performed on the sand tables may have actually led to the development of the place-based number systems.

Learn more about Zeno’s paradoxes of motion, space, and time

Later, in the 2nd century C.E., Ptolemy used the Greek letter omicron, which looks like an “O,” in fact, to denote “nothing.” So this is the symbol for zero, the “0” that we see—the circle. But I want to make it very clear that Ptolemy did not view this as a number, but merely as the idea of nothing. But you can see, again, that these things were slowly coming together. Zero as a number really occurred in India, most likely.

By the 7th century, the Indian astronomer Bhramagupta offered a treatment of negative numbers and actually understood zero as a number, not just as a placeholder. In fact, he actually studied 0 divided by 0, and 1 divided by 0, and he decided erroneously that 0 divided by 0 equals 0 but just didn’t know what to conclude about 1 divided by 0.

Here again we see a couple of things. First of all, we know today that we can’t divide by 0. If we divide by 0, it does not yield a number, so we leave the realm of number. So we can’t do that—no dividing by 0—and we learn that in school. But we also see a wonderful thing. Bhramagupta, this very important, great mind, was making a mistake, again—something that is to be celebrated rather than to feel embarrassed about. He didn’t get it quite right. That’s okay; his contributions were enormous. So finally, humankind expanded its view of number to actually include and embrace zero.

Learn more about Kurt Gödel’s demonstration that mathematical consistency is a mirage and that the price for avoiding paradoxes is incompleteness

A few words about this “nothing” number in terms of language: from the 6th to the 8th centuries, in Sanskrit there was “sun-yah,” which meant “empty,” to represent zero as we think of it. By the 9th century, in Arabic there was “sigh-fr.” By 13th-century Latin, there was “zef-ear-e-um.” From 14th-century Italian there was “zef-ear-row.” By 15th-century English, we have “zero.” So we can see the evolution of just that word.

Learn more about how the paradoxes associated with infinity are infinite

Because of zero’s power in computation, some viewed it as mysterious and nearly magical. As a result, the word zero has the same origins as another word that means “a hidden or mysterious code,” and that word, of course, is “cipher.” We can see that “cipher” actually came from the mysterious qualities that zero possessed in the eyes of our ancestors.

While it was used as a placeholder for millennia before, **the number zero** is officially thought to have been invented by **Brahmagupta** around the year 628, though this is still mostly scholarly conjecture.

**The number zero** is absolutely a **natural number** on the number line between positive and negative 1 and can be used in sets to identify numbers. However, as numbers are used to count and zero cannot count anything, it can also be considered not a number!

Technically, **the number zero** cannot be larger or smaller than itself like the number one or negative one can be, so it is neither. However, in set theory **zero is in the set of non-negative numbers** while also not being in the set of positive numbers. Zero is unique.

**The number zero** does not hold a value. Zero is best thought of as a placeholder and a tool for extending **mathematics**.

Are There Absolute Truths in Mathematics?

Math in Literature: Depicting the Collapse of Certainty

Rationalism in Mathematics Enters Shaky Ground