AlphaGo beats Lee Sedol in third consecutive Go game

March 13, 2016

Google’s DeepMind computer program wins $1m in victory marking significant development in artificial intelligence

3013_jpg_w_620_q_20_auto_format_usm_12_f

Go player Lee Sedol, seated right, reviews the game after losing to AlphaGo

Google’s AlphaGo computer program has won a third and decisive encounter with a top-ranked player of the Chinese board game Go in a victory marking significant developments in artificial intelligence.

Lee Sedol, who is the world’s second best player of the strategy game, lost three games in a row in Seoul this week, with the latest AlphaGo victory on Saturday handing Google the best-of-five match.

“I’ve never played a game where I felt this amount of pressure, and I wasn’t able to overcome this pressure,” Lee said at a post-game press conference.

Go has simple rules, but is highly intuitive and complex in practice. Mastering it has been an exceptionally difficult task for even the world’s best IT designers.

“We came here to challenge Lee, to learn from him and see what AG was capable of,” said Demis Hassabis, co-founder of Google’s artificial intelligence business, DeepMind, which created the program.

“AlphaGo controlled the momentum over more than four hours of gameplay, with Lee struggling to maintain territory against the program’s creative approach. Google DeepMind taught AlphaGo to recognise the optimal move in thousands of possible scenarios.”

AlphaGo’s dominance amounts to a significant, and much faster than previously expected, advance in artificial intelligence.

Google co-founder Sergey Brin, who was in Seoul to watch the third match, described Go as a beautiful game and said he was excited the company had been able to “instil that kind of beauty in our computers”.

Michael Redmond, one of the match’s commentators and a professional Go player, said some people initially doubted AlphaGo’s abilities. “After three matches and three straight victories, we are convinced,” he said.

AlphaGo won $1m in prize money, which Google DeepMind said would be donated to charities, including Unicef and Go organisations.

“AlphaGo controlled the game amazingly,” said Fan Hui, the European Go champion who was the first professional player to lose to the program when he played it in October.

Hui said the advances in artificial intelligence appeared to bode well for the future of the ancient game.

“We now have this new way of learning about Go. And look how many people are watching this now. More and more people are interested in Go now.”

Article source

March 13, 2016

It was history in the making, and considerably sooner than anyone expected, whether players or programmers. AlphaGo, the Go program by DeepMind, has effectively won a five-game match after a 3-0 start against one of the world’s best players, Lee Sedol.

...

In order to appreciate just how extraordinary an achievement AlphaGo is, a comparison with chess programs, which have been punishing the elite for decades now, is in order. What is so special or different about Go that somehow resisted programming efforts and genius for so long?

Go, which translates to “encircling game” is a game whose roots and history easily rival those of chess, with written records going back to the 4th century BC. It is played on a 19 x 19 grid, with each player placing stones on the board. Black moves first, and then white, with the pieces never moving from their squares, though they can be removed if captured. The goal, as the translation of its name implies, is to have surrounded a larger total area of the board with one's stones than the opponent by the end of the game.

This ultra-simplified introduction to Go is necessary to understand the complexities and challenges involved in programming it, compared to a game such as chess. Chess programming is dominated by the search and the evaluation function. The evaluation starts with the most fundamental aspect: the differing values of the pieces, while the search is about pruning down the number of moves to calculate and then looking ahead as many moves as possible to reach a quality decision. In Go, both of these are instantly problematic.

The search function in chess engines boils down to selecting a number of moves and steadily looking deeper and deeper. At the beginning of a chess game, White has twenty possible moves. After that, Black also has twenty possible moves. Once both sides have played, there are 400 possible board positions. Go, by contrast, begins with an empty board, where Black has 361 possible opening moves, one at every intersection of the 19 by 19 grid. White can follow with 360 moves. That makes for 129,960 possible board positions after just the first round of moves. It is easy to see that even with the most severe pruning techniques, a program would only be able to see ahead a few moves at best.

...

Next comes the evaluation function, or determining what constitutes a bad position or a good position, to choose between moves. In chess this starts with the value of the pieces, where the king is priceless (capture it and win), the queen is worth nine pawns, a rook is five, and so on. This is then tempered by various well-defined aspects such as isolated pawns, centralized knights, and so forth. Go begins with no difference in value of any of its pieces, and the board situations are so large and complex that simple rules such as a doubled-pawn makes no sense. Now that you understand why chess programming strategies have failed so abysmally in Go, what is the solution?

For a long time, there was none really, and as a result, until 2005-2006, the best programs in the world were weak amateurs at best.... Don’t think for an instant this was due to a lack of resources invested, since the promised payback was huge: tens of millions of Go players in East Asia who would line up to buy a strong program.

The change came with a French programmer Rémi Coulom. Coulom was a programming prodigy who at the age of ten, less than a year after receiving his first computer, had programmed Mastermind. In four years, he had created an AI that could play Connect Four. Othello followed shortly thereafter, and by 18, Coulom had written his first chess program. The Frenchman eventually earned a PhD for work on how neural networks and reinforcement learning can be used to train simulated robots to swim.

Coulom had exchanged ideas with a fellow academic named Bruno Bouzy, who believed that the secret to computer Go might lie in a search algorithm known as Monte Carlo. Rather than having to search every branch of the game tree, Monte Carlo would play out a series of random games from each possible move, and then deduce the value of the move from an analysis of the results.

While Bouzy was unable to make the idea work, Coulom hit upon a novel way of combining the virtues of tree search with the efficiency of Monte Carlo. He christened the new algorithm Monte Carlo Tree Search, or MCTS, and in January of 2006, his program Crazy Stone won its first tournament. He published his landmark concept in a paper that changed Go programs, setting a dividing point for programs before MCTS and those after.

This led to a revolution in Go programs that experienced a massive burst in strength, and now the very best version of Crazy Stone, running on a 64-core PC was able to hold its own against a pro, albeit 65 years old, with only a four-stone handicap in an exhibition game.

...

Where does AlphaGo come in the story? The problem with the Monte Carlo technique is that there was no obvious way forward. Doubling the CPU power does not lead to a significant increase in playing ability.

...

Given this understanding, then even if Google and DeepMind were to somehow get the world’s most powerful supercomputer behind a refined version of Crazy Stone, it might be a bit smarter, and a couple of moves deeper, but nothing a world-class player need concern himself with.... Somehow DeepMind had conjured up some magic that brought Go software from weak master at best, to world beater! The question is simply: how?

DeepMind Technologies, the developer of AlphaGo, was founded by British Artificial Intelligence researcher Demis Hassabis in 2010, and specialized in building general-purpose learning algorithms. Hassabis, a former chess prodigy in his own right, rated no.2 in the world in the under-14 category just 35 Elo behind Judit Polgar who was no.1, has always had a particular fascination for games, and not just chess. He won the world games championship a record five times, being also an expert in Shogi, poker and Diplomacy to name a few.

DeepMind Technologies's stated goal is to "solve intelligence", which they are trying to achieve by combining "the best techniques from machine learning and systems neuroscience to build powerful general-purpose learning algorithms". As opposed to other AIs, such as IBM's Deep Blue or Watson, which were developed for a pre-defined purpose and only function within their scope, DeepMind claims that their system is not pre-programmed: it learns from experience, using only raw pixels as data input. Technically it uses deep learning on a convolutional neural network, with a novel form of Q-learning, a form of model-free reinforcement learning. Their system has been tested on video games, notably early arcade games, such as Space Invaders or Breakout. Without altering the code, the AI begins to understand how to play the game, and after some time plays, for a few games (most notably Breakout), a more efficient game than any human ever could.

David Silver, lead project manager explains how AlphaGo came into existence: "AlphaGo is actually around two years old, if we have to give it an age. It was a research project with myself and Aja Huang, and Chris Maddison, an intern from Google Brain. We wanted to ask this question, whether a neural network using deep learning can actually learn to understand the game of Go well enough to play reasonably. And so this was a pilot research project. We tried some experimental things, we tried a whole bunch of ideas, and around a year ago we published a first paper on this result and we discovered that actually the neural network by itself could perform remarkably well. It could actually reach the level of an amateur dan-level player without any lookahead at all. Without even adding any search tree in. When I saw this result, I was really taken aback. I'm an amateur player myself. I'm not a very strong player, but I'm aware as a player of the importance of reading out situations, and I kind of found it mind-blowing that a neural network without any explicit reading of the positions would be able to understand a position well enough to reach amateur dan level. And at that time I felt that this was something with a lot of potential and I sat down and talked with Demis, the CEO of DeepMind, and I said 'I really think someone is going to take these deep learning techniques and actually achieve the highest levels of play. I think it's really going to happen. This is something that's in the cards now', and he said, 'let's make sure it's us.' And he really powered up the project."

The end-result was published on January 27, 2016, in a paper in the journal Nature, revealing not only the existence of AlphaGo, but its incredible results by then....

The ground-breaking paper started with the statement:

"We introduce a new approach to computer Go that uses value networks to evaluate board positions and policy networks to select moves. These deep neural networks are trained by a novel combination of supervised learning from human expert games, and reinforcement learning from games of self-play. Without any lookahead search, the neural networks play Go at the level of state-of-the-art Monte-Carlo tree search programs that simulate thousands of random games of self-play. We also introduce a new search algorithm that combines Monte-Carlo simulation with value and policy networks. Using this search algorithm, our program AlphaGo achieved a 99.8% winning rate against other Go programs.”

As of March, a match with the legendary player Lee Sedol was organized with a prizefund of $1 million. By now, DeepMind has effectively won the match with an incredible start of 3-0....

ARTICLE SOURCE

March 14, 2016

Thread has been merged.

March 14, 2016

Most likely, Lee Sedol was not lost because of his lack of skills in Go, but rather, he was lost because of the weaker mental toughness as compared to a machine. In game 4, Lee Sedol was able to let go of his pressure and win the game.

Sign In

AlphaGo beats Lee Sedol in third consecutive Go game

Recommended Posts

Batu69

Link to comment

Share on other sites

vibranium

Link to comment

Share on other sites

Batu69

Link to comment

Share on other sites

oliverjia

Link to comment

Share on other sites

Archived

Recently Browsing 0 members

nsane.down

Latest News

Browse

Activity