Author: "Sutskever I" / Publication Type: Academic Journals - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Sutskever I"' showing total 3 results

Start Over Author "Sutskever I" Publication Type Academic Journals

3 results on '"Sutskever I"'

1. Mastering the game of Go with deep neural networks and tree search.

Author: Silver D, Huang A, Maddison CJ, Guez A, Sifre L, van den Driessche G, Schrittwieser J, Antonoglou I, Panneershelvam V, Lanctot M, Dieleman S, Grewe D, Nham J, Kalchbrenner N, Sutskever I, Lillicrap T, Leach M, Kavukcuoglu K, Graepel T, and Hassabis D
Subjects: Computers, Europe, Humans, Monte Carlo Method, Reinforcement, Psychology, Games, Recreational, Neural Networks, Computer, Software, Supervised Machine Learning
Abstract: The game of Go has long been viewed as the most challenging of classic games for artificial intelligence owing to its enormous search space and the difficulty of evaluating board positions and moves. Here we introduce a new approach to computer Go that uses 'value networks' to evaluate board positions and 'policy networks' to select moves. These deep neural networks are trained by a novel combination of supervised learning from human expert games, and reinforcement learning from games of self-play. Without any lookahead search, the neural networks play Go at the level of state-of-the-art Monte Carlo tree search programs that simulate thousands of random games of self-play. We also introduce a new search algorithm that combines Monte Carlo simulation with value and policy networks. Using this search algorithm, our program AlphaGo achieved a 99.8% winning rate against other Go programs, and defeated the human European Go champion by 5 games to 0. This is the first time that a computer program has defeated a human professional player in the full-sized game of Go, a feat previously thought to be at least a decade away.
Published: 2016
Full Text: View/download PDF

2. Temporal-kernel recurrent neural networks.

Author: Sutskever I and Hinton G
Subjects: Algorithms, Humans, Neuropsychological Tests, Time Factors, Memory, Short-Term, Neural Networks, Computer
Abstract: A Recurrent Neural Network (RNN) is a powerful connectionist model that can be applied to many challenging sequential problems, including problems that naturally arise in language and speech. However, RNNs are extremely hard to train on problems that have long-term dependencies, where it is necessary to remember events for many timesteps before using them to make a prediction. In this paper we consider the problem of training RNNs to predict sequences that exhibit significant long-term dependencies, focusing on a serial recall task where the RNN needs to remember a sequence of characters for a large number of steps before reconstructing it. We introduce the Temporal-Kernel Recurrent Neural Network (TKRNN), which is a variant of the RNN that can cope with long-term dependencies much more easily than a standard RNN, and show that the TKRNN develops short-term memory that successfully solves the serial recall task by representing the input string with a stable state of its hidden units., (Copyright 2009 Elsevier Ltd. All rights reserved.)
Published: 2010
Full Text: View/download PDF

3. Deep, narrow sigmoid belief networks are universal approximators.

Author: Sutskever I and Hinton GE
Subjects: Algorithms, Humans, Nonlinear Dynamics, Learning, Neural Networks, Computer
Abstract: In this note, we show that exponentially deep belief networks can approximate any distribution over binary vectors to arbitrary accuracy, even when the width of each layer is limited to the dimensionality of the data. We further show that such networks can be greedily learned in an easy yet impractical way.
Published: 2008
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

3 results on '"Sutskever I"'

1. Mastering the game of Go with deep neural networks and tree search.

2. Temporal-kernel recurrent neural networks.

3. Deep, narrow sigmoid belief networks are universal approximators.

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

3 results on '"Sutskever I"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources