Rick's Cafe AI 9:44 am on June 23, 2021
Tags: GANS ( 71 ), Reinforcement Learning

Reward is enough

In this article we hypothesise that intelligence, and its associated abilities, can be understood as subserving the maximisation of reward. Accordingly, reward is enough to drive behaviour that exhibits abilities studied in natural and artificial intelligence, including knowledge, learning, perception, social intelligence, language, generalisation and imitation. This is in contrast to the view that specialised problem formulations are needed for each ability, based on other signals or objectives. Furthermore, we suggest that agents that learn through trial and error experience to maximise reward could learn behaviour that exhibits most if not all of these abilities, and therefore that powerful reinforcement learning agents could constitute a solution to artificial general intelligence. Read More

#gans, #reinforcement-learning

Rick's Cafe AI 3:13 pm on March 20, 2021
Tags: Reinforcement Learning

Novel deep learning framework for symbolic regression

Lawrence Livermore National Laboratory (LLNL) computer scientists have developed a new framework and an accompanying visualization tool that leverages deep reinforcement learning for symbolic regression problems, outperforming baseline methods on benchmark problems.

The paper was recently accepted as an oral presentation at the International Conference on Learning Representations (ICLR 2021), one of the top machine learning conferences in the world. The conference takes place virtually May 3-7.

In the paper, the LLNL team describes applying deep reinforcement learning to discrete optimization — problems that deal with discrete “building blocks” that must be combined in a particular order or configuration to optimize a desired property. The team focused on a type of discrete optimization called symbolic regression — finding short mathematical expressions that fit data gathered from an experiment. Symbolic regression aims to uncover the underlying equations or dynamics of a physical process. Read More

#reinforcement-learning

Rick's Cafe AI 6:48 pm on February 22, 2021
Tags: Reinforcement Learning, Videos ( 368 )

Deep Reinforcement Learning: Neural Networks for Learning Control Laws

Read More

#reinforcement-learning, #videos

Rick's Cafe AI 1:12 pm on February 22, 2021
Tags: Reinforcement Learning, Videos ( 368 )

Reinforcement Learning: Machine Learning Meets Control Theory

Read More

#reinforcement-learning, #videos

Rick's Cafe AI 9:00 pm on February 6, 2021
Tags: Reinforcement Learning

Tips for Running High-Fidelity Deep Reinforcement Learning Experiments

Despite recent incredible algorithmic advances in the field, deep reinforcement learning (DRL) remains notorious for being computationally expensive, prone to “silent bugs”, and difficult to tune hyperparameters. These phenomena make running high-fidelity, scientifically-rigorous reinforcement learning experiments paramount.

In this article, I will discuss a few tips and lessons I’ve learned to mitigate the effects of these difficulties in DRL — tips I never would have learned from a reinforcement learning class. Read More

#reinforcement-learning

Rick's Cafe AI 11:05 am on February 3, 2021
Tags: Podcasts ( 79 ), Reinforcement Learning

Reinforcement Learning At Facebook with Jason Gauci

If you ever wanted to learn about machine learning you could do worse than have Jason Gauci teach you. Jason has worked on YouTube recommendations. He was an early contributor to TensorFlow the open-source machine learning platform. His thesis work was cited by DeepMind. Read More

#podcasts, #reinforcement-learning

Rick's Cafe AI 10:44 am on January 14, 2021
Tags: Reinforcement Learning

Discrete Latent Space World Models for Reinforcement Learning

Sample efficiency remains a fundamental issue of reinforcement learning. Model-based algorithms try to make better use of data by simulating the environment with a model. We propose a new neural network architecture for world models based on a vector quantized-variational autoencoder (VQ-VAE) to encode observations and a convolutional LSTM to predict the next embedding indices. A model-free PPO agent is trained purely on simulated experience from the world model. We adopt the setup introduced by Kaiser et al. (2020), which only allows100Kinteractionswith the real environment, and show that we reach better performance than their SimPLe algorithm in five out of six randomly selected Atari environments, while our model is significantly smaller. Read More

#reinforcement-learning

Rick's Cafe AI 10:31 am on December 11, 2020
Tags: Reinforcement Learning

Deep reinforcement-learning architecture combines pre-learned skills to create new sets of skills on the fly

A team of researchers from the University of Edinburgh and Zhejiang University has developed a way to combine deep neural networks (DNNs) to create a new type of system with a new kind of learning ability. The group describes their new architecture and its performance in the journal Science Robotics.

Deep neural networks are able to learn functions by training on multiple examples repeatedly. To date, they have been used in a wide variety of applications such as recognizing faces in a crowd or deciding whether a loan applicant is credit-worthy. In this new effort, the researchers have combined several DNNs developed for different applications to create a new system with the benefits of all of its constituent DNNs. Read More

#reinforcement-learning

Rick's Cafe AI 11:10 am on December 10, 2020
Tags: Big7 ( 259 ), Reinforcement Learning, Robotics ( 197 )

Alphabet’s Loon hands the reins of its internet air balloons to self-learning AI

Alphabet’s Loon, the team responsible for beaming internet down to Earth from stratospheric helium balloons, has achieved a new milestone: its navigation system is no longer run by human-designed software.

Instead, the company’s internet balloons are steered around the globe by an artificial intelligence — in particular, a set of algorithms both written and executed by a deep reinforcement learning-based flight control system that is more efficient and adept than the older, human-made one. The system is now managing Loon’s fleet of balloons over Kenya, where Loon launched its first commercial internet service in July after testing its fleet in a series of disaster relief initiatives and other test environments for much of the last decade. Read More

#big7, #reinforcement-learning, #robotics

Rick's Cafe AI 9:51 pm on December 4, 2020
Tags: Big7 ( 259 ), Reinforcement Learning

ReBeL: A general game-playing AI bot that excels at poker and more

Combining reinforcement learning with search (RL+Search) has been tremendously successful for perfect-information games. But prior RL+Search algorithms break down in imperfect-information games. We introduce ReBeL, an algorithm that for the first time enables sound RL+Search in imperfect-information games like poker.

ReBeL achieves superhuman performance in heads-up no-limit Texas Hold’em while using far less domain knowledge than any prior poker bot and extends to other imperfect-information games as well, such as Liar’s Dice, for which we’ve open-sourced our implementation.

ReBeL is a major step toward creating ever more general AI algorithms. Read More

#big7, #reinforcement-learning

Rick's Cafe AI

The latest in Artificial Intelligence carefully curated into its own special blend

Tag Archives: Reinforcement Learning

Reward is enough

Novel deep learning framework for symbolic regression

Deep Reinforcement Learning: Neural Networks for Learning Control Laws

Reinforcement Learning: Machine Learning Meets Control Theory

Tips for Running High-Fidelity Deep Reinforcement Learning Experiments

Reinforcement Learning At Facebook with Jason Gauci

Discrete Latent Space World Models for Reinforcement Learning

Deep reinforcement-learning architecture combines pre-learned skills to create new sets of skills on the fly

Alphabet’s Loon hands the reins of its internet air balloons to self-learning AI

ReBeL: A general game-playing AI bot that excels at poker and more