Open source reinforcement learning framework for training AI models

Above: Google’s Mountain View headquarters.

Image Credit: Google

Reinforcement learning — an artificial intelligence (AI) technique that uses rewards (or punishments) to drive agents in the direction of specific goals — trained the systems that defeated Alpha Go world champions and mastered Valve’s Dota 2. And it’s a core part of Google subsidiary DeepMind’s deep Q-network (DQN), which can distribute learning across multiple workers in the pursuit of, for example, achieving “superhuman” performance in Atari 2600 games. The trouble is, reinforcement learning frameworks take time to master a goal, tend to be inflexible, and aren’t always stable.

That’s why Google is proposing an alternative: an open source reinforcement framework based on TensorFlow, its machine learning library. It’s available from Github starting today.

“Inspired by one of the main components in reward-motivated behavior in the brain and reflecting the strong historical connection between neuroscience and reinforcement learning research, this platform aims to enable the kind of speculative research that can drive radical discoveries,” Pablo Samuel Castro and Marc G. Bellemare, researchers on the Google Brain Team, wrote in a blog post. “This release also includes a set of colabs that clarify how to use our framework.”

They and the Google Brain team developed the reinforcement framework with three tenets in mind: flexibility, stability, and reproducibility.

Google reinforcement

Above: A visualization of AI agents trained using reinforcement learning.

Image Credit: Google

To that end, it includes a compact set of well-documented code (15 Python files) focused on the Arcade Learning Environment — a platform for evaluating AI technology with video games — and four distinct machine learning models: the aforementioned DQN; C51; a simplified variant of the Rainbow agent; and the Implicit Quantile Network. In the interest of reproducibility, the code is provided with full test coverage and training data (in JSON and Python pickle formats) across the 60 games supported by the Arcade Learning Environment and follows best practices on standardizing the results for empirical evaluations.

Alongside the release of the reinforcement framework, Google is launching a website that allows developers to quickly visualize training runs for multiple agents. It’s also making available trained models, raw statistics logs, and TensorFlow event files for plotting with TensorBoard, the Mountain View company’s suite of visualization tools for TensorFlow programs.

“Our hope is that our framework’s flexibility and ease-of-use will empower researchers to try out new ideas, both incremental and radical,” Bellemare and Castro wrote. “We are already actively using it for our research and finding it is giving us the flexibility to iterate quickly over many ideas. We’re excited to see what the larger community can make of it.”

Source: Google releases open source reinforcement learning framework for training AI models

Related Blogs:

Transforming Enterprises with
Data & AI Services & Solutions.

ThirdEye delivers Data and AI services & solutions for enterprises worldwide by
leveraging state-of-the-art Data & AI technologies.

Talk to ThirdEye

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

AI & Machine Learning

Generative AI & ChatGPT

Big Data & Engineering

Digital Transformation

Automating Tasks

Know Your Customers

Project Types

Manufacturing

Retail

Healthcare

Energy, Oil & Gas

IT

AdTech

NGO

More...

Transforming Enterprises with
Data & AI Services & Solutions.

Services We Offer

Tailored Solutions

Explore Us

Talk To Us

AI & Machine Learning

Generative AI & ChatGPT

Big Data & Engineering

Digital Transformation

Automating Tasks

Know Your Customers

Project Types

Manufacturing

Retail

Healthcare

Energy, Oil & Gas

IT

AdTech

NGO

More...

Transforming Enterprises with Data & AI Services & Solutions.

Share This Article

Related Posts

Significance and Applications of Edge AI

The Three Pillars for AI Project Success

AI – Past, Present and Future

Patient Appointment Management Optimization with Simulated Annealing

Services We Offer

Tailored Solutions

Explore Us

Talk To Us

Transforming Enterprises with
Data & AI Services & Solutions.