Blog posts

2024

Action Chunking with Transformers (ACT)

10 minutes read

Published: May 14, 2024

This blog covers a SOTA imitation learning model called Action Chunking with Transformers, which can perform versatile tasks with little amount of demonstration data.

Decision Transformer

7 minutes read

Published: April 29, 2024

This blog goes over Decision Transformer, which is an offline-RL method that learns to optimize from pregathered data using a transformer model.

Robotic Transformer (RT-1) [Japanese]

10 minutes read

Published: April 26, 2024

このブログではRobotic TransformerというGoogle Researchが考案した様々なタスクに対応した言語指令ロボット制御モデルを解説します。

Robotic Transformer (RT-1)

10 minutes read

Published: April 24, 2024

This blog quickly covers a SOTA imitation learning model called Robotics Transformer (RT-1), which can perform variaous tasks based on a language instruction.

Actor-Critic Methods (A2C, PPO, DDPG, MA-POCA)

20 minutes read

Published: April 09, 2024

This blog thoroughly covers the Actor-Critic approach, which is a keep concept in RL that allows algorithms to handle continuous action spaces with low variance by using both value and policy networks. Famous Actor-Critic methods like A2C, PPO, DDPG, and SAC are also showcases in the blog.

Vanilla Policy Gradient (VPG)

8 minutes read

Published: January 13, 2024

This blog thoroughly covers the policy gradient method, which is crucial for RL algorithms to handle continous action spaces.

2023

Temporal Difference Learning

3 minutes read

Published: December 18, 2023

This blog quickly goes over temporal difference (TD) learning, which is a vital aspect that makes RL sample efficient.

Tai Inui

Blog posts

2024

Action Chunking with Transformers (ACT)

Decision Transformer

Robotic Transformer (RT-1) [Japanese]

Robotic Transformer (RT-1)

Actor-Critic Methods (A2C, PPO, DDPG, MA-POCA)

Vanilla Policy Gradient (VPG)

2023

Temporal Difference Learning