Blog posts

2024

Action Chunking with Transformers (ACT)

10 minutes read

Published:

This blog covers a SOTA imitation learning model called Action Chunking with Transformers, which can perform versatile tasks with little amount of demonstration data.

Decision Transformer

7 minutes read

Published:

This blog goes over Decision Transformer, which is an offline-RL method that learns to optimize from pregathered data using a transformer model.

Robotic Transformer (RT-1) [Japanese]

10 minutes read

Published:

このブログではRobotic TransformerというGoogle Researchが考案した様々なタスクに対応した言語指令ロボット制御モデルを解説します。

Robotic Transformer (RT-1)

10 minutes read

Published:

This blog quickly covers a SOTA imitation learning model called Robotics Transformer (RT-1), which can perform variaous tasks based on a language instruction.

Actor-Critic Methods (A2C, PPO, DDPG, MA-POCA)

20 minutes read

Published:

This blog thoroughly covers the Actor-Critic approach, which is a keep concept in RL that allows algorithms to handle continuous action spaces with low variance by using both value and policy networks. Famous Actor-Critic methods like A2C, PPO, DDPG, and SAC are also showcases in the blog.

Vanilla Policy Gradient (VPG)

8 minutes read

Published:

This blog thoroughly covers the policy gradient method, which is crucial for RL algorithms to handle continous action spaces.

2023

Temporal Difference Learning

3 minutes read

Published:

This blog quickly goes over temporal difference (TD) learning, which is a vital aspect that makes RL sample efficient.