AI & ML interests
Offline RL datasets
farama-minari/HumanoidStandup-v5-SAC-simple
Reinforcement Learning
• Updated farama-minari/HumanoidStandup-v5-SAC-medium
Reinforcement Learning
• Updated • 3
farama-minari/HumanoidStandup-v5-SAC-expert
Reinforcement Learning
• Updated • 8
farama-minari/Ant-v5-SAC-expert-fine-tuned
Updated
farama-minari/Humanoid-v5-TQC-simple
Reinforcement Learning
• Updated • 12
farama-minari/Humanoid-v5-TQC-medium
Reinforcement Learning
• Updated • 10
farama-minari/Humanoid-v5-TQC-expert
Reinforcement Learning
• Updated • 13
farama-minari/Swimmer-v5-PPO-medium
Reinforcement Learning
• Updated • 2
• 1
farama-minari/Swimmer-v5-PPO-expert
Reinforcement Learning
• Updated • 2
farama-minari/Ant-v5-SAC-simple
Reinforcement Learning
• Updated • 2
farama-minari/HalfCheetah-v5-TQC-simple
Reinforcement Learning
• Updated • 23
farama-minari/Hopper-v5-SAC-simple
Reinforcement Learning
• Updated • 40
farama-minari/Hopper-v5-SAC-medium
Reinforcement Learning
• Updated • 35
farama-minari/Hopper-v5-SAC-expert
Reinforcement Learning
• Updated • 52
farama-minari/HalfCheetah-v5-TQC-medium
Reinforcement Learning
• Updated • 25
farama-minari/HalfCheetah-v5-TQC-expert
Reinforcement Learning
• Updated • 29
farama-minari/Hopper-v5-TQC-expert
Reinforcement Learning
• Updated • 1
farama-minari/Pusher-v5-SAC-medium
Reinforcement Learning
• Updated • 1
farama-minari/Pusher-v5-SAC-expert
Reinforcement Learning
• Updated • 18
farama-minari/Reacher-v5-SAC-medium
Reinforcement Learning
• Updated • 3
farama-minari/Reacher-v5-SAC-expert
Reinforcement Learning
• Updated • 12
farama-minari/Ant-v5-SAC-medium
Reinforcement Learning
• Updated • 8
farama-minari/Walker2d-v5-SAC-simple
Reinforcement Learning
• Updated • 21
farama-minari/Walker2d-v5-SAC-medium
Reinforcement Learning
• Updated • 11
farama-minari/Walker2d-v5-SAC-expert
Reinforcement Learning
• Updated • 24
farama-minari/Reacher-v5-SAC-simple
Reinforcement Learning
• Updated • 5
farama-minari/HumanoidStandup-v5-PPO-medium
Reinforcement Learning
• Updated farama-minari/HumanoidStandup-v5-PPO-simple
Reinforcement Learning
• Updated farama-minari/Humanoid-v5-SAC-medium
Reinforcement Learning
• Updated • 11
farama-minari/Humanoid-v5-SAC-simple
Reinforcement Learning
• Updated