Deep Reinforcement Learning

Why reinforcement learning plateaus without representation depth (and other key takeaways from NeurIPS 2025)

Why reinforcement learning plateaus without representation depth (and other key takeaways from NeurIPS 2025) ...

AI can cut emissions and congestion in urban transport by nearly 50%

Traffic congestion, fuel consumption, and emissions also offer quantifiable performance indicators, making mobility uniquely ...

Interesting Engineering on MSN

AI-trained quadruped robot walks rough, low-friction terrain without human input

A quadruped robot has learned to walk across slippery, uneven terrain entirely through simulation, ...

Forbes

From Turing To DeepSeek, Reinforcement Learning Soars To AI Summit

Using a bunch of carrots to train a pony and rider. (Photo by: Education Images/Universal Images Group via Getty Images) Andrew Barto and Richard Sutton are the recipients of the Turing Award for ...

EurekAlert!

Deep reinforcement learning optimizes distributed manufacturing scheduling

A recent study published in Engineering presents a significant advancement in manufacturing scheduling. Researchers Xueyan Sun, Weiming Shen, Jiaxin Fan, and their colleagues from Huazhong University ...

VentureBeat

DeepSeek-R1’s bold bet on reinforcement learning: How it outpaced OpenAI at 3% of the cost

DeepSeek-R1's release last Monday has sent shockwaves through the AI community, disrupting assumptions about what’s required to achieve cutting-edge AI performance. Matching OpenAI’s o1 at just 3%-5% ...

Devdiscourse

AI trading systems mimicking human bias show higher risk

Reinforcement learning frames trading as a sequential decision-making problem, where an agent observes market conditions, ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results