News
OpenAI o1 is a large language model focused on complex reasoning through reinforcement learning. It outperforms GPT-4o in domains like coding, math, and science by using a chain-of-thought process.
The application of Deep Reinforcement Learning (DRL) in economics has been an area of active research in recent years. A number of recent works have shown how deep reinforcement learning can be used ...
DeepSeek-R1’s Monday release has sent shockwaves through the AI community, disrupting assumptions about what’s required to achieve cutting-edge AI performance. This story focuses on exactly ...
The bird has never gotten much credit for being intelligent. But the reinforcement learning powering the world’s most ...
This study seeks to construct a basic reinforcement learning-based AI-macroeconomic simulator. We use a deep RL (DRL) approach (DDPG) in an RBC macroeconomic model. We set up two learning scenarios, ...
A common approach DeepSeek’s use of reinforcement learning is the main innovation that the company describes in its R1 paper. But DeepSeek is not the only firm experimenting with this technique.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results