Openai O1 Reinforcement Learning

News

DeepCoder delivers top coding performance in efficient 14B open model

DeepCoder-14B competes with frontier models like o3 and o1—and the weights, code, and optimization platform are open source.

Unite.AI5d

The Rise of Small Reasoning Models: Can Compact AI Match GPT-Level Reasoning?

In recent years, the AI field has been captivated by the success of large language models (LLMs). Initially designed for ...

Developer Tech2d

Open-source AI matches coding abilities of proprietary models

To build a robust training set, Agentica and Together AI curated 24,000 high-quality, verifiable coding problems. This ...

5don MSN

TechKnow: Musk’s Grok-3 vs. China’s DeepSeek

NVIDIA H100s chasing frontier gains, while DeepSeek-R1 delivers similar performance using a fraction of the compute, ...

4don MSN

The Grok model that Elon Musk went offline for may have just beaten China's DeepSeek-R1

Elon Musk's xAI has introduced Grok-3, surpassing China's DeepSeek-R1 in performance. Grok-3 was trained using 200,000 H100 GPUs, demonstrating a brut ...

Technique2d

DeepSeek rattles U.S. Markets

Chinese AI startup DeepSeek on January 20 launched two large-language models (LLMs): DeepSeek-R1-Zero and DeepSeek-R1-Distill ...

Meta’s answer to DeepSeek is here: Llama 4 launches with long context Scout and Maverick models, and 2T parameter Behemoth on the way!

While DeepSeek R1 and OpenAI o1 edge out Behemoth on a couple metrics, Llama 4 Behemoth remains highly competitive.

Geeky Gadgets5d

New OpenAI PaperBench : Autonomous AI Research Benchmarking

OpenAI has unveiled “PaperBench,” a benchmark designed to evaluate how effectively AI agents can replicate innovative machine learning research. This initiative is a cornerstone of OpenAI’s ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results