Openai O1 Reinforcement Learning

2hon MSN

OpenAI study says punishing AI models for lying doesn't help — It only sharpens their deceptive and obscure workarounds

Hallucinations and outright wrong responses are among the major challenges facing the progression and public interpretation ...

OpenAI Sounds the Alarm : The Hidden Dangers of Controlling AI Thought Processes

OpenAI warns AI labs about the risks of controlling AI thought processes, highlighting dangers like obfuscation and reward ...

18d

AI tries to cheat at chess when it’s losing

While supercomputers—most famously IBM’s Deep Blue —have long surpassed the world’s best human chess players, generative AI ...

MIT Technology Review18d

AI reasoning models can cheat to win chess games

These newer models appear more likely to indulge in rule-bending behaviors than previous generations—and there’s no way to ...

The Verge25d

OpenAI announces GPT-4.5, warns it’s not a frontier AI model

It was previously reported that OpenAI was using its o1 reasoning model ... methods like supervised fine-tuning (SFT) and reinforcement learning from human feedback (RLHF), similar to those ...

Tencent’s Hunyuan T1 AI reasoning model rivals DeepSeek in performance and price

The tech giant’s latest offering leverages large-scale reinforcement learning, rivalling DeepSeek in top benchmark tests.

13d

Frontier AI like o3-mini can cheat to achieve goals and then lie about it

New ChatGPT research from OpenAI shows that reasoning models like o1 and o3-mini can lie and cheat to achieve a goal.

ZDNet19d

OpenAI expands GPT-4.5 rollout. Here's how to access (and what it can do for you)

The o1 version took a bit longer ... such as supervised fine-tuning (SFT) and reinforcement learning from human feedback (RLHF). During the livestream, OpenAI took a trip down memory lane ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results