Reinforcement Learning LLM

News

ExtremeTech on MSN1d

Modern AI LLMs can seem almost magical when you use them. But, just like even the best magic tricks, there is an explanation ...

DeepCoder delivers top coding performance in efficient 14B open model

DeepCoder-14B competes with frontier models like o3 and o1—and the weights, code, and optimization platform are open source.

24d

Databricks Has a Trick That Lets AI Models Improve Themselves

Using several recent innovations, the company Databricks will let customers boost the IQ of their AI models even if they don’t have squeaky clean data.

SFGate12d

What is reinforcement learning? An AI researcher explains a key method of teaching machines – and how it relates to training your dog

Turing’s ideas ultimately led to the development of reinforcement learning, a branch of artificial intelligence. Reinforcement learning designs intelligent agents by training them to maximize rewards ...

Business Insider1mon

John Snow Labs Introduces First Commercially Available Medical Reasoning LLM at NVIDIA GTC

March 20, 2025 (GLOBE NEWSWIRE) -- John Snow Labs, the AI for healthcare company, today announced Medical LLM Reasoner ... capabilities through reinforcement learning. Developed with NVIDIA ...

SiliconANGLE23d

Databricks partners with Anthropic and touts breakthrough in reinforcement learning

Separately, Databricks said it has found a new fine-tuning method that leverages Test-time Adaptive Optimization, a type of reinforcement learning that ... Traditional LLM training often relies ...

Results that may be inaccessible to you are currently showing.

Hide inaccessible results