Reasoning Models Reinforcement Learning

13d

Ai2's new Olmo 3.1 extends reinforcement learning training for stronger reasoning benchmarks

Ai2 updates its Olmo 3 family of models to Olmo 3.1 following additional extended RL training to boost performance.

Reinforcement Learning Does NOT Fundamentally Improve AI Models

Reinforcement Learning does NOT make the base model more intelligent and limits the world of the base model in exchange for early pass performances. Graphs show that after pass 1000 the reasoning ...

Geeky Gadgets

New ChatGPT o1-preview reinforcement learning process explained

OpenAI has introduced its latest AI model, ChatGPT o1, a large language model (LLM) that significantly advances the field of AI reasoning. Leveraging reinforcement learning (RL), o1 represents a leap ...

How 2025 Recalibrated AI Models Race

In 2025, large language models moved beyond benchmarks to efficiency, reliability, and integration, reshaping how AI is ...

Geeky Gadgets

ChatGPT o1 AI reasoning and thinking explained

OpenAI has introduced two groundbreaking models, ChatGPT o1 Preview and ChatGPT o1 Mini, which represent a significant shift from their previous GPT series. These models are specifically designed to ...

SiliconANGLE

Beyond autocomplete: Reasoning models raise the bar for generative AI

If you go to ChatGPT.com, choose the o4-mini model from the drop-down menu and enter a prompt, you’ll see a message you’ve probably never seen before. “Thinking,” the chatbot responds as several ...

Unite.AI

The Reinforcement Gap: Why AI Excels at Some Tasks but Stalls at Others

Artificial Intelligence (AI) has achieved remarkable successes in recent years. It can defeat human champions in games like Go, predict protein structures with high accuracy, and perform complex tasks ...

SiliconANGLE

Microsoft releases small but mighty Phi-4 reasoning AI models that outperform larger models

Microsoft Corp. has released three new advanced small language models artificial intelligence models extending its “Phi” range of AI models that include reasoning capability. The new model releases ...

VentureBeat

Tencent’s new AI technique teaches language models ‘parallel thinking’

In a new paper, researchers from Tencent AI Lab Seattle and the University of Maryland, College Park, present a reinforcement learning technique that enables large language models (LLMs) to utilize ...

The Brighterside of News on MSNOpinion

MIT researchers teach AI models to learn from their own notes

Large language models already read, write, and answer questions with striking skill. They do this by training on vast ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results