site:the-decoder.com - Search News

News

Anthropic says that AI can learn risky behaviors even when the training ...

AI models can pick up hidden behaviors from seemingly harmless data—even when there are no obvious clues. Researchers warn that this might be a fundamental property of neural networks.

the-decoder7d

OpenAI claims a breakthrough in LLM reasoning on complex math problems

OpenAI says its experimental language model has solved International Mathematical Olympiad (IMO) problems at a gold medal level—a possible breakthrough for AI with general reasoning skills. The ...

the-decoder1mon

Math genius Terence Tao says that AI still can't "smell" bad math

Terence Tao, widely regarded as a mathematical prodigy, says that AI still lacks what he calls a mathematical "sense of smell." According to Tao, even when generative AI produces flawed proofs, they ...

the-decoder1mon

Anthropic shares blueprint for Claude Research agent using multiple AI ...

Anthropic has published the technical details behind its new Claude Research agent, which uses a multi-agent approach to speed up and improve complex searches.

the-decoder1mon

OpenAI cuts o3 model prices by 80% and launches o3-pro today

OpenAI has lowered the price of its o3 language model by 80 percent, CEO Sam Altman said. The new cost is $2 per million input tokens and $8 per million output tokens. The move follows Google’s Gemini ...

the-decoder1mon

Meta AI chief scientist LeCun's latest comment reveals deep industry ...

Yann LeCun, Meta's chief AI scientist, has taken a direct shot at Anthropic CEO Dario Amodei on Threads, making clear just how sharply the AI community is split over the future of general artificial ...

the-decoder2mon

Google Deepmind CEO Demis Hassabi says world models are making progress ...

Google Deepmind CEO Demis Hassabis says world models—AI systems that simulate the structure of the real world—are already making surprising progress toward general intelligence.

the-decoder2mon

Anthropic releases Claude 4 with new safety measures targeting CBRN misuse

Anthropic has released its next generation of AI models, Claude Opus 4 and Claude Sonnet 4, and is introducing new safety measures designed to prevent their use in developing chemical, biological, ...

the-decoder2mon

Mistral launches Devstral Small 24B, a new open-source LLM for coding

The French AI startup Mistral has launched Devstral Small 24B, a new open language model built for software development and described as "agentic." ...

the-decoder2mon

Geoffrey Hinton's wildly overconfident AI prediction failed—now it's ...

Geoffrey Hinton, a leading AI researcher and Turing Award winner, now says he was too quick to declare that artificial intelligence would replace radiologists.

the-decoder2mon

OpenAI brings its new GPT-4.1 model to ChatGPT users

OpenAI says GPT-4.1 is particularly strong when it comes to programming tasks and following instructions precisely. In our tests, the model is noticeably less "chatty" than GPT-4o—not being overly ...

the-decoder2mon

OpenAI says its latest models outperform doctors in medical benchmark

OpenAI has released a new benchmark for testing AI systems in healthcare. Called HealthBench, it's designed to evaluate how well language models handle realistic medical conversations. According to ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results