AI News

OpenAI keeps shuffling its executives in bid to win AI agent battle

OpenAI announced yet another reorganization Friday, consolidating certain areas and making company president Greg Brockman the official lead of all things product. In a memo viewed by The Verge, Brockman wrote that since OpenAI’s product strategy for this year is to go all-in on AI agents, the company is combining its products to “invest in […]

Read More »

ArXiv will ban researchers who upload papers full of AI slop

ArXiv, a popular platform for preprint academic research, is taking a new step to attempt to reduce the volume of papers that include AI slop. If a paper has “incontrovertible evidence that the authors did not check the results of LLM generation,” such as hallucinated references or “meta-comments” left by an LLM, authors will be […]

Read More »

Deloitte: Scale ‘autonomous intelligence’ for real growth

Enterprise leaders must progress past generative applications and scale “autonomous intelligence” to capture real growth. Generating text or summarising internal communications offers localised productivity improvements, yet these abilities rarely alter the core cost or revenue structure of a large organisation. Enterprises are now focused on deploying systems capable of independent execution. Leaders are demanding applications […]

The post Deloitte: Scale ‘autonomous intelligence’ for real growth appeared first on AI News.

Read More »

Best AI Agents for Software Development Ranked: A Benchmark-Driven Look at the Current Field

The AI coding agent field in 2026 is more capable, more fragmented, and harder to benchmark than it looks. Claude Code leads on code quality at 87.6% SWE-bench Verified. GPT-5.5 tops Terminal-Bench at 82.7%. But the benchmark OpenAI itself declared contaminated in February 2026 is still being used to rank these tools — including by the labs publishing their own scores.

The post Best AI Agents for Software Development Ranked: A Benchmark-Driven Look at the Current Field appeared first on MarkTechPost.

Read More »

Zyphra Releases ZAYA1-8B-Diffusion-Preview: The First MoE Diffusion Model Converted From an Autoregressive LLM With Up to 7.7x Speedup

Zyphra’s latest release shows that an autoregressive MoE model can be converted into a discrete diffusion model with no systematic loss in evaluation performance. ZAYA1-8B-Diffusion-Preview achieves up to 7.7x inference speedup over autoregression by shifting decoding from memory-bandwidth bound to compute-bound — a key advantage as modern GPUs continue scaling FLOPs faster than memory bandwidth.

The post Zyphra Releases ZAYA1-8B-Diffusion-Preview: The First MoE Diffusion Model Converted From an Autoregressive LLM With Up to 7.7x Speedup appeared first on MarkTechPost.

Read More »

5 Interesting Startup Deals You May Have Missed: A Law Firm Operating System, Building Defense Tech Near The Battlefield, And Cell-Based Milk

Most of the interesting companies that caught our eye in the past month were working on problems in the physical world, often far from the glow of a laptop screen. They include a supplier of cell-based milk, a startup that bills itself as the operating system for modern law firms, and a defense tech startup that makes containerized manufacturing platforms for drones and other equipment near the battlefield.

Read More »

How Chinese short dramas became AI content machines

In a dimly lit bedroom, a frightened young woman is thrown onto a bed by a tall, muscular man. He grabs her hand, and flame-like vines crawl across her body, fusing with her flesh. She levitates, then drops. A dragon-shaped tattoo appears across her chest. “Two months,” the man says. “Give me an heir, or…

Read More »

The promises and pitfalls of personalized health

This is Optimizer, a weekly newsletter sent every Friday from Verge senior reviewer Victoria Song that dissects and discusses the latest gizmos and potions that swear they’re going to change your life. Opt in for Optimizer here. A few days ago, my esthetician was smearing hot wax on my face. The two caterpillars I call […]

Read More »

OpenAI now wants ChatGPT to access your bank accounts

Your trust in AI is about to be put to the test: OpenAI will soon let you give the chatbot direct access to your bank accounts. The new feature announced in preview today will allow users to “securely connect” ChatGPT with Plaid – the bank-to-app bridging platform used by 12,000 financial institutions, including Schwab, Fidelity, […]

Read More »

Does Trump Mobile know how many stripes are on the American flag?

Where’s the Trump phone? We’re going to keep talking about it every week. We’ve reached out, as usual, to ask about the Trump phone’s whereabouts. This week, despite our best hopes, we still don’t have our phone – but we do have some fresh doubts about the company’s patriotic credentials. This has been a momentous […]

Read More »

AI radio hosts demonstrate why AI can’t be trusted alone

Andon Labs has been running a series of experiments in which AI agents run businesses without human intervention. Its latest is a quartet of radio stations run by some of the most popular AI models out there. “Thinking Frequencies” is run by Claude, “OpenAIR” by ChatGPT, “Backlink Broadcast” by Google’s Gemini, and “Grok and Roll […]

Read More »

Cline Releases Cline SDK: An Open-Source Agent Runtime Now Powering Its CLI and Kanban, With IDE Extensions Being Migrated

Cline has extracted its internal agent harness into an open-source TypeScript SDK called @cline/sdk, the same runtime now powering its CLI and Kanban, with VS Code and JetBrains extensions being migrated. The SDK is structured as a four-layer stack — @cline/shared, @cline/llms, @cline/agents, and @cline/core — with native support for plugins, subagents, CRON scheduling, checkpointing, and MCP connectors. On Terminal Benchmark 2.0, Cline CLI scored 74.2% on claude-opus-4.7, compared to Anthropic’s published 69.4% for Claude Code on the same model. Install via npm install @cline/sdk. Requires Node.js 22+.

The post Cline Releases Cline SDK: An Open-Source Agent Runtime Now Powering Its CLI and Kanban, With IDE Extensions Being Migrated appeared first on MarkTechPost.

Read More »