AI Agent
16 articles about AI Agent

OpenAI requests contractors to upload past work for AI agent performance evaluation
OpenAI is asking contractors to upload past work projects—after removing sensitive data—to help evaluate and train office-focused AI agents.
LinkedIn reinstates AI agent startup Artisan after initial ban
AI agent startup Artisan was briefly banned on LinkedIn, and its CEO says the real reason wasn’t what viral posts claimed.
Mirakl's vision for agent-native commerce using AI and ChatGPT Enterprise
Mirakl is using AI agents and ChatGPT Enterprise to speed up documentation, improve customer support, and move toward agent-native commerce with Mirakl Nexus.
Scaling accounting capacity using OpenAI-powered AI agents
Basis uses OpenAI-powered AI agents to help accounting firms save up to 30% of their time and scale advisory and growth capacity.
New tools to support developers in building reliable agents
The platform is adding new tools to help developers and enterprises build useful, reliable agents.
Self-play enables machine learning system to surpass top Dota 2 players
It explains how Dota 2 self-play let an AI rapidly improve from below human level to beating top pros, outperforming supervised learning by generating better data as it learns.
Anthropic secures enterprise deal with Allianz for AI agent development
Anthropic landed its first 2026 enterprise deal with Allianz to build AI agents and provide Claude Code.
Caterpillar partners with Nvidia to integrate AI into construction equipment
Caterpillar is testing Nvidia-powered AI agents in an excavator to add AI capabilities to its construction equipment.
SIMA 2: an AI agent that plays, reasons, and learns in virtual 3D worlds
SIMA 2 is a Gemini-powered AI agent that can reason, learn, and act with you in interactive 3D virtual worlds.
Outtake uses GPT-4.1 and OpenAI o3 to accelerate digital threat resolution
Outtake uses OpenAI’s GPT-4.1 and o3 to run AI agents that detect and fix digital threats 100x faster.
MLE-bench: benchmark for evaluating AI agents on machine learning engineering tasks
MLE-bench is a benchmark that tests how well AI agents can do machine learning engineering tasks.

Tech companies adopt AI platforms amid developer concerns over user interaction
As AI devices emerge as the next tech platform, some app developers worry AI agents will come between them and their users.
BNY expands AI adoption enterprise-wide using OpenAI technology
BNY is rolling out OpenAI-powered AI across the company via its Eliza platform, enabling 20,000+ employees to build agents that boost efficiency and client results.
Introducing instant checkout and agentic commerce protocol in ChatGPT
ChatGPT is introducing instant checkout and an agentic commerce protocol to let people, AI agents, and businesses shop together.
PaperBench: evaluating AI agents' ability to replicate AI research
PaperBench is a benchmark that tests whether AI agents can replicate state-of-the-art AI research.
Introducing custom GPTs with instructions, knowledge, and skills
GPTs let you build custom ChatGPT versions by combining instructions, extra knowledge, and specific skills.
Showing page 1 of 1