AI active agents do not have to call: Here is the evidence

Are Ai's most expensive agents using a scale? It is a hot topic in the world of artificial, and new research from Oppo Agent eventually puting some real numbers – the solution – on the table.
The most impressive AIs of AI can face major activities, with many steps that use power to consult large languages (llms) as GPT-4 and Cloud. But in every way to succeed, the price of running these programs is the key to businesses (and investigators!) To use it more. Include a “valid agent” framework “New agent recipe for the agents that keep about all performance but significantly reduce the cost.
Real Problem: Ai Ei-AI gets the price
Have you ever wondered why your favorite AI helper didn't take all the features of your work at the moment? It's not just technology – it's a payment. Some high-quality cuts require each of the best APIs. Multiply that with thousands of users and, suddenly, “measurement” seems to be like a pipeline dream.
This page The OPPO team saw this coming. Their recent research It is structured in order when the agents meet the cost and, most importantly, how much difficulty is needed to solve daily activities.
GAME-Changer: Accumulation of AI agent
This research is introducing a clear crystalic metric: Cost-of-Pass. Think of “the perfect cost of producing a proper answer in trouble.” Retixes how much you pay for the tokens (every word inside and out of your model) and how good model is getting things at first.
Here is a punchlugree: The most effective models are like Claude 3.7 Sonnet top boards with accuracy, but their passing costs are three to four to GPT-4.1. For simple functions, small models like the SQU3-30B-A3B do little but they call pens.

Major exam: What makes agents call?
1. Backbone Model Choice
Claude 3.7 Sonnet Nails 61.82% accuracy in difficult benchmark but costs $ 3.54 for each successful work. GPT-4.1 drops slowly (53.33%) but only costs $ 0.98. Looking for arenones, quick and cheap? QWen3 declines cost to $ 0.13 for basic tasks.
2. Planning and measuring
You thought that “More Editing” means “better results.” Not so fast. Too many steps equal to high costs, but not increased more to success rate. Moderate acts allowed the agent to try multiple options (best-n) multiple compute for jumps for accuracy.
3. Using tools
Agents can use browsers, search engines, and other tools for new information. Many sources of search helps until the point, but moves fancy like pages-up / page down add cost without much payment. Keeping the tool use simple and very comprehensive jobs.
4. The agent memory
Amazingly, The simplest memory setup – to keep the tracking of actions and views – give the best balance of lower costs and optimal operations. Additional memory modules make slow and more expensive agents, for less profit.
To put everything together: “Working agents” Blueprint
Here is how the effective system of agents explode the code:
- Use a smart model but not excessively (GPT-4.1).
- Limit its steps to avoid endless 'transfusing.'
- Search more (integrate on Google, Wikipedia, and other sources), but do not go for a crazy browser.
- Keep memory dependence and easy.
The result? Effective agents bring 96.7% of the performance of senior succumbents (such as OWL), but under three of the costs! That comes down 28.4% in the bill, without the result of sacrifice.
Why is this important
This study is a wake call: Smart AI does not just mean energy – it's about being helpful. If you build or create agents, measure your expenses and select your ingredients wisely. Don't think it is best all the time. Sometimes, a simple win.
This page Active planning system Is the open source, so you can start trying these ideas now. As AI becomes a complete, effective shape will be key – even if you release agents at first or for a Fortne 500 company.
Fact: NEXT-GEN AI AI can be smart and cheap when you are willing to reconsider how you build it. The efficient agent is not just another Technical DEP-is a roadmap road to do AI everywhere. And who doesn't want that?
Look Paper including GitHub page. Also, feel free to follow it Sane and don't forget to join ours 100K + ml subreddit Then sign up for Our newspaper.
Sana Hassan, a contact in MarktechPost with a student of the Dual-degree student in the IIit Madras, loves to use technology and ai to deal with the real challenges of the world. I'm very interested in solving practical problems, brings a new view of ai solution to AI and real solutions.



