• About Us
  • Disclaimer
  • Contact Us
  • Privacy Policy
Thursday, February 26, 2026
mGrowTech
No Result
View All Result
  • Technology And Software
    • Account Based Marketing
    • Channel Marketing
    • Marketing Automation
      • Al, Analytics and Automation
      • Ad Management
  • Digital Marketing
    • Social Media Management
    • Google Marketing
  • Direct Marketing
    • Brand Management
    • Marketing Attribution and Consulting
  • Mobile Marketing
  • Event Management
  • PR Solutions
  • Technology And Software
    • Account Based Marketing
    • Channel Marketing
    • Marketing Automation
      • Al, Analytics and Automation
      • Ad Management
  • Digital Marketing
    • Social Media Management
    • Google Marketing
  • Direct Marketing
    • Brand Management
    • Marketing Attribution and Consulting
  • Mobile Marketing
  • Event Management
  • PR Solutions
No Result
View All Result
mGrowTech
No Result
View All Result
Home Technology And Software

8 billion tokens a day forced AT&T to rethink AI orchestration — and cut costs by 90%

Josh by Josh
February 26, 2026
in Technology And Software
0
8 billion tokens a day forced AT&T to rethink AI orchestration — and cut costs by 90%



When your average daily token usage is 8 billion a day, you have a massive scale problem.

READ ALSO

AI recession: A memo laid out how AI could kill jobs. Wall Street panicked.

New Webb Telescope photos show off the Exposed Cranium Nebula

This was the case at AT&T, and chief data officer Andy Markus and his team recognized that it simply wasn’t feasible (or economical) to push everything through large reasoning models.

So, when building out an internal Ask AT&T personal assistant, they reconstructed the orchestration layer. The result: A multi-agent stack built on LangChain where large language model “super agents” direct smaller, underlying “worker” agents performing more concise, purpose-driven work.

This flexible orchestration layer has dramatically improved latency, speed and response times, Markus told VentureBeat. Most notably, his team has seen up to 90% cost savings.

“I believe the future of agentic AI is many, many, many small language models (SLMs),” he said. “We find small language models to be just about as accurate, if not as accurate, as a large language model on a given domain area.”

Most recently, Markus and his team used this re-architected stack along with Microsoft Azure to build and deploy Ask AT&T Workflows, a graphical drag-and-drop agent builder for employees to automate tasks.

The agents pull from a suite of proprietary AT&T tools that handle document processing, natural language-to-SQL conversion, and image analysis. “As the workflow is executed, it's AT&T’s data that's really driving the decisions,” Markus said. Rather than asking general questions, “we're asking questions of our data, and we bring our data to bear to make sure it focuses on our information as it makes decisions.”

Still, a human always oversees the “chain reaction” of agents. All agent actions are logged, data is isolated throughout the process, and role-based access is enforced when agents pass workloads off to one another.

“Things do happen autonomously, but the human on the loop still provides a check and balance of the entire process,” Markus said.

Not overbuilding, using ‘interchangeable and selectable’ models

AT&T doesn’t take a "build everything from scratch" mindset, Markus noted; it’s more relying on models that are “interchangeable and selectable” and “never rebuilding a commodity.” As functionality matures across the industry, they’ll deprecate homegrown tools in lieu of off the shelf options, he explained.

“Because in this space, things change every week, if we're lucky, sometimes multiple times a week,” he said. “We need to be able to pilot, plug in and plug out different components.”

They do “really rigorous” evaluations of available options as well as their own; for instance, their Ask Data with Relational Knowledge Graph has topped the Spider 2.0 text to SQL accuracy leaderboard, and other tools have scored highly on the BERT SQL benchmark.

In the case of homegrown agentic tools, his team uses LangChain as a core framework, fine-tunes models with standard retrieval-augmented generation (RAG) and other in-house algorithms, and partners closely with Microsoft, using the tech giant’s search functionality for their vector store.

Ultimately, though, it’s important not to just fuse agentic AI or other advanced tools into everything for the sake of it, Markus advised. “Sometimes we over complicate things,” he said. “Sometimes I've seen a solution over engineered.”

Instead, builders should ask themselves whether a given tool actually needs to be agentic. This could include questions like: What accuracy level could be achieved if it was a simpler, single-turn generative solution? How could they break it down into smaller pieces where each piece could be delivered “way more accurately”?, as Markus put it.

Accuracy, cost and tool responsiveness should be core principles. “Even as the solutions have gotten more complicated, those three pretty basic principles still give us a lot of direction,” he said.

How 100,000 employees are actually using it

Ask AT&T Workflows has been rolled out to 100,000-plus employees. More than half say they use it every day, and active adopters report productivity gains as high as 90%, Markus said.

“We're looking at, are they using the system repeatedly? Because stickiness is a good indicator of success,” he said.

The agent builder offers “two journeys” for employees. One is pro-code, where users can program Python behind the scenes, dictating rules for how agents should work. The other is no-code, featuring a drag-and-drop visual interface for a “pretty light user experience,” Markus said.

Interestingly, even proficient users are gravitating toward the latter option. At a recent hackathon geared to a technical audience, participants were given a choice of both, and more than half chose low code. “This was a surprise to us, because these people were all very competent in the programming aspect,” Markus said.

Employees are using agents across a variety of functions; for instance, a network engineer may build a series of them to address alerts and reconnect customers when they lose connectivity. In this scenario, one agent can correlate telemetry to identify the network issue and its location, pull change logs and check for known issues. Then, it can open a trouble ticket.

Another agent could then come up with ways to solve the issue and even write new code to patch it. Once the problem is resolved, a third agent can then write up a summary with preventative measures for the future.

“The [human] engineer would watch over all of it, making sure the agents are performing as expected and taking the right actions,” Markus said.

AI-fueled coding is the future

That same engineering discipline — breaking work into smaller, purpose-built pieces — is now reshaping how AT&T writes code itself, through what Markus calls "AI-fueled coding."

He compared the process to RAG; devs use agile coding methods in an integrated development environment (IDE) along with “function-specific” build archetypes that dictates how code should interact.

The output is not loose code; the code is “very close to production grade,” and could reach that quality in one turn. “We've all worked with vibe coding, where we have an agentic kind of code editor,” Markus noted. But AI-fueled coding “eliminates a lot of the back and forth iterations that you might see in vibe coding.”

He sees this coding technique as “tangibly redefining” the software development cycle, ultimately shortening development timelines and increasing output of production-grade code. Non-technical teams can also get in on the action, using plain language prompts to build software prototypes.

His team, for instance, has used the technique to build an internal curated data product in 20 minutes; without AI, building it would have taken six weeks. “We develop software with it, modify software with it, do data science with it, do data analytics with it, do data engineering with it,” Markus said. “So it's a game changer.”



Source_link

Related Posts

AI recession: A memo laid out how AI could kill jobs. Wall Street panicked.
Technology And Software

AI recession: A memo laid out how AI could kill jobs. Wall Street panicked.

February 26, 2026
New Webb Telescope photos show off the Exposed Cranium Nebula
Technology And Software

New Webb Telescope photos show off the Exposed Cranium Nebula

February 26, 2026
Everyone Speaks Incel Now | WIRED
Technology And Software

Everyone Speaks Incel Now | WIRED

February 26, 2026
Nvidia has another record quarter amid record capex spends
Technology And Software

Nvidia has another record quarter amid record capex spends

February 25, 2026
Anthropic just released a mobile version of Claude Code called Remote Control
Technology And Software

Anthropic just released a mobile version of Claude Code called Remote Control

February 25, 2026
Trump made tax day more complicated. ChatGPT and Claude can make it easier.
Technology And Software

Trump made tax day more complicated. ChatGPT and Claude can make it easier.

February 25, 2026
Next Post
Amity Park Walkthrough Guide – Followchain

Amity Park Walkthrough Guide - Followchain

POPULAR NEWS

Trump ends trade talks with Canada over a digital services tax

Trump ends trade talks with Canada over a digital services tax

June 28, 2025
Communication Effectiveness Skills For Business Leaders

Communication Effectiveness Skills For Business Leaders

June 10, 2025
15 Trending Songs on TikTok in 2025 (+ How to Use Them)

15 Trending Songs on TikTok in 2025 (+ How to Use Them)

June 18, 2025
App Development Cost in Singapore: Pricing Breakdown & Insights

App Development Cost in Singapore: Pricing Breakdown & Insights

June 22, 2025
Google announced the next step in its nuclear energy plans 

Google announced the next step in its nuclear energy plans 

August 20, 2025

EDITOR'S PICK

Clarins launches its mobile app to elevate its customer relationship, in partnership with Merkle

Clarins launches its mobile app to elevate its customer relationship, in partnership with Merkle

December 20, 2025
How Much Does It Cost in 2025?

How Much Does It Cost in 2025?

August 20, 2025
Position Yourself as a Health Expert Journalists Actually Call

Position Yourself as a Health Expert Journalists Actually Call

December 5, 2025
Trump Mobile is promoting its smartphone with terribly edited photos of other brands’ products

Trump Mobile is promoting its smartphone with terribly edited photos of other brands’ products

August 22, 2025

About

We bring you the best Premium WordPress Themes that perfect for news, magazine, personal blog, etc. Check our landing page for details.

Follow us

Categories

  • Account Based Marketing
  • Ad Management
  • Al, Analytics and Automation
  • Brand Management
  • Channel Marketing
  • Digital Marketing
  • Direct Marketing
  • Event Management
  • Google Marketing
  • Marketing Attribution and Consulting
  • Marketing Automation
  • Mobile Marketing
  • PR Solutions
  • Social Media Management
  • Technology And Software
  • Uncategorized

Recent Posts

  • Open Banking in Australia: Guide for Enterprise Businesses
  • What Different Generations Want from B-to-B Events
  • Trump claims tech companies will sign deals next week to pay for their own power supply
  • What Is Keyword Search Volume? + 5 Free Tools to Check It
  • About Us
  • Disclaimer
  • Contact Us
  • Privacy Policy
No Result
View All Result
  • Technology And Software
    • Account Based Marketing
    • Channel Marketing
    • Marketing Automation
      • Al, Analytics and Automation
      • Ad Management
  • Digital Marketing
    • Social Media Management
    • Google Marketing
  • Direct Marketing
    • Brand Management
    • Marketing Attribution and Consulting
  • Mobile Marketing
  • Event Management
  • PR Solutions