Caterpillar’s CEO had a problem. Jim Umpleby had stepped into the chief executive role in 2017 with a vision of achieving more profitable growth by selling more services and parts to the company’s heavy-equipment customers. Because offering real-time fleet management information services and selling parts online would depend on digital technologies, he set up a new division called Cat Digital. But the division’s head soon had unwelcome news for Umpleby: The company didn’t know its customers well enough to deliver on its goal. Customer data was siloed, fragmented, and, in the case of secondhand equipment, often entirely lacking.
That problem is one shared by countless leaders who see how digital can enable a growth strategy but are stymied by a legacy of fragmented, incomplete, and inconsistent data assets. They are frequently told that getting their data in order is a prerequisite for taking full advantage of new tools like artificial intelligence but too often see data transformation as an IT modernization project. They delegate the work to IT leaders and evaluate success based on cost, speed, and tool adoption. But when data is treated as IT infrastructure rather than as an enterprise asset, its impact is predictably limited. Companies that gain real value from their data actively involve the top management team in data transformation.
At Caterpillar, Umpleby and his team did more than simply sign off on the proposal by Ogi Redzic, chief digital officer of Cat Digital, to build a new enterprise digital data platform. In 2019, they demonstrated long-term commitment by giving Redzic’s team a generous three years to build the platform, dubbed Helios, that would allow the company, its customers, and its dealers to see consistent and complete fleet information across all applications and interactions. Executives went on to redesign data governance, elevating data ownership to senior leaders. By 2025, the Helios platform was supporting e-commerce, fleet management, and predictive and preventive maintenance — and Caterpillar had grown its services revenue from $14 billion in 2016 to $24 billion in 2024.
What Really Drives Data Transformation
Caterpillar’s experience underscores a critical lesson: Data transformation is not a purely technical exercise. Top company leaders must set a goal for the transformation in terms of business outcomes; give executives responsibility for data; commit resources to building an enterprise data platform; give all stakeholders a voice in the transformation; and direct strategic investments that take advantage of new data capabilities. Here, we’ll explore each of these areas and the lessons learned from Caterpillar’s experience.
1. Set (and monitor) an aggressive target for new revenue from the company’s use of data. One particularly consequential action that Umpleby took helped to focus data transformation efforts early on: In 2019, he set the bold goal of reaching $28 billion in services revenues by 2026. That narrowed the company’s attention on a small set of priority data initiatives and on closely tracking how data transformation led to increased services revenues.
Leaders initially prioritized developing a new tool, VisionLink, to help customers manage their fleets, and enabling customers to order parts via new e-commerce capabilities. Setting these priorities helped the team that was building the Helios platform focus on cleaning up the customer, customer contact, and equipment data needed for those solutions. Those three data types were known as “the trifecta” because they were key to answering important questions like “Which contact working for which customer should receive a sales or service offer?” and “Which contact working for which customer is responsible for replacing which machines?”
Top leaders regularly monitored Cat Digital’s progress on those priority initiatives using three measures of value. Value enablement was measured by the number of accurate trifecta records on the platform. Value created for customers and dealers was assessed based on the number of VisionLink users, how often the application was used, and how quickly that usage was growing. Value realization was tracked as revenue attributable to the new solutions, such as parts sales. This value-based reporting approach helped business leaders connect progress made in data initiatives to impact on the corporate income statement.
By establishing clear business goals, leaders keep decision-making aligned with strategic objectives. Because foundational work can take years, tracking and reporting progress that has been made is important to sustain stakeholders’ patience.
2. Give senior business executives ownership of data. Data becomes fragmented and inconsistent in many organizations because no one is accountable for its life cycle, quality, or reusability. At Caterpillar, establishing data as a corporate asset for which senior leaders were accountable focused attention on data quality. It also tied executives closely to the employees responsible for maintaining data products and related solutions.
To implement the new data-ownership policy, Cat Digital and the corporate IT team identified 14 enterprise data domains (such as engineering, finance, and human resources) and recruited about a dozen vice presidents across the company to own them. Some leaders represented functions such as finance or procurement, while others represented business units that had a strong stake in a particular data domain. Redzic took on customer, contact, and equipment data because those domains were key to services revenue. The vice presidents received information monthly about the quality of their data and were kept informed about the applications that used their data.
Cat Digital also established the role of data product owner — individuals who tracked value enablement and value creation metrics and were expected to drive data reuse and make data investments pay off. (See “Before and After Caterpillar’s Data Transformation.”) On the tech side, data product owners made sure that widely needed data, such as customer master lists and fleet information, were designed as reusable components. Owners of customer data came from the business side, bringing valuable domain expertise to the work of defining quality standards.