50% of firms are using AI or ML to spot data quality issues
How does your firm stack up?
About half of firms are using AI or machine learning-based techniques to spot data quality issues.
Of 10 major banks and asset managers surveyed for our ongoing 2025 Automation in Data Management Benchmark, 50% have used AI or machine learning to identify data quality issues in the past year.
An additional 30% of firms are working to do the same. And the firms who successfully spotted data quality issues using AI or ML remediated less than 25% of issues using the same techniques, meaning at least three-quarters of issues were remediated manually.
There’s a long way to go before banks and asset managers fully automate their data functions.
If you’re interested in how your firm compares, our benchmark is still open. Reach out to me at emmahilary.gould@infopro-digital.com to receive a link.
Only firms who complete the survey will receive the full results.
The statistic is small but significant. Our research shows that data management offices vary widely in their means of automating, but almost all agree they are under high pressure from executives and their boards to automate more.
Still, “tremendous” progress has been made automating core data governance, says Junaid Farooq, the founder of Pegasus 19 Consulting, which is currently working with a large bank’s data and AI office.
Data quality, one of the “pillars” of core data governance, is used by industry groups like the EDM Association to measure and standardize data management. Quality assesses the accuracy, completeness, consistency, timeliness, validity, and uniqueness of datasets.
The bank Farooq is working with has programmed data quality agents to perform profiling (an exercise that evaluates the accuracy of data), write data quality rules, and detect anomalies in datasets, such as missing fields, fat-finger errors, or abbreviations where there should be full words or phrases.
“AI is very good at searching a large set of data, analyzing it, and summarizing it. When you can do that, identifying anomalies in your data from a data governance perspective is reliable, low-hanging fruit,” Farooq says.
Benchmarking is a new initiative on WatersTechnology to bring our readership trusted, independent information.
We will run more benchmarks next year covering how technology and data are used in financial markets.
If you are not a data professional but have ideas for future benchmarks, reach out at emmahilary.gould@infopro-digital.com.
Only users who have a paid subscription or are part of a corporate subscription are able to print or copy content.
To access these options, along with all other subscription benefits, please contact info@waterstechnology.com or view our subscription options here: https://subscriptions.waterstechnology.com/subscribe
You are currently unable to print this content. Please contact info@waterstechnology.com to find out more.
You are currently unable to copy this content. Please contact info@waterstechnology.com to find out more.
Copyright Infopro Digital Limited. All rights reserved.
As outlined in our terms and conditions, https://www.infopro-digital.com/terms-and-conditions/subscriptions/ (point 2.4), printing is limited to a single copy.
If you would like to purchase additional rights please email info@waterstechnology.com
Copyright Infopro Digital Limited. All rights reserved.
You may share this content using our article tools. As outlined in our terms and conditions, https://www.infopro-digital.com/terms-and-conditions/subscriptions/ (clause 2.4), an Authorised User may only make one copy of the materials for their own personal use. You must also comply with the restrictions in clause 2.5.
If you would like to purchase additional rights please email info@waterstechnology.com
More on Data Management
Spoiler alert: managing market data is a bad case for AI
The IMD Wrap: A recent conversation between Max and one of his sources highlights the uses of different mechanisms to manage one of their most expensive assets.
LSEG makes final case for dismissal of MayStreet lawsuit
Lawyers for both LSEG and MayStreet founder Patrick Flannery have argued the lawsuit’s merits through various legal filings for almost a year.
A new market data hope or an expanding Empire
Market data is now part of systemic infrastructure rather than just a commercial product. Tim Versteeg questions if market data is becoming too powerful to fail.
The race to ‘financialize’ GPU compute set to ratchet up
The Waters Wrap: Anthony looks at two companies aiming to bring efficiency and transparency to the GPU compute market.
Deutsche Börse invests $200M in Kraken, DTCC advances cloud strategy, and more
A recap of this week’s major tech and data news in the capital markets.
Data industry spend hits $50B for first time in new report
A new product by BCG Expand will track market data vendor size and market share as it seeks to show data users where the market is heading.
TNS integrates Radianz, Exegy reduces latency, BondXN allies with BlackRock, and more
A recap of this week’s major tech and data news in the capital markets.
Re-engineering reconciliations: User-initiated AI cuts recs from days to minutes
Reconciliations have long been tied to batch scheduling. Prasanna Anandan explains how one bank broke down bottlenecks by embedding an AI-driven, user-initiated interface.