Golden Copy: Untapped Goldmine
Unstructured data has been around for a long time, but its importance is emerging in new ways
More firms and providers in the financial industry are trying to harness the value of data analytics and apply it to their data management. One facet of reference data is emerging though, as a giant "goldmine" for the application of data analytics, and that is unstructured data.
This type of data is probably nothing new, as Australian data-development expert Leif Hanlen describes it: "50 million Word documents saved over the last 10 years and put in a box" ... cannot automatically "turn out to be as good as the accounting machinery they have been using since the 1970s." Hanlen is a business development executive at the Commonwealth Scientific Industrial Research Organization (CSIRO), a digital operations consultancy, backed by Australian national and state governments, and CSIRO's Data61 subsidiary.
Unstructured data has started to bubble up as a matter of concern and interest in Inside Reference Data's coverage this year. In March, ISITC executive Jeff Zoller pointed to unstructured data about investment behavior and patterns, as well as industry analysts' commentary, and social and economic behavior, potentially supporting the disruption of data-management technology. In May, MUFG Canada Branch chief information and operations officer Ron Lee, speaking about how his firm is consolidating systems, noted this effort would increase the capability to work with unstructured databases.
The task for analytics of unstructured data is not to build a brand new goldmine, but to extract elements of information from that unstructured data
Leif Hanlen, Commonwealth Scientific Industrial Research Organization
Legal entity identifier (LEI) work is also addressing unstructured data, as Karla McKenna, head of standards at the Global Legal Entity Identifier Foundation, noted in June, saying entity legal form information – a data element in the LEI repository – remains in "unstructured form."
When addressing unstructured data, Hanlen counsels that firms should realize there are two different types. "Unstructured data sitting inside the enterprise – in the customer relationship-management system, in fields called ‘other' – is like a hole in the ground that's yet to become a goldmine," he says. External unstructured data from sources such as social media and market surveillance is already being tapped for its golden value, as Hanlen describes.
Data analytics can come into play for the former type of unstructured data, he says. "The task for analytics of unstructured data is not to build a brand new goldmine, but to extract elements of information from that unstructured data."
CSIRO and Data61 have worked on analyzing payments, documents predicting market outcomes and price signals based on the likelihood of future events. The challenge Hanlen finds in working with such unstructured data of a predictive rather than measuring nature is placing that data in a credible spectrum, which can be used to anticipate the future or a likely outcome.
Unstructured data may be a goldmine, but it appears to be one that industry experts are just beginning to figure out how to tap, for a variety of purposes.
Only users who have a paid subscription or are part of a corporate subscription are able to print or copy content.
To access these options, along with all other subscription benefits, please contact info@waterstechnology.com or view our subscription options here: http://subscriptions.waterstechnology.com/subscribe
You are currently unable to print this content. Please contact info@waterstechnology.com to find out more.
You are currently unable to copy this content. Please contact info@waterstechnology.com to find out more.
Copyright Infopro Digital Limited. All rights reserved.
As outlined in our terms and conditions, https://www.infopro-digital.com/terms-and-conditions/subscriptions/ (point 2.4), printing is limited to a single copy.
If you would like to purchase additional rights please email info@waterstechnology.com
Copyright Infopro Digital Limited. All rights reserved.
You may share this content using our article tools. As outlined in our terms and conditions, https://www.infopro-digital.com/terms-and-conditions/subscriptions/ (clause 2.4), an Authorised User may only make one copy of the materials for their own personal use. You must also comply with the restrictions in clause 2.5.
If you would like to purchase additional rights please email info@waterstechnology.com
More on Data Management
Growing pains: Why good data and fortitude are crucial for banks’ tech projects
The IMD Wrap: Max examines recent WatersTechnology deep dives into long-term technology projects at several firms and the role data plays in those efforts.
Investing in the invisible, ING plots a tech renaissance
Voice of the CTO: Less than a year in the job, Daniele Tonella delves into ING’s global data platform, gives his thoughts on the future of Agile development, and talks about the importance of “invisible controls” for tech development.
Optiver relies on BMLL market data for quant strategy
The market-maker has built its trading business on top of BMLL’s Level 3 data. But the collaboration is young, and the pair have grand plans to make options the next quant frontier.
Bloomberg expands IBVAL; the SIPs and 24/5 trading; Broadridge’s agentic play, and more
The Waters Cooler: State Street embraces interop, Citi’s CIO outlines the XiNG risk platform, power companies explore alternative nuclear supply options to datacenters, and more.
As costs rise, buy-side CIOs urge caution on AI
Conference attendees encouraged asset managers to tread carefully when looking to deploy AI-driven solutions, citing high cost pressures.
XiNG: Inside Citi’s all-encompassing risk platform
Voice of the CTO: Citi’s chief information officer, Jon Lofthouse, explains how and why the bank has extended its enterprise-wide risk platform so that every trade in any asset class goes through it.
Demand for private markets data turns users into providers
Buy-side firms seeking standardized, user-friendly datasets are turning toward a new section of the alternatives market to get their fix—each other.
LSEG-AWS extend partnership, Deutsche Bank’s AI plans, GenAI (and regular AI) concerns, and more
The Waters Cooler: Nasdaq and MTFs bicker about data fees, Craig Donohue to take the reins at Cboe, and Clearwater closes its Beacon deal, in this week’s news roundup.