
JP Morgan Data Scientist Eyes Insights from Internal Data
The bank’s asset management arm believes that trawling its home waters for data will land a valuable catch. Risk.net’s Faye Kilburn speaks to the data scientist at its helm.
The firm’s three flagship real estate funds—which buy individual properties rather than portfolios—manage roughly $50 billion in assets globally. Over the years, the funds have naturally amassed research on the US real estate market that could now bear fruit for the rest of the business, says Ravit Mandell, chief data scientist for JPMAM’s intelligent digital solutions division, who joined the asset manager to lead its Big Data and deep learning efforts last year from JP Morgan’s corporate and investment bank, where she was head of the quantitative market making and swap trading desk, responsible for macro market strategy, the strategic investments portfolio, Big Data analytics, and market structure initiatives.
“Whenever [the funds] buy a mall, we collect data on the per-square-foot purchase cost, the foot traffic, when was the last time they changed the air conditioning or painted the parking lot, so we have a lot of internal data in the US,” she says.
As well as providing this data to its own investment team to steer the analysts on the real estate sector and the health of the broader US economy, the firm may give it to institutional clients to inform their broader real estate allocations. Using data mining techniques, the firm aims to find previously unexploited uses for data held within the group.

Knowing Where to Look
The asset manager initially held alignment sessions to determine what intellectual property exists in the firm. “We have hired data strategists and engineers to find data internally. Someone, somewhere always has it,” Mandell says.
The hunt for internal data is part of a broader effort to build a data science culture at JPMAM, where fundamental analysts and traders incorporate machine learning and non-traditional forms of data in their daily decision making.
The firm’s data efforts are housed in its so-called “data lab,” currently staffed by 21 data scientists and engineers—most of whom have been hired in the past three months—though Mandell expects headcount to double over the next 18 months.
The group plans to apply machine learning analysis to proprietary information and internal data from its investment bank parent, and is looking into applying sentiment analysis to more than 100,000 company research reports written by JPMAM analysts, who are required to write reports on any company they meet. Mandell says this could provide proprietary insight for JPMAM’s investment process.
“The investment bank has been around for 150 years, and has collected so much data. As a bank-owned asset manager, we have a lot of internal data. But when it comes to data mining, the question is how to make all of that data useful,” she adds.
War of the Words
To date, the firm has been using so-called “bag of words” language processing to analyze text in earnings call transcripts, primarily to predict revisions in earnings estimates for its emerging markets and Asia-Pacific equities, behavioral finance equities and beta strategies teams. This method assigns sentiment scores to individual words but due to its “binary” nature, cannot handle some complexities of human language. For instance, it fails to judge the neutral sentiment of a phrase like “it’s not the worst,” Mandell says.
So, the asset manager is rolling out a new natural language processing model based on recurrent neural networks used by Google and Apple’s Siri. The technology can memorize inputs and make predictions—as opposed to “feedforward” neural networks, which are primarily used for pattern recognition—and will be able to understand the sentiment of phrases in analyst reports and other documents, in the context of the fund manager’s business. To train the neural network, the firm has built JP Morgan-specific dictionaries, which feed the language processing models. For example, the new models will be able to assign a negative sentiment to a term like “higher deficits,” whereas previous models would incorrectly deduce the term to be both good (higher) and bad (deficits).
Mandell says the new model will allow the firm to extract information from research notes, broker reports and filings to help anticipate company earnings revisions; gain insight from corporate filings (e.g., 10-Ks, 10-Qs, prospectuses, etc.) to enhance its research efforts; and use temporal clustering and sentiment analysis on news to better understand what’s being discussed and where consensus is forming.
For five years, JP Morgan operated a centralized data science unit that oversaw data architecture and strategy for the entire group in one place. However, the group is now diversifying to integrate data science capabilities into its four business lines: central investment banking, corporate client banking, asset and wealth management, and technology. “The math behind it has existed for many years but now we have the compute power to figure things out that we couldn’t two years ago,” Mandell says.
Only users who have a paid subscription or are part of a corporate subscription are able to print or copy content.
To access these options, along with all other subscription benefits, please contact info@waterstechnology.com or view our subscription options here: http://subscriptions.waterstechnology.com/subscribe
You are currently unable to print this content. Please contact info@waterstechnology.com to find out more.
You are currently unable to copy this content. Please contact info@waterstechnology.com to find out more.
Copyright Infopro Digital Limited. All rights reserved.
As outlined in our terms and conditions, https://www.infopro-digital.com/terms-and-conditions/subscriptions/ (point 2.4), printing is limited to a single copy.
If you would like to purchase additional rights please email info@waterstechnology.com
Copyright Infopro Digital Limited. All rights reserved.
You may share this content using our article tools. As outlined in our terms and conditions, https://www.infopro-digital.com/terms-and-conditions/subscriptions/ (clause 2.4), an Authorised User may only make one copy of the materials for their own personal use. You must also comply with the restrictions in clause 2.5.
If you would like to purchase additional rights please email info@waterstechnology.com
More on Data Management
Waters Wavelength Ep. 314: Capco’s Bertie Haskins
Bertie Haskins, executive director and head of data for Apac and Middle East at Capco, joins to discuss the challenges of commercializing data.
Nasdaq, AWS offer cloud exchange in a box for regional venues
The companies will leverage the experience gained from their relationship to provide an expanded range of services, including cloud and AI capabilities, to other market operators.
Bank of America reduces, reuses, and recycles tech for markets division
Voice of the CTO: When it comes to the old build, buy, or borrow debate, Ashok Krishnan and his team are increasingly leaning into repurposing tech that is tried and true.
Navigating the tariffs data minefield
The IMD Wrap: In an era of volatility and uncertainty, what datasets can investors employ to understand how potential tariffs could impact them, their suppliers, and their portfolios?
Project Condor: Inside the data exercise expanding Man Group’s universe
Voice of the CTO: The investment management firm is strategically restructuring its data and trading architecture.
Tariffs, data spikes, and having a ‘reasonable level of paranoia’
History doesn’t repeat itself, but it rhymes. Covid brought a “new normal” and a multitude of lessons that markets—and people—are still learning. New tariffs and global economic uncertainty mean it’s time to apply them, ready or not.
HSBC’s former global head of market data to grow Expand Research consulting arm
The business will look to help pull together the company’s existing data optimization offerings.
Stocks are sinking again. Are traders better prepared this time?
The IMD Wrap: The economic indicators aren’t good. But almost two decades after the credit crunch and financial crisis, the data and tools that will allow us to spot potential catastrophes are more accurate and widely available.