There are a number of systems that can capture voice data and many others that can retrieve it, store it, and analyze it. However, there is no single standard defining how institutional voice data needs to be captured. Firms have ample amounts of voice metadata detailing where a conversation began and where it ended, but one of the issues the industry faces is that there is no single agreement on how to represent voice data in a unified way. This has resulted in consumers of that data having to integrate and manage a wide array of unstructured data formats.
As customers became more interested – and in some cases compelled – to capture information from voice communications, it became clear that this issue needed to be addressed.
The FINOS Voice Metadata Standardization working group has taken a number of steps to try to unify the metadata. However, in order for these steps to be enacted and become valuable to users, the standard needs to be adopted across the industry to ensure the process is uniform.
While the simple facet of the equation is capturing and storing voice data in a standard-based method, the more difficult element is transcribing the data and then structuring it so that firms can make more informed trading decisions and leverage it for regulatory purposes.
There are numerous vendors that have expertise in natural language processing (NLP), data structuring, and transcribing data. It will be the responsibility of the producers of such data to make it available via standards-based APIs whereby owners of this data can easily make it available to their vendors.