Nomad Data offers data discovery, allowing companies to search for specific types of data in natural language. Schneider highlights the value of previously "worthless" data, such as millions of consumer records, company records, or government filings, which LLMs can now infer in seconds. He also mentions the growing trend of large media companies licensing their data to OpenAI and other LLM companies, and the increasing number of corporations, including automotive manufacturers and insurers, selling their data on the Nomad platform.
Key takeaways:
- Large Language Models (LLMs) are increasingly requiring highly-specific datasets for AI inference, creating a growing market for data discovery and matching services.
- Nomad Data, a New York City company, uses its own LLMs to match over 2,500 data vendors to data buyers, helping companies find specific types of data they need for their own LLM inference use cases.
- While training data is important, the constant feeding of live data through a trained AI model for inference is just as crucial, especially for large companies looking to take advantage of generative AI.
- Nomad Data is signing up media companies and other corporations as data vendors, with the aim of licensing their data to OpenAI and other LLM companies for training and inference purposes.