For AI Platforms, What Is Good Data?

Share this 

Share on facebook
Share on twitter
Share on linkedin

Last summer, my colleague Tony Vlismas warned us about conflating AI data with commodities, highlighting that data is a means to creating the commodity, not the commodity itself. That’s not to say you don’t need “good data.”

“Artificial intelligence is a bankable buzzword, but its use pales in comparison to the most recent hype-laden phrase: data. Data is a big part of how AI works in all of its implementations, and given data’s importance, many opinion pieces and quick-hit news stories try to simplify what exactly it is we mean when we say data, treating it as an obtuse object rather than the complicated and multifaceted element of a AI it actually is. This does a disservice to the varying types of data different AI implementations require for success.”

Many platforms and solutions in a myriad of industries boast about their data and how it can help you or your business, but it’s a blanket term for heaps of unqualified information.

How you qualify and use data is just as important as the data itself.

Not A Commodity

In “Data Is Not The New Oil,” authors Jocelyn Goldfein and Ivy Nguyen of Zetta Venture Partners remark on the dangers of basing investment decisions on quantity of data alone:

“In all the enthusiasm for big data, it’s easy to lose sight of the fact that all data is not created equal. Startups and large corporations alike boast about the volume of data they’ve amassed, ranging from terabytes of data to quantities surpassing all of the information contained in the Library of Congress. Quantity alone does not make a “data moat.”

They go on to define what a successful data moat entails, including the accessibility of the data, the time it takes to make it useful, its cost, uniqueness, breadth, and perishability (data durability).

“A truly defensible data moat doesn’t come from just amassing the largest volume of data. The best data moats are tied to a particular problem domain, in which unique, fresh, data compounds in value as it solves problems for customers.”

Bad Data = Useless Technology

In “If Your Data Is Bad, Your Machine Learning Tools Are Useless,” Thomas C. Redman too rails against the perils of poor data quality.

“To properly train a predictive model, historical data must meet exceptionally broad and high quality standards. First, the data must be right: It must be correct, properly labeled, de-deduped, and so forth. But you must also have the right data — lots of unbiased data, over the entire range of inputs for which one aims to develop the predictive model. Most data quality work focuses on one criterion or the other, but for machine learning, you must work on both simultaneously.”

According to Redman, a vast amount of today’s data fails to meet a standard of quality that makes it useful for application – not necessarily because the data is bad per se, but instead it wasn’t checked and cleansed to remove the consequences of expectations, poor calibration, and human bias and error: “Increasingly-complex problems demand not just more data, but more diverse, comprehensive data.”

Redman’s advice for a quality review program are comprehensive, and those curious should read the full article to start implementing them for their own organizations and efforts.

None of these recommendations are easy quick fixes, but those using data to inform their AI and machine learning efforts need to make sure their tools are fuelled by useful data. This is especially true in customer support and cognitive care, where users are often already approaching automated systems with negative connotations. There is little room for frustration or error.

0 replies on “For AI Platforms, What Is Good Data?”

Let’s supercharge your virtual agent together

With outstanding customer experience being the most important differentiator, make sure your virtual agent is the one leaving the competition in the dust.

Wysdom is a conversational AI optimization platform that enables higher performing, lower cost conversational virtual agents, continually learning and delivering high-quality customer experiences.

© Copyright Wysdom 2021. All rights reserved.

Artiom Kreimer

VP, Product & Analytics

Artiom has spent 10 years in software and mobile engineering, specializing in quality assurance and customer service. He has worked in testing and QA at both startups and in enterprises such as Clickfree, TELUS, and Freescale Semiconductor.

Michel Benitah

VP, Optimization & Delivery

Michel has 20 years of experience in leading the successful delivery of Conversational AI and Natural Language Customer Care solutions to some of the largest financial, telco, healthcare, utilities, and retail enterprises throughout North America. 


Prior to joining Wysdom, Michel spent 20 years at Nuance Communications, holding senior management and leadership positions within the enterprise division, most recently as director of the Toronto office and professional services team.

Frederic Lam

SVP, Sales

Fred brings in 25 years of international experience in sales and business development across North America, the Caribbean, Asia-Pacific, Europe, and the Middle-East.


Prior to Wysdom.AI, he held sales leadership positions at Oracle, Redknee, and Movius/Glenayre, successfully growing revenues in both large and small organizations. Fred has also been involved in the start-up community in the earlier stages of his career as an Investment Manager with SP Capital and was an alternate director on a few investee companies.

Karen Chan

Chief Engineering Officer, Co-Founder

With 20 years of experience in software and mobile, Karen has held senior technical roles at 5 startups, including Wysdom.AI, Clickfree, Mobile Diagnostix (HP), Teamatic, and Virtualthere.

Karthik Balakrishnan

Chief Technology Officer

Karthik has over a decade of hands-on, proven global expertise in emerging technologies and implementing intricate platforms and solutions for telecommunications and enterprise during his time at Amdocs, with senior positions in their India, Cyprus, America, and Canada offices.

Nitin Singhal

Chief Operating Officer

Nitin has over 20 years of success in global executions of business technology, driving operational efficiency and digital scalability for some of the world’s largest enterprise clients. 


Nitin spent 16 years at Redknee holding executive positions in Research and Development, Customer Operations, Partner Alliances, and most recently as COO.

Jeff Brunet​

President, Co-Founder

Jeff has more than 20 years of experience in the startup world, founding and growing 4 software companies: AracNet, Mobile Diagnostix (HP), ClickFree, and Wysdom.AI. 


His in-depth understanding of software development and the challenges in making new technologies successful in the startup world prove invaluable as he serves on the boards of XMG, SurfEasy (Opera), Locationary (Apple), Groupie, and as an advisor to Pushlife (Google), LogMeIn (IPO) and HP. 


Jeff holds 23 issued patents in the wireless and consumer electronics spaces and is the lead inventor on 30+ pending patents.

Ian Collins​

CEO, Co-Founder

Ian has founded and grown 6 technology companies over the past 20 years, primarily in the enterprise software space including Wyrex, Mobile Diagnostix (HP), Clickfree, and most recently Wysdom.AI. 


Ian invests, mentors, and sits on the boards of several startups in the Toronto area.