Data Quality for AI in Healthcare

Xavier Health February 2021

Bradley Merrill Thompson, Member of the Firm in the Health Care & Life Sciences practice, in the firm’s Washington, DC, office, co-authored a whitepaper titled “Data Quality for AI in Healthcare,” a collaborative piece developed under the leadership of the Xavier Health program at Xavier University in partnership with industry professionals.

Following is an excerpt:

Over the past few years, Artificial Intelligence (AI), and more specifically Machine Learning (ML) technology has experienced rapid adoption in the healthcare space as tools for diagnosis and decision-making. Such tools are intended to address challenges in the health care system to both process and the application of rapidly proliferating medical findings to practice, as well as to deliver on the promise of personalized and precision medicine.

The classic computer science idiom of “garbage in = garbage out” certainly holds true for ML systems; the quality of the data used to develop and test the system has a significant impact on the quality of the system output. There are many examples in the press where poor data quality led to poor recommendations and even led to instances of discrimination against certain patient populations, due to bias in the data that was used to develop the system. This paper, developed by the Xavier GMLP (Good Machine Learning Practices) Working Team, attempts to describe and address these potential data quality issues.

Learn more about new learnings and best practices in AI in health care by downloading the free whitepaper.