AEPD-EDPS Joint Paper – 10 Misunderstandings about Machine Learning

The EU has identified artificial intelligence (AI) as one of the most relevant technologies of the 21st century and highlighted 1 its importance on the strategy for EU’s digital transformation. Having a wide range of applications, AI can contribute in areas as disparate as helping in the treatment of chronic diseases, fighting climate change or anticipating cybersecurity threats.

https://edps.europa.eu/data-protection/our-work/publications/papers/2022-09-20-aepd-edps-joint-paper-10-misunderstandings-about-machine-learning_en

  • MISUNDERSTANDING: Correlation implies causality.
    • Fact: Causality requires more than finding correlations.
  • MISUNDERSTANDING: When developing machine learning systems, the greater the variety of data, the better.
    • Fact: ML training datasets must meet accuracy and representativeness thresholds.
  • MISUNDERSTANDING: ML needs completely error-free training datasets.
    • Fact: Well-performing ML systems require training datasets above a certain quality threshold.
  • MISUNDERSTANDING: The development of ML systems requires large repositories of data or the sharing of datasets from different sources.
    • Fact: Federated learning allows the development of machine learning systems without sharing training data sets
  • MISUNDERSTANDING: ML models automatically improve over time.
    • Fact: Once deployed, ML models performance may deteriorate and will not improve unless it receives further training.
  • MISUNDERSTANDING: Automatic decisions taken by ML algorithms cannot be explained.
    • Fact: A well-designed ML model can produce decisions understandable to all relevant stakeholders.
  • MISUNDERSTANDING: Transparency in ML violates intellectual property and is not understood by the user.
    • Fact: It is possible to provide meaningful transparency to AI users without harming intellectual property.
  • MISUNDERSTANDING: ML systems are less subject to human biases.
    • Fact: ML systems are subjects to different types of biases and some of these come from human biases.
  • MISUNDERSTANDING: ML can accurately predict the future.
    • Fact: ML system predictions are only accurate when future events reproduce past trends.
  • MISUNDERSTANDING: Individuals are able to anticipate the possible outcomes that ML systems can make of their data.
    • Fact: The ability for ML to find nonevident correlations in data can end up with the discovery of new data, unknown to the data subject.