15  :books: Big data and machine learning

15.1 Introduction

15.1.1 Overview

15.1.2 Learning objectives

  • key dates and vocabular from the Artificial Intelligence community

  • machine learning vs. statistics

  • trade-off between accuracy and generalisability

15.2 Big data

Big data is a combination of structured, semistructured and unstructured data collected by organizations that can be mined for information and used in Big data refers to data sets that are too large or complex to be dealt with by traditional data-processing application software.

15.3 Development of Artificial Intelligence

15.4 Machine Learning vs. Statistics

15.5 Learning From Data: Accuracy or Generalizability?

15.5.1 Train / test split

We can split the original data into a training set and a testing set.