Search⌘ K

Overview of dataset

Explore the structure of a dataset using Amazon's Top 50 bestselling books as an example. Understand how tabular data is organized in CSV files and the significance of each column in this real-world dataset. This lesson prepares you to analyze and interpret data effectively.

A dataset is the collection of data. In the case of tabular data, a dataset is arranged in tables. Each column in a table represents a variable, and each row acts as a record of the data.

Example dataset

Below, we can see a very basic medical chart created by a doctor to keep the information of patients.

Medical information in tabular form
Medical information in tabular form

Notice that the chart has three columns:

  • Name
  • Age
  • Weight

The chart has six rows. The first row of the chart holds Carol, 21, and 55 kg. Instead of writing, Carol is 2121 ...