How Is Data Measured ?



Data is measured by the number of bits it takes to represent it. All information in a computer can be represented as a binary number consisting solely of 0’s and 1’s. Each 0 or 1 in a number is a bit. A bit is the smallest unit of storage in computers. Since computers work in binary (Base 2), this means that all the important numbers that differentiate between different data sizes will be powers of 2.  

A byte is a collection of 8 bits. Take a moment to examine the table below to get a feel for the difference between data measurements and their relative sizes to one another. 

Unit

Equivalent to

Abbreviation

Real-World Example

Byte

8 bits

B

1 character in a string

Kilobyte

1024 bytes

KB

A page of text (~4 kilobytes)

Megabyte

1024 Kilobytes

MB

1 song in MP3 format (~2-3 megabytes)

Gigabyte

1024 Megabytes

GB

~300 songs in MP3 format

Terabyte

1024 Gigabytes

TB

~500 hours of HD video

Petabyte

1024 Terabytes

PB

10 billion Facebook photos

Exabyte

1024 Petabytes

EB

~500 million hours of HD video

Zettabyte

1024 Exabytes

ZB

All the data on the internet in 2019 (~4.5 ZB)

The amount of data in the world is exploding and growing at an incredible pace every year. This growth is largely the result of the over 4.6 billion people around the world connected to the Internet. Now that smartphones and other Internet-connected devices have become common, they generate a staggering amount of new data. Many experts believe that the size of all the data on the Internet will swell to 175 ZB by the end of 2025!

The size of the dataset you’re working with usually determines which tool, spreadsheets or SQL, is best suited for the task. Spreadsheets often start to have performance issues as dataset sizes increase beyond a few megabytes. SQL databases are much better at working with larger datasets that have billions of rows with sizes measured in gigabytes. The dataset’s size still matters here--larger datasets will take longer for queries to complete, depending on the query’s content and the number of rows SQL has to process to complete the query.

Comments

Most Popular