Compression (Cambridge (CIE) O Level Computer Science): Revision Note

Exam code: 2210

Written by: Robert Hampton

Reviewed by: James Woodhouse

Updated on 13 May 2025

The Need For Compression

What is compression?

Compression is reducing the size of a file so that it takes up less space on secondary storage
The impact of compression is:
- Less storage space required
- Less bandwidth required
- Shorter transmission time
Compression can be achieved using two methods, lossy and lossless

Lossy Compression

What is lossy compression?

Lossy compression is when data is lost in order to reduce the size on secondary storage
Lossy compression is irreversible
Lossy can greatly reduce the size of a file but at the expense of losing quality
Lossy is only suitable for data where reducing quality is acceptable, for example images, video and sound
In photographs, lossy compression will try to group similar colours together, reducing the amount of colours in the image without compromising the overall quality of the image

3 images of a husky dog, 1 with no compression, 1 at 50% and 1 at 80%

In the images above, lossy compression is applied to a photograph and dramatically reduces the file size
Data has been removed and the overall quality has been reduced, however it is acceptable as it is difficult to visually see a difference
Lossy compressed photographs take up less storage space which means you can store more and they are quicker to share across a network

Lossless Compression

What is lossless compression?

Lossless compression is when data is encoded in order to reduce the size on secondary storage
Lossless compression is reversible, the file can be returned to its original state
Lossless can reduce the size of a file but not as dramatically as lossy
Lossless can be used on all data but is more suitable for data where a loss in quality is unacceptable, for example documents
In a document, lossless compression algorithms such as run length encoding (RLE) can be used to analyse the contents looking for patterns and repetition.

What is run length encoding?

Run length encoding (RLE) is a form of lossless data compression that condenses identical elements into a single value with a count
For a text file, "AAAABBBCCDAA" is compressed to "4A3B2C1D2A"
The string has four 'A's, followed by three 'B's, two 'C's, one 'D', and two 'A's
RLE is used in bitmap images to compress sequences of the same colour
For example, a line in an image with 5 red pixels followed by 3 blue pixels could be represented as "5R3B"

Lossless file formats

Diagram to show loss in file size from DOCX to PDF

In the image above, lossless compression is automatically applied to document formats such as DOCX and PDF with a different rate of success
When you open a lossless compressed document the decompression process reverses the algorithms and returns the data back to its original state
Lossless compressed documents take up less storage space which means you can store more and they are quicker to share across a network

Worked Example

An email is sent containing a sound file.

Lossy compression is used to compress the sound file.

Explain two reasons why using lossy compression is beneficial. [4]

How to answer this question

What are the differences between lossy and lossless?
Can you state two differences? [2 marks]
Can you say why each point is a benefit? [2 marks]

Answer

Lossy will decrease the file size [1]
...so it can sent via email quicker [1]
Lossy means data is lost [1]
...the difference is unlikely to be noticed by humans [1]

Unlock more, it's free!

Join the 100,000+ Students that ❤️ Save My Exams

the (exam) results speak for themselves:

Test yourself Flashcards

Did this page help you?

Previous:Calculating File SizesNext:Data Packets

Compression (Cambridge (CIE) O Level Computer Science): Revision Note

The Need For Compression

What is compression?

Lossy Compression

What is lossy compression?

Lossless Compression

What is lossless compression?

What is run length encoding?

Lossless file formats

Unlock more, it's free!

Join the 100,000+ Students that ❤️ Save My Exams

1. Data Representation

Number Systems

Computers & Binary

Number Systems

Converting Between Binary & Denary

Converting Between Hexadecimal & Denary

Converting Between Hexadecimal & Binary

Uses of Hexadecimal

Binary Addition

Binary Shifts

Two's Complement

Text, Sound & Images

Character Sets

Representing Sound

Representing Images

Data Storage & Compression

Units of Data Storage

Calculating File Sizes

Compression

2. Data Transmission

Types & Methods of Data Transmission

Data Packets

Packet Switching

Data Transmission

Universal Serial Bus (USB)

Methods of Error Detection

Error Checking

Error Detection Methods

Check Digits

Automatic Repeat Query (ARQ)

Encryption

Encryption

3. Hardware

Computer Architecture

The CPU & Microprocessors

Von Neumann Architecture

The Fetch-Decode-Execute Cycle

Characteristics of the CPU

CPU Instruction Sets

Embedded Systems

Input & Output Devices

Input Devices

Output Devices

Sensors

Data Storage

Primary Storage

Secondary Storage

Virtual Memory

Cloud Storage

Network Hardware

Network Interface Card (NIC)

MAC Addresses & IP Addresses

Router

4. Software

Types of Software & Interrupts

System Software & Application Software

The Purpose & Functionality of Operating Systems

Hardware, Firmware & the Operating System

Interrupts

Types of Programming Language, Translators & Integrated Development Environments (IDEs)

Levels of Programming Languages

Assembly Language

Translators, Compilers & Interpreters

Tools & Facilities in IDEs

5. The Internet & its Uses

The Internet & the World Wide Web

The Internet & the World Wide Web

Network Protocols

Web Browser