Introduction
Do you want a document that is secured, difficult to edit, easily compressed and portable? If so, please stick with PDFs. But at a time when the world is moving toward collaborative practices with data at the core of this movement, how can we keep it locked in a Portable Document Format, difficult to reach?If you have faced the challenge of having to extract data from a pdf, this tutorial is for you. If you haven’t, read on anyway because if you want to keep up with the growing trend of data, extracting is something you should know.
This module provides a brief overview of the different techniques used to extract data from PDFs, with a focus on introducing Tabula, a free open-source tool build for this specific task.
Modules Objectives
- Discovering the different techniques available for extracting PDFs
- Learning how to use Tabula to extract data from PDFs
Prerequisites
- None
