How to Extract Data from Graphs or Images in Scientific Papers? — PlotDigitizer (2024)

Updated 3 years ago 4 min read

How to Extract Data from Graphs or Images in Scientific Papers? — PlotDigitizer (1)

Scientific results or analyses are often visualized in journals in the form of graphs. As researchers, we are interested in studying these published visuals and want to further analyze and improve the results. However, the associated raw data with the published graphs are not always available in the papers. Sometimes, we might want to correlate our revised results with historical plots, and there is no numerical data published along with reports.

It brings us to the point where we need to somehow reverse-engineer graphs and extract the data from them in the numerical format. This process of reverse engineering is the digitization of data. In other words, we are retrieving the information embedded inside the graphics.

Extracting Data from Graphs or Images using PlotDigitizer

Extracting data from published graphs or images is not a simple process and could consume a significant amount of time without the right tools. PlotDigitizer is one such professional tool that is capable of extracting data from graphs.

PlotDigitizer is freemium software; the online version is free with limited functionality, while the offline version is paid. It is available for all operating systems.

How to use PlotDigitizer to extract data from Graphs?

The first step in the digitization is to get the graph in the image file format. If the visual is in document formats, like PDF, you can capture the screenshot of the graph. Or, if the scientific visual is in the physical format, you can scan the document and then crop out all unnecessary portions of the image.

PlotDigitizer has an in-built image editing tool kit with which you can crop, flip, rotate, or scale the image as required. For example, if the image is slightly tilted, you can use the rotate tool to align the image appropriately.

Besides standard XY, PlotDigitizer supports several types of graphs, e.g., polar, ternary, bar, column, pie/doughnut, map. Moreover, you can also calculate the distance, angle, and area on the image.

Here are simple steps for extracting data from XY graphs:

Step 1: Find the graph in an image format

The first, foremost step is to get a graph in image file formats, such as PNG, JPG, JPEG, SVG, GIF, TIFF.

Here, we have taken the following graphs as a sample image.

The curve below represents the solubility of oxygen (expressed on the y-axis) in water with temperature (expressed on the x-axis). The image is taken from ResearchGate.

How to Extract Data from Graphs or Images in Scientific Papers? — PlotDigitizer (2)

Step 2: Upload the image to PlotDigitizer

Upload or drag-drop the image in PlotDigitizer.

How to Extract Data from Graphs or Images in Scientific Papers? — PlotDigitizer (3)

Step 3: Adjusting the image

We can adjust the image to make it fit properly. For example, crop out the unwanted parts or rotate the image to make it align with the screen, or scale it to increase/decrease the resolution. In our case, the image is perfect; there is no need for any modifications.

Step 4: Calibrating the axes

Once the image is uploaded, the four points will appear around the center of the view. These are nothing but the x and y coordinates of the individual axes. Drag these points to the extremes of the plot. Here x1 and x2 are dragged, placed at 0 and 90, while y1 and y2 are dragged, placed at 0 and 16. You can use the zoom panel to increase the accuracy while calibrating the axes.

How to Extract Data from Graphs or Images in Scientific Papers? — PlotDigitizer (4)

Both axes are linear scales. Now, enter the values of the points x1, x2, y1, and y2, i.e., 0, 90, 0, and 16.

How to Extract Data from Graphs or Images in Scientific Papers? — PlotDigitizer (5)

Note: We have placed the points to the extremes to improve the accuracy, but it is not necessary. Also, the points do not necessarily have to be set on the axes. You can drop on them in-between the plot.

Now, the entire graph is calibrated. You can see the coordinate for the mouse’s cursor below the zoom panel. If you want, you hide x1, x2, y1, and y2 by clicking on the lock icon on the top of the window.

Step 5: Extracting the data points from the plot

Finally, we can extract data points from the curve. Mark various points on the curve and their respective values are recorded on the side panel.

How to Extract Data from Graphs or Images in Scientific Papers? — PlotDigitizer (6)

You can collect as many points as you want. Also, instead of manual extraction, you can use the automatic extraction feature to collect a large number of data points.

Step 6: Exporting the extracted data points

In the end, we can export the extracted points into other formats, like CSV, MS Excel, JSON, MatLab, Array.

Here, we have discussed the XY graph, but for every other type of graph, the process of digitization is very similar.

How to Extract Data from Graphs or Images in Scientific Papers? — PlotDigitizer (2024)

FAQs

How to Extract Data from Graphs or Images in Scientific Papers? — PlotDigitizer? ›

You can take screenshot of the Graph plotted and feed it as a image to Origin (or any other plot digitizer). I have used this web based tool to extract numerical data from plot images. It has provided me data with great accuracy from an image of a plot with a few simple clicks.

How to get data from a plot in a paper? ›

You can take screenshot of the Graph plotted and feed it as a image to Origin (or any other plot digitizer). I have used this web based tool to extract numerical data from plot images. It has provided me data with great accuracy from an image of a plot with a few simple clicks.

How do you extract data from data science? ›

How does Data Extraction work?
  1. Identifying Data Sources.
  2. Source Connection.
  3. Query or Retrieval.
  4. Data Transformation and Loading.
  5. Web Scraping.
  6. API-Based Extraction.
  7. Text Extraction (Natural Language Processing – NLP)
  8. OCR.
Jan 5, 2024

How to extract data from a chart? ›

You can use an OCR to extract the values and can even try fine-tuning it to capture the symbols. You can correlate the bounding boxes to find the values. Then you can use simple logic to capture the symbols and correlate them on the axis for the value. You can also use an LLM to understand the semantics of the chart.

How can I extract an image? ›

Navigate to the web page containing the image you want to extract. Take a screenshot of the page by pressing the Print Screen key on your keyboard (Windows) or Command+Shift+3 (Mac). Open the image files editing program, such as Paint or Photoshop, and paste the screenshot into the program.

How do I extract data from a document? ›

There are two ways to extract data from documents manually:
  1. Manual Data Entry: This method involves manually entering data from documents into a digital format. ...
  2. Copy-pasting: Data is manually copied from documents and pasted into the desired digital format.
Mar 12, 2024

What are the steps of data extraction? ›

Let's dive deeper into each component to gain a comprehensive understanding of the process.
  • Step 1: Identify Data Sources & Extract Relevant Data. The raw data is the data you extract from its source, such as a database or web page. ...
  • Step 2: Transform Data Into a Usable Format. ...
  • Step 3: Load Data Into the Target System.
Jul 29, 2023

What is an example of extracting data? ›

Extracting data from multiple sources occurs in different ways; for example, web scraping helps you extract user, product, and financial data off web pages; SQL helps extract data from database management systems, while APIs and webhooks enable data extraction from SaaS data integration tools like StreamSets.

What is an example of data extraction? ›

Types of Data Extraction

Customer Data: This is the kind of data that helps businesses and organizations understand their customers and donors. It can include names, phone numbers, email addresses, unique identifying numbers, purchase histories, social media activity, and web searches, to name a few.

What is the program to extract data points from a graph? ›

Opensource computer vision assisted software to help extract numerical data from images of plots, maps and much more.

Which command is used for extract data from a table? ›

SELECT statements

An SQL SELECT statement retrieves records from a database table according to clauses (for example, FROM and WHERE ) that specify criteria.

How do I extract data points from a graph image in Excel? ›

Extracting Data from Chart or Graph
  1. Step 1 : Consider an Excel sheet where you have a chart similar to the below image.
  2. Step 2 : Then click on Insert, select Module, and copy the below code into the text box.
  3. Step 3 : Then save the sheet as a macro−enabled template and click F5 to run the module.
Jul 20, 2023

How to extract data from chart image in Excel? ›

Then select the chart you want to extract data from and press Alt + F11 keys simultaneously, and a Microsoft Visual Basic for Applications window pops. 3. Click Insert > Module, then paste below VBA code to the popping Module window. VBA: Extract data from chart.

Can Excel pull data from an image? ›

Click Data > Data From Picture > browse and select the picture file > Open. The image should only depict the data you want to import. If necessary, crop the image.

Top Articles
Latest Posts
Article information

Author: Chrissy Homenick

Last Updated:

Views: 5771

Rating: 4.3 / 5 (74 voted)

Reviews: 81% of readers found this page helpful

Author information

Name: Chrissy Homenick

Birthday: 2001-10-22

Address: 611 Kuhn Oval, Feltonbury, NY 02783-3818

Phone: +96619177651654

Job: Mining Representative

Hobby: amateur radio, Sculling, Knife making, Gardening, Watching movies, Gunsmithing, Video gaming

Introduction: My name is Chrissy Homenick, I am a tender, funny, determined, tender, glorious, fancy, enthusiastic person who loves writing and wants to share my knowledge and understanding with you.