FREYA Logo FREYA WP2 User Story 8 As a longitudinal study, I want to be able to deduplicate the metrics/impact for our data, so that I can see the impact of our study’s data as a whole.

Scientific datasets may be composed of individual components, whereby the parent and each component are identified by a different DOI and hence can be cited, viewed and downloaded individually. In order to assess the reuse such datasets, their authors must be able to aggregate views, downloads and citations metrics across all the dataset components.

This notebook uses the DataCite GraphQL API to retrieve all parts of the dataset: 2014 TCCON Data Release dataset, so that its overall impact can be quantified.

Goal: By the end of this notebook, for a given dataset with constituent parts, you should be able to display:

Install libraries and prepare GraphQL client

Define and run GraphQL query

Define the GraphQL query to retrieve 2014 TCCON Data Release dataset.

Run the above query via the GraphQL client

Display total dataset metrics

Display total number of citations, views and downloads of 2014 TCCON Data Release dataset, aggregated across all the parts.

Plot metric counts per part

Plot stacked bar plot showing how the individual parts of 2014 TCCON Data Release dataset contribute their metric counts to the corresponding aggregated total.