Environmental Data Asset and Report Generation Management System

Name
Kristo Hõrrak
Abstract
In this thesis, the primary objective was to implement a data management solution for the generation of time-series environmental data assets and reports. Overall, the system should aid in preparing for a more open-science approach to asset generation by allowing versioning, metadata annotation, and the ability to trace how and why data assets were created. In order to facilitate collaboration on such data-driven work, a technology stack consisting of multiple open source solutions were chosen. The solution should enable researchers to perform a variety of duties, including data collection, metadata injection, data versioning, and asset materialization, with relative ease. The implemented system makes it apparent how data assets are dependent on one another and how, by utilizing the tools, users can determine what type of data they are working with. As a result of implementing such a solution, an environmental data asset and report system was developed for two Tahkuse environmental measurement station gas analyzers, allowing for the generation of multiple data assets and reports in a clear, comprehensible, validated, versioned, and visually understandable way.
Graduation Thesis language
English
Graduation Thesis type
Master - Data Science
Supervisor(s)
Urmas Hõrrak, Priit Adler
Defence year
2023
 
PDF