TSDat: An open_source Data Standardization Framework for Marine Energy and Beyond

Many organizations are tasked with the collection
and processing of large quantities of data from various
measurement devices. Data reported from these sources are often
not interoperable with datasets and software used by analysts and
other organizations in the same domain, introducing barriers for
collaboration on large-scale projects. This poses a particular
problem for cross-device comparisons and machine learning
applications, which rely on large quantities of data from multiple
sources. To address these challenges, the open-source Time-Series
Data Pipelines (Tsdat) Python framework was developed by
Pacific Northwest National Laboratory, with strategic guidance
and direction provided by the National Renewable Energy
Laboratory and Sandia National Laboratories to facilitate
collaboration and accelerate advancements in the marine energy
domain through the development of an open-source ecosystem of
tools. This paper will describe the Tsdat framework and the data
standards within which it operates. A beta version of Tsdat has
been released and is being used by several projects in marine
energy, wind energy, and building energy systems.