Computer Science

DGChain: Data control version for trustworthy reproducibility with Blockchain

Published on - The Sixth International Conference on Blockchain Computing and Applications (BCCA 2024)

Authors: Jose Armando Hernandez Gonzalez

This work presents the DGChain (Data-Git-for Blockchain) project. This Python package allows version control of data in blockchain and IPFS based on a DAO (decentralized autonomous organization) for managing data in the development cycles of reproducible computational scientific research. Analyzes the benefits of using this Blockchain-Based Decentralized Architecture to mediate collaborative interactions between developers compared to existing solutions. Presents a use case in developing a medical research project and typical IRIS example to offer the traceability of changes and provenance of metadata, data, and code in Data / Software Version Control systems through management of intrinsic hash-based persistent, immutable CIDs (Content Identifier) recorded in Merkle trees in the development cycle of its main products, publication, software source code, and Data to guarantee reproducibility and trustworthiness in computational scientific research using DGChain.