Roadmap

LibreDataHub is under active development with several planned features to improve user experience and extend platform capabilities.

General features

User Interface

  • Homepage improvements: Enhanced interface with application filtering by category and user profile
  • Improved file management: Password-protected folders and better file organization
  • Project dashboard: Research project tracking with metrics and statistics

New Applications

  • Marimo: Reactive and interactive Python notebook system
  • Dash: Python framework for creating analytical web applications
  • OnlyOffice: Complete office suite accessible via browser
  • Draw.io: Diagram and schema tool
  • Gitea/GitLab: Integration of Git repository managers
  • Superset: Open-source business intelligence platform
  • OpenWebUI: Graphical interface for language models (LLM)

Databases

  • Solr support: Integration of Apache Solr search engine
  • Neo4J support: Integration of Neo4J graph database

Infrastructure and Security

  • Shared resource management: GPU support for compute-intensive tasks
  • Encrypted backups: Encrypted database backups
  • Automated S3 backups: Integration with Rekkord.org for automated backups
  • Platform observability: Log analysis, monitoring and alerts to ensure long-term system stability

Orchestration and Workflows

  • Data processing orchestration: Data processing orchestration across services and tools
  • Dashboard sharing: Dashboard sharing for collaborative analysis and reporting

Features for Education

Several features are planned to support teaching activities:

  • Virtual classroom creation: Virtual classroom linked to Git
  • Automated testing: Automated testing via continuous integration (CI)
  • Code review: Code reviews directly on GitLab
  • Automated distribution and collection: Automated distribution and collection of assignments
  • Improved documentation: Improved documentation and developer onboarding

Features for Research

The focus is on extending the technical infrastructure to support:

  • Signal processing: Signal (waveform) analysis for biomedical data processing
  • Federated research: Federated research across multiple LDH instances
  • User-friendly file management: User-friendly file management for non-developers
  • Project dashboard: Project dashboard for research project tracking

Integration with InterHop Tools

  • Interoperability with InterHop software: Seamless integration with InterHop’s ecosystem, including Goupile.fr (eCRF) and Linkr, InterHop’s low-code platform
  • Advanced LLM integration: Strong integration between GPU, large language models (LLM) and graphical interface similar to OpenWebUI to facilitate advanced research workflows

Demonstration Environment

  • Complete demonstration environment: Complete demonstration environment, including demonstration notebooks, sample datasets, Airflow jobs and dashboards to help users quickly explore and understand the platform
  • Easily accessible demo instance: A publicly accessible demo instance will be available, allowing users to test LibreDataHub without installation and account creation. This non-HDS (not authorized for health data) version provides a quick way to explore the platform’s features and capabilities before deploying your own instance.

Compliance and Standards

  • EHDS alignment: LibreDataHub is well aligned with the upcoming European Health Data Space (EHDS) framework and could serve as a federated platform in this context

Funding

LibreDataHub development has been entirely volunteer-driven until now. The project has submitted an application to the NGI Zero Commons Fund to accelerate the development of these features.