Roadmap
LibreDataHub is under active development with several planned features to improve user experience and extend platform capabilities.
General features
User Interface
- Homepage improvements: Enhanced interface with application filtering by category and user profile
- Improved file management: Password-protected folders and better file organization
- Project dashboard: Research project tracking with metrics and statistics
New Applications
- Marimo: Reactive and interactive Python notebook system
- Dash: Python framework for creating analytical web applications
- OnlyOffice: Complete office suite accessible via browser
- Draw.io: Diagram and schema tool
- Gitea/GitLab: Integration of Git repository managers
- Superset: Open-source business intelligence platform
- OpenWebUI: Graphical interface for language models (LLM)
Databases
- Solr support: Integration of Apache Solr search engine
- Neo4J support: Integration of Neo4J graph database
Infrastructure and Security
- Shared resource management: GPU support for compute-intensive tasks
- Encrypted backups: Encrypted database backups
- Automated S3 backups: Integration with Rekkord.org for automated backups
- Platform observability: Log analysis, monitoring and alerts to ensure long-term system stability
Orchestration and Workflows
- Data processing orchestration: Data processing orchestration across services and tools
- Dashboard sharing: Dashboard sharing for collaborative analysis and reporting
Features for Education
Several features are planned to support teaching activities:
- Virtual classroom creation: Virtual classroom linked to Git
- Automated testing: Automated testing via continuous integration (CI)
- Code review: Code reviews directly on GitLab
- Automated distribution and collection: Automated distribution and collection of assignments
- Improved documentation: Improved documentation and developer onboarding
Features for Research
The focus is on extending the technical infrastructure to support:
- Signal processing: Signal (waveform) analysis for biomedical data processing
- Federated research: Federated research across multiple LDH instances
- User-friendly file management: User-friendly file management for non-developers
- Project dashboard: Project dashboard for research project tracking
Integration with InterHop Tools
- Interoperability with InterHop software: Seamless integration with InterHop’s ecosystem, including Goupile.fr (eCRF) and Linkr, InterHop’s low-code platform
- Advanced LLM integration: Strong integration between GPU, large language models (LLM) and graphical interface similar to OpenWebUI to facilitate advanced research workflows
Demonstration Environment
- Complete demonstration environment: Complete demonstration environment, including demonstration notebooks, sample datasets, Airflow jobs and dashboards to help users quickly explore and understand the platform
- Easily accessible demo instance: A publicly accessible demo instance will be available, allowing users to test LibreDataHub without installation and account creation. This non-HDS (not authorized for health data) version provides a quick way to explore the platform’s features and capabilities before deploying your own instance.
Compliance and Standards
- EHDS alignment: LibreDataHub is well aligned with the upcoming European Health Data Space (EHDS) framework and could serve as a federated platform in this context
Funding
LibreDataHub development has been entirely volunteer-driven until now. The project has submitted an application to the NGI Zero Commons Fund to accelerate the development of these features.