LABIOS
a label-based i/o system
labios (label-based i/o system) is a groundbreaking distributed storage framework that introduces a novel data labeling paradigm to address the challenges of modern high-performance computing (hpc) and big data storage systems. as scientific applications become increasingly data-intensive and storage requirements grow more complex, labios provides a unified solution that enables efficient data access, storage elasticity, and computational storage capabilities.
patents
our innovative approach to label-based i/o has resulted in groundbreaking patents that protect our intellectual property and demonstrate the uniqueness of our solution:
Label-Based Data Representation I/O Process and System
what makes labios special?
labios revolutionizes storage access through its innovative label-based paradigm. just as a shipping label contains all the necessary information for package delivery, labios data labels encapsulate both the data and instructions for its handling. this approach enables:
- asynchronous i/o operations
- dynamic storage resource allocation
- intelligent data placement and movement
- computational storage capabilities
- seamless integration of different storage backends
behind the innovation
labios emerged from the recognition that modern applications require a more flexible and efficient approach to data storage and access. funded by the national science foundation, the project aims to create a unified storage system that can support diverse and often conflicting i/o workloads under a single scalable platform.
key innovations
- label-based abstraction: novel data representation that bundles operations with data
- decoupled architecture: separation of data, metadata, and instruction paths
- storage elasticity: dynamic resource provisioning based on workload characteristics
- active storage: support for in-situ and in-transit data analytics
- high performance: significant speedups for real scientific applications
- transparency: no modification needed to existing applications
real-world impact
labios is making significant contributions across various scientific domains:
- scientific computing: supporting applications like cm1 with efficient i/o operations
- data analytics: enabling seamless integration of analysis with computation
- deep learning: optimizing data movement for training and inference workflows
- workflow management: improving data sharing between workflow stages
technical architecture
labios consists of several key components working together:
- label manager: creates and manages data labels
- distributed queue: handles label scheduling and distribution
- worker pool: executes labels independently
- storage bridge: connects to external storage systems
- warehouse: manages intermediate data storage
looking forward
labios continues to evolve with exciting developments in:
- enhanced support for computational storage
- advanced label scheduling algorithms
- deeper integration with workflow systems
- extended support for diverse storage backends
join the labios community
labios is an open-source project welcoming contributions from both academic and industrial researchers:
- repository: github - labios project
- documentation: comprehensive guides and technical details
- research papers: latest findings and technical innovations
acknowledgements 🙏
the development of labios has been made possible through the support of the national science foundation (nsf). we’re grateful to our collaborators at the illinois institute of technology, sandia national laboratories, lawrence livermore national laboratory, and various research institutions whose expertise has been instrumental in advancing this project.
Interested in learning more about LABIOS or discussing potential collaborations? Feel free to reach out!
Related Publications
2024
- In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis SC , Nov 2024
- In Proceedings of the International Parallel and Distributed Processing Symposium , May 2024
2022
- In Proceedings of the 29th International Conference on High Performance Computing, Data, and Analytics , Dec 2022
- sc22 LabStor: A modular and extensible platform for developing high-performance, customized I/O stacks in userspaceIn Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis , Nov 2022
2021
- cluster21 pMEMCPY: a simple, lightweight, and portable I/O library for storing data in persistent memoryIn Proceedings of the International Conference on Cluster Computing , Sep 2021
- In Proceedings of the 30th International Symposium on High-Performance Parallel and Distributed Computing , Jun 2021
2020
- In Proceedings of the ACM Transactions on Storage , Oct 2020
2019
- In Proceedings of the 28th International Symposium on High-Performance Parallel and Distributed Computing. ║ Best Paper Award ║ , Jun 2019
- In Proceedings of the 19th International Symposium on Cluster, Cloud and Grid Computing , May 2019
2018
- In Proceedings of the 25th International Conference on High Performance Computing , Dec 2018