LABIOS

a label-based i/o system

labios (label-based i/o system) is a groundbreaking distributed storage framework that introduces a novel data labeling paradigm to address the challenges of modern high-performance computing (hpc) and big data storage systems. as scientific applications become increasingly data-intensive and storage requirements grow more complex, labios provides a unified solution that enables efficient data access, storage elasticity, and computational storage capabilities.

patents

our innovative approach to label-based i/o has resulted in groundbreaking patents that protect our intellectual property and demonstrate the uniqueness of our solution:

Active

Label-Based Data Representation I/O Process and System

Anthony KougkasHariharan DevarajanXian-He Sun
Issue Date: Dec 2, 2021
Domain: HPC, Big Data Analytics, ML/AI
Assignee: USA Patent Office
Patent Overview
Contact Us About Licensing


what makes labios special?

labios revolutionizes storage access through its innovative label-based paradigm. just as a shipping label contains all the necessary information for package delivery, labios data labels encapsulate both the data and instructions for its handling. this approach enables:

  • asynchronous i/o operations
  • dynamic storage resource allocation
  • intelligent data placement and movement
  • computational storage capabilities
  • seamless integration of different storage backends

behind the innovation

labios emerged from the recognition that modern applications require a more flexible and efficient approach to data storage and access. funded by the national science foundation, the project aims to create a unified storage system that can support diverse and often conflicting i/o workloads under a single scalable platform.

key innovations

  • label-based abstraction: novel data representation that bundles operations with data
  • decoupled architecture: separation of data, metadata, and instruction paths
  • storage elasticity: dynamic resource provisioning based on workload characteristics
  • active storage: support for in-situ and in-transit data analytics
  • high performance: significant speedups for real scientific applications
  • transparency: no modification needed to existing applications

real-world impact

labios is making significant contributions across various scientific domains:

  • scientific computing: supporting applications like cm1 with efficient i/o operations
  • data analytics: enabling seamless integration of analysis with computation
  • deep learning: optimizing data movement for training and inference workflows
  • workflow management: improving data sharing between workflow stages

technical architecture

labios consists of several key components working together:

  • label manager: creates and manages data labels
  • distributed queue: handles label scheduling and distribution
  • worker pool: executes labels independently
  • storage bridge: connects to external storage systems
  • warehouse: manages intermediate data storage

looking forward

labios continues to evolve with exciting developments in:

  • enhanced support for computational storage
  • advanced label scheduling algorithms
  • deeper integration with workflow systems
  • extended support for diverse storage backends

join the labios community

labios is an open-source project welcoming contributions from both academic and industrial researchers:

  • repository: github - labios project
  • documentation: comprehensive guides and technical details
  • research papers: latest findings and technical innovations

acknowledgements 🙏

the development of labios has been made possible through the support of the national science foundation (nsf). we’re grateful to our collaborators at the illinois institute of technology, sandia national laboratories, lawrence livermore national laboratory, and various research institutions whose expertise has been instrumental in advancing this project.


Interested in learning more about LABIOS or discussing potential collaborations? Feel free to reach out!

Related Publications

2024

  1. Luke Logan, Anthony Kougkas, and Xian-He Sun
    In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis SC , Nov 2024
  2. Neeraj Rajesh, Keith Bateman, Jean Luca Bez, Suren Byna, Anthony Kougkas, and Xian-He Sun
    In Proceedings of the International Parallel and Distributed Processing Symposium , May 2024

2022

  1. Keith Bateman, Neeraj Rajesh, Jaime Cernuda Garcia, Luke Logan, Jie Ye, Stephen Herbein, Anthony Kougkas, and Xian-He Sun
    In Proceedings of the 29th International Conference on High Performance Computing, Data, and Analytics , Dec 2022
  2. Luke Logan, Jaime Cernuda Garcia, Jay Lofstead, Xian–He Sun, and Anthony Kougkas
    In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis , Nov 2022

2021

  1. Luke Logan, Jay Lofstead, Patrick Widener, Xian-He Sun, and Anthony Kougkas
    In Proceedings of the International Conference on Cluster Computing , Sep 2021
  2. Neeraj Rajesh, Hariharan Devarajan, Jaime Cernuda Garcia, Keith Bateman, Luke Logan, Jie Ye, Anthony Kougkas, and Xian-He Sun
    In Proceedings of the 30th International Symposium on High-Performance Parallel and Distributed Computing , Jun 2021

2020

  1. Anthony Kougkas, Hariharan Devarajan, and Xian-He Sun
    In Proceedings of the ACM Transactions on Storage , Oct 2020

2019

  1. Anthony Kougkas, Hariharan Devarajan, Jay Lofstead, and Xian-He Sun
    In Proceedings of the 28th International Symposium on High-Performance Parallel and Distributed Computing.Best Paper Award ║ , Jun 2019
  2. Hariharan Devarajan, Anthony Kougkas, and Xian-He Sun
    In Proceedings of the 19th International Symposium on Cluster, Cloud and Grid Computing , May 2019

2018

  1. Hariharan Devarajan, Anthony Kougkas, Prajwal Challa, and Xian-He Sun
    In Proceedings of the 25th International Conference on High Performance Computing , Dec 2018