akougkas.io
[ io = input/output ]
Assistant Research Professor
Data Management & Storage
Distributed Systems
Input/Output for HPC & AI/ML
Illinois Tech
Chicago, IL
I am an Assistant Research Professor in Computer Science at Illinois Tech, where I serve as the Deputy Director of the Gnosis Research Center. My work at Illinois Tech and as a Guest Research Scientist at Argonne National Laboratory involves developing cutting-edge data management and storage solutions. With a strong focus on High-Performance Computing (HPC) and Artificial Intelligence (AI), I lead innovative projects and guide the next generation of computer scientists. Always curious and ready to explore.
* for collaboration opportunities, please reach out via email
Interests
HPC Storage & I/O
- Multi-tiered storage architectures, hierarchical data buffering & prefetching, distributed & parallel I/O optimization.
- Asynchronous I/O techniques, I/O scheduling strategies, bottleneck detection & optimization, efficient resource utilization.
Data Management for Workflows
- Workflow execution optimizations, tiered data streaming engines, context-aware active storage, data prefetching algorithms.
- Data management for large datasets, prevention of I/O interference, advanced data compression techniques.
Systems for AI & Scientific ML
- Advanced data management for AI/ML frameworks, AI-driven I/O optimizations, exascale-ready storage.
- Integration of AI/ML with HPC systems, context-aware active storage, task-driven frameworks & data labeling.
Projects
NEW IOWarp
: IOWarp bridges HPC, AI, and Advanced Data Management for extracting insights from data. It accelerates data-driven research and enhances adaptability in modern AI/ML computing.
Hermes
: A multi-tiered I/O buffering platform for HPC environments, utilizing hierarchical data structures to optimize data placement across memory and storage hierarchies.
IRIS
: A unified data access framework merging HPC and analytics and AI, enabling integrated data and metadata management, and intelligent data placement.
ChronoLog
: A high-performance, distributed log storage system that manages activity and log workloads using physical time for event ordering, reducing contention, and enabling elastic scaling across storage tiers. Supports plugins, SQL-like engines, and TensorFlow.
Coeus
: A framework that accelerates scientific insights through enriched metadata management, optimizing queries by leveraging AI/ML to balance computation and storage, and enhancing query performance with tiered data placement and staging.
DTIO
: A task-driven I/O framework for the hyperconvergence of HPC, AI, and cloud, offering scalable, distributed I/O optimization, enhanced data movement, and improved fault tolerance.
LABIOS
: A novel, energy-efficient label-based I/O system for HPC environments, providing solutions such as multi-tenancy, resource scheduling, and seamless integration with diverse storage pools.
DeepIO
: A scalable I/O runtime for AI workflows, optimizing DNN model updates between training and inference tasks. It balances inference quality, training duration, and throughput using novel caching, versioning, and asynchronous data transfer techniques.
Contributions
Publications: Over 50 peer-reviewed publications in the fields of data management and storage.
Patents: “Label-Based Data Representation I/O Process and System”, US 2021/0374152 A1.
Funding: Strong track record in fundraising with over $12M of federal grants for my research.
Mentoring: Co-advising ten doctoral students in data management and AI-powered storage.
Recognition: Best Paper Awards at CCGrid’21 for DLIO and at HPDC’19 for LABIOS papers.
Software: Deployed open-source software like DLIO (now in Apache MLPerf storage).
Teaching: Active educator, emphasizing hands-on, lab-focused experiences for all students.
Latest Posts
Dec 30, 2024 | 2024 Reflections: Year in Review |
---|---|
Dec 07, 2024 | Change Hurts! Does it have to? |
Nov 26, 2024 | Reflecting on SC24 |
Selected Publications
- In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis SC , Nov 2024
- sc22 LabStor: A modular and extensible platform for developing high-performance, customized I/O stacks in userspaceIn Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis , Nov 2022
- In Proceedings of the 21st International Symposium on Cluster, Cloud and Internet Computing ║ Best Paper Award ║ , May 2021
- In Proceedings of the 28th International Symposium on High-Performance Parallel and Distributed Computing. ║ Best Paper Award ║ , Jun 2019
- In Proceedings of the 27th International Symposium on High-Performance Parallel and Distributed Computing , Jun 2018
- In Proceedings of the 12th International Conference on e-Science , Jun 2016