akougkas.io

[ io = input/output ]

avatar_profile.png
Anthony Kougkas

assistant research professor


data management & storage

distributed & parallel systems

input/output for hpc & ai/ml



illinois tech

chicago, il

email

google scholar

github

linkedIn

i am an assistant research professor in computer science at illinois tech, where i serve as the associate director of the gnosis research center. my work at illinois tech and as a guest research faculty at argonne national laboratory involves developing cutting-edge data management and storage solutions. with a strong focus on high-performance computing (hpc) and artificial intelligence (ai), i lead innovative projects and guide the next generation of computer scientists. Always curious and ready to explore.

interests

hpc storage & i/o
  • multi-tiered storage architectures, hierarchical data buffering & prefetching, distributed & parallel i/o optimization.
  • asynchronous i/o techniques, i/o scheduling strategies, bottleneck detection & optimization, efficient resource utilization.
data management for workflows
  • workflow execution optimizations, tiered data streaming engines, context-aware active storage, data prefetching algorithms.
  • data management for large datasets, prevention of i/o interference, advanced data compression techniques.
systems for ai & scientific ml
  • advanced data management for ai/ml frameworks, ai-driven i/o optimizations, exascale-ready storage.
  • integration of ai/ml with hpc systems, context-aware active storage, task-driven frameworks & data labeling.

contributions

publications: advancing the fields of HPC data management and storage with over 50 peer-reviewed publications.

patents: US patent for the “Label-Based Data Representation I/O Process and System”, US 2021/0374152 A1.

funding: strong track record in acquiring research funding, successfully securing over $8M of federal grants.

mentoring: co-advising eight doctoral students in data streaming, AI-powered I/O, and programmable storage.

awards: Best Paper Awards at CCGrid’21 for the DLIO and at HPDC’19 for the LABIOS distributed data platform.

software: deployed open-source software like Hermes, ChronoLog, and DLIO (now in Apache MLPerf suite).

teaching: active learning, engaging teaching environment, emphasizing hands-on, lab-focused experiences.

projects

hermes: a state-of-the-art i/o buffering platform for hpc environments, utilizing hdf5 data structures to optimize data placement across memory and storage hierarchies, including nvram and ssds. 2.86m

iris: a unified data access framework merging hpc and analytics and ai, enabling integrated data and metadata management, and intelligent data placement. 515k

chronolog: a high-performance, distributed log storage system that manages activity and log workloads using physical time for event ordering, reducing contention, and enabling elastic scaling across storage tiers. supports plugins, sql-like engines, and tensorflow. 2.67m

coeus: a framework that accelerates scientific insights through enriched metadata management, optimizing queries by leveraging ai/ml to balance computation and storage, and enhancing query performance with tiered data placement and staging. 750k

dtio: a task-driven i/o framework for the hyperconvergence of hpc, ai, and cloud, offering scalable, distributed i/o optimization, enhanced data movement, and improved fault tolerance. 750k

labios: an innovative, energy-efficient label-based i/o system for multi-tiered hpc environments, providing solutions such as multi-tenancy, resource scheduling, and seamless integration with diverse storage pools. 600k

deepio: a scalable i/o runtime for ai workflows, optimizing dnn model updates between training and inference tasks. it balances inference quality, training duration, and throughput using novel caching, versioning, and asynchronous data transfer techniques. 135k

contact

for collaboration opportunities, please reach out via email or connect on linkedin.

selected publications

  1. Leveraging burst buffer coordination to prevent I/O interference
    Anthony Kougkas , Matthieu Dorier , Rob Latham , Rob Ross , and Xian-He Sun
    In eScience’16: Proceedings of the 12th International Conference on e-Science , Jun 2016
  2. Hermes: a heterogeneous-aware multi-tiered distributed I/O buffering system
    Anthony Kougkas , Hariharan Devarajan , and Xian-He Sun
    In HPDC’18: Proceedings of the 27th International Symposium on High-Performance Parallel and Distributed Computing , Jun 2018
  3. Labios: A distributed label-based i/o system
    Anthony Kougkas , Hariharan Devarajan , Jay Lofstead , and Xian-He Sun
    In HPDC’19: Proceedings of the 28th International Symposium on High-Performance Parallel and Distributed Computing.Best Paper Award ║ , Jun 2019
  4. DLIO: A data-centric benchmark for scientific deep learning applications
    Hariharan Devarajan , Huihuo Zheng, Anthony Kougkas , Xian-He Sun , and Venkatram Vishwanath
    In CCGrid’21: Proceedings of the 21st International Symposium on Cluster, Cloud and Internet ComputingBest Paper Award ║ , May 2021
  5. LabStor: A modular and extensible platform for developing high-performance, customized I/O stacks in userspace
    Luke Logan, Jaime Cernuda Garcia , Jay Lofstead , Xian–He Sun, and Anthony Kougkas
    In SC’22: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis , Nov 2022