SANDS lab

We develop techniques and algorithms for building and managing key networked systems that are worthy of society’s trust. Our core interests lie in improving the modern computing environment where distributed systems and computer networks are a pervasive component.

We focus on bridging the gap between the abstractions that users (i.e., software developers, cloud providers, or network operators) need and what a performant, scalable, dependable and deployable system can achieve in practice.
We build prototypes that directly improve the lives of real users.
We seek solutions based on theoretically grounded arguments while also gaining insights into constraints and trade-offs in the design space.

Our goal is to enrich the human knowledge of how to build future-proof systems that can stand the test of time.

News

Aug'23: Chen-Yu (Elton) has defended his PhD thesis titled “Tackling the Communication Bottlenecks of Distributed Deep Learning Training Workloads” and will next join Bytedance (USA). Congratulations!
Mar'23: Arnaud has defended his PhD thesis titled “Verification and Privacy Techniques for Improving the Trustworthiness of Neural Networks” and will next join Nokia Bell Labs. Congratulations!
LineFS wins the Best Paper Award at SOSP'21.
Rethinking gradient sparsification as total error minimization accepted as spotlight paper (top 3%) at NeurIPS'21.
We organize a tutorial on Network-Accelerated Distributed Deep Learning at SIGCOMM'21.
In our NSDI'21 paper we demonstrated how to accelerate distributed ML via in-network aggregation with SwitchML. In our upcoming SIGCOMM'21 paper introducing OmniReduce, we are advancing streaming aggregation to leverage the sparsity of large models’ gradient vectors to accelerate training.
In the GRACE project, we survey popular gradient compression techniques for distributed deep learning and perform a comprehensive comparative evaluation. Read our ICDCS'21 paper.

Marco Canini

Associate Professor of Computer Science

Computer, Electrical and Mathematical Sciences & Engineering

King Abdullah University of Science and Technology (KAUST)

Principal Investigator

Marco’s research area is cloud computing, distributed systems and networking. His current interest is in designing better systems support for AI/ML and provide practical implementations deployable in the real-world.

Marco is an associate professor of Computer Science at King Abdullah University of Science and Technology (KAUST). Marco obtained his PhD from the University of Genoa in 2009 after spending the last year as a visiting student at the University of Cambridge. He was a postdoctoral researcher at EPFL and then a Senior Research Scientist at Deutsche Telekom Innovation Labs & TU Berlin. In 2013, he assumed an assistant professor position at UCLouvain, and in 2016 he became an Assistant Professor at KAUST. He was promoted to the rank of Associate Professor at KAUST in 2019. He also held positions at Intel Research, Google and Microsoft Research.

Teaching

Projects

SIDCo

An Efficient Statistical-Based Gradient Compression Technique for Distributed Training Systems

OmniReduce

Efficient Sparse Collective Communication

GRACE

GRAdient ComprEssion for distributed deep learning

FairFL

A Systems Approach to Tackling Fairness in Federated Learning

DC2

Delay-aware Communication Control for Distributed ML

SwitchML

Scaling Distributed Machine Learning with In-Network Aggregation

DAIET

In-Network Computation is a Dumb Idea Whose Time Has Come

Previous major projects focusing on SDN and programmable networks include:

Group

Faculty

Marco Canini

Associate Professor of Computer Science

Distributed Systems, Networking, Machine Learning, Cloud Computing

Research Staff

Amandio Faustino

Research Software Engineer

Mubarak Ojewale

Postdoc

Students

Achref Rebai

MS/PhD Student

Boris Radovic

PhD Student

Jihao Xin

PhD Student

Juyi Lin

MS/PhD Student

Mohammed K. Aljahdali

PhD Student

Norah Alballa

PhD Student

Salma Kharrat

PhD Student

Tongzhou Gu

MS/PhD Student

Vladyslav Shumanskyy

PhD Student

Alumni

Ahmed M. Abdelmoniem Sayed

Alumni

Postdoc 2019, Research Scientist 2020-2021, now Assistant Professor at QMUL

Amedeo Sapio

Alumni

Postdoc 2018-19, now Software Engineer at Intel

Arnaud Dethise

PhD Student

PhD 2023, now Research Scientist at Nokia Bell Labs

Atal Sahu

Alumni

MS 2020, now Data Scientist at Regology

Chen-Yu Ho

PhD Student

PhD 2023, joining Bytedance (USA)

Dan Levin

Alumni

PhD 2014, co-founder and CEO of Stacktile GmbH

Fatimah Zohra

Alumni

MS 2020, now PhD Student at KAUST

Hassan Alsibyani

Alumni

MS 2018, now Technical Lead at Wasphi

Jiawei Fei

Alumni

PhD with the sponsorship from China Scholarship Council (CSC) 2021

Lalith Suresh

Alumni

PhD 2016, now Researcher at VMware Research

Marco Chiesa

Alumni

Postdoc 2015-2017, now Associate Professor at KTH

M. Bilal

Alumni

PhD 2022, now Senior Engineer at Unbabel

Omar Alama

Alumni

Research Software Engineer 2020-21, now MSc in CE student at CMU

Omar Zawawi

Alumni

MS 2023, now Software Engineer at Mozn

Thanh Dang Nguyen

Alumni

Postdoc 2015-16, now Research Engineer at University of Chicago

Waleed Reda

Alumni

PhD 2022, now Postdoctoral Researcher at Microsoft Research

Yousef Alowayed

Alumni

MS 2018, now Software Engineer at Google

Publications

See all publications

Ahmed M. Abdelmoniem Sayed, Atal Sahu, Marco Canini, Suhaib Fahmy (2023). REFL: Resource-Efficient Federated Learning. Proceedings of EuroSys'23.

PDF

M. Bilal, Marco Canini, Rodrigo Fonseca, Rodrigo Rodrigues (2023). With Great Freedom Comes Great Opportunity: Rethinking Resource Allocation for Serverless Functions. Proceedings of EuroSys'23.

PDF

Norah Alballa, Marco Canini, Rodrigo Fonseca, Rodrigo Rodrigues (2023). A First Look at the Impact of Distillation Hyper-Parameters in Federated Knowledge Distillation. Proceedings of EuroMLSys'23.

PDF

Shuo Liu, Qiaoling Wang, Junyi Zhang, Wenfei Wu, Qinliang Lin, Yao Liu, Meng Xu, Marco Canini, Ray Chak Chung Cheung, Jianfei He (2023). In-Network Aggregation with Transport Layer Transparency for Distributed Training. Proceedings of ASPLOS'23.

PDF

Samuel Horváth, Chen-Yu Ho, Ľudovít Horváth, Atal Sahu, Marco Canini, Peter Richtárik (2022). Natural Compression for Distributed Deep Learning. Proceedings of MSML'22.

PDF

See all publications

Open Positions

I’m always looking for bright and enthusiastic people to join my group. If you are looking to do a PhD with me, thank you for your interest, but please read this first. If you don’t I will know, and I’m afraid I will have to ignore your message.

New positions: We invite applications for a post-doctoral researcher interested in working on optimizing distributed machine learning (ML) systems. See details here.
Internships: KAUST Visiting Student Research Program (VSRP) projects on High-efficiency AI and ML distributed systems at Big-Learning scales and on Making ML-based Networked Systems more Trustworthy

SANDS lab

News

Marco Canini

Associate Professor of Computer Science

Principal Investigator

Contact

Teaching

Projects

Group

Faculty

Associate Professor of Computer Science

Research Staff

Research Software Engineer

Postdoc

Students

MS/PhD Student

PhD Student

PhD Student

MS/PhD Student

PhD Student

PhD Student

PhD Student

MS/PhD Student

PhD Student

Alumni

Alumni

Alumni

PhD Student

Alumni

PhD Student

Alumni

Alumni

Alumni

Alumni

Alumni

Alumni

Alumni

Alumni

Alumni

Alumni

Alumni

Alumni

Publications

Open Positions