Publications
You can also find my articles on my Google Scholar profile.
Conferences
- “GRANNY: Granular Management of Compute-Intensive Applications in the Cloud.” In USENIX Symp. on Networked Systems Design and Implementation (NSDI), Apr 2025. [PDF]
- “ORC: Increasing Cloud Memory Density via Object Reuse with Capabilities.” In Symp. on Operating Systems Design and Implementation (OSDI), Jul 2023. [PDF]
- “Translation Pass-Through for Near-Native Paging Performance in VMs.” In USENIX Annual Technical Conf. (ATC), Jul 2023. [PDF]
- “CAP-VMs: Capability-Based Isolation and Sharing in the Cloud.” In Symp. on Operating Systems Design and Implementation (OSDI), Jul 2022. [PDF] [presentation]
- “Reconsidering OS Memory Optimizations in the Presence of Disaggregated Memory.” In Intl. Symp. on Memory Management (ISMM), Jun 2022. [doi] [PDF]
- “Slashing the Disaggregation Tax in Heterogeneous Data Centers with FractOS.” In European Conf. on Computer Systems (EuroSys), Apr 2022. [doi] [PDF]
- “CubicleOS: A Library OS with Software Componentisation for Practical Isolation.” In Intl. Conf. on Arch. Support for Programming Languages & Operating Systems (ASPLOS), Apr 2021. [doi] [PDF]
- “Spons & Shields: Practical Memory Isolation for Hardware Enclaves.” In Intl. Conf. on Virtual Execution Environments (VEE), Apr 2021. [doi] [PDF]
- “Using SMT to accelerate nested virtualization.” In Intl. Symp. on Computer Architecture (ISCA), Jun 2019. [doi] [PDF]
- “DATS - Data Containers for Web Applications.” In Intl. Conf. on Arch. Support for Programming Languages & Operating Systems (ASPLOS), Mar 2018. [doi] [PDF]
- “Direct Inter-Process Communication (dIPC): Repurposing the CODOMs Architecture to Accelerate IPC.” In European Conf. on Computer Systems (EuroSys), Apr 2017. [doi] [PDF]
- “Automatic Parallelization of Kernels in Shared-Memory Multi-GPU Nodes.” In Intl. Conf. on Supercomputing (ICS), Jun 2015. [doi] [PDF]
- “Coherence Protocol for Transparent Management of Scratchpad Memories in Shared Memory Manycore Architectures.” In Intl. Symp. on Computer Architecture (ISCA), Jun 2015. [doi] [PDF]
- “CODOMs: Protecting Software with Code-centric Memory Domains.” In Intl. Symp. on Computer Architecture (ISCA), Jun 2014. [doi] [PDF]
- “Auto-Tunning of Data Communication on Heterogeneous Systems.” In Intl. Symp. on Embedded Multicore Socs (MCSoC), Sep 2013. [doi] [PDF]
- “Hardware/Software Coherence Protocol for the Coexistence of Caches and Local Memories.” In Intl. Conf. for High Performance Computing, Networking, Storage and Analysis (SC), Nov 2012. [doi] [PDF]
- “DiDi: Mitigating The Performance Impact of TLB Shootdowns Using A Shared TLB Directory.” In Intl. Conf. on Parallel Arch. and Compilation Techniques (PACT), Oct 2011. [doi] [PDF]
- “Predictive Runtime Code Scheduling for Heterogeneous Architectures.” In Intl. Conf. on High Performance Embedded Architectures and Compilers (HiPEAC), Jan 2009. [doi] [PDF]
Journals
- “Hardware-Software Coherence Protocol for the Coexistence of Caches and Local Memories.” IEEE Transaction on Computers, Jan 2015. [doi] [PDF]
- “The Low-Power Architecture Approach Towards Exascale Computing.” Journal of Computational Science, Nov 2013. [doi] [PDF]
Patents
- “Logical-to-physical block mapping inside the disk controller: accessing data objects without operating system intervention.” US10331591B2 (granted Jun 25th 2019), Sep 2014.
Workshops
- “Enabling Cloud-Scale Distributed Capabilities.” In Workshop on Heterogeneous Composable and Disaggregated Systems (HCDS), Mar 2025. [PDF]
- “Caladan: A Distributed Meta-OS for Data Center Disaggregation.” In Systems for Post-Moore Architectures (SPMA), Feb 2020. [PDF]
- “One Interface to Rule them All: A Hardware/Software Co-Design for Disaggregated Computing.” In Systems for Multi-core and Heterogeneous Architectures (SFMA), Mar 2019. [PDF]
- “Comparison based sorting for systems with multiple GPUs.” In Workshop on General Purpose Processor Using Graphics Processing Units (GPGPU), Nov 2013.
- “The low-power architecture approach towards exascale computing.” In Workshop on scalable algorithms for large-scale systems (ScalA), Nov 2011. [doi] [PDF]
- “The Cost of IPC: and Architectural Analysis.” In Workshop on the Interaction between Operating Systems and Computer Architecture (WIOSCA), Jun 2007.
- “Building a Global System View for Optimization Purposes.” In Workshop on the Interaction between Operating Systems and Computer Architecture (WIOSCA), Jun 2006.
Posters
- “Automatic Execution of Single-GPU Computations across Multiple GPUs.” In Intl. Conf. on Parallel Arch. and Compilation Techniques (PACT), Aug 2014. [doi] [PDF]
- “A Flexible Multi-Grain Full-System Tracing Framework.” In Advanced Computer Architecture and Compilation for Embedded Systems (ACACES-08), Poster Session, Jul 2008.
- “Support for Dynamically Adaptable Heterogeneous Applications.” In Advanced Computer Architecture and Compilation for Embedded Systems (ACACES-07), Poster Session, Jul 2007.
- “An Experimental Framework for Whole System Optimization.” In Advanced Computer Architecture and Compilation for Embedded Systems (ACACES-06), Poster Session, Jul 2006.
- “Opportunities for Global Optimization: Breaking the Boundaries Across System Components.” In EuroSys 2006, Poster Session, Jan 2006.
Technical Reports
- “Hardware/Software Coherence in Hybrid Memory Models.” Technical Report UPC-DAC-RR-CAP-2011-21, Jul 2011.