Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
RecSSD: near data processing for solid state drive based recommendation inference
ASPLOS '21: Proceedings of the 26th ACM International Conference on Architectural Support for Programming Languages and Operating SystemsApril 2021, pp 717–729https://doi.org/10.1145/3445814.3446763Neural personalized recommendation models are used across a wide variety of datacenter applications including search, social media, and entertainment. State-of-the-art models comprise large embedding tables that have billions of parameters requiring ...
- research-articleApril 2021
Mind mappings: enabling efficient algorithm-accelerator mapping space search
ASPLOS '21: Proceedings of the 26th ACM International Conference on Architectural Support for Programming Languages and Operating SystemsApril 2021, pp 943–958https://doi.org/10.1145/3445814.3446762Modern day computing increasingly relies on specialization to satiate growing performance and efficiency requirements. A core challenge in designing such specialized hardware architectures is how to perform mapping space search, i.e., search for an ...
- ArticleApril 2021
CutQC: using small Quantum computers for large Quantum circuit evaluations
ASPLOS '21: Proceedings of the 26th ACM International Conference on Architectural Support for Programming Languages and Operating SystemsApril 2021, pp 473–486https://doi.org/10.1145/3445814.3446758Quantum computing (QC) is a new paradigm offering the potential of exponential speedups over classical computing for certain computational problems. Each additional qubit doubles the size of the computational state space available to a QC algorithm. ...
- research-articleApril 2021
Logical abstractions for noisy variational Quantum algorithm simulation
ASPLOS '21: Proceedings of the 26th ACM International Conference on Architectural Support for Programming Languages and Operating SystemsApril 2021, pp 456–472https://doi.org/10.1145/3445814.3446750Due to the unreliability and limited capacity of existing quantum computer prototypes, quantum circuit simulation continues to be a vital tool for validating next generation quantum computers and for studying variational quantum algorithms, which are ...
- ArticleApril 2021
SIMDRAM: a framework for bit-serial SIMD processing using DRAM
- Nastaran Hajinazar,
- Geraldo F. Oliveira,
- Sven Gregorio,
- João Dinis Ferreira,
- Nika Mansouri Ghiasi,
- Minesh Patel,
- Mohammed Alser,
- Saugata Ghose,
- Juan Gómez-Luna,
- Onur Mutlu
ASPLOS '21: Proceedings of the 26th ACM International Conference on Architectural Support for Programming Languages and Operating SystemsApril 2021, pp 329–345https://doi.org/10.1145/3445814.3446749Processing-using-DRAM has been proposed for a limited set of basic operations (i.e., logic operations, addition). However, in order to enable full adoption of processing-using-DRAM, it is necessary to provide support for more complex operations. In this ...
-
- research-articleApril 2021
Qraft: reverse your Quantum circuit and know the correct program output
ASPLOS '21: Proceedings of the 26th ACM International Conference on Architectural Support for Programming Languages and Operating SystemsApril 2021, pp 443–455https://doi.org/10.1145/3445814.3446743Current Noisy Intermediate-Scale Quantum (NISQ) computers are useful in developing the quantum computing stack, test quantum algorithms, and establish the feasibility of quantum computing. However, different statistically significant errors permeate ...
- research-articleApril 2021
Training for multi-resolution inference using reusable quantization terms
ASPLOS '21: Proceedings of the 26th ACM International Conference on Architectural Support for Programming Languages and Operating SystemsApril 2021, pp 845–860https://doi.org/10.1145/3445814.3446741Low-resolution uniform quantization (e.g., 4-bit bitwidth) for both Deep Neural Network (DNN) weights and data has emerged as an important technique for efficient inference. Departing from conventional quantization, we describe a novel training approach ...
- research-articleApril 2021
Rhythmic pixel regions: multi-resolution visual sensing system towards high-precision visual computing at low power
ASPLOS '21: Proceedings of the 26th ACM International Conference on Architectural Support for Programming Languages and Operating SystemsApril 2021, pp 573–586https://doi.org/10.1145/3445814.3446737High spatiotemporal resolution can offer high precision for vision applications, which is particularly useful to capture the nuances of visual features, such as for augmented reality. Unfortunately, capturing and processing high spatiotemporal visual ...
- research-articleApril 2021Best Paper
Computing with time: microarchitectural weird machines
ASPLOS '21: Proceedings of the 26th ACM International Conference on Architectural Support for Programming Languages and Operating SystemsApril 2021, pp 758–772https://doi.org/10.1145/3445814.3446729Side-channel attacks such as Spectre rely on properties of modern CPUs that permit discovery of microarchitectural state via timing of various operations. The Weird Machine concept is an increasingly popular model for characterization of emergent ...
- research-articleApril 2021
Warehouse-scale video acceleration: co-design and deployment in the wild
- Parthasarathy Ranganathan,
- Daniel Stodolsky,
- Jeff Calow,
- Jeremy Dorfman,
- Marisabel Guevara,
- Clinton Wills Smullen IV,
- Aki Kuusela,
- Raghu Balasubramanian,
- Sandeep Bhatia,
- Prakash Chauhan,
- Anna Cheung,
- In Suk Chong,
- Niranjani Dasharathi,
- Jia Feng,
- Brian Fosco,
- Samuel Foss,
- Ben Gelb,
- Sara J. Gwin,
- Yoshiaki Hase,
- Da-ke He,
- C. Richard Ho,
- Roy W. Huffman Jr.,
- Elisha Indupalli,
- Indira Jayaram,
- Poonacha Kongetira,
- Cho Mon Kyaw,
- Aaron Laursen,
- Yuan Li,
- Fong Lou,
- Kyle A. Lucke,
- JP Maaninen,
- Ramon Macias,
- Maire Mahony,
- David Alexander Munday,
- Srikanth Muroor,
- Narayana Penukonda,
- Eric Perkins-Argueta,
- Devin Persaud,
- Alex Ramirez,
- Ville-Mikko Rautio,
- Yolanda Ripley,
- Amir Salek,
- Sathish Sekar,
- Sergey N. Sokolov,
- Rob Springer,
- Don Stark,
- Mercedes Tan,
- Mark S. Wachsler,
- Andrew C. Walton,
- David A. Wickeraad,
- Alvin Wijaya,
- Hon Kwan Wu
ASPLOS '21: Proceedings of the 26th ACM International Conference on Architectural Support for Programming Languages and Operating SystemsApril 2021, pp 600–615https://doi.org/10.1145/3445814.3446723Video sharing (e.g., YouTube, Vimeo, Facebook, TikTok) accounts for the majority of internet traffic, and video processing is also foundational to several other key workloads (video conferencing, virtual/augmented reality, cloud gaming, video in ...
- research-articleApril 2021
Orchestrated trios: compiling for efficient communication in Quantum programs with 3-Qubit gates
ASPLOS '21: Proceedings of the 26th ACM International Conference on Architectural Support for Programming Languages and Operating SystemsApril 2021, pp 375–385https://doi.org/10.1145/3445814.3446718Current quantum computers are especially error prone and require high levels of optimization to reduce operation counts and maximize the probability the compiled program will succeed. These computers only support operations decomposed into one- and two-...
- research-articleApril 2021
DiAG: a dataflow-inspired architecture for general-purpose processors
ASPLOS '21: Proceedings of the 26th ACM International Conference on Architectural Support for Programming Languages and Operating SystemsApril 2021, pp 93–106https://doi.org/10.1145/3445814.3446703The end of Dennard scaling and decline of Moore's law has prompted the proliferation of hardware accelerators for a wide range of application domains. Yet, at the dawn of an era of specialized computing, left behind the trend is the general-purpose ...
- ArticleApril 2021
When application-specific ISA meets FPGAs: a multi-layer virtualization framework for heterogeneous cloud FPGAs
ASPLOS '21: Proceedings of the 26th ACM International Conference on Architectural Support for Programming Languages and Operating SystemsApril 2021, pp 123–134https://doi.org/10.1145/3445814.3446699While field-programmable gate arrays (FPGAs) have been widely deployed into cloud platforms, the high programming complexity and the inability to manage FPGA resources in an elastic/scalable manner largely limits the adoption of FPGA acceleration. ...
- ArticleApril 2021
Statistical robustness of Markov chain Monte Carlo accelerators
ASPLOS '21: Proceedings of the 26th ACM International Conference on Architectural Support for Programming Languages and Operating SystemsApril 2021, pp 959–974https://doi.org/10.1145/3445814.3446697Statistical machine learning often uses probabilistic models and algorithms, such as Markov Chain Monte Carlo (MCMC), to solve a wide range of problems. Probabilistic computations, often considered too slow on conventional processors, can be accelerated ...