Showing results 1 to 19 of 19
Accelerator-centric Deep Learning Systems for Enhanced Scalability, Energy-Efficiency, and Programmability Rhu, Minsoo, 23rd Asia and South Pacific Design Automation Conference ASP-DAC 2018, SIGDA, 2018-01-25 |
ARK: Fully Homomorphic Encryption Accelerator with Runtime Data Generation and Inter-Operation Key Reuse Kim, Jongmin; Lee, Gwangho; Kim, Sangpyo; Sohn, Gina; Kim, John Dongjun; Rhu, Minsoo; Ahn, Jung Ho, The 55th IEEE/ACM International Symposium on Microarchitecture, MICRO 2022, pp.1237 - 1254, IEEE/ACM, 2022-10-05 |
Bandwidth bottleneck in network-on-chip for high-throughput processors Kim, Jiho; Cho, Sanghun; Rhu, Minsoo; Bakhoda, Ali; Aamodt, Tor M; Kim, John Dongjun, 2020 ACM International Conference on Parallel Architectures and Compilation Techniques, PACT 2020, pp.157 - 158, Institute of Electrical and Electronics Engineers Inc., 2020-10-05 |
Beyond the Memory Wall: A Case for Memory-centric HPC System for Deep Learning Kwon, Youngeun; Rhu, Minsoo, 51st Annual IEEE/ACM International Symposium on Microarchitecture, MICRO 2018, pp.148 - 161, IEEE Computer Society, 2018-10-22 |
BTS: An Accelerator for Bootstrappable Fully Homomorphic Encryption Kim, Sangpyo; Kim, Jongmin; Kim, Michael Jaemin; Jung, Wonkyung; Kim, John Dongjun; Rhu, Minsoo; Ahn, Jung Ho, The 49th IEEE/ACM International Symposium on Computer Architecture (ISCA-49), pp.711 - 725, IEEE/ACM, 2022-06 |
Centaur: A Chiplet-based, Hybrid Sparse-Dense Accelerator for Personalized Recommendations Hwang, Ranggi; Kim, Taehun; Kwon, Youngeun; Rhu, Minsoo, 2020 ACM/IEEE 47th Annual International Symposium on Computer Architecture (ISCA), pp.968 - 981, IEEE/ACM, 2020-06-03 |
Compressing DMA Engine: Leveraging Activation Sparsity for Training Deep Neural Networks Rhu, Minsoo; O'Connor, Mike; Chatterjee, Niladrish; Pool, Jeff; Kwon, Youngeun; Keckler, Steve, 24th IEEE International Symposium on High Performance Computer Architecture, HPCA 2018, pp.78 - 91, IEEE Computer Society, 2018-02-26 |
DiVa: An Accelerator for Differentially Private Machine Learning Park, Beomsik; Hwang, Ranggi; Yoon, Dongho; Choi, Yoonhyuk; Rhu, Minsoo, The 55th IEEE/ACM International Symposium on Microarchitecture, MICRO 2022, pp.1200 - 1217, IEEE/ACM, 2022-10-05 |
GROW: A Row-Stationary Sparse-Dense GEMM Accelerator for Memory-Efficient Graph Convolutional Neural Networks Hwang, Ranggi; Kang, Minhoo; Lee, Jiwon; Kam, Dongyun; Lee, Youngjoo; Rhu, Minsoo, The 29th IEEE International Symposium on High-Performance Computer Architecture (HPCA-29), pp.42 - 55, IEEE, 2023-02-27 |
LazyBatching: An SLA-aware Batching System for Cloud Machine Learning Inference Choi, Yujeong; Kim, Yunseong; Rhu, Minsoo, The 27th IEEE International Symposium on High-Performance Computer Architecture (HPCA-27), pp.493 - 506, IEEE Computer Society, 2021-03-02 |
NeuMMU: Architectural Support for Efficient Address Translations in Neural Processing Units Hyun, Bongjoon; Kwon, Youngeun; Choi, Yujeong; Kim, John Dongjun; Rhu, Minsoo, The 25th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS-25), pp.1109 - 1124, ACM, 2020-03-20 |
PARIS and ELSA: An Elastic Scheduling Algorithm for Reconfigurable Multi-GPU Inference Servers Kim, Yunseong; Choi, Yujeong; Rhu, Minsoo, 59th ACM/IEEE Design Automation Conference, DAC 2022, pp.607 - 612, ACM/IEEE/ESDA, 2022-06-10 |
PREMA: A Predictive Multi-task Scheduling Algorithm For Preemptible Neural Processing Units Choi, Yujeong; Rhu, Minsoo, 26th IEEE International Symposium on High Performance Computer Architecture, HPCA 2020, pp.220 - 233, IEEE, 2020-02-24 |
SmartSAGE: Training Large-scale Graph Neural Networks using In-Storage Processing Architectures, Lee, Yunjae; Chung, Jinha; Rhu, Minsoo, 49th IEEE/ACM International Symposium on Computer Architecture, ISCA 2022, IEEE/ACM, 2022-06 |
Tensor Casting: Co-Designing Algorithm-Architecture for Personalized Recommendation Training Kwon, Youngeun; Lee, Yunjae; Rhu, Minsoo, The 27th IEEE International Symposium on High-Performance Computer Architecture (HPCA-27), pp.235 - 248, IEEE Computer Society, 2021-03-01 |
TensorDIMM: A Practical Near-Memory Processing Architecture for Embeddings and Tensor Operations in Deep Learning Kwon, Youngeun; Lee, Yunjae; Rhu, Minsoo, The 52nd Annual IEEE/ACM International Symposium on Microarchitecture (MICRO), pp.740 - 753, IEEE/ACM, 2019-10-15 |
Training Personalized Recommendation Systems from (GPU) Scratch: Look Forward not Backwards Kwon, Youngeun; Rhu, Minsoo, 49th IEEE/ACM International Symposium on Computer Architecture, ISCA 2022, pp.860 - 873, IEEE/ACM, 2022-06 |
Trident: A Hybrid Correlation-Collision GPU Cache Timing Attack for AES Key Recovery Ahn, Jaeguk; Jin, Cheolgyu; Kim, Jiho; Rhu, Minsoo; Fei, Yunsi; Kaeli, David; Kim, John, 27th Annual IEEE International Symposium on High Performance Computer Architecture, HPCA 2021, pp.332 - 344, IEEE Computer Society, 2021-03-02 |
TRiM: Enhancing Processor-Memory Interfaces with Scalable Tensor Reduction in Memory Park, Jaehyun; Kim, Byeongho; Yun, Sungmin; Lee, Eojin; Rhu, Minsoo; Ahn, Jung Ho, 54th Annual IEEE/ACM International Symposium on Microarchitecture, MICRO 2021, pp.268 - 281, IEEE/ACM, 2021-10-20 |
Discover