Browse "EE-Conference Papers(학술회의논문)" by Author Kwon, Youngeun

Showing results 1 to 7 of 7

1
Beyond the Memory Wall: A Case for Memory-centric HPC System for Deep Learning

Kwon, Youngeun; Rhu, Minsoo, 51st Annual IEEE/ACM International Symposium on Microarchitecture, MICRO 2018, pp.148 - 161, IEEE Computer Society, 2018-10-22

2
Centaur: A Chiplet-based, Hybrid Sparse-Dense Accelerator for Personalized Recommendations

Hwang, Ranggi; Kim, Taehun; Kwon, Youngeun; Rhu, Minsoo, 2020 ACM/IEEE 47th Annual International Symposium on Computer Architecture (ISCA), pp.968 - 981, IEEE/ACM, 2020-06-03

3
Compressing DMA Engine: Leveraging Activation Sparsity for Training Deep Neural Networks

Rhu, Minsoo; O'Connor, Mike; Chatterjee, Niladrish; Pool, Jeff; Kwon, Youngeun; Keckler, Steve, 24th IEEE International Symposium on High Performance Computer Architecture, HPCA 2018, pp.78 - 91, IEEE Computer Society, 2018-02-26

4
NeuMMU: Architectural Support for Efficient Address Translations in Neural Processing Units

Hyun, Bongjoon; Kwon, Youngeun; Choi, Yujeong; Kim, John Dongjun; Rhu, Minsoo, The 25th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS-25), pp.1109 - 1124, ACM, 2020-03-20

5
Tensor Casting: Co-Designing Algorithm-Architecture for Personalized Recommendation Training

Kwon, Youngeun; Lee, Yunjae; Rhu, Minsoo, The 27th IEEE International Symposium on High-Performance Computer Architecture (HPCA-27), pp.235 - 248, IEEE Computer Society, 2021-03-01

6
TensorDIMM: A Practical Near-Memory Processing Architecture for Embeddings and Tensor Operations in Deep Learning

Kwon, Youngeun; Lee, Yunjae; Rhu, Minsoo, The 52nd Annual IEEE/ACM International Symposium on Microarchitecture (MICRO), pp.740 - 753, IEEE/ACM, 2019-10-15

7
Training Personalized Recommendation Systems from (GPU) Scratch: Look Forward not Backwards

Kwon, Youngeun; Rhu, Minsoo, 49th IEEE/ACM International Symposium on Computer Architecture, ISCA 2022, pp.860 - 873, IEEE/ACM, 2022-06

rss_1.0 rss_2.0 atom_1.0