Both sides previous revisionPrevious revisionNext revision | Previous revision |
start [2024/03/22 16:33] – geraldod | start [2024/06/28 19:14] (current) – [Lectures (tentative schedule, time zone: GMT-3)] geraldod |
---|
Recent PIM products and prototypes place compute units near the memory arrays. New memory interfaces like CXL (Compute Express Link) aid the enablement of compute-capable memories. At the same time, academia and industry are actively exploring other types of PIM by, e.g., exploiting the analog operation of DRAM, SRAM, flash memory, and emerging non-volatile memories, and hybrid PIM architectures that combine processing capabilities of different types and at different parts of the memory/storage hierarchy. | Recent PIM products and prototypes place compute units near the memory arrays. New memory interfaces like CXL (Compute Express Link) aid the enablement of compute-capable memories. At the same time, academia and industry are actively exploring other types of PIM by, e.g., exploiting the analog operation of DRAM, SRAM, flash memory, and emerging non-volatile memories, and hybrid PIM architectures that combine processing capabilities of different types and at different parts of the memory/storage hierarchy. |
| |
| {{:memory_centric_comp_banner.jpeg?400 |}} |
| |
PIM can improve performance and energy efficiency for many modern applications, enabling a commercially viable way of dealing with huge amounts of data bottlenecking our computing systems, which is especially exacerbated by workloads like AI/ML and genomics. In fact, workloads like large language model training and inference can potentially be “killer applications'' for PIM. | PIM can improve performance and energy efficiency for many modern applications, enabling a commercially viable way of dealing with huge amounts of data bottlenecking our computing systems, which is especially exacerbated by workloads like AI/ML and genomics. In fact, workloads like large language model training and inference can potentially be “killer applications'' for PIM. |
| |
| |
==== Livestream ==== | ==== Livestream ==== |
[[|Zoom livestream]] | [[https://www.youtube.com/watch?v=KV2MXvcBgb0 |YouTube livestream]] |
| {{youtube>KV2MXvcBgb0?large}} |
| |
==== Organizers ==== | ==== Organizers ==== |
^ Name ^ E-mail ^ | ^ Name ^ E-mail ^ |
|[[https://geraldofojunior.github.io/|Geraldo F. Oliveira]]| <geraldod@safari.ethz.ch> | | |[[https://geraldofojunior.github.io/|Geraldo F. Oliveira]]| <geraldod@safari.ethz.ch> | |
|[[https://people.inf.ethz.ch/omutlu/index.html|Onur Mutlu]]| <onur.mutlu@safari.ethz.ch> | | |
|[[https://ee.ethz.ch/the-department/people-a-z/person-detail.MjM3MTk5.TGlzdC8zMjc5LC0xNjUwNTg5ODIw.html|Dr. Mohammad Sadrosadati]] | <mohammad.sadrosadati@safari.ethz.ch> | | |[[https://ee.ethz.ch/the-department/people-a-z/person-detail.MjM3MTk5.TGlzdC8zMjc5LC0xNjUwNTg5ODIw.html|Dr. Mohammad Sadrosadati]] | <mohammad.sadrosadati@safari.ethz.ch> | |
|[[http://ataberkolgun.com/|Ataberk Olgun]] | <ataberk.olgun@safari.ethz.ch> | | |[[http://ataberkolgun.com/|Ataberk Olgun]] | <ataberk.olgun@safari.ethz.ch> | |
|[[https://people.inf.ethz.ch/omutlu/index.html|Professor Onur Mutlu]]| <onur.mutlu@safari.ethz.ch> | | |[[https://people.inf.ethz.ch/omutlu/index.html|Professor Onur Mutlu]]| <onur.mutlu@safari.ethz.ch> | |
| |
===== Agenda ===== | ===== Agenda (June 29, 2024) ===== |
==== Lectures (tentative schedule) ==== | ==== Lectures (tentative schedule, time zone: GMT-3) ==== |
| |
- Introduction: PIM as a paradigm to overcome the data movement bottleneck. | * 9:00am-9:20am, Prof. Onur Mutlu / Geraldo F. Oliveira, “Memory-centric Computing: Introduction to PIM as a Paradigm to Overcome the Data Movement Bottleneck.” |
- Workload analysis and system bottlenecks. | * Workload analysis and system bottlenecks. |
- PIM taxonomy: technology, location, and nature of computation (e.g., PNM (processing-near-memory) and PUM (processing-using-memory). | * PIM taxonomy: technology, location, and nature of computation (e.g., PNM (processing-near-memory) and PUM (processing-using-memory). |
- Advances in different types of PIM at different parts of the memory/storage systems. | * Advances in different types of PIM at different parts of the memory/storage systems. |
- Example real-world PNM systems: UPMEM PIM, Samsung HBM-PIM & CXL-PNM, SK Hynix AiM & CMS 2.0, Samsung AxDIMM, Alibaba PNM, Mythic. | * 9:30am-9:50am, Prof. Minsoo Rhu, Invited Talk #1. |
- PUM systems for bulk bitwise operations in simulated and off-the-shelf memory technologies (DRAM, SRAM, and NVM). | |
- Programming techniques and tools for PIM systems. | * Coffee break (10:00am-10:20am) |
- Infrastructures for doing PIM Research (simulation, real systems, FPGA prototypes). | |
- Research challenges and opportunities in PIM systems, with a focus on enabling adoption in the real world. | * 10:30am-10:50am, Dr. Mohammad Sadr, “Processing-Near-Memory: Real PNM Architectures.” |
| * Example real-world PNM systems: UPMEM PIM, Samsung HBM-PIM & CXL-PNM, SK Hynix AiM & CMS 2.0, Samsung AxDIMM, Alibaba PNM, Mythic. |
| * 11:00am-11:20am, Geraldo F. Oliveira, "PUM Systems for Bulk Bitwise Operations." |
| * PUM systems for bulk bitwise operations in simulated and off-the-shelf memory technologies (DRAM, SRAM, and NVM). |
| * 11:30am-11:50am, Prof. Saugata Ghose, Invited Talk #2. |
| * 12:00pm-12:20pm, Geraldo F. Oliveira, "Programming Techniques, Infrastructure, and Research Challenges for PIM." |
| * Programming techniques and tools for PIM systems. |
| * Infrastructures for doing PIM Research (simulation, real systems, FPGA prototypes). |
| * Research challenges and opportunities in PIM systems, with a focus on enabling adoption in the real world. |
| * 12:20pm-12:30pm, Geraldo F. Olveira, "Closing Remarks." |
| |
==== Tutorial Materials ==== | ==== Tutorial Materials ==== |
| |
| |
^ Time ^ Speaker ^ Title ^ Materials ^ | ^ Time ^ Speaker ^ Title ^ Materials ^ |
| TBA | TBA | TBA | TBA | | | 09:00am-09:20am | Prof. Onur Mutlu / Geraldo F. Oliveira | Memory-Centric Computing |{{geraldo-isca24-lecture1-memory-centric-computing-beforelecture.pdf|(PDF)}} {{geraldo-isca24-lecture1-memory-centric-computing-beforelecture.pptx|(PPT)}}| |
| | 09:30am-09:50am | Professor Minsoo Rhu | Memory-Centric Computing Systems – For AI and Beyond |{{|(PDF)}} {{|(PPT)}}| |
| | 10:00am-10:20am | N/A | Coffee Break | | |
| | 10:30am-10:50am | Dr. Mohammad Sadr | Processing-Near-Memory: Real PNM Architectures |{{sadr-isca24-lecture2-processing-near-memory-beforelecture.pdf|(PDF)}} {{sadr-isca24-lecture2-processing-near-memory-beforelecture.pptx|(PPT)}}| |
| | 11:00am-11:20am | Geraldo F. Oliveira | Processing-Using-Memory for Bulk Bitwise Operations |{{geraldo-isca24-lecture3-processing-using-memory-beforelecture.pdf|(PDF)}} {{geraldo-isca24-lecture3-processing-using-memory-beforelecture.pptx|(PPT)}}| |
| | 11:30am-11:50am | Professor Saugata Ghose | RACER and ReRAM Processing-Using-Memory |{{|(PDF)}} {{|(PPT)}}| |
| | 12:00pm-12:00pm | Geraldo F. Oliveira | Programming Techniques, Infrastructure, and Research Challenges for PIM |{{geraldo-isca24-lecture4-adoption-programmability-beforelecture.pdf|(PDF)}} {{geraldo-isca24-lecture4-adoption-programmability-beforelecture.pptx|(PPT)}}| |
| | 12:20pm-12:30pm | Geraldo F. Oliveira | Closing Remarks |{{geraldo-isca24-lecture5-conclusion-remarks-beforelecture.pdf|(PDF)}} {{geraldo-isca24-lecture5-conclusion-remarks-beforelecture.pptx|(PPT)}}| |
| |
==== Learning Materials ==== | ==== Learning Materials ==== |
| |
=== Recommended Materials === | === Recommended Materials === |
* Gómez-Luna, J., and Mutlu, O., Data-Centric Architectures: Fundamentally Improving Performance and Energy (227-0085-37L), ETH Zürich, Fall 2022. | |
* [[https://safari.ethz.ch/projects_and_seminars/fall2022/doku.php?id=processing_in_memory | Course Website]] | |
* [[https://youtube.com/playlist?list=PL5Q2soXY2Zi8KzG2CQYRNQOVD0GOBrnKy | Lecture Playlist]] | |
* Mutlu, O., Ghose, S., Gómez-Luna, J., and Ausavarungnirun, R. A Modern Primer on Processing in Memory. In Emerging Computing: From Devices to Systems, 2023. | * Mutlu, O., Ghose, S., Gómez-Luna, J., and Ausavarungnirun, R. A Modern Primer on Processing in Memory. In Emerging Computing: From Devices to Systems, 2023. |
* [[https://arxiv.org/pdf/2012.03112.pdf | PDF (arXiv)]] | |
* Gómez-Luna, J., El Hajj, I., Fernandez, I., Giannoula, C., Oliveira, G. F., and Mutlu, O. Benchmarking a New Paradigm: Experimental Analysis and Characterization of a Real Processing-in-Memory System. IEEE Access, 2022. | * Gómez-Luna, J., El Hajj, I., Fernandez, I., Giannoula, C., Oliveira, G. F., and Mutlu, O. Benchmarking a New Paradigm: Experimental Analysis and Characterization of a Real Processing-in-Memory System. IEEE Access, 2022. |
* [[https://arxiv.org/pdf/2105.03814.pdf | PDF (arXiv)]] | * [[https://arxiv.org/pdf/2105.03814.pdf | PDF (arXiv)]] |
* [[https://arxiv.org/pdf/2111.00082.pdf | PDF (arXiv)]] | * [[https://arxiv.org/pdf/2111.00082.pdf | PDF (arXiv)]] |
* [[https://github.com/cmu-safari/pidram | Repository (GitHub)]] | * [[https://github.com/cmu-safari/pidram | Repository (GitHub)]] |
| * Oliveira, G. F., Gómez-Luna, J., Orosa, L., Ghose, S., Vijaykumar, N., Fernandez, I., Sadrosadati, M., Mutlu, O. DAMOV: A New Methodology and Benchmark Suite for Evaluating Data Movement Bottlenecks. IEEE Access, 2021. |
| * [[https://arxiv.org/pdf/2105.03725.pdf | PDF (arXiv)]] |
| * [[https://github.com/CMU-SAFARI/DAMOV | Repository (GitHub)]] |
| * Luo, H., Tu, Y. C., Bostancı, F. N., Olgun, A., Ya, A. G., Mutlu, O. Ramulator 2.0: A Modern, Modular, and Extensible DRAM Simulator. IEEE CAL, 2023. |
| * [[https://arxiv.org/pdf/2308.11030.pdf | PDF (arXiv)]] |
| * [[https://github.com/CMU-SAFARI/ramulator2 | Repository (GitHub)]] |
| * Olgun, A., Hassan, H., Yağlıkçı, A. G., Tuğrul, Y. C., Orosa, L., Luo, H., Patel, M., Ergin, O., Mutlu, O. DRAM Bender: An Extensible and Versatile FPGA-based Infrastructure to Easily Test State-of-the-art DRAM Chips. IEEE CAD, 2023. |
| * [[https://arxiv.org/pdf/2211.05838.pdf | PDF (arXiv)]] |
| * [[https://github.com/CMU-SAFARI/DRAM-Bender | Repository (GitHub)]] |
| * Oliveira, G. F., Olgun, A., Yaglikci, A. G., Bostanci, N., Gomez-Luna, J., Ghose, S., Mutlu, O., MIMDRAM: An End-to-End Processing-Using-DRAM System for High-Throughput, Energy-Efficient and Programmer-Transparent Multiple-Instruction Multiple-Data Computing, in HPCA, 2024. |
| * [[https://arxiv.org/pdf/2402.19080.pdf | PDF (arXiv)]] |
| * [[https://github.com/CMU-SAFARI/MIMDRAM | Repository (GitHub)]] |
| * Hajinazar, N., Oliveira, G. F., Gregorio, S., Ferreira, J. D., Ghiasi, N. M., Patel, M., Alser, M., Ghose, S., Gomez-Luna, J., Mutlu. O., SIMDRAM: An End-to-End Framework for Bit-Serial SIMD Computing in DRAM, in ASPLOS, 2021. |
| * [[https://arxiv.org/pdf/2105.12839.pdf | PDF (arXiv)]] |
| * [[https://www.youtube.com/watch?v=lu3Br4-kySw | Full Talk Video]] |
| * Seshadri, V., Lee, D., Mullins, T., Hassan, H., Boroumand, A., Kim, J., Kozuch, M. A., Mutlu, O., Gibbons, P. B., Mowry, T. C., Ambit: In-Memory Accelerator for Bulk Bitwise Operations Using Commodity DRAM Technology, in MICRO, 2017. |
| * [[https://people.inf.ethz.ch/omutlu/pub/ambit-bulk-bitwise-dram_micro17.pdf | PDF]] |
| |
| |
=== More Learning Materials === | === More Learning Materials === |
* Mutlu O., Memory-Centric Computing (IMACAW Keynote Talk at DAC 2023), July 2023: | * Mutlu O., Memory-Centric Computing (IMACAW Keynote Talk at DAC 2023), July 2023: |
* [[https://people.inf.ethz.ch/omutlu/pub/onur-IMACAW-Keynote-MemoryCentricComputing-9-July-2023.pdf | PDF]] [[https://people.inf.ethz.ch/omutlu/pub/onur-IMACAW-Keynote-MemoryCentricComputing-9-July-2023.pdf | PPT]] [[https://www.youtube.com/watch?v=npr5zMEkANQ | Video]] | * [[https://people.inf.ethz.ch/omutlu/pub/onur-IMACAW-Keynote-MemoryCentricComputing-9-July-2023.pdf | PDF]] [[https://people.inf.ethz.ch/omutlu/pub/onur-IMACAW-Keynote-MemoryCentricComputing-9-July-2023.pdf | PPT]] [[https://www.youtube.com/watch?v=npr5zMEkANQ | Video]] |
* Processing-in-memory: A workload-driven perspective (summary paper about recent research in PIM): | * Processing-in-Memory: A Workload-Driven Perspective (summary paper about recent research in PIM): |
* [[https://people.inf.ethz.ch/omutlu/pub/processing-in-memory_workload-driven-perspective_IBMjrd19.pdf | PDF]] | * [[https://people.inf.ethz.ch/omutlu/pub/processing-in-memory_workload-driven-perspective_IBMjrd19.pdf | PDF]] |
* Processing Data Where It Makes Sense: Enabling In-Memory Computation (summary paper about recent research in PIM): | * Processing Data Where It Makes Sense: Enabling In-Memory Computation (summary paper about recent research in PIM): |
* Processing-in-Memory course (Spring 2022): | * Processing-in-Memory course (Spring 2022): |
* [[https://safari.ethz.ch/projects_and_seminars/spring2022/doku.php?id=processing_in_memory | Course website]] | * [[https://safari.ethz.ch/projects_and_seminars/spring2022/doku.php?id=processing_in_memory | Course website]] |
* [[https://sdk.upmem.com/2023.1.0/ | UPMEM SDK documentation: The first real-world PIM architecture]] | * Gómez-Luna, J., and Mutlu, O., Data-Centric Architectures: Fundamentally Improving Performance and Energy (227-0085-37L), ETH Zürich, Fall 2022. |
| * [[https://safari.ethz.ch/projects_and_seminars/fall2022/doku.php?id=processing_in_memory | Course Website]] |
| * [[https://youtube.com/playlist?list=PL5Q2soXY2Zi8KzG2CQYRNQOVD0GOBrnKy | Lecture Playlist]] |
| * [[https://arxiv.org/pdf/2012.03112.pdf | PDF (arXiv)]] |
| |