A case for three‐dimensional stacking of tightly coupled data memories over multi‐core clusters using low‐latency interconnects. Issue 5 (1st September 2013)
- Record Type:
- Journal Article
- Title:
- A case for three‐dimensional stacking of tightly coupled data memories over multi‐core clusters using low‐latency interconnects. Issue 5 (1st September 2013)
- Main Title:
- A case for three‐dimensional stacking of tightly coupled data memories over multi‐core clusters using low‐latency interconnects
- Authors:
- Azarkhish, Erfan
Loi, Igor
Benini, Luca - Abstract:
- Abstract : Shared tightly coupled data memories are key architectural elements for building multi‐core clusters in programmable accelerators and embedded systems, as they provide a convenient shared memory abstraction while avoiding cache coherence overheads. The performance of these memories largely depends on the architecture of the interconnect used between processing elements (PEs) and memory banks. The advent of three‐dimensional (3D) technology has provided new opportunities to increase design modularity and reduce latency and manufacturing cost. In this study, the authors propose two 3D network architectures: C‐logarithmic interconnect (LIN) and Distributed logarithmic interconnect (D‐LIN) (designed in synthesisable RTL), which allow modular stacking of multiple L1 memory dies over a multi‐core cluster with a limited number of PEs. The authors have used two through‐silicon‐via technologies: the state‐of‐the‐art micro‐bumps and the promising and dense Cu–Cu direct bonding. The overhead of electrostatic discharge protection circuits has been considered, as well. Architectural simulation results demonstrate that, in processor‐to‐L1‐memory context, C‐LIN and D‐LIN perform significantly better than traditional network‐on‐chips and simple time‐division multiplexing buses. Furthermore, post‐layout results show that the proposed 3D architectures achieve comparable speed against their 2D counterparts, whereas enabling modularity: from 256 kB to 2 MB L1 memory configurationsAbstract : Shared tightly coupled data memories are key architectural elements for building multi‐core clusters in programmable accelerators and embedded systems, as they provide a convenient shared memory abstraction while avoiding cache coherence overheads. The performance of these memories largely depends on the architecture of the interconnect used between processing elements (PEs) and memory banks. The advent of three‐dimensional (3D) technology has provided new opportunities to increase design modularity and reduce latency and manufacturing cost. In this study, the authors propose two 3D network architectures: C‐logarithmic interconnect (LIN) and Distributed logarithmic interconnect (D‐LIN) (designed in synthesisable RTL), which allow modular stacking of multiple L1 memory dies over a multi‐core cluster with a limited number of PEs. The authors have used two through‐silicon‐via technologies: the state‐of‐the‐art micro‐bumps and the promising and dense Cu–Cu direct bonding. The overhead of electrostatic discharge protection circuits has been considered, as well. Architectural simulation results demonstrate that, in processor‐to‐L1‐memory context, C‐LIN and D‐LIN perform significantly better than traditional network‐on‐chips and simple time‐division multiplexing buses. Furthermore, post‐layout results show that the proposed 3D architectures achieve comparable speed against their 2D counterparts, whereas enabling modularity: from 256 kB to 2 MB L1 memory configurations with a single mask set. … (more)
- Is Part Of:
- IET computers & digital techniques. Volume 7:Issue 5(2013)
- Journal:
- IET computers & digital techniques
- Issue:
- Volume 7:Issue 5(2013)
- Issue Display:
- Volume 7, Issue 5 (2013)
- Year:
- 2013
- Volume:
- 7
- Issue:
- 5
- Issue Sort Value:
- 2013-0007-0005-0000
- Page Start:
- 191
- Page End:
- 199
- Publication Date:
- 2013-09-01
- Subjects:
- cost reduction -- electrostatic discharge -- embedded systems -- network‐on‐chip -- shared memory systems -- time division multiplexing
state‐of‐the‐art microbumps -- 3D architectures -- time‐division multiplexing bus -- network‐on‐chip -- C‐LIN -- processor‐to‐L1‐memory context -- electrostatic discharge protection circuits -- dense Cu‐Cu direct bonding -- modular stacking -- D‐LIN -- C‐logarithmic interconnect -- 3D network architectures -- manufacturing cost reduction -- design modularity -- PE -- memory banks -- processing elements -- shared memory abstraction -- embedded systems -- programmable accelerators -- multicore clusters -- tightly coupled data memories -- three‐dimensional stacking
Computers -- Periodicals
Digital electronics -- Periodicals
Computer engineering -- Periodicals
Computer architecture -- Periodicals
Computer organization -- Periodicals
621.39 - Journal URLs:
- http://digital-library.theiet.org/content/journals/iet-cdt ↗
http://ieeexplore.ieee.org/servlet/opac?punumber=4117424 ↗
http://www.ietdl.org/IET-CDT ↗
https://ietresearch.onlinelibrary.wiley.com/journal/1751861x ↗
http://www.theiet.org/ ↗ - DOI:
- 10.1049/iet-cdt.2013.0031 ↗
- Languages:
- English
- ISSNs:
- 1751-8601
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4363.252300
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 17061.xml