A barrier optimization framework for NUMA multi‐core system. (21st October 2019)
- Record Type:
- Journal Article
- Title:
- A barrier optimization framework for NUMA multi‐core system. (21st October 2019)
- Main Title:
- A barrier optimization framework for NUMA multi‐core system
- Authors:
- Yi, ZhengMing
Chen, Fei
Yao, YiPing - Abstract:
- Summary: Parallel program performance often critically depends on barrier performance. In modern NUMA multi‐core machines, barrier synchronization performance is significantly affected by cache‐coherence communication between cores, especially when the scale of NUMA systems is large, complex interconnected networks, memory hierarchies, and cache‐coherence protocols make optimization of barrier algorithm hard. We propose a general barrier optimization framework on NUMA multi‐core machines. The framework splits the barrier into three stages: the barrier arrival within a NUMA node, the barrier arrival across the NUMA nodes, and the wakeup, providing an opportunity to optimize the communication pattern and the cache‐line placement in each stage. To reduce remote communication traffic, we introduce a coordinator per NUMA node. In addition, we implement two barrier algorithms based on the framework. Finally, we show the superiority of the barrier algorithms within our framework over other barrier algorithms and show how to translate a barrier algorithm into a performance model to help make an optimal tradeoff design. Experiments were conducted on three NUMA multi‐core platforms and the results show that the barrier algorithm optimized within our framework is sufficient to deliver as good or better performance than state‐of‐art approaches on NUMA multi‐core machines.
- Is Part Of:
- Concurrency and computation. Volume 32:Number 5(2020)
- Journal:
- Concurrency and computation
- Issue:
- Volume 32:Number 5(2020)
- Issue Display:
- Volume 32, Issue 5 (2020)
- Year:
- 2020
- Volume:
- 32
- Issue:
- 5
- Issue Sort Value:
- 2020-0032-0005-0000
- Page Start:
- n/a
- Page End:
- n/a
- Publication Date:
- 2019-10-21
- Subjects:
- barrier algorithm -- cache coherence -- NUMA multi‐core -- synchronization
Parallel processing (Electronic computers) -- Periodicals
Parallel computers -- Periodicals
004.35 - Journal URLs:
- http://onlinelibrary.wiley.com/ ↗
- DOI:
- 10.1002/cpe.5527 ↗
- Languages:
- English
- ISSNs:
- 1532-0626
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3405.622000
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 12742.xml