A distributed model for sampling large scale social networks. (30th December 2021)
- Record Type:
- Journal Article
- Title:
- A distributed model for sampling large scale social networks. (30th December 2021)
- Main Title:
- A distributed model for sampling large scale social networks
- Authors:
- Jaouadi, Myriam
Ben Romdhane, Lotfi - Abstract:
- Abstract: Social networks content analysis has become more challenging over the years due to the rapidly increasing amount of data. Real social networks are omnipresent in everyday life, which makes the structure of the generated data more complex. A key task in social networks analysis is to reduce the network' s size and to produce an approximate representation that preserves the original network' s properties. This task is known as graph' s reduction and is gaining increasing attention in the scientific community. A review of literature reveals diverse methods to address this task. Some of them are based on graph coarsening and are developed to cope with the problem of communities detection. Others are part of graph sampling and are designed to reduce the graph' s size while preserving its structure, which is our purpose. In this paper, we put forth a distributed model called DGS "Distributed Graph Sampling" to generate a sample in a distributed way. The idea behind distributing our model is to cope with large scale social networks. In effect, our model is based on the MapReduce framework that allows to access simultaneously to several data segments for the calculation during the sampling strategy. The main task of our model is to use a new centrality measure based on the degree centrality to sample the graph. We evaluate the performance and the scalability of our DGS model using real world social networks. In this paper, we will compare our proposed model to fourAbstract: Social networks content analysis has become more challenging over the years due to the rapidly increasing amount of data. Real social networks are omnipresent in everyday life, which makes the structure of the generated data more complex. A key task in social networks analysis is to reduce the network' s size and to produce an approximate representation that preserves the original network' s properties. This task is known as graph' s reduction and is gaining increasing attention in the scientific community. A review of literature reveals diverse methods to address this task. Some of them are based on graph coarsening and are developed to cope with the problem of communities detection. Others are part of graph sampling and are designed to reduce the graph' s size while preserving its structure, which is our purpose. In this paper, we put forth a distributed model called DGS "Distributed Graph Sampling" to generate a sample in a distributed way. The idea behind distributing our model is to cope with large scale social networks. In effect, our model is based on the MapReduce framework that allows to access simultaneously to several data segments for the calculation during the sampling strategy. The main task of our model is to use a new centrality measure based on the degree centrality to sample the graph. We evaluate the performance and the scalability of our DGS model using real world social networks. In this paper, we will compare our proposed model to four well-known sampling strategies in order to demonstrate its efficiency to preserve the original network' s structure. Highlights: We investigate the effects of graph sampling on reducing the graph's size. We produce an approximate representation that retains the graph's structure. Our distributed method DGS is based on the MapReduce paradigm. DGS is able to cope with large scale social networks. For real world social networks our model demonstrates its efficiency. … (more)
- Is Part Of:
- Expert systems with applications. Volume 186(2021)
- Journal:
- Expert systems with applications
- Issue:
- Volume 186(2021)
- Issue Display:
- Volume 186, Issue 2021 (2021)
- Year:
- 2021
- Volume:
- 186
- Issue:
- 2021
- Issue Sort Value:
- 2021-0186-2021-0000
- Page Start:
- Page End:
- Publication Date:
- 2021-12-30
- Subjects:
- Social networks -- Graph sampling -- MapReduce paradigm -- Degree centrality
Expert systems (Computer science) -- Periodicals
Systèmes experts (Informatique) -- Périodiques
Electronic journals
006.33 - Journal URLs:
- http://www.sciencedirect.com/science/journal/09574174 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.eswa.2021.115773 ↗
- Languages:
- English
- ISSNs:
- 0957-4174
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3842.004220
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 19606.xml