Big data preprocessing: methods and prospects. Issue 1 (December 2016)
- Record Type:
- Journal Article
- Title:
- Big data preprocessing: methods and prospects. Issue 1 (December 2016)
- Main Title:
- Big data preprocessing: methods and prospects
- Authors:
- García, Salvador
Ramírez-Gallego, Sergio
Luengo, Julián
Benítez, José
Herrera, Francisco - Abstract:
- Abstract The massive growth in the scale of data has been observed in recent years being a key factor of the Big Data scenario. Big Data can be defined as high volume, velocity and variety of data that require a new high-performance processing. Addressing big data is a challenging and time-demanding task that requires a large computational infrastructure to ensure successful data processing and analysis. The presence of data preprocessing methods for data mining in big data is reviewed in this paper. The definition, characteristics, and categorization of data preprocessing approaches in big data are introduced. The connection between big data and data preprocessing throughout all families of methods and big data technologies are also examined, including a review of the state-of-the-art. In addition, research challenges are discussed, with focus on developments on different big data framework, such as Hadoop, Spark and Flink and the encouragement in devoting substantial research efforts in some families of data preprocessing methods and applications on new big data learning paradigms.
- Is Part Of:
- Big data analytics. Volume 1:Issue 1(2016)
- Journal:
- Big data analytics
- Issue:
- Volume 1:Issue 1(2016)
- Issue Display:
- Volume 1, Issue 1 (2016)
- Year:
- 2016
- Volume:
- 1
- Issue:
- 1
- Issue Sort Value:
- 2016-0001-0001-0000
- Page Start:
- 1
- Page End:
- 22
- Publication Date:
- 2016-12
- Subjects:
- Big data -- Data mining -- Data preprocessing -- Hadoop -- Spark -- Imperfect data -- Data transformation -- Feature selection -- Instance reduction
Big data -- Periodicals
Biology -- Data processing -- Periodicals
570.28557 - Journal URLs:
- https://bdataanalytics.biomedcentral.com/ ↗
http://link.springer.com/ ↗ - DOI:
- 10.1186/s41044-016-0014-0 ↗
- Languages:
- English
- ISSNs:
- 2058-6345
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 9958.xml