Practical big data analytics : hands-on techniques to implement enterprise analytics and machine learning using Hadoop, Spark, NoSQL and R /: hands-on techniques to implement enterprise analytics and machine learning using Hadoop, Spark, NoSQL and R. (2018)
- Record Type:
- Book
- Title:
- Practical big data analytics : hands-on techniques to implement enterprise analytics and machine learning using Hadoop, Spark, NoSQL and R /: hands-on techniques to implement enterprise analytics and machine learning using Hadoop, Spark, NoSQL and R. (2018)
- Main Title:
- Practical big data analytics : hands-on techniques to implement enterprise analytics and machine learning using Hadoop, Spark, NoSQL and R
- Further Information:
- Note: Nataraj Dasgupta.
- Authors:
- Dasgupta, Nataraj
- Contents:
- Cover; Copyright and Credits; Packt Upsell; Contributors; Table of Contents; Preface; Chapter 1: Too Big or Not Too Big; What is big data?; A brief history of data; Dawn of the information age; Dr. Alan Turing and modern computing; The advent of the stored-program computer; From magnetic devices to SSDs; Why we are talking about big data now if data has always existed; Definition of big data; Building blocks of big data analytics; Types of Big Data; Structured; Unstructured; Semi-structured; Sources of big data; The 4Vs of big data When do you know you have a big data problem and where do you start your search for the big data solution?Summary; Chapter 2: Big Data Mining for the Masses; What is big data mining?; Big data mining in the enterprise; Building the case for a Big Data strategy; Implementation life cycle; Stakeholders of the solution; Implementing the solution; Technical elements of the big data platform; Selection of the hardware stack; Selection of the software stack; Summary; Chapter 3: The Analytics Toolkit; Components of the Analytics Toolkit; System recommendations; Installing on a laptop or workstation Installing on the cloudInstalling Hadoop; Installing Oracle VirtualBox; Installing CDH in other environments; Installing Packt Data Science Box; Installing Spark; Installing R; Steps for downloading and installing Microsoft R Open; Installing RStudio; Installing Python; Summary; Chapter 4: Big Data With Hadoop; The fundamentals of Hadoop; The fundamentalCover; Copyright and Credits; Packt Upsell; Contributors; Table of Contents; Preface; Chapter 1: Too Big or Not Too Big; What is big data?; A brief history of data; Dawn of the information age; Dr. Alan Turing and modern computing; The advent of the stored-program computer; From magnetic devices to SSDs; Why we are talking about big data now if data has always existed; Definition of big data; Building blocks of big data analytics; Types of Big Data; Structured; Unstructured; Semi-structured; Sources of big data; The 4Vs of big data When do you know you have a big data problem and where do you start your search for the big data solution?Summary; Chapter 2: Big Data Mining for the Masses; What is big data mining?; Big data mining in the enterprise; Building the case for a Big Data strategy; Implementation life cycle; Stakeholders of the solution; Implementing the solution; Technical elements of the big data platform; Selection of the hardware stack; Selection of the software stack; Summary; Chapter 3: The Analytics Toolkit; Components of the Analytics Toolkit; System recommendations; Installing on a laptop or workstation Installing on the cloudInstalling Hadoop; Installing Oracle VirtualBox; Installing CDH in other environments; Installing Packt Data Science Box; Installing Spark; Installing R; Steps for downloading and installing Microsoft R Open; Installing RStudio; Installing Python; Summary; Chapter 4: Big Data With Hadoop; The fundamentals of Hadoop; The fundamental premise of Hadoop; The core modules of Hadoop; Hadoop Distributed File System -- HDFS; Data storage process in HDFS; Hadoop MapReduce; An intuitive introduction to MapReduce; A technical understanding of MapReduce Block size and number of mappers and reducersHadoop YARN; Job scheduling in YARN; Other topics in Hadoop; Encryption; User authentication; Hadoop data storage formats; New features expected in Hadoop 3; The Hadoop ecosystem; Hands-on with CDH; WordCount using Hadoop MapReduce; Analyzing oil import prices with Hive; Joining tables in Hive; Summary; Chapter 5: Big Data Mining with NoSQL; Why NoSQL?; The ACID, BASE, and CAP properties; ACID and SQL; The BASE property of NoSQL; The CAP theorem; The need for NoSQL technologies; Google Bigtable; Amazon Dynamo; NoSQL databases; In-memory databases Columnar databasesDocument-oriented databases; Key-value databases; Graph databases; Other NoSQL types and summary of other types of databases ; Analyzing Nobel Laureates data with MongoDB; JSON format; Installing and using MongoDB; Tracking physician payments with real-world data; Installing kdb+, R, and RStudio; Installing kdb+; Installing R; Installing RStudio; The CMS Open Payments Portal; Downloading the CMS Open Payments data; Creating the Q application; Loading the data; The backend code; Creating the frontend web portal; R Shiny platform for developers Putting it all together -- The CMS Open Payments application … (more)
- Publisher Details:
- Birmingham, UK : Packt Publishing
- Publication Date:
- 2018
- Extent:
- 1 online resource (1 volume), illustrations
- Subjects:
- 004.6782
Computers -- Data Modeling & Design
Big data
Cloud computing
Machine learning
Database design & theory
Cloud computing
Information architecture
Computers -- Data Processing
Data capture & analysis
Big data
Cloud computing
Machine learning
COMPUTERS / Computer Literacy
COMPUTERS / Computer Science
COMPUTERS / Data Processing
COMPUTERS / Hardware / General
COMPUTERS / Information Technology
COMPUTERS / Machine Theory
COMPUTERS / Reference
Electronic books - Languages:
- English
- ISBNs:
- 9781783554409
1783554401 - Related ISBNs:
- 9781783554393
- Notes:
- Note: Description based on online resource; title from title page (viewed February 5, 2018).
- Access Rights:
- Legal Deposit; Only available on premises controlled by the deposit library and to one user at any one time; The Legal Deposit Libraries (Non-Print Works) Regulations (UK).
- Access Usage:
- Restricted: Printing from this resource is governed by The Legal Deposit Libraries (Non-Print Works) Regulations (UK) and UK copyright law currently in force.
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library HMNTS - ELD.DS.265276
- Ingest File:
- 01_175.xml