Scalable big data architecture : a practitioner's guide to choosing relevant big data architecture /: a practitioner's guide to choosing relevant big data architecture. (2016)
- Record Type:
- Book
- Title:
- Scalable big data architecture : a practitioner's guide to choosing relevant big data architecture /: a practitioner's guide to choosing relevant big data architecture. (2016)
- Main Title:
- Scalable big data architecture : a practitioner's guide to choosing relevant big data architecture
- Further Information:
- Note: Bahaaldine Azarmi.
- Authors:
- Azarmi, Bahaaldine
- Contents:
- Chapter 1: I think I have a Big (data) Problem (20 pages)Chapter Goal: This chapter aims to introduce you to the topology of common existing limitations when it comes to dealing with large amounts of data, and what are the common solutions to those problems. The goal here is to lay down the foundation of a heterogeneous architecture that will be described in the following chapters.1- Identifying Big Data symptoms2- Understanding the Big Data projects ecosystem3- Creating the foundation of a long term Big Data architectureChapter 2: Early Big Data with No-SQL (30 pages)Chapter Goal: This chapter aims to describe how a No-SQL database can be a starting point for your Big Data project, how it can deal with large amounts of data, what are the limits of this model and how it can be scaled to a full-fledged Big Data project.1- Choosing the right No-SQL database2- Introduction to Couchbase3- Introduction to Elasticsearch4- Using No-SQL cache in a SQL based architectureChapter 3: Big Data processing jobs topology (30 pages)Chapter Goal: The more data you get, the more important it is to split the processing into different jobs depending on the topology of the processing.1- Big Data Job processing strategy2- Smart data extraction from No-SQL database3- Short term processing jobs.4- Long term processing jobs.Chapter 4: Big Data Streaming Pattern (30 pages)Chapter Goal: This chapter helps the readers to understand what are their options when it comes to dealing with streaming high dataChapter 1: I think I have a Big (data) Problem (20 pages)Chapter Goal: This chapter aims to introduce you to the topology of common existing limitations when it comes to dealing with large amounts of data, and what are the common solutions to those problems. The goal here is to lay down the foundation of a heterogeneous architecture that will be described in the following chapters.1- Identifying Big Data symptoms2- Understanding the Big Data projects ecosystem3- Creating the foundation of a long term Big Data architectureChapter 2: Early Big Data with No-SQL (30 pages)Chapter Goal: This chapter aims to describe how a No-SQL database can be a starting point for your Big Data project, how it can deal with large amounts of data, what are the limits of this model and how it can be scaled to a full-fledged Big Data project.1- Choosing the right No-SQL database2- Introduction to Couchbase3- Introduction to Elasticsearch4- Using No-SQL cache in a SQL based architectureChapter 3: Big Data processing jobs topology (30 pages)Chapter Goal: The more data you get, the more important it is to split the processing into different jobs depending on the topology of the processing.1- Big Data Job processing strategy2- Smart data extraction from No-SQL database3- Short term processing jobs.4- Long term processing jobs.Chapter 4: Big Data Streaming Pattern (30 pages)Chapter Goal: This chapter helps the readers to understand what are their options when it comes to dealing with streaming high data throughput.1- Identifying streaming data sources2- Streaming with Big Data projects (Flume) versus Enterprise Service Bus3- Processing architecture for stream dataChapter 5: Querying and Analysing Patterns (30 pages)Chapter Goal: In this chapter, the readers will understand how to leverage the processing work through long term & real time data querying.1- "Process then Query" strategy versus real-time querying2- Process, store and query data in Elasticsearch3- Real-Time querying using SparkChapter 6: How About Learning from your Data? (30 pages)Chapter Goal: This chapter will introduce the concept of machine learning at different level of the preceding described patterns and through different relative methodology.1- Introduction to machine learning2- Supervised and Unsupervised learning3- A simple example of Machine learning4- Using MLlib for machine learningChapter 7: Governance Considerations (20 pages)Chapter Goal: Monitoring, and more generally governance is extremely important when dealing with architecture that involves all the previous patterns. This chapter is to safeguard the reader from major issues, and to gain visibility and control over the architecture.1- Data Quality2- Architecture Scalability3- Security4- Monitoring. … (more)
- Publisher Details:
- Berkeley, CA : Apress
- Publication Date:
- 2016
- Copyright Date:
- 2016
- Extent:
- 1 online resource (xiii, 141 pages), illustrations
- Subjects:
- 005.7
Computer science
Big data
Databases
COMPUTERS -- Database Management -- General
Big data
Databases
Computers -- Data Processing
Public administration
Databases
Information systems
Database management
Electronic books
Electronic books - Languages:
- English
- ISBNs:
- 9781484213261
1484213262
1484213270
9781484213278 - Related ISBNs:
- 9781484213278
- Notes:
- Note: Includes bibliographical references.
Note: Online resource; title from PDF title page (SpringerLink, viewed January 4, 2016). - Access Rights:
- Legal Deposit; Only available on premises controlled by the deposit library and to one user at any one time; The Legal Deposit Libraries (Non-Print Works) Regulations (UK).
- Access Usage:
- Restricted: Printing from this resource is governed by The Legal Deposit Libraries (Non-Print Works) Regulations (UK) and UK copyright law currently in force.
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library HMNTS - ELD.DS.353454
- Ingest File:
- 01_312.xml