Practical data analysis. (2013)
- Record Type:
- Book
- Title:
- Practical data analysis. (2013)
- Main Title:
- Practical data analysis
- Further Information:
- Note: Hector Cuesta.
- Other Names:
- Cuesta, Hector
- Contents:
- Cover; Copyright; Credits; Foreword; About the Author; Acknowledgments; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1:Getting Started; Computer science; Artificial intelligence (AI); Machine Learning (ML); Statistics; Mathematics; Knowledge domain; Data, information, and knowledge; The nature of data; The data analysis process; The problem; Data preparation; Data exploration; Predictive modeling; Visualization of results; Quantitative versus qualitative data analysis; Importance of data visualization; What about big data?; Sensors and cameras. Social networks analysisTools and toys for this book; Why Python?; Why mlpy?; Why D3.js?; Why MongoDB?; Summary; Chapter 2:Working with Data; Data sources; Open data; Text files; Excel files; SQL databases; NoSQL databases; Multimedia; Web scraping; Data scrubbing; Statistical methods; Text parsing; Data transformation; Data formats; CSV; Parsing a CSV file with the csv module; Parsing a CSV file using NumPy; JSON; Parsing a JSON file using json module; XML; Parsing an XML file in Python using xml module; YAML; Getting started with OpenRefine; Text facet; Clustering; Text filters. Numeric facetsTransforming data; Exporting data; Operation history; Summary; Chapter 3:Data Visualization; Data-Driven Documents (D3); HTML; DOM; CSS; JavaScript; SVG; Getting started with D3.js; Bar chart; Pie chart; Scatter plot; Single line chart; Multi-line chart; Interaction and animation; Summary; Chapter 4:TextCover; Copyright; Credits; Foreword; About the Author; Acknowledgments; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1:Getting Started; Computer science; Artificial intelligence (AI); Machine Learning (ML); Statistics; Mathematics; Knowledge domain; Data, information, and knowledge; The nature of data; The data analysis process; The problem; Data preparation; Data exploration; Predictive modeling; Visualization of results; Quantitative versus qualitative data analysis; Importance of data visualization; What about big data?; Sensors and cameras. Social networks analysisTools and toys for this book; Why Python?; Why mlpy?; Why D3.js?; Why MongoDB?; Summary; Chapter 2:Working with Data; Data sources; Open data; Text files; Excel files; SQL databases; NoSQL databases; Multimedia; Web scraping; Data scrubbing; Statistical methods; Text parsing; Data transformation; Data formats; CSV; Parsing a CSV file with the csv module; Parsing a CSV file using NumPy; JSON; Parsing a JSON file using json module; XML; Parsing an XML file in Python using xml module; YAML; Getting started with OpenRefine; Text facet; Clustering; Text filters. Numeric facetsTransforming data; Exporting data; Operation history; Summary; Chapter 3:Data Visualization; Data-Driven Documents (D3); HTML; DOM; CSS; JavaScript; SVG; Getting started with D3.js; Bar chart; Pie chart; Scatter plot; Single line chart; Multi-line chart; Interaction and animation; Summary; Chapter 4:Text Classification; Learning and classification; Bayesian classification; Naïve Bayes algorithm; E-mail subject line tester; The algorithm; Classifier accuracy; Summary; Chapter 5:Similarity-based Image Retrieval; Image similarity search; Dynamic time warping (DTW). Processing the image datasetImplementing DTW; Analyzing the results; Summary; Chapter 6:Simulation of Stock Prices; Financial time series; Random walk simulation; Monte Carlo methods; Generating random numbers; Implementation in D3.js; Summary; Chapter 7:Predicting Gold Prices; Working with the time series data; Components of a time series; Smoothing the time series; The data -- historical gold prices; Nonlinear regression; Kernel ridge regression; Smoothing the gold prices time series; Predicting in the smoothed time series; Contrasting the predicted value; Summary. Chapter 8:Working with Support Vector MachinesUnderstanding the multivariate dataset; Dimensionality reduction; Linear Discriminant Analysis; Principal Component Analysis; Getting started with support vector machine; Kernel functions; Double spiral problem; SVM implemented on mlpy; Summary; Chapter 9:Modeling Infectious Disease with Cellular Automata; Introduction to epidemiology; The epidemiology triangle; The epidemic models; The SIR model; Solving ordinary differential equation for the SIR model with SciPy; The SIRS model; Modelling with cellular automata cell, state, grid, and neighborhood. … (more)
- Publisher Details:
- Birmingham, UK : Packt Publishing
- Publication Date:
- 2013
- Extent:
- 1 online resource (360 pages), illustrations
- Subjects:
- 005.7
COMPUTERS -- Databases -- General
Electronic data processing
Databases
Data structures (Computer science)
System design
System analysis
COMPUTERS / Data Processing
COMPUTERS / Databases / General
Data structures (Computer science)
Databases
System analysis
System design
COMPUTERS -- Data Transmission Systems -- General
Electronic books - Languages:
- English
- ISBNs:
- 9781680153613
1680153617
9781783281008
1783281006 - Related ISBNs:
- 9781783280995
1783280999 - Notes:
- Note: Print version record.
- Access Rights:
- Legal Deposit; Only available on premises controlled by the deposit library and to one user at any one time; The Legal Deposit Libraries (Non-Print Works) Regulations (UK).
- Access Usage:
- Restricted: Printing from this resource is governed by The Legal Deposit Libraries (Non-Print Works) Regulations (UK) and UK copyright law currently in force.
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library HMNTS - ELD.DS.87365
- Ingest File:
- 01_030.xml