Java Data Science Cookbook. (2017)
- Record Type:
- Book
- Title:
- Java Data Science Cookbook. (2017)
- Main Title:
- Java Data Science Cookbook
- Other Names:
- Shams, Rushdi
- Contents:
- Cover; Credits; About the Author; About the Reviewer; www.PacktPub.com; Customer Feedback; Table of Contents; Preface; Chapter 1: Obtaining and Cleaning Data; Introduction; Retrieving all filenames from hierarchical directories using Java; Getting ready; How to do it…; Retrieving all filenames from hierarchical directories using Apache Commons IO; Getting ready; How to do it…; Reading contents from text files all at once using Java 8; How to do it…; Reading contents from text files all at once using Apache Commons IO; Getting ready; How to do it…; Extracting PDF text using Apache Tika Getting readyHow to do it…; Cleaning ASCII text files using Regular Expressions; How to do it…; Parsing Comma Separated Value (CSV) Files using Univocity; Getting ready; How to do it…; Parsing Tab Separated Value (TSV) file using Univocity; Getting ready; How to do it…; Parsing XML files using JDOM; Getting ready; How to do it…; Writing JSON files using JSON.simple; Getting ready; How to do it…; Reading JSON files using JSON.simple; Getting ready; How to do it …; Extracting web data from a URL using JSoup; Getting ready; How to do it… Extracting web data from a website using Selenium WebdriverGetting ready; How to do it…; Reading table data from a MySQL database; Getting ready; How to do it…; Chapter 2: Indexing and Searching Data; Introduction; Indexing data with Apache Lucene; Getting ready; How to do it…; How it works…; Searching indexed data with Apache Lucene; Getting ready; How to do it…;Cover; Credits; About the Author; About the Reviewer; www.PacktPub.com; Customer Feedback; Table of Contents; Preface; Chapter 1: Obtaining and Cleaning Data; Introduction; Retrieving all filenames from hierarchical directories using Java; Getting ready; How to do it…; Retrieving all filenames from hierarchical directories using Apache Commons IO; Getting ready; How to do it…; Reading contents from text files all at once using Java 8; How to do it…; Reading contents from text files all at once using Apache Commons IO; Getting ready; How to do it…; Extracting PDF text using Apache Tika Getting readyHow to do it…; Cleaning ASCII text files using Regular Expressions; How to do it…; Parsing Comma Separated Value (CSV) Files using Univocity; Getting ready; How to do it…; Parsing Tab Separated Value (TSV) file using Univocity; Getting ready; How to do it…; Parsing XML files using JDOM; Getting ready; How to do it…; Writing JSON files using JSON.simple; Getting ready; How to do it…; Reading JSON files using JSON.simple; Getting ready; How to do it …; Extracting web data from a URL using JSoup; Getting ready; How to do it… Extracting web data from a website using Selenium WebdriverGetting ready; How to do it…; Reading table data from a MySQL database; Getting ready; How to do it…; Chapter 2: Indexing and Searching Data; Introduction; Indexing data with Apache Lucene; Getting ready; How to do it…; How it works…; Searching indexed data with Apache Lucene; Getting ready; How to do it…; Chapter 3: Analyzing Data Statistically; Introduction; Generating descriptive statistics; How to do it…; Generating summary statistics; How to do it…; Generating summary statistics from multiple distributions; How to do it… There's more…Computing frequency distribution; How to do it…; Counting word frequency in a string; How to do it…; How it works…; Counting word frequency in a string using Java 8; How to do it…; Computing simple regression; How to do it…; Computing ordinary least squares regression; How to do it…; Computing generalized least squares regression; How to do it…; Calculating covariance of two sets of data points; How to do it…; Calculating Pearson's correlation of two sets of data points; How to do it…; Conducting a paired t-test; How to do it…; Conducting a Chi-square test; How to do it… Conducting the one-way ANOVA testHow to do it…; Conducting a Kolmogorov-Smirnov test; How to do it…; Chapter 4: Learning from Data -- Part 1; Introduction; Creating and saving an Attribute-Relation File Format (ARFF) file; How to do it…; Cross-validating a machine learning model; How to do it…; Classifying unseen test data; Getting ready; How to do it…; Classifying unseen test data with a filtered classifier; How to do it…; Generating linear regression models; How to do it…; Generating logistic regression models; How to do it…; Clustering data points using the KMeans algorithm; How to do it… … (more)
- Publisher Details:
- Place of publication not identified : Packt Publishing
- Publication Date:
- 2017
- Extent:
- 1 online resource ()
- Subjects:
- 005.1
COMPUTERS -- Data Processing
Java
COMPUTERS -- Intelligence (AI) & Semantics
COMPUTERS -- Programming Languages -- Java
Electronic books
Electronic books - Languages:
- English
- ISBNs:
- 1787127656
9781787127654 - Related ISBNs:
- 1787122530
- Notes:
- Note: Description based on print version record.
- Access Rights:
- Legal Deposit; Only available on premises controlled by the deposit library and to one user at any one time; The Legal Deposit Libraries (Non-Print Works) Regulations (UK).
- Access Usage:
- Restricted: Printing from this resource is governed by The Legal Deposit Libraries (Non-Print Works) Regulations (UK) and UK copyright law currently in force.
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library HMNTS - ELD.DS.134159
- Ingest File:
- 01_006.xml