Название: Big Data Analytics: A Guide to Data Science Practitioners Making the Transition to Big Data Автор: Ulrich Matter Издательство: CRC Press Серия: Chapman & Hall/CRC Data Science Series Год: 2024 Страниц: 328 Язык: английский Формат: pdf (true) Размер: 18.2 MB
Successfully navigating the data-driven economy presupposes a certain understanding of the technologies and methods to gain insights from Big Data. This book aims to help data science practitioners to successfully manage the transition to Big Data.
Building on familiar content from applied econometrics and business analytics, this book introduces the reader to the basic concepts of Big Data Analytics. The focus of the book is on how to productively apply econometric and machine learning techniques with large, complex data sets, as well as on all the steps involved before analysing the data (data storage, data import, data preparation). The book combines conceptual and theoretical material with the practical application of the concepts using R and SQL. The reader will thus acquire the skills to analyse large data sets, both locally and in the cloud. Various code examples and tutorials, focused on empirical economic and business research, illustrate practical techniques to handle and analyse Big Data.
Key Features:
- Includes many code examples in R and SQL, with R/SQL scripts freely provided online. - Extensive use of real datasets from empirical economic research and business analytics, with data files freely provided online. - Leads students and practitioners to think critically about where the bottlenecks are in practical data analysis tasks with large data sets, and how to address them.
The book is a valuable resource for Data Science practitioners, graduate students and researchers who aim to gain insights from Big Data in the context of research questions in business, economics, and the social sciences.
Contents:
Part 1. Setting the Scene: Analyzing Big Data 1. What is Big in "Big Data"? 2. Approaches to Analyzing Big Data 3. The Two Domains of Big Data Analytics Part 2. Platform: Software and Computing Resources 4. Software: Programming with (Big) Data 5. Hardware: Computing Resources 6. Distributed Systems 7. Cloud Computing Part 3. Components of Big Data Analytics 8. Data Collection and Data Storage 9. Big Data Cleaning and Transformation 10. Descriptive Statistics and Aggregation 11. (Big) Data Visualization Part 4. Application: Topics in Big Data Econometrics 12. Bottlenecks in Everyday Data Analytics Tasks 13. Econometrics with GPUs 14. Regression Analysis and Categorization with Spark and R 15. Large-scale Text Analysis with sparklyr Part 5. Appendices Appendix A. GitHub Appendix B. R Basics Appendix C. Install Hadoop
Скачать Big Data Analytics: A Guide to Data Science Practitioners Making the Transition to Big Data
|