Название: Big Data Management and Analytics: Concepts, Tools, and Applications Автор: Rajesh Jugulum, David J. Fogarty, Chris Heien, Surya Putchala Издательство: CRC Press Год: 2025 Страниц: 187 Язык: английский Формат: pdf (true), epub Размер: 24.1 MB
As more companies go digital and conduct their business online, this book provides practical examples of how they can better manage their data and use it to generate maximum value. It offers an integrated approach by treating data as an asset and discusses how to preserve and protect it just like any other corporate asset.
Big Data Management and Analytics: Concepts, Tools, and Applications illustrates effective strategies for managing, governing, and analyzing Big Data to gain a competitive edge for companies utilizing Big Data and analytics. It offers a comprehensive guide on methods, tools, and concepts to efficiently manage and analyze Big Data in order to make informed decisions. Additionally, this book explores the significance of Artificial Intelligence (AI) and Machine Learning in leveraging Big Data and how they can be optimized in a well-structured environment. This book also emphasizes treating Big Data as a valuable asset and outlines strategies for preserving and safeguarding it like any other corporate asset. The inclusion of case studies ensures that the methodologies and concepts presented can be easily implemented in day-to-day operations.
Hadoop was invented in 2002, when Big Data started to emerge, and Apache software created a new type of database to handle large unstructured data, which was called Hadoop. The name “Hadoop” comes from an Indian cartoon character, an elephant. The developer of Hadoop had a son who watched this cartoon, which is where the inspiration for the name came from. Unlike structured databases, Hadoop can handle and process large amounts of unstructured data, making it ideal for the data storage and processing requirements of the digital age. Many firms are now uploading data to the cloud but will keep their most sensitive data on premises. This is where Hadoop comes into play. With on premise infrastructure is not as powerful as cloud technology, Hadoop can process large amounts of unstructured data more efficiently and effectively. With such large amounts of data being processed, firms must take care to make it easier for analysts to understand which data is available.
There are specific databases for unstructured data. One particular is the NoSQL database. As the name suggests, NSQL database do not depend on structured data and do not use SQL to enable access. If one desired a similar setup to a structured database, tools like Grok can be used. Grok is a filter tool within Logstash that is used to parse unstructured data into something structured and query able. It is used extensively for working with log files.
Given the current significance of Big Data in the business world, this book equips readers with the necessary skills to effectively manage this valuable asset. It is tailored for practitioners, students, and professionals working in data mining, Big Data, and Machine Learning across various industries, including manufacturing.
Скачать Big Data Management and Analytics: Concepts, Tools, and Applications
|